Why We Built Vanna 2.0
The Journey So Far
When we open-sourced Vanna, our goal was clear: become the default tool people reach for when they want to generate SQL using LLMs. We achieved that goal. With over 20,000 stars on GitHub, Vanna has surpassed and outlasted other options. Developers around the world choose Vanna every day for text-to-SQL generation.
What We Learned
The original Vanna was optimized for GPT-4 and single-user demos. While it continues to gain new users daily, itâs fundamentally a product of its eraâbuilt before we truly understood what production AI agents needed.
We learned three critical lessons from our enterprise customers:
1. User Identity Canât Be Bolted On
Every enterprise customer asked the same question: âHow do I make this work for multiple users with different permissions?â Most agent frameworks are built for single-user demos or operate in âglobalâ scope. They simply donât have multi-tenant isolation built in. Adding it yourself means building your own wrapper framework on top of the frameworkâyou end up rewriting everything when you go from prototype to production.
2. LLMs and Humans Need Different Outputs
When a SQL query returns 10,000 rows, dumping that into the LLMâs context is a waste of tokens. But showing the user just âQuery succeededâ is a waste of their time.
We discovered that tools need to return two things: a concise summary for the LLM (âQuery returned 10,000 rows, hereâs a sampleâŚâ) and rich interactive components for the user (full sortable data table with export).
Nobody else was solving this problem. Other agent frameworks focused on streaming text. But production applications need status cards that update, progress bars that fill, and artifacts that render externallyânot append-only chat logs.
3. Enterprises Need the Full Stack
When an enterprise says âwe want to self-host,â they donât just mean the agent loop. They mean:
- Secure Python sandboxes for code execution
- Audit logs for compliance
- Quota management per user
- Row-level security for database access
- Conversation persistence with encryption
- LLM failover when Claude is down
These arenât nice-to-haves. Theyâre table stakes for production. And every customer was building the same infrastructure from scratch.
The GPT-5 Moment
Weâre at the threshold of GPT-5 class modelsâmodels with dramatically improved reasoning, longer context windows, and true agentic capabilities. But hereâs the problem: most agent frameworks were designed for GPT-4âs limitations, not GPT-5âs possibilities.
Vanna 2.0 is our answer. Itâs not an incremental updateâitâs a complete rethinking of what an agent framework should be in the GPT-5 era.
Why Vanna 2.0 Exists
The core problem with existing agent frameworks is that theyâre built for single-user demos or operate in global scope. Multi-tenant isolation is an afterthought, if itâs considered at all. This creates a gap between prototyping and production that costs months of engineering time and introduces security risks.
POC Hell: The Enterprise Agent Crisis
Most enterprise AI agent deployments never make it past proof-of-concept. The demo works perfectly for a single user. But when it's time to deploy to real users with different permissions, everything breaks.
Teams spend 6+ months rewriting their entire agent architecture to add user isolation, permission enforcement, and audit logging. By the time they're ready to deploy, the business has moved on or the budget is gone.
Identity-First Architecture
Every request in Vanna 2.0 carries user identity, permissions, and workspace context from the moment it enters the system until tools finish executing. User isolation isnât something you configureâitâs impossible to build an agent without it.
Key Benefits:
- User context flows automatically to every tool
- Tools enforce their own permission logic based on user identity
- No manual plumbing required - itâs built into the framework
- Multi-tenant by design, not an afterthought
Dual-Output Model
Tools return results optimized for both LLM reasoning and user interaction. The LLM sees: âQuery returned 1,247 rows.â The user sees: an interactive data table with sort, filter, and export.
This saves tokens, improves UX, and keeps your LLM context clean. No other framework does this.
Production-Ready from Line One
Unlike other frameworks where you prototype with one architecture and rewrite for production, Vanna 2.0 uses the same codebase from first prototype to multi-tenant production:
- Prototype: Test with MockLlmService, no API keys needed
- Add Tools: Register SQL, file system, or custom tools
- Production: Add your real LLM, enable streaming, deploy as-is
No rewrites. No surprises. The same agent that works on your laptop scales to 10,000 users.
Beyond SQL
Hereâs what we realized: thereâs nothing about Vanna 2.0âs architecture thatâs specific to SQL or databases. The identity-first design, dual outputs, and rich components work for any enterprise agent use case:
- Coding assistants that respect workspace boundaries
- Document Q&A that enforces document-level permissions
- Workflow automation with user-scoped execution
- Customer support with quota enforcement
- Data visualization that generates Plotly charts and dashboards
The technical foundation we built for SQL agents turned out to be exactly what enterprises need for any AI agent.
What Weâre Not
Weâre not trying to be:
- A low-code builder - You write Python
- A ChatGPT wrapper - Weâre a framework for building your own agents
- SQL-only - Thatâs just one use case we happen to know well
What We Are
Weâre the agent framework for enterprises that need to ship to production.
- Open source - MIT licensed, no vendor lock-in
- Identity-first - Multi-tenant by design, not as an afterthought
- Production-ready - Quota management, permissions, audit logs built-in
- Developer-friendly - Start in 5 minutes, scale without rewrites
The Road Ahead
Weâre not anchored to SQL anymore. Vanna 2.0 is a platform for building any kind of enterprise agent.
Our roadmap:
- Launch Vanna 2.0 open source framework
- Launch Cloud Platform with premium features
- Enterprise VPC deployments
- First-party tools marketplace (Python sandboxes, RLS, failover)
- Ecosystem of community-built tools and templates
Join Us
We learned from older versions of Vanna. We listened to our enterprise customers. We saw where the market is heading.
Vanna 2.0 isnât just a better version of what existsâitâs what should have existed from the beginning.
If youâre building AI agents for production, we built this for you.
Quickstart Notebook â Open in Colab â View on GitHub â Join Discord â