Deployment Overview

📋 Planned

Documentation Under Construction

This page will guide you through deploying Vanna 2.0 to production, covering FastAPI and Flask servers, environment configuration, and common deployment scenarios.

Planned Content

  • FastAPI deployment (development vs. production with Uvicorn + Gunicorn)
  • Flask deployment options
  • Environment variables and configuration
  • Docker containerization
  • Health checks and monitoring
  • Scaling considerations (stateless agents, connection pooling)
  • Links to cloud-specific guides (AWS, GCP, Azure)
  • Production checklist

Want to contribute or suggest improvements? Open an issue on GitHub

Deployment Options

When complete, this will cover:

  • Development: uvicorn app:app --reload
  • Production: Uvicorn + Gunicorn workers
  • Docker container example

Flask

  • Development: flask run
  • Production: Gunicorn or uWSGI
  • Docker container example

Common Scenarios

  • Internal tool: Deploy to AWS ECS with SSO
  • Multi-tenant SaaS: Deploy to GCP Cloud Run with row-level security
  • Embedded feature: Add routes to existing FastAPI app

Production Checklist

Coming soon:

  • Environment variables configured
  • Authentication implemented
  • Database connection pooling enabled
  • Health checks configured
  • Audit logging enabled
  • Monitoring/observability set up