Deployment Options
Choose the deployment method that best fits your needs:LangGraph Platform
One-command serverless deployment
- Best for: Quick production deployments
- Complexity: ★☆☆☆☆ Minimal
- Time: ~2 minutes*
- Cost: Pay-per-use
Google Cloud Run
GCP serverless container platform
- Best for: GCP-native applications
- Complexity: ★★☆☆☆ Low
- Time: ~10 minutes*
- Cost: Pay-per-use
Kubernetes
Production-grade K8s deployment
- Best for: Enterprise, self-hosted
- Complexity: ★★★★☆ High
- Time: ~1-2 hours*
- Cost: Fixed + usage
Docker
Quick Docker Compose setup
- Best for: Development, testing
- Complexity: ★☆☆☆☆ Minimal
- Time: ~15 minutes*
- Cost: Hosting only
Helm
Flexible Helm chart deployment
- Best for: Customizable K8s
- Complexity: ★★★☆☆ Medium
- Time: ~30 minutes*
- Cost: Fixed + usage
Kustomize
Environment-specific overlays
- Best for: Multi-environment K8s
- Complexity: ★★★☆☆ Medium
- Time: ~45 minutes*
- Cost: Fixed + usage
*Time estimates assume prerequisites are configured (accounts, credentials, CLI tools installed). First-time setup may take longer. See individual deployment guides for detailed requirements.
Platform Comparison
| Feature | LangGraph Platform | Cloud Run | Kubernetes | Docker |
|---|---|---|---|---|
| Setup Time | ~2 min* | ~10 min* | ~1-2 hrs* | ~15 min* |
| Infrastructure | ✅ None | ⚠️ Minimal | ❌ Complex | ⚠️ Basic |
| Scaling | ✅ Auto | ✅ Auto | ⚠️ Manual config | ❌ Manual |
| LangSmith Integration | ✅ Built-in | ⚠️ Manual | ⚠️ Manual | ⚠️ Manual |
| Versioning | ✅ Built-in | ⚠️ Manual | ⚠️ Manual | ❌ None |
| Cost | Pay-per-use | Pay-per-use | Fixed + usage | Hosting only |
| Best For | Quick production | GCP apps | Enterprise | Development |
Recommendation: Start with LangGraph Platform for fastest time-to-production, or Cloud Run if you’re already on GCP. Use Kubernetes for enterprise self-hosted deployments.
Architecture
Supported Platforms
- Google Cloud
- AWS
- Azure
- Other
Google Kubernetes Engine (GKE)
Fully supported and tested
- ✅ Autopilot and Standard clusters
- ✅ Workload Identity for secrets
- ✅ Cloud Armor for DDoS protection
- ✅ Cloud Load Balancing
- ✅ Managed Prometheus
Pre-Deployment Checklist
1
Review Security Audit
Run the security audit checklist:Ensure all checks pass before deploying.
2
Configure Secrets
Set up secrets management:
- Configure Infisical project
- Generate JWT secret:
openssl rand -base64 32 - Get LLM API keys
- Setup OpenFGA store and model IDs
3
Build Container Image
4
Prepare Infrastructure
- Provision Kubernetes cluster
- Setup DNS records
- Configure TLS certificates
- Deploy monitoring stack
- Setup OpenFGA with PostgreSQL backend
5
Review Configuration
Update configuration for production:
- Set
ENVIRONMENT=production - Configure resource limits
- Enable autoscaling
- Setup network policies
- Configure ingress and TLS
Quick Deploy (Docker)
For a quick test deployment:Production Deploy (Kubernetes)
For production deployment:Environment Configuration
- Development
- Staging
- Production
Monitoring & Observability
Deploy the monitoring stack:- Grafana: http://grafana.yourdomain.com (admin/admin)
- Jaeger: http://jaeger.yourdomain.com
- Prometheus: http://prometheus.yourdomain.com
Scaling
Horizontal Pod Autoscaling
Vertical Pod Autoscaling
High Availability
Ensure high availability with:- Multiple Replicas: Run at least 3 replicas
- Pod Disruption Budget: Maintain minimum availability
- Multi-Zone Deployment: Spread across availability zones
- Health Checks: Liveness and readiness probes
- Graceful Shutdown: Handle SIGTERM properly
- Circuit Breakers: Fail fast with timeouts
Disaster Recovery
1
Backup Strategy
- OpenFGA PostgreSQL database
- Configuration and secrets
- Persistent volumes
2
Recovery Procedure
- Restore database from backup
- Redeploy application
- Verify functionality
3
Testing
- Test disaster recovery quarterly
- Document RTO/RPO
- Update runbooks
Security Hardening
Production security checklist:- TLS enabled for all endpoints
- Network policies applied
- Pod security policies enforced
- RBAC configured with least privilege
- Secrets encrypted at rest
- Audit logging enabled
- Regular security scans
- Rate limiting configured
Cost Optimization
- GKE
- EKS
- AKS
- Use Autopilot for hands-off management
- Enable cluster autoscaling
- Use preemptible nodes for dev/staging
- Configure resource requests accurately
- Use committed use discounts