Documentation Index
Fetch the complete documentation index at: https://mcp-server-langgraph.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Interactive TCO Calculator
Calculate the Total Cost of Ownership (TCO) for deploying AI agents across different frameworks. Adjust the sliders to match your expected usage and see real-time cost comparisons.Usage Configuration
Monthly Request Volume
10K1M requests/month10M
Average Tokens per Request
1001,000 tokens/request10K
Team Size (Developers)
13 developers20
Workflow Complexity
Deployment Region
LLM Model Selection
Cost Breakdown by Framework
The costs below are baseline estimates for 1M requests/month with Gemini 2.5 Flash. Use the sliders and dropdowns above to see dynamic calculations based on your specific configuration.
MCP Server with LangGraph (Self-Hosted - Kubernetes)
| Cost Component | Monthly Cost | Notes |
|---|---|---|
| Infrastructure | $350 | GKE cluster (4 pods, n2-standard-4) |
| LLM API Costs | $420 | Gemini 2.5 Flash @ 2.50 per 1M tokens |
| Observability | $50 | LangSmith + Prometheus + Grafana |
| Storage | $20 | PostgreSQL + Redis (state/checkpoints) |
| Networking | $15 | Load balancer + egress |
| DevOps Time | $500 | ~10 hours/month @ $50/hr (maintenance) |
| Total TCO | $1,355/month | $1.36 per 1,000 requests |
💰 Cost Savings: $3,745/month vs LangGraph Cloud (73% cheaper)
MCP Server with LangGraph (Cloud Run - Serverless)
| Cost Component | Monthly Cost | Notes |
|---|---|---|
| Cloud Run | $600 | Serverless container hosting |
| LLM API Costs | $420 | Gemini 2.5 Flash @ 2.50 per 1M tokens |
| Observability | $50 | LangSmith + Cloud Monitoring |
| Storage | $30 | Cloud SQL + Memorystore |
| Networking | $25 | Cloud Load Balancer |
| DevOps Time | $200 | ~4 hours/month (less maintenance) |
| Total TCO | $1,325/month | $1.33 per 1,000 requests |
💰 Cost Savings: $3,775/month vs LangGraph Cloud (74% cheaper)
LangGraph Cloud (Managed Platform)
| Cost Component | Monthly Cost | Notes |
|---|---|---|
| Platform Fees | $5,000 | $0.001/node × 5M node executions |
| Uptime Fees | $100 | Always-on deployment |
| LLM API Costs | $0 | Included in platform fees |
| Observability | $0 | LangSmith included |
| Storage | $0 | Included in platform |
| DevOps Time | $0 | Fully managed |
| Total TCO | $5,100/month | $5.10 per 1,000 requests |
⚠️ Trade-off: Zero DevOps effort, but 4x higher cost at scale
OpenAI AgentKit (Platform)
| Cost Component | Monthly Cost | Notes |
|---|---|---|
| Platform Fees | $0 | No separate AgentKit fee |
| GPT-5.1 API | $1,688 | 1B tokens @ 10 per 1M tokens |
| Web Search | $500 | 50K searches @ $10/1K calls |
| ChatKit Storage | $20 | 200 GB-days @ $0.10/GB-day |
| Observability | $0 | Basic Evals included |
| DevOps Time | $0 | Fully managed |
| Total TCO | $2,208/month | $2.21 per 1,000 requests |
- ❌ Vendor Lock-in: OpenAI models only, limited flexibility
CrewAI (Self-Hosted)
| Cost Component | Monthly Cost | Notes |
|---|---|---|
| Infrastructure | $200 | Single VM (n2-standard-2) |
| LLM API Costs | $420 | Gemini 2.5 Flash @ 2.50 per 1M tokens |
| Observability | $0 | Basic logging only |
| Storage | $10 | Local SQLite (not production-ready) |
| Networking | $5 | Minimal |
| DevOps Time | $800 | ~16 hours/month (manual setup) |
| Total TCO | $1,435/month | $1.44 per 1,000 requests |
⚠️ Note: Low infrastructure cost, but high DevOps burden for production features
Google ADK (Vertex AI Agent Engine)
| Cost Component | Monthly Cost | Notes |
|---|---|---|
| Vertex AI Agent Engine | $1,500 | Platform fees |
| Gemini 2.5 Pro API | $1,688 | 1B tokens @ 10 per 1M tokens |
| Cloud Storage | $30 | Agent state/artifacts |
| Networking | $20 | VPC, load balancer |
| Observability | $100 | Cloud Monitoring + Trace |
| DevOps Time | $300 | ~6 hours/month |
| Total TCO | $3,638/month | $3.64 per 1,000 requests |
- ⚠️ Trade-off: Good for GCP-native, but 2.5x more expensive than self-hosted MCP Server
Cost Comparison Chart
Loading dynamic cost comparison chart…
Best Value: Calculating based on your configuration…
TCO Factors Explained
Infrastructure Costs
Infrastructure Costs
Kubernetes (GKE/EKS/AKS):
- Cluster management: $70-150/month
- Node pools: $250-400/month (4 pods, n2-standard-4)
- Load balancer: $15-30/month
- Total: ~$350/month for 1M requests
- Pay-per-use: $0.40-2.00 per 1M requests
- Always-on: Add ~$200/month for warm instances
- Total: ~$600/month for 1M requests
- Node executions: 5,000)
- API markup: Often included but higher per-request cost
LLM API Costs
LLM API Costs
Model Selection Impact:
- Gemini Flash: $0.075/1M tokens (cheapest)
- Claude Haiku: $0.25/1M tokens
- GPT-4o Mini: $0.15/1M tokens
- GPT-4: $10-30/1M tokens (most expensive)
- Gemini Flash: $75/month
- GPT-4: $10,000/month (133x more expensive!)
DevOps Time Costs
DevOps Time Costs
Self-Hosted (Kubernetes):
- Initial setup: 20-40 hours (one-time)
- Monthly maintenance: 8-12 hours
- Troubleshooting: 2-5 hours
- Total: ~10 hours/month @ 500-1,000
- Initial setup: 4-8 hours
- Monthly maintenance: 2-4 hours
- Total: ~4 hours/month = $200-400
- Zero DevOps time (fully managed)
- Trade-off: 5-10x higher platform fees
Observability Costs
Observability Costs
MCP Server (Dual Stack):
- LangSmith: $0-50/month (usage-based)
- Prometheus + Grafana: Self-hosted (free) or Cloud ($50-100/month)
- Total: ~$50-100/month
- Included in platform fees
- Limited customization
- No infrastructure metrics (only LLM tracing)
Hidden Costs
Hidden Costs
Break-Even Analysis
When Does Self-Hosting Pay Off?
- Low Volume (< 100K/month)
- Medium Volume (100K - 1M/month)
- High Volume (> 1M/month)
Recommendation: Managed platform (LangGraph Cloud or OpenAI AgentKit)Why:
- DevOps overhead dominates costs at low volume
- Platform fees are low (< $500/month)
- Time-to-market matters more than cost optimization
- 50K requests/month
- LangGraph Cloud: $250/month (platform fees)
- MCP Server: $400/month (infrastructure + DevOps)
- Winner: LangGraph Cloud (simpler, cheaper)
3-Year TCO Projection
Total Cost Over 3 Years (assuming 1M req/month growth)
| Framework | Year 1 | Year 2 | Year 3 | 3-Year Total |
|---|---|---|---|---|
| MCP Server (K8s) | $12,000 | $18,000 | $24,000 | $54,000 |
| MCP Server (Cloud Run) | $12,000 | $18,000 | $24,000 | $54,000 |
| LangGraph Cloud | $61,000 | $122,000 | $183,000 | $366,000 |
| OpenAI AgentKit | $126,000 | $252,000 | $378,000 | $756,000 |
| Google ADK | $25,000 | $40,000 | $60,000 | $125,000 |
ROI Calculator
Return on Investment for Self-Hosting
Scenario: Migrating from LangGraph Cloud to MCP Server (K8s) Initial Investment:- Setup time: 40 hours @ 4,000
- Training: $3,000
- Total: $7,000 (one-time)
- Platform cost reduction: 1,010 = $4,090/month
- 4,090 = 1.7 months
- Savings: 49,080
- Investment: $7,000
- ROI: 601% ($42,080 profit)
Cost Optimization Strategies
Choose Cheaper LLMs
Use Gemini Flash ($0.075/1M tokens) for 90% of requests, GPT-4 for complex reasoning only.Savings: 10-20x on LLM costs
Implement Caching
Cache common queries with Redis TTL.Savings: 30-50% reduction in LLM API calls
Batch Requests
Group multiple requests for parallel processing.Savings: 20% infrastructure efficiency gain
Use Spot Instances
Kubernetes node pools with spot/preemptible VMs.Savings: 60-80% on compute costs
Frequently Asked Questions
Why is MCP Server cheaper than managed platforms?
Why is MCP Server cheaper than managed platforms?
Three reasons:
- No Platform Markup: You pay cloud costs directly (no 10x markup)
- Model Flexibility: Use cheaper LLMs (Gemini Flash vs GPT-4)
- Economies of Scale: Self-hosting benefits from volume (managed platforms charge per-request)
Are these costs accurate?
Are these costs accurate?
These estimates are based on:
- Public pricing (GCP, AWS, Azure, LLM providers) as of 2025
- Typical deployment patterns from real-world usage
- Average assumptions (e.g., 1,000 tokens/request)
- Negotiated enterprise pricing
- Regional pricing differences
- Actual token usage (can be 100-10,000+ tokens/request)
- Custom configurations and optimizations
What about egress costs?
What about egress costs?
Included in estimates but can vary significantly:
- GCP: $0.12/GB (North America)
- AWS: $0.09/GB (first 10 TB)
- Azure: $0.087/GB (first 5 GB free)
- Average response: 2 KB
- Total egress: 2 GB = $0.18-0.24/month (negligible)
How do I reduce DevOps time?
How do I reduce DevOps time?
Strategies:
- Use Cloud Run instead of Kubernetes (4 hours vs 10 hours/month)
- Automate deployments with CI/CD (GitHub Actions, ArgoCD)
- Use Helm charts provided by MCP Server (pre-configured)
- Enable auto-scaling to reduce manual intervention
- Invest in monitoring to catch issues early
Next Steps
Try MCP Server
Get started in 5 minutes
Deployment Guide
Deploy to Kubernetes
Multi-LLM Setup
Advanced cost reduction strategies
Framework Comparison
Full framework comparison