Pricing
Pay per request. Not per token.
Zero markup on LLM costs. They pass through at cost. You pay only for what Toolken adds: full visibility and control over every request.
Developer
$0
Free forever
For prototyping and side projects. Not for production.
Start for free- Universal API: all providers & models
- Logs & metrics: cost, tokens, latency, errors/status
- Cost attribution & filtering by any metadata (feature, customer, agent…)
- Usage KPIs + dashboards
- Retention: 7 days
- 1 API key · solo (no team)
Coming soon
- automatic fallback
Starter
$25
/ month · billed monthly
For solo builders going live with real users.
Get startedEverything in Developer, plus:
- Team members: invite your workspace
- Multiple API keys & environments (prod/staging)
- Retention: 30 days
Coming soon
- monthly spend cap
- response cache (exact)
- agent tracing (session + trace)
Production
$49
/ month · billed monthly
For teams that need cost visibility per customer, agent, and feature.
Upgrade nowEverything in Starter, plus:
- RBAC (roles & permissions)
- Retention: 90 days
- Production support, 24h response
Coming soon
- budgets & caps per key / team / dimension
- smart routing + load balancing + fallback
- Vault (enforced budgets & key control)
- semantic caching · PII anonymizer
- MCP Gateway + coding-CLI routing
- full transcript logging
Enterprise
Custom
Custom pricing
For orgs with compliance, scale, and multiple teams.
Talk to usEverything in Production, plus:
- Compliance & private deployment: on roadmap, talk to us
- Custom retention · guaranteed SLA · dedicated CS
Coming soon
- SSO (Okta / Google) + SCIM
Questions? [email protected]
Compare plans
Compare every plan.
Live feature · Soon On the roadmap · — Not included
| Feature | Developer Free 10,000 req / mo | Starter $25/mo 50,000 req / mo | Production $49/mo 100,000 req / mo | Enterprise Custom 10M+ req / mo |
|---|---|---|---|---|
| Included requests | ||||
| Requests / month | 10,000 | 50,000 | 100,000 | 10M+ |
| Overage | — | — | +$9 / 100k req | Custom |
| AI Gateway | ||||
| Universal API (all providers & models) | ||||
| Automatic fallback | Soon | Soon | Soon | Soon |
| Smart routing & load balancing | — | — | Soon | Soon |
| Retry dedup & rate-limit passthrough | — | — | Soon | Soon |
| Caching | ||||
| Response cache (exact) | — | Soon | Soon | Soon |
| Semantic cache | — | — | Soon | Soon |
| Observability | ||||
| Logs & metrics (cost, tokens, latency, errors/status) | ||||
| Cost attribution & filtering by any metadata | ||||
| Usage KPIs & dashboards | ||||
| Agent tracing (session + trace) | — | Soon | Soon | Soon |
| Full transcript logging | — | — | Soon | Soon |
| Data retention | 7 days | 30 days | 90 days | Custom |
| Cost controls | ||||
| Monthly spend cap | — | Soon | Soon | Soon |
| Budgets & caps per key / team / dimension | — | — | Soon | Soon |
| Vault (enforced budgets & key control) | — | — | Soon | Soon |
| Agent gateway | ||||
| MCP Gateway (govern agent tool traffic) | — | — | Soon | Soon |
| Coding-CLI routing (Claude Code, Codex, Cursor) | — | — | Soon | Soon |
| Team & access | ||||
| Team members (invite your workspace) | — | |||
| Multiple API keys & environments | — | |||
| RBAC (roles & permissions) | — | — | ||
| Security & compliance | ||||
| PII anonymizer | — | — | Soon | Soon |
| SSO (Okta / Google) + SCIM | — | — | — | Soon |
| Compliance & private deployment (SOC2 · HIPAA · VPC) | — | — | — | On roadmap, talk to us |
| Support | ||||
| Support | Docs | Priority · 24h | Dedicated CS + SLA | |
Ready to get started?
Free forever for development. No credit card required.