Simple, transparent pricing
Start free with Lite, pay only for what you use, scale without limits.
Pricing
Simple, transparent pricing
Pay only for what you use. No hidden fees.
MonthlyAnnualSave 20%
Most Popular
Developer
For production workloads with pay-as-you-go
$49/ month + usage
5% usage discount
- 5% usage discount
- All models (70B+, vision, code)
- 25 RAG knowledge bases
- 25 GB vector storage
- 100 GB document storage
- SSO authentication
- Hybrid search + reranking
- Streaming & function calling
- Code execution (120s max)
- Email + Discord support
- 99.9% uptime SLA
Per-model pricing
Prices per 1 million tokens. Input and output priced separately.
| Model | Input | Output | Context |
|---|---|---|---|
| Llama 3.3 70B | $0.20 | $0.60 | 128K |
| Llama 3.3 8B | $0.05 | $0.10 | 128K |
| Qwen 3 32B | $0.10 | $0.30 | 128K |
| Qwen 3 8B | $0.04 | $0.08 | 128K |
| Mistral Large 2 | $0.30 | $0.90 | 128K |
| Mistral 7B | $0.04 | $0.08 | 32K |
| DeepSeek V3 | $0.15 | $0.45 | 128K |
RAG storage & limits
| Resource | Lite | Developer | Pro | Enterprise |
|---|---|---|---|---|
| Vector storage | 1 GB | 25 GB | 100 GB | Unlimited |
| Document storage | 5 GB | 100 GB | 500 GB | Unlimited |
| Knowledge bases | 5 | 25 | 100 | Unlimited |
| Extra vector storage | $0.10 / GB / mo | $0.10 / GB / mo | $0.10 / GB / mo | Custom |
| Extra document storage | $0.02 / GB / mo | $0.02 / GB / mo | $0.02 / GB / mo | Custom |
Embedding models
| Model | Price |
|---|---|
| BGE Large EN v1.5 | $0.01 / 1M tokens |
| E5 Large v2 | $0.01 / 1M tokens |
| Cohere Embed v3 | $0.10 / 1M tokens |
Reranking models
| Model | Price |
|---|---|
| BGE Reranker Large | $0.02 / 1K queries |
| Cohere Rerank v3 | $0.10 / 1K queries |
Audio pricing
Speech-to-text and text-to-speech at simple per-unit rates.
Speech-to-Text
| Model | Price |
|---|---|
| Whisper Large v3 | $0.01 / minute |
Text-to-Speech
| Model | Price |
|---|---|
| Kokoro 82M | $15.00 / 1M characters |
Image generation pricing
| Model | Price | Sizes |
|---|---|---|
| FLUX.1 Schnell | $0.02 / image | 256x256, 512x512, 1024x1024 |
Code execution pricing
| Environment | Price | Description |
|---|---|---|
| Python | $0.03 / minute | Python 3.12 with data science packages |
