Simple, transparent pricing

Start free with Lite, pay only for what you use, scale without limits.

Pricing

Simple, transparent pricing

Pay only for what you use. No hidden fees.

MonthlyAnnualSave 20%

Lite

For experimentation and prototyping

$0pay-as-you-go
  • Pay-as-you-go inference
  • Community models (Llama, Qwen, Mistral)
  • 5 RAG knowledge bases
  • 1 GB vector storage
  • 5 GB document storage
  • SSO authentication
  • Code execution (30s max)
  • Community support
Most Popular

Developer

For production workloads with pay-as-you-go

$49/ month + usage
5% usage discount
  • 5% usage discount
  • All models (70B+, vision, code)
  • 25 RAG knowledge bases
  • 25 GB vector storage
  • 100 GB document storage
  • SSO authentication
  • Hybrid search + reranking
  • Streaming & function calling
  • Code execution (120s max)
  • Email + Discord support
  • 99.9% uptime SLA

Pro

For scaling teams with advanced needs

$99/ month + usage
10% usage discount
  • 10% usage discount
  • Everything in Developer
  • 100 RAG knowledge bases
  • 100 GB vector storage
  • 500 GB document storage
  • SSO authentication
  • Priority support
  • 3,000 requests/min rate limit
  • Code execution (180s max)
  • Advanced analytics

Enterprise

For teams with custom requirements

Custom
  • Custom usage discount
  • Everything in Pro
  • Dedicated GPU clusters
  • Custom model fine-tuning
  • SSO / SAML / SCIM
  • VPC peering & private endpoints
  • Unlimited RAG storage
  • Code execution (300s max)
  • Dedicated account manager
  • SLA up to 99.99%

Per-model pricing

Prices per 1 million tokens. Input and output priced separately.

ModelInputOutputContext
Llama 3.3 70B$0.20$0.60128K
Llama 3.3 8B$0.05$0.10128K
Qwen 3 32B$0.10$0.30128K
Qwen 3 8B$0.04$0.08128K
Mistral Large 2$0.30$0.90128K
Mistral 7B$0.04$0.0832K
DeepSeek V3$0.15$0.45128K

RAG storage & limits

ResourceLiteDeveloperProEnterprise
Vector storage1 GB25 GB100 GBUnlimited
Document storage5 GB100 GB500 GBUnlimited
Knowledge bases525100Unlimited
Extra vector storage$0.10 / GB / mo$0.10 / GB / mo$0.10 / GB / moCustom
Extra document storage$0.02 / GB / mo$0.02 / GB / mo$0.02 / GB / moCustom

Embedding models

ModelPrice
BGE Large EN v1.5$0.01 / 1M tokens
E5 Large v2$0.01 / 1M tokens
Cohere Embed v3$0.10 / 1M tokens

Reranking models

ModelPrice
BGE Reranker Large$0.02 / 1K queries
Cohere Rerank v3$0.10 / 1K queries

Audio pricing

Speech-to-text and text-to-speech at simple per-unit rates.

Speech-to-Text

ModelPrice
Whisper Large v3$0.01 / minute

Text-to-Speech

ModelPrice
Kokoro 82M$15.00 / 1M characters

Image generation pricing

ModelPriceSizes
FLUX.1 Schnell$0.02 / image256x256, 512x512, 1024x1024

Code execution pricing

EnvironmentPriceDescription
Python$0.03 / minutePython 3.12 with data science packages

Frequently asked questions

Build the fastest apps

Join thousands of developers using Tensoras to ship AI-powered products that feel instant. Start free, scale without limits.