Levain LabsLevain Labs
Models

Models

The AI models available on Levain, grouped by provider, with token prices.

Levain routes every agent's LLM traffic for you, so you can pick the model that fits each task. Claude models are first-class on every agent; other providers are available too — some out of the box, others when you bring your own key.

Provider list prices per 1M tokens, shown for reference as of July 2026. See Pricing for what you'll actually pay. Models marked Your own key need a connected provider key; the rest are available by default.

Anthropic

Frontier reasoning and agentic-coding leader; long-horizon autonomy and 1M-token context. First-class on every Levain lane.

ModelModel IDContextInput / 1MOutput / 1MAvailability
Claude Fable 5
claude-fable-51M$10.00$50.00Available
Claude Opus 4.8
claude-opus-4-81M$5.00$25.00Available
Claude Sonnet 5
claude-sonnet-51M$3.00$15.00Available
Claude Haiku 4.5
claude-haiku-4-5200K$1.00$5.00Available
Claude Opus 4.1
claude-opus-4-1200K$15.00$75.00Available
Claude Opus 4.5
claude-opus-4-5200K$5.00$25.00Available
Claude Opus 4.6
claude-opus-4-6-v11M$5.00$25.00Available
Claude Opus 4.7
claude-opus-4-71M$5.00$25.00Available
Claude Sonnet 4.5
claude-sonnet-4-5200K$3.00$15.00Available
Claude Sonnet 4.6
claude-sonnet-4-61M$3.00$15.00Available

OpenAI

Broad ecosystem; strong general-purpose flagships.

ModelModel IDContextInput / 1MOutput / 1MAvailability
GPT OSS Safeguard 120B
openai.gpt-oss-safeguard-120b128K$0.15$0.60Available
GPT OSS Safeguard 20B
openai.gpt-oss-safeguard-20b128K$0.070$0.20Available
gpt-oss-120b
openai.gpt-oss-120b-1:0128K$0.15$0.60Available
gpt-oss-20b
openai.gpt-oss-20b-1:0128K$0.070$0.30Available
GPT-5.5
gpt-5.51M$5.00$30.00Your own key
GPT-5.4
gpt-5.41M$2.50$15.00Your own key

Google

Gemma open models run by default; Gemini flagships run with your own Google key.

ModelModel IDContextInput / 1MOutput / 1MAvailability
Gemma 3 12B IT
google.gemma-3-12b-it128K$0.090$0.29Available
Gemma 3 27B IT
google.gemma-3-27b-it128K$0.23$0.38Available
Gemma 3 4B IT
google.gemma-3-4b-it128K$0.040$0.080Available
Gemini 3 Pro
gemini-3-pro-preview1M$2.00$12.00Your own key
Gemini 3 Flash
gemini-3-flash-preview1M$0.50$3.00Your own key

Meta

Self-hostable open-weight Llama models.

ModelModel IDContextInput / 1MOutput / 1MAvailability
Llama 3 70B Instruct
meta.llama3-70b-instruct-v1:08K$2.65$3.50Available
Llama 3 8B Instruct
meta.llama3-8b-instruct-v1:08K$0.30$0.60Available
Llama 3.1 70B Instruct
meta.llama3-1-70b-instruct-v1:0128K$0.99$0.99Available
Llama 3.1 8B Instruct
meta.llama3-1-8b-instruct-v1:0128K$0.22$0.22Available
Llama 3.3 70B Instruct
meta.llama3-3-70b-instruct-v1:0128K$0.72$0.72Available
Llama 4 Maverick 17B Instruct
meta.llama4-maverick-17b-instruct-v1:0128K$0.24$0.97Available
Llama 4 Scout 17B Instruct
meta.llama4-scout-17b-instruct-v1:0128K$0.17$0.66Available

Mistral

European provider; Devstral targets coding.

ModelModel IDContextInput / 1MOutput / 1MAvailability
Devstral 2 123B
mistral.devstral-2-123b256K$0.40$2.00Available
Magistral Small 2509
mistral.magistral-small-2509128K$0.50$1.50Available
Ministral 14B 3.0
mistral.ministral-3-14b-instruct128K$0.20$0.20Available
Ministral 3 8B
mistral.ministral-3-8b-instruct128K$0.15$0.15Available
Ministral 3B
mistral.ministral-3-3b-instruct128K$0.10$0.10Available
Mistral 7B Instruct
mistral.mistral-7b-instruct-v0:232K$0.15$0.20Available
Mistral Large (24.02)
mistral.mistral-large-2402-v1:032K$8.00$24.00Available
Mistral Large 3
mistral.mistral-large-3-675b-instruct256K$0.50$1.50Available
Mistral Small (24.02)
mistral.mistral-small-2402-v1:032K$1.00$3.00Available
Mixtral 8x7B Instruct
mistral.mixtral-8x7b-instruct-v0:132K$0.45$0.70Available
Pixtral Large (25.02)
mistral.pixtral-large-2502-v1:0128K$2.00$6.00Available
Voxtral Mini 3B 2507
mistral.voxtral-mini-3b-2507128K$0.040$0.040Available
Voxtral Small 24B 2507
mistral.voxtral-small-24b-2507128K$0.10$0.30Available

DeepSeek

Extreme price/performance for extraction and coding tiers; open weights.

ModelModel IDContextInput / 1MOutput / 1MAvailability
DeepSeek V3.2
deepseek.v3.2164K$0.62$1.85Available
DeepSeek-R1
deepseek.r1-v1:0128K$1.35$5.40Available

Alibaba (Qwen)

Open-weight coder and vision lineup; budget tier.

ModelModel IDContextInput / 1MOutput / 1MAvailability
Qwen3 32B (dense)
qwen.qwen3-32b-v1:0131K$0.15$0.60Available
Qwen3 Coder Next
qwen.qwen3-coder-next262K$0.50$1.20Available
Qwen3 Next 80B A3B
qwen.qwen3-next-80b-a3b128K$0.15$1.20Available
Qwen3 VL 235B A22B
qwen.qwen3-vl-235b-a22b128K$0.53$2.66Available
Qwen3-Coder-30B-A3B-Instruct
qwen.qwen3-coder-30b-a3b-v1:0262K$0.15$0.60Available

xAI

The Grok family; runs with your own xAI key.

ModelModel IDContextInput / 1MOutput / 1MAvailability
Grok 4
xai/grok-4256K$3.00$15.00Your own key

MiniMax

Cost outlier for agentic coding.

ModelModel IDContextInput / 1MOutput / 1MAvailability
MiniMax M2
minimax.minimax-m2128K$0.30$1.20Available
MiniMax M2.1
minimax.minimax-m2.1196K$0.30$1.20Available
MiniMax M2.5
minimax.minimax-m2.51M$0.30$1.20Available

Amazon

The Nova family — fast, low-cost multimodal models.

ModelModel IDContextInput / 1MOutput / 1MAvailability
Nova 2 Lite
amazon.nova-2-lite-v1:01M$0.30$2.50Available
Nova Lite
amazon.nova-lite-v1:0300K$0.060$0.24Available
Nova Micro
amazon.nova-micro-v1:0128K$0.035$0.14Available
Nova Pro
amazon.nova-pro-v1:0300K$0.80$3.20Available

Z.AI

The GLM family; open-weight reasoning and coding.

ModelModel IDContextInput / 1MOutput / 1MAvailability
GLM 4.7
zai.glm-4.7200K$0.60$2.20Available
GLM 4.7 Flash
zai.glm-4.7-flash200K$0.070$0.40Available
GLM 5
zai.glm-5200K$1.00$3.20Available

Moonshot AI

The Kimi family; long-context agentic models.

ModelModel IDContextInput / 1MOutput / 1MAvailability
Kimi K2 Thinking
moonshot.kimi-k2-thinking128K$0.60$2.50Available
Kimi K2.5
moonshot.kimi-k2.5262K$0.60$3.00Available

NVIDIA

The Nemotron family; open reasoning models.

ModelModel IDContextInput / 1MOutput / 1MAvailability
NVIDIA Nemotron 3 Super 120B A12B
nvidia.nemotron-super-3-120b256K$0.15$0.65Available
NVIDIA Nemotron Nano 12B v2 VL BF16
nvidia.nemotron-nano-12b-v2128K$0.20$0.60Available
NVIDIA Nemotron Nano 9B v2
nvidia.nemotron-nano-9b-v2128K$0.060$0.23Available
Nemotron Nano 3 30B
nvidia.nemotron-nano-3-30b262K$0.060$0.24Available

Writer

The Palmyra family; enterprise and domain models.

ModelModel IDContextInput / 1MOutput / 1MAvailability
Palmyra X4
writer.palmyra-x4-v1:0128K$2.50$10.00Available
Palmyra X5
writer.palmyra-x5-v1:01M$0.60$6.00Available

Choosing a model

Model choice is per node in a recipe. A few rules of thumb:

  • Default to the cheapest model that survives the task. Reserve the top tier for deep reasoning, planning, and long-horizon autonomy.
  • Tune effort before jumping tiers. A smaller model at higher effort often matches a bigger one at a fraction of the cost.
  • Cost concentrates in loops. A model choice on a node that runs once is a rounding error; the same choice inside a per-item or retry loop dominates the run's cost.

For how billing works — provider cost plus a flat platform fee, or margin-only when you bring your own key — see Pricing and Bring Your Own Key.

On this page