Model Catalog
Models zot Knows About.
Generated from zot's built-in provider catalog in the source repo:packages/provider.
Models
943 / 943
| Model | Context | Input $/m | Output $/m | Cache Read $/m | Cache Write $/m |
|---|---|---|---|---|---|
| amazon-bedrock (90 models) | |||||
AU Anthropic Claude Opus 4.6 au.anthropic.claude-opus-4-6-v1 reasoning | 1,000,000 | $16.5 | $82.5 | $0.5 | $6.25 |
AU Anthropic Claude Opus 4.8 au.anthropic.claude-opus-4-8 reasoning | 1,000,000 | $16.5 | $82.5 | $0.5 | $6.25 |
AU Anthropic Claude Sonnet 4.6 au.anthropic.claude-sonnet-4-6 reasoning | 1,000,000 | $3.3 | $16.5 | $0.33 | $4.125 |
Claude Haiku 4.5 anthropic.claude-haiku-4-5-20251001-v1:0 reasoning | 200,000 | $1 | $5 | $0.1 | $1.25 |
Claude Haiku 4.5 (AU) au.anthropic.claude-haiku-4-5-20251001-v1:0 reasoning | 200,000 | $1 | $5 | $0.1 | $1.25 |
Claude Haiku 4.5 (EU) eu.anthropic.claude-haiku-4-5-20251001-v1:0 reasoning | 200,000 | $1 | $5 | $0.1 | $1.25 |
Claude Haiku 4.5 (Global) global.anthropic.claude-haiku-4-5-20251001-v1:0 reasoning | 200,000 | $1 | $5 | $0.1 | $1.25 |
Claude Haiku 4.5 (US) us.anthropic.claude-haiku-4-5-20251001-v1:0 reasoning | 200,000 | $1 | $5 | $0.1 | $1.25 |
Claude Opus 4.1 anthropic.claude-opus-4-1-20250805-v1:0 reasoning | 200,000 | $15 | $75 | $1.5 | $18.75 |
Claude Opus 4.1 (US) us.anthropic.claude-opus-4-1-20250805-v1:0 reasoning | 200,000 | $15 | $75 | $1.5 | $18.75 |
Claude Opus 4.5 anthropic.claude-opus-4-5-20251101-v1:0 reasoning | 200,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.5 (EU) eu.anthropic.claude-opus-4-5-20251101-v1:0 reasoning | 200,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.5 (Global) global.anthropic.claude-opus-4-5-20251101-v1:0 reasoning | 200,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.5 (US) us.anthropic.claude-opus-4-5-20251101-v1:0 reasoning | 200,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.6 anthropic.claude-opus-4-6-v1 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.6 (EU) eu.anthropic.claude-opus-4-6-v1 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.6 (Global) global.anthropic.claude-opus-4-6-v1 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.6 (US) us.anthropic.claude-opus-4-6-v1 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.7 anthropic.claude-opus-4-7 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.7 (EU) eu.anthropic.claude-opus-4-7 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.7 (Global) global.anthropic.claude-opus-4-7 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.7 (JP) jp.anthropic.claude-opus-4-7 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.7 (US) us.anthropic.claude-opus-4-7 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.8 anthropic.claude-opus-4-8 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.8 (EU) eu.anthropic.claude-opus-4-8 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.8 (Global) global.anthropic.claude-opus-4-8 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.8 (JP) jp.anthropic.claude-opus-4-8 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.8 (US) us.anthropic.claude-opus-4-8 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Sonnet 4.5 anthropic.claude-sonnet-4-5-20250929-v1:0 reasoning | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.5 (AU) au.anthropic.claude-sonnet-4-5-20250929-v1:0 reasoning | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.5 (EU) eu.anthropic.claude-sonnet-4-5-20250929-v1:0 reasoning | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.5 (Global) global.anthropic.claude-sonnet-4-5-20250929-v1:0 reasoning | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.5 (JP) jp.anthropic.claude-sonnet-4-5-20250929-v1:0 reasoning | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.5 (US) us.anthropic.claude-sonnet-4-5-20250929-v1:0 reasoning | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.6 anthropic.claude-sonnet-4-6 reasoning | 1,000,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.6 (EU) eu.anthropic.claude-sonnet-4-6 reasoning | 1,000,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.6 (Global) global.anthropic.claude-sonnet-4-6 reasoning | 1,000,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.6 (JP) jp.anthropic.claude-sonnet-4-6 reasoning | 1,000,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.6 (US) us.anthropic.claude-sonnet-4-6 reasoning | 1,000,000 | $3 | $15 | $0.3 | $3.75 |
DeepSeek-R1 deepseek.r1-v1:0 reasoning | 128,000 | $1.35 | $5.4 | $0 | - |
DeepSeek-R1 (US) us.deepseek.r1-v1:0 reasoning | 128,000 | $1.35 | $5.4 | $0 | - |
DeepSeek-V3.1 deepseek.v3-v1:0 reasoning | 163,840 | $0.58 | $1.68 | $0 | - |
DeepSeek-V3.2 deepseek.v3.2 reasoning | 163,840 | $0.62 | $1.85 | $0 | - |
Devstral 2 123B mistral.devstral-2-123b | 256,000 | $0.4 | $2 | $0 | - |
Gemma 3 4B IT google.gemma-3-4b-it | 128,000 | $0.04 | $0.08 | $0 | - |
GLM-4.7 zai.glm-4.7 reasoning | 204,800 | $0.6 | $2.2 | $0 | - |
GLM-4.7-Flash zai.glm-4.7-flash reasoning | 200,000 | $0.07 | $0.4 | $0 | - |
GLM-5 zai.glm-5 reasoning | 202,752 | $1 | $3.2 | $0 | - |
Google Gemma 3 27B Instruct google.gemma-3-27b-it | 202,752 | $0.12 | $0.2 | $0 | - |
GPT OSS Safeguard 120B openai.gpt-oss-safeguard-120b | 128,000 | $0.15 | $0.6 | $0 | - |
GPT OSS Safeguard 20B openai.gpt-oss-safeguard-20b | 128,000 | $0.07 | $0.2 | $0 | - |
gpt-oss-120b openai.gpt-oss-120b-1:0 | 128,000 | $0.15 | $0.6 | $0 | - |
gpt-oss-20b openai.gpt-oss-20b-1:0 | 128,000 | $0.07 | $0.3 | $0 | - |
Kimi K2 Thinking moonshot.kimi-k2-thinking reasoning | 262,143 | $0.6 | $2.5 | $0 | - |
Kimi K2.5 moonshotai.kimi-k2.5 reasoning | 262,143 | $0.6 | $3 | $0 | - |
Llama 3.1 70B Instruct meta.llama3-1-70b-instruct-v1:0 | 128,000 | $0.72 | $0.72 | $0 | - |
Llama 3.1 8B Instruct meta.llama3-1-8b-instruct-v1:0 | 128,000 | $0.22 | $0.22 | $0 | - |
Llama 3.3 70B Instruct meta.llama3-3-70b-instruct-v1:0 | 128,000 | $0.72 | $0.72 | $0 | - |
Llama 4 Maverick 17B Instruct meta.llama4-maverick-17b-instruct-v1:0 | 1,000,000 | $0.24 | $0.97 | $0 | - |
Llama 4 Maverick 17B Instruct (US) us.meta.llama4-maverick-17b-instruct-v1:0 | 1,000,000 | $0.24 | $0.97 | $0 | - |
Llama 4 Scout 17B Instruct meta.llama4-scout-17b-instruct-v1:0 | 3,500,000 | $0.17 | $0.66 | $0 | - |
Llama 4 Scout 17B Instruct (US) us.meta.llama4-scout-17b-instruct-v1:0 | 3,500,000 | $0.17 | $0.66 | $0 | - |
Magistral Small 1.2 mistral.magistral-small-2509 reasoning | 128,000 | $0.5 | $1.5 | $0 | - |
MiniMax M2 minimax.minimax-m2 reasoning | 204,608 | $0.3 | $1.2 | $0 | - |
MiniMax M2.1 minimax.minimax-m2.1 reasoning | 204,800 | $0.3 | $1.2 | $0 | - |
MiniMax M2.5 minimax.minimax-m2.5 reasoning | 196,608 | $0.3 | $1.2 | $0 | - |
Ministral 14B 3.0 mistral.ministral-3-14b-instruct | 128,000 | $0.2 | $0.2 | $0 | - |
Ministral 3 3B mistral.ministral-3-3b-instruct | 256,000 | $0.1 | $0.1 | $0 | - |
Ministral 3 8B mistral.ministral-3-8b-instruct | 128,000 | $0.15 | $0.15 | $0 | - |
Mistral Large 3 mistral.mistral-large-3-675b-instruct | 256,000 | $0.5 | $1.5 | $0 | - |
Nova 2 Lite amazon.nova-2-lite-v1:0 | 128,000 | $0.33 | $2.75 | $0 | - |
Nova Lite amazon.nova-lite-v1:0 | 300,000 | $0.06 | $0.24 | $0.015 | - |
Nova Micro amazon.nova-micro-v1:0 | 128,000 | $0.035 | $0.14 | $0.00875 | - |
Nova Pro amazon.nova-pro-v1:0 | 300,000 | $0.8 | $3.2 | $0.2 | - |
NVIDIA Nemotron 3 Super 120B A12B nvidia.nemotron-super-3-120b reasoning | 262,144 | $0.15 | $0.65 | $0 | - |
NVIDIA Nemotron Nano 12B v2 VL BF16 nvidia.nemotron-nano-12b-v2 | 128,000 | $0.2 | $0.6 | $0 | - |
NVIDIA Nemotron Nano 3 30B nvidia.nemotron-nano-3-30b reasoning | 128,000 | $0.06 | $0.24 | $0 | - |
NVIDIA Nemotron Nano 9B v2 nvidia.nemotron-nano-9b-v2 | 128,000 | $0.06 | $0.23 | $0 | - |
Palmyra X4 writer.palmyra-x4-v1:0 reasoning | 122,880 | $2.5 | $10 | $0 | - |
Palmyra X5 writer.palmyra-x5-v1:0 reasoning | 1,040,000 | $0.6 | $6 | $0 | - |
Pixtral Large (25.02) mistral.pixtral-large-2502-v1:0 | 128,000 | $2 | $6 | $0 | - |
Qwen/Qwen3-Next-80B-A3B-Instruct qwen.qwen3-next-80b-a3b | 262,000 | $0.14 | $1.4 | $0 | - |
Qwen/Qwen3-VL-235B-A22B-Instruct qwen.qwen3-vl-235b-a22b | 262,000 | $0.3 | $1.5 | $0 | - |
Qwen3 235B A22B 2507 qwen.qwen3-235b-a22b-2507-v1:0 | 262,144 | $0.22 | $0.88 | $0 | - |
Qwen3 32B (dense) qwen.qwen3-32b-v1:0 reasoning | 16,384 | $0.15 | $0.6 | $0 | - |
Qwen3 Coder 30B A3B Instruct qwen.qwen3-coder-30b-a3b-v1:0 | 262,144 | $0.15 | $0.6 | $0 | - |
Qwen3 Coder 480B A35B Instruct qwen.qwen3-coder-480b-a35b-v1:0 | 131,072 | $0.22 | $1.8 | $0 | - |
Qwen3 Coder Next qwen.qwen3-coder-next reasoning | 131,072 | $0.22 | $1.8 | $0 | - |
Voxtral Mini 3B 2507 mistral.voxtral-mini-3b-2507 | 128,000 | $0.04 | $0.04 | $0 | - |
Voxtral Small 24B 2507 mistral.voxtral-small-24b-2507 | 32,000 | $0.15 | $0.35 | $0 | - |
| anthropic (24 models) | |||||
Claude Haiku 3 claude-3-haiku-20240307 | 200,000 | $0.25 | $1.25 | $0.03 | $0.3 |
Claude Haiku 3.5 claude-3-5-haiku-20241022 | 200,000 | $0.8 | $4 | $0.08 | $1 |
Claude Haiku 3.5 (latest) claude-3-5-haiku-latest | 200,000 | $0.8 | $4 | $0.08 | $1 |
Claude Haiku 4.5 claude-haiku-4-5-20251001 reasoning | 200,000 | $1 | $5 | $0.1 | $1.25 |
Claude Haiku 4.5 (latest) claude-haiku-4-5 reasoning | 200,000 | $1 | $5 | $0.1 | $1.25 |
Claude Opus 3 claude-3-opus-20240229 | 200,000 | $15 | $75 | $1.5 | $18.75 |
Claude Opus 4 claude-opus-4-20250514 reasoning | 200,000 | $15 | $75 | $1.5 | $18.75 |
Claude Opus 4 (latest) claude-opus-4-0 reasoning | 200,000 | $15 | $75 | $1.5 | $18.75 |
Claude Opus 4.1 claude-opus-4-1-20250805 reasoning | 200,000 | $15 | $75 | $1.5 | $18.75 |
Claude Opus 4.1 (latest) claude-opus-4-1 reasoning | 200,000 | $15 | $75 | $1.5 | $18.75 |
Claude Opus 4.5 claude-opus-4-5-20251101 reasoning | 200,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.5 (latest) claude-opus-4-5 reasoning | 200,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.6 claude-opus-4-6 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.7 claude-opus-4-7 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.8 claude-opus-4-8 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Sonnet 3 claude-3-sonnet-20240229 | 200,000 | $3 | $15 | $0.3 | $0.3 |
Claude Sonnet 3.5 claude-3-5-sonnet-20240620 | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 3.5 v2 claude-3-5-sonnet-20241022 | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 3.7 claude-3-7-sonnet-20250219 reasoning | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4 claude-sonnet-4-20250514 reasoning | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4 (latest) claude-sonnet-4-0 reasoning | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.5 claude-sonnet-4-5-20250929 reasoning | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.5 (latest) claude-sonnet-4-5 reasoning | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.6 claude-sonnet-4-6 reasoning | 1,000,000 | $3 | $15 | $0.3 | $3.75 |
| azure-openai-responses (42 models) | |||||
GPT-4 gpt-4 | 8,192 | $30 | $60 | $0 | - |
GPT-4 Turbo gpt-4-turbo | 128,000 | $10 | $30 | $0 | - |
GPT-4.1 gpt-4.1 | 1,047,576 | $2 | $8 | $0.5 | - |
GPT-4.1 mini gpt-4.1-mini | 1,047,576 | $0.4 | $1.6 | $0.1 | - |
GPT-4.1 nano gpt-4.1-nano | 1,047,576 | $0.1 | $0.4 | $0.03 | - |
GPT-4o gpt-4o | 128,000 | $2.5 | $10 | $1.25 | - |
GPT-4o (2024-05-13) gpt-4o-2024-05-13 | 128,000 | $5 | $15 | $0 | - |
GPT-4o (2024-08-06) gpt-4o-2024-08-06 | 128,000 | $2.5 | $10 | $1.25 | - |
GPT-4o (2024-11-20) gpt-4o-2024-11-20 | 128,000 | $2.5 | $10 | $1.25 | - |
GPT-4o mini gpt-4o-mini | 128,000 | $0.15 | $0.6 | $0.08 | - |
GPT-5 gpt-5 reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT-5 Chat Latest gpt-5-chat-latest | 128,000 | $1.25 | $10 | $0.125 | - |
GPT-5 Mini gpt-5-mini reasoning | 400,000 | $0.25 | $2 | $0.025 | - |
GPT-5 Nano gpt-5-nano reasoning | 400,000 | $0.05 | $0.4 | $0.005 | - |
GPT-5 Pro gpt-5-pro reasoning | 400,000 | $15 | $120 | $0 | - |
GPT-5-Codex gpt-5-codex reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT-5.1 gpt-5.1 reasoning | 400,000 | $1.25 | $10 | $0.13 | - |
GPT-5.1 Chat gpt-5.1-chat-latest reasoning | 128,000 | $1.25 | $10 | $0.125 | - |
GPT-5.1 Codex gpt-5.1-codex reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT-5.1 Codex Max gpt-5.1-codex-max reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT-5.1 Codex mini gpt-5.1-codex-mini reasoning | 400,000 | $0.25 | $2 | $0.025 | - |
GPT-5.2 gpt-5.2 reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
GPT-5.2 Chat gpt-5.2-chat-latest reasoning | 128,000 | $1.75 | $14 | $0.175 | - |
GPT-5.2 Codex gpt-5.2-codex reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
GPT-5.2 Pro gpt-5.2-pro reasoning | 400,000 | $21 | $168 | $0 | - |
GPT-5.3 Chat (latest) gpt-5.3-chat-latest | 128,000 | $1.75 | $14 | $0.175 | - |
GPT-5.3 Codex gpt-5.3-codex reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
GPT-5.3 Codex Spark gpt-5.3-codex-spark reasoning | 128,000 | $1.75 | $14 | $0.175 | - |
GPT-5.4 gpt-5.4 reasoning | 272,000 | $2.5 | $15 | $0.25 | - |
GPT-5.4 mini gpt-5.4-mini reasoning | 400,000 | $0.75 | $4.5 | $0.075 | - |
GPT-5.4 nano gpt-5.4-nano reasoning | 400,000 | $0.2 | $1.25 | $0.02 | - |
GPT-5.4 Pro gpt-5.4-pro reasoning | 1,050,000 | $30 | $180 | $0 | - |
GPT-5.5 gpt-5.5 reasoning | 272,000 | $5 | $30 | $0.5 | - |
GPT-5.5 Pro gpt-5.5-pro reasoning | 1,050,000 | $30 | $180 | $0 | - |
o1 o1 reasoning | 200,000 | $15 | $60 | $7.5 | - |
o1-pro o1-pro reasoning | 200,000 | $150 | $600 | $0 | - |
o3 o3 reasoning | 200,000 | $2 | $8 | $0.5 | - |
o3-deep-research o3-deep-research reasoning | 200,000 | $10 | $40 | $2.5 | - |
o3-mini o3-mini reasoning | 200,000 | $1.1 | $4.4 | $0.55 | - |
o3-pro o3-pro reasoning | 200,000 | $20 | $80 | $0 | - |
o4-mini o4-mini reasoning | 200,000 | $1.1 | $4.4 | $0.28 | - |
o4-mini-deep-research o4-mini-deep-research reasoning | 200,000 | $2 | $8 | $0.5 | - |
| cerebras (4 models) | |||||
GPT OSS 120B gpt-oss-120b reasoning | 131,072 | $0.25 | $0.69 | $0 | - |
Llama 3.1 8B llama3.1-8b | 32,000 | $0.1 | $0.1 | $0 | - |
Qwen 3 235B Instruct qwen-3-235b-a22b-instruct-2507 | 131,000 | $0.6 | $1.2 | $0 | - |
Z.AI GLM-4.7 zai-glm-4.7 | 131,072 | $2.25 | $2.75 | $0 | - |
| cloudflare-ai-gateway (36 models) | |||||
Claude Haiku 3 claude-3-haiku | 200,000 | $0.25 | $1.25 | $0.03 | $0.3 |
Claude Haiku 3.5 (latest) claude-3-5-haiku | 200,000 | $0.8 | $4 | $0.08 | $1 |
Claude Haiku 3.5 (latest) claude-3.5-haiku | 200,000 | $0.8 | $4 | $0.08 | $1 |
Claude Haiku 4.5 (latest) claude-haiku-4-5 reasoning | 200,000 | $1 | $5 | $0.1 | $1.25 |
Claude Opus 3 claude-3-opus | 200,000 | $15 | $75 | $1.5 | $18.75 |
Claude Opus 4 (latest) claude-opus-4 reasoning | 200,000 | $15 | $75 | $1.5 | $18.75 |
Claude Opus 4.1 (latest) claude-opus-4-1 reasoning | 200,000 | $15 | $75 | $1.5 | $18.75 |
Claude Opus 4.5 (latest) claude-opus-4-5 reasoning | 200,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.6 (latest) claude-opus-4-6 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.7 claude-opus-4-7 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.8 claude-opus-4-8 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Sonnet 3 claude-3-sonnet | 200,000 | $3 | $15 | $0.3 | $0.3 |
Claude Sonnet 3.5 v2 claude-3.5-sonnet | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4 (latest) claude-sonnet-4 reasoning | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.5 (latest) claude-sonnet-4-5 reasoning | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.6 claude-sonnet-4-6 reasoning | 1,000,000 | $3 | $15 | $0.3 | $3.75 |
GLM-4.7-Flash workers-ai/@cf/zai-org/glm-4.7-flash reasoning | 131,072 | $0.06 | $0.4 | $0 | - |
GPT-4 gpt-4 | 8,192 | $30 | $60 | $0 | - |
GPT-4 Turbo gpt-4-turbo | 128,000 | $10 | $30 | $0 | - |
GPT-4o gpt-4o | 128,000 | $2.5 | $10 | $1.25 | - |
GPT-4o mini gpt-4o-mini | 128,000 | $0.15 | $0.6 | $0.08 | - |
GPT-5.1 gpt-5.1 reasoning | 400,000 | $1.25 | $10 | $0.13 | - |
GPT-5.1 Codex gpt-5.1-codex reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT-5.2 gpt-5.2 reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
GPT-5.2 Codex gpt-5.2-codex reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
GPT-5.3 Codex gpt-5.3-codex reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
GPT-5.4 gpt-5.4 reasoning | 1,050,000 | $2.5 | $15 | $0.25 | - |
GPT-5.5 gpt-5.5 reasoning | 1,050,000 | $5 | $30 | $0.5 | - |
Kimi K2.5 workers-ai/@cf/moonshotai/kimi-k2.5 reasoning | 256,000 | $0.6 | $3 | $0.1 | - |
Kimi K2.6 workers-ai/@cf/moonshotai/kimi-k2.6 reasoning | 256,000 | $0.95 | $4 | $0.16 | - |
Nemotron 3 Super 120B workers-ai/@cf/nvidia/nemotron-3-120b-a12b reasoning | 256,000 | $0.5 | $1.5 | $0 | - |
o1 o1 reasoning | 200,000 | $15 | $60 | $7.5 | - |
o3 o3 reasoning | 200,000 | $2 | $8 | $0.5 | - |
o3-mini o3-mini reasoning | 200,000 | $1.1 | $4.4 | $0.55 | - |
o3-pro o3-pro reasoning | 200,000 | $20 | $80 | $0 | - |
o4-mini o4-mini reasoning | 200,000 | $1.1 | $4.4 | $0.28 | - |
| cloudflare-workers-ai (12 models) | |||||
Gemma 4 26B A4B IT @cf/google/gemma-4-26b-a4b-it reasoning | 256,000 | $0.1 | $0.3 | $0 | - |
GLM-4.7-Flash @cf/zai-org/glm-4.7-flash reasoning | 131,072 | $0.0605 | $0.4 | $0 | - |
GPT OSS 120B @cf/openai/gpt-oss-120b reasoning | 128,000 | $0.35 | $0.75 | $0 | - |
GPT OSS 20B @cf/openai/gpt-oss-20b reasoning | 128,000 | $0.2 | $0.3 | $0 | - |
Granite 4.0 H Micro @cf/ibm-granite/granite-4.0-h-micro | 131,000 | $0.017 | $0.112 | $0 | - |
Kimi K2.5 @cf/moonshotai/kimi-k2.5 reasoning | 256,000 | $0.6 | $3 | $0.1 | - |
Kimi K2.6 @cf/moonshotai/kimi-k2.6 reasoning | 262,144 | $0.95 | $4 | $0.16 | - |
Llama 3.3 70B Instruct fp8 Fast @cf/meta/llama-3.3-70b-instruct-fp8-fast | 24,000 | $0.293 | $2.253 | $0 | - |
Llama 4 Scout 17B 16E Instruct @cf/meta/llama-4-scout-17b-16e-instruct | 131,000 | $0.27 | $0.85 | $0 | - |
Mistral Small 3.1 24B Instruct @cf/mistralai/mistral-small-3.1-24b-instruct | 128,000 | $0.351 | $0.555 | $0 | - |
Nemotron 3 Super 120B @cf/nvidia/nemotron-3-120b-a12b reasoning | 256,000 | $0.5 | $1.5 | $0 | - |
Qwen3 30B A3b fp8 @cf/qwen/qwen3-30b-a3b-fp8 reasoning | 32,768 | $0.0509 | $0.335 | $0 | - |
| deepseek (2 models) | |||||
DeepSeek V4 Flash deepseek-v4-flash reasoning | 1,000,000 | $0.14 | $0.28 | $0.0028 | - |
DeepSeek V4 Pro deepseek-v4-pro reasoning | 1,000,000 | $0.435 | $0.87 | $0.003625 | - |
| fireworks (12 models) | |||||
DeepSeek V4 Flash accounts/fireworks/models/deepseek-v4-flash reasoning | 1,000,000 | $0.14 | $0.28 | $0.03 | - |
DeepSeek V4 Pro accounts/fireworks/models/deepseek-v4-pro reasoning | 1,000,000 | $1.74 | $3.48 | $0.145 | - |
GLM 5.1 accounts/fireworks/models/glm-5p1 reasoning | 202,800 | $1.4 | $4.4 | $0.26 | - |
GLM 5.1 Fast accounts/fireworks/routers/glm-5p1-fast reasoning | 202,800 | $2.8 | $8.8 | $0.52 | - |
GPT OSS 120B accounts/fireworks/models/gpt-oss-120b reasoning | 131,072 | $0.15 | $0.6 | $0.015 | - |
GPT OSS 20B accounts/fireworks/models/gpt-oss-20b reasoning | 131,072 | $0.07 | $0.3 | $0.035 | - |
Kimi K2.5 accounts/fireworks/models/kimi-k2p5 reasoning | 256,000 | $0.6 | $3 | $0.1 | - |
Kimi K2.6 accounts/fireworks/models/kimi-k2p6 reasoning | 262,000 | $0.95 | $4 | $0.16 | - |
Kimi K2.6 Turbo accounts/fireworks/routers/kimi-k2p6-turbo reasoning | 262,000 | $2 | $8 | $0.3 | - |
MiniMax-M2.5 accounts/fireworks/models/minimax-m2p5 reasoning | 196,608 | $0.3 | $1.2 | $0.03 | - |
MiniMax-M2.7 accounts/fireworks/models/minimax-m2p7 reasoning | 196,608 | $0.3 | $1.2 | $0.06 | - |
Qwen 3.6 Plus accounts/fireworks/models/qwen3p6-plus reasoning | 128,000 | $0.5 | $3 | $0.1 | - |
| github-copilot (21 models) | |||||
Claude Haiku 4.5 claude-haiku-4.5 reasoning | 144,000 | $0 | $0 | $0 | - |
Claude Opus 4.5 claude-opus-4.5 reasoning | 160,000 | $0 | $0 | $0 | - |
Claude Opus 4.6 claude-opus-4.6 reasoning | 1,000,000 | $0 | $0 | $0 | - |
Claude Opus 4.7 claude-opus-4.7 reasoning | 144,000 | $0 | $0 | $0 | - |
Claude Opus 4.8 claude-opus-4.8 reasoning | 144,000 | $0 | $0 | $0 | - |
Claude Sonnet 4.5 claude-sonnet-4.5 reasoning | 144,000 | $0 | $0 | $0 | - |
Claude Sonnet 4.6 claude-sonnet-4.6 reasoning | 1,000,000 | $0 | $0 | $0 | - |
Gemini 2.5 Pro gemini-2.5-pro | 128,000 | $0 | $0 | $0 | - |
Gemini 3 Flash gemini-3-flash-preview reasoning | 128,000 | $0 | $0 | $0 | - |
Gemini 3.1 Pro Preview gemini-3.1-pro-preview reasoning | 128,000 | $0 | $0 | $0 | - |
Gemini 3.5 Flash gemini-3.5-flash reasoning | 128,000 | $0 | $0 | $0 | - |
GPT-4.1 gpt-4.1 | 128,000 | $0 | $0 | $0 | - |
GPT-4o gpt-4o | 128,000 | $0 | $0 | $0 | - |
GPT-5-mini gpt-5-mini reasoning | 264,000 | $0 | $0 | $0 | - |
GPT-5.2 gpt-5.2 reasoning | 264,000 | $0 | $0 | $0 | - |
GPT-5.2-Codex gpt-5.2-codex reasoning | 400,000 | $0 | $0 | $0 | - |
GPT-5.3-Codex gpt-5.3-codex reasoning | 400,000 | $0 | $0 | $0 | - |
GPT-5.4 gpt-5.4 reasoning | 400,000 | $0 | $0 | $0 | - |
GPT-5.4 Mini gpt-5.4-mini reasoning | 400,000 | $0 | $0 | $0 | - |
GPT-5.5 gpt-5.5 reasoning | 400,000 | $0 | $0 | $0 | - |
Grok Code Fast 1 grok-code-fast-1 reasoning | 128,000 | $0 | $0 | $0 | - |
| google (16 models) | |||||
Gemini 2.0 Flash gemini-2.0-flash | 1,048,576 | $0.1 | $0.4 | $0.025 | - |
Gemini 2.0 Flash-Lite gemini-2.0-flash-lite | 1,048,576 | $0.075 | $0.3 | $0 | - |
Gemini 2.5 Flash gemini-2.5-flash reasoning | 1,048,576 | $0.3 | $2.5 | $0.03 | - |
Gemini 2.5 Flash-Lite gemini-2.5-flash-lite reasoning | 1,048,576 | $0.1 | $0.4 | $0.01 | - |
Gemini 2.5 Pro gemini-2.5-pro reasoning | 1,048,576 | $1.25 | $10 | $0.125 | - |
Gemini 3 Flash Preview gemini-3-flash-preview reasoning | 1,048,576 | $0.5 | $3 | $0.05 | - |
Gemini 3 Pro Preview gemini-3-pro-preview reasoning | 1,048,576 | $2 | $12 | $0.2 | - |
Gemini 3.1 Flash Lite gemini-3.1-flash-lite reasoning | 1,048,576 | $0.25 | $1.5 | $0.025 | - |
Gemini 3.1 Flash Lite Preview gemini-3.1-flash-lite-preview reasoning | 1,048,576 | $0.25 | $1.5 | $0.025 | - |
Gemini 3.1 Pro Preview gemini-3.1-pro-preview reasoning | 1,048,576 | $2 | $12 | $0.2 | - |
Gemini 3.1 Pro Preview Custom Tools gemini-3.1-pro-preview-customtools reasoning | 1,048,576 | $2 | $12 | $0.2 | - |
Gemini 3.5 Flash gemini-3.5-flash reasoning | 1,048,576 | $1.5 | $9 | $0.15 | - |
Gemini Flash Latest gemini-flash-latest reasoning | 1,048,576 | $0.3 | $2.5 | $0.075 | - |
Gemini Flash-Lite Latest gemini-flash-lite-latest reasoning | 1,048,576 | $0.1 | $0.4 | $0.025 | - |
Gemma 4 26B A4B IT gemma-4-26b-a4b-it reasoning | 262,144 | $0 | $0 | $0 | - |
Gemma 4 31B IT gemma-4-31b-it reasoning | 262,144 | $0 | $0 | $0 | - |
| google-vertex (13 models) | |||||
Gemini 1.5 Flash (Vertex) gemini-1.5-flash | 1,000,000 | $0.075 | $0.3 | $0.01875 | - |
Gemini 1.5 Flash-8B (Vertex) gemini-1.5-flash-8b | 1,000,000 | $0.0375 | $0.15 | $0.01 | - |
Gemini 1.5 Pro (Vertex) gemini-1.5-pro | 1,000,000 | $1.25 | $5 | $0.3125 | - |
Gemini 2.0 Flash (Vertex) gemini-2.0-flash | 1,048,576 | $0.15 | $0.6 | $0.0375 | - |
Gemini 2.0 Flash Lite (Vertex) gemini-2.0-flash-lite reasoning | 1,048,576 | $0.075 | $0.3 | $0.01875 | - |
Gemini 2.5 Flash (Vertex) gemini-2.5-flash reasoning | 1,048,576 | $0.3 | $2.5 | $0.03 | - |
Gemini 2.5 Flash Lite (Vertex) gemini-2.5-flash-lite reasoning | 1,048,576 | $0.1 | $0.4 | $0.01 | - |
Gemini 2.5 Flash Lite Preview 09-25 (Vertex) gemini-2.5-flash-lite-preview-09-2025 reasoning | 1,048,576 | $0.1 | $0.4 | $0.01 | - |
Gemini 2.5 Pro (Vertex) gemini-2.5-pro reasoning | 1,048,576 | $1.25 | $10 | $0.125 | - |
Gemini 3 Flash Preview (Vertex) gemini-3-flash-preview reasoning | 1,048,576 | $0.5 | $3 | $0.05 | - |
Gemini 3 Pro Preview (Vertex) gemini-3-pro-preview reasoning | 1,000,000 | $2 | $12 | $0.2 | - |
Gemini 3.1 Pro Preview (Vertex) gemini-3.1-pro-preview reasoning | 1,048,576 | $2 | $12 | $0.2 | - |
Gemini 3.1 Pro Preview Custom Tools (Vertex) gemini-3.1-pro-preview-customtools reasoning | 1,048,576 | $2 | $12 | $0.2 | - |
| groq (18 models) | |||||
Compound groq/compound reasoning | 131,072 | $0 | $0 | $0 | - |
Compound Mini groq/compound-mini reasoning | 131,072 | $0 | $0 | $0 | - |
DeepSeek R1 Distill Llama 70B deepseek-r1-distill-llama-70b reasoning | 131,072 | $0.75 | $0.99 | $0 | - |
Gemma 2 9B gemma2-9b-it | 8,192 | $0.2 | $0.2 | $0 | - |
GPT OSS 120B openai/gpt-oss-120b reasoning | 131,072 | $0.15 | $0.6 | $0 | - |
GPT OSS 20B openai/gpt-oss-20b reasoning | 131,072 | $0.075 | $0.3 | $0 | - |
Kimi K2 Instruct moonshotai/kimi-k2-instruct | 131,072 | $1 | $3 | $0 | - |
Kimi K2 Instruct 0905 moonshotai/kimi-k2-instruct-0905 | 262,144 | $1 | $3 | $0 | - |
Llama 3 70B llama3-70b-8192 | 8,192 | $0.59 | $0.79 | $0 | - |
Llama 3 8B llama3-8b-8192 | 8,192 | $0.05 | $0.08 | $0 | - |
Llama 3.1 8B Instant llama-3.1-8b-instant | 131,072 | $0.05 | $0.08 | $0 | - |
Llama 3.3 70B Versatile llama-3.3-70b-versatile | 131,072 | $0.59 | $0.79 | $0 | - |
Llama 4 Maverick 17B meta-llama/llama-4-maverick-17b-128e-instruct | 131,072 | $0.2 | $0.6 | $0 | - |
Llama 4 Scout 17B meta-llama/llama-4-scout-17b-16e-instruct | 131,072 | $0.11 | $0.34 | $0 | - |
Mistral Saba 24B mistral-saba-24b | 32,768 | $0.79 | $0.79 | $0 | - |
Qwen QwQ 32B qwen-qwq-32b reasoning | 131,072 | $0.29 | $0.39 | $0 | - |
Qwen3 32B qwen/qwen3-32b reasoning | 131,072 | $0.29 | $0.59 | $0 | - |
Safety GPT OSS 20B openai/gpt-oss-safeguard-20b reasoning | 131,072 | $0.075 | $0.3 | $0.037 | - |
| huggingface (22 models) | |||||
DeepSeek V4 Pro deepseek-ai/DeepSeek-V4-Pro reasoning | 1,048,576 | $1.74 | $3.48 | $0.145 | - |
DeepSeek-R1-0528 deepseek-ai/DeepSeek-R1-0528 reasoning | 163,840 | $3 | $5 | $0 | - |
DeepSeek-V3.2 deepseek-ai/DeepSeek-V3.2 reasoning | 163,840 | $0.28 | $0.4 | $0 | - |
GLM-4.7 zai-org/GLM-4.7 reasoning | 204,800 | $0.6 | $2.2 | $0.11 | - |
GLM-4.7-Flash zai-org/GLM-4.7-Flash reasoning | 200,000 | $0 | $0 | $0 | - |
GLM-5 zai-org/GLM-5 reasoning | 202,752 | $1 | $3.2 | $0.2 | - |
GLM-5.1 zai-org/GLM-5.1 reasoning | 202,752 | $1 | $3.2 | $0.2 | - |
Kimi-K2-Instruct moonshotai/Kimi-K2-Instruct | 131,072 | $1 | $3 | $0 | - |
Kimi-K2-Instruct-0905 moonshotai/Kimi-K2-Instruct-0905 | 262,144 | $1 | $3 | $0 | - |
Kimi-K2-Thinking moonshotai/Kimi-K2-Thinking reasoning | 262,144 | $0.6 | $2.5 | $0.15 | - |
Kimi-K2.5 moonshotai/Kimi-K2.5 reasoning | 262,144 | $0.6 | $3 | $0.1 | - |
Kimi-K2.6 moonshotai/Kimi-K2.6 reasoning | 262,144 | $0.95 | $4 | $0.16 | - |
MiMo-V2-Flash XiaomiMiMo/MiMo-V2-Flash reasoning | 262,144 | $0.1 | $0.3 | $0 | - |
MiniMax-M2.1 MiniMaxAI/MiniMax-M2.1 reasoning | 204,800 | $0.3 | $1.2 | $0 | - |
MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 reasoning | 204,800 | $0.3 | $1.2 | $0.03 | - |
MiniMax-M2.7 MiniMaxAI/MiniMax-M2.7 reasoning | 204,800 | $0.3 | $1.2 | $0.06 | - |
Qwen3-235B-A22B-Thinking-2507 Qwen/Qwen3-235B-A22B-Thinking-2507 reasoning | 262,144 | $0.3 | $3 | $0 | - |
Qwen3-Coder-480B-A35B-Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct | 262,144 | $2 | $2 | $0 | - |
Qwen3-Coder-Next Qwen/Qwen3-Coder-Next | 262,144 | $0.2 | $1.5 | $0 | - |
Qwen3-Next-80B-A3B-Instruct Qwen/Qwen3-Next-80B-A3B-Instruct | 262,144 | $0.25 | $1 | $0 | - |
Qwen3-Next-80B-A3B-Thinking Qwen/Qwen3-Next-80B-A3B-Thinking | 262,144 | $0.3 | $2 | $0 | - |
Qwen3.5-397B-A17B Qwen/Qwen3.5-397B-A17B reasoning | 262,144 | $0.6 | $3.6 | $0 | - |
| kimi (2 models) | |||||
Kimi For Coding kimi-for-coding reasoning | 262,144 | $0 | $0 | $0 | - |
Kimi K2 Thinking kimi-k2-thinking reasoning | 262,144 | - | - | - | - |
| minimax (2 models) | |||||
MiniMax-M2.7 MiniMax-M2.7 reasoning | 204,800 | $0.3 | $1.2 | $0.06 | $0.375 |
MiniMax-M2.7-highspeed MiniMax-M2.7-highspeed reasoning | 204,800 | $0.6 | $2.4 | $0.06 | $0.375 |
| minimax-cn (2 models) | |||||
MiniMax-M2.7 MiniMax-M2.7 reasoning | 204,800 | $0.3 | $1.2 | $0.06 | $0.375 |
MiniMax-M2.7-highspeed MiniMax-M2.7-highspeed reasoning | 204,800 | $0.6 | $2.4 | $0.06 | $0.375 |
| mistral (28 models) | |||||
Codestral (latest) codestral-latest | 256,000 | $0.3 | $0.9 | $0 | - |
Devstral 2 devstral-2512 | 262,144 | $0.4 | $2 | $0 | - |
Devstral 2 (latest) devstral-medium-latest | 262,144 | $0.4 | $2 | $0 | - |
Devstral Medium devstral-medium-2507 | 128,000 | $0.4 | $2 | $0 | - |
Devstral Small devstral-small-2507 | 128,000 | $0.1 | $0.3 | $0 | - |
Devstral Small 2 labs-devstral-small-2512 | 256,000 | $0 | $0 | $0 | - |
Devstral Small 2505 devstral-small-2505 | 128,000 | $0.1 | $0.3 | $0 | - |
Magistral Medium (latest) magistral-medium-latest reasoning | 128,000 | $2 | $5 | $0 | - |
Magistral Small magistral-small reasoning | 128,000 | $0.5 | $1.5 | $0 | - |
Ministral 3B (latest) ministral-3b-latest | 128,000 | $0.04 | $0.04 | $0 | - |
Ministral 8B (latest) ministral-8b-latest | 128,000 | $0.1 | $0.1 | $0 | - |
Mistral 7B open-mistral-7b | 8,000 | $0.25 | $0.25 | $0 | - |
Mistral Large (latest) mistral-large-latest | 262,144 | $0.5 | $1.5 | $0 | - |
Mistral Large 2.1 mistral-large-2411 | 131,072 | $2 | $6 | $0 | - |
Mistral Large 3 mistral-large-2512 | 262,144 | $0.5 | $1.5 | $0 | - |
Mistral Medium (latest) mistral-medium-latest reasoning | 262,144 | $1.5 | $7.5 | $0 | - |
Mistral Medium 3 mistral-medium-2505 | 131,072 | $0.4 | $2 | $0 | - |
Mistral Medium 3.1 mistral-medium-2508 | 262,144 | $0.4 | $2 | $0 | - |
Mistral Medium 3.5 mistral-medium-2604 reasoning | 262,144 | $1.5 | $7.5 | $0 | - |
Mistral Medium 3.5 mistral-medium-3.5 reasoning | 262,144 | $1.5 | $7.5 | $0 | - |
Mistral Nemo mistral-nemo | 128,000 | $0.15 | $0.15 | $0 | - |
Mistral Small (latest) mistral-small-latest reasoning | 256,000 | $0.15 | $0.6 | $0 | - |
Mistral Small 3.2 mistral-small-2506 | 128,000 | $0.1 | $0.3 | $0 | - |
Mistral Small 4 mistral-small-2603 reasoning | 256,000 | $0.15 | $0.6 | $0 | - |
Mixtral 8x22B open-mixtral-8x22b | 64,000 | $2 | $6 | $0 | - |
Mixtral 8x7B open-mixtral-8x7b | 32,000 | $0.7 | $0.7 | $0 | - |
Pixtral 12B pixtral-12b | 128,000 | $0.15 | $0.15 | $0 | - |
Pixtral Large (latest) pixtral-large-latest | 128,000 | $2 | $6 | $0 | - |
| moonshotai (7 models) | |||||
Kimi K2 0711 kimi-k2-0711-preview | 131,072 | $0.6 | $2.5 | $0.15 | - |
Kimi K2 0905 kimi-k2-0905-preview | 262,144 | $0.6 | $2.5 | $0.15 | - |
Kimi K2 Thinking kimi-k2-thinking reasoning | 262,144 | $0.6 | $2.5 | $0.15 | - |
Kimi K2 Thinking Turbo kimi-k2-thinking-turbo reasoning | 262,144 | $1.15 | $8 | $0.15 | - |
Kimi K2 Turbo kimi-k2-turbo-preview | 262,144 | $2.4 | $10 | $0.6 | - |
Kimi K2.5 kimi-k2.5 reasoning | 262,144 | $0.6 | $3 | $0.1 | - |
Kimi K2.6 kimi-k2.6 reasoning | 262,144 | $0.95 | $4 | $0.16 | - |
| moonshotai-cn (7 models) | |||||
Kimi K2 0711 kimi-k2-0711-preview | 131,072 | $0.6 | $2.5 | $0.15 | - |
Kimi K2 0905 kimi-k2-0905-preview | 262,144 | $0.6 | $2.5 | $0.15 | - |
Kimi K2 Thinking kimi-k2-thinking reasoning | 262,144 | $0.6 | $2.5 | $0.15 | - |
Kimi K2 Thinking Turbo kimi-k2-thinking-turbo reasoning | 262,144 | $1.15 | $8 | $0.15 | - |
Kimi K2 Turbo kimi-k2-turbo-preview | 262,144 | $2.4 | $10 | $0.6 | - |
Kimi K2.5 kimi-k2.5 reasoning | 262,144 | $0.6 | $3 | $0.1 | - |
Kimi K2.6 kimi-k2.6 reasoning | 262,144 | $0.95 | $4 | $0.16 | - |
| openai (42 models) | |||||
GPT-4 gpt-4 | 8,192 | $30 | $60 | $0 | - |
GPT-4 Turbo gpt-4-turbo | 128,000 | $10 | $30 | $0 | - |
GPT-4.1 gpt-4.1 | 1,047,576 | $2 | $8 | $0.5 | - |
GPT-4.1 mini gpt-4.1-mini | 1,047,576 | $0.4 | $1.6 | $0.1 | - |
GPT-4.1 nano gpt-4.1-nano | 1,047,576 | $0.1 | $0.4 | $0.03 | - |
GPT-4o gpt-4o | 128,000 | $2.5 | $10 | $1.25 | - |
GPT-4o (2024-05-13) gpt-4o-2024-05-13 | 128,000 | $5 | $15 | $0 | - |
GPT-4o (2024-08-06) gpt-4o-2024-08-06 | 128,000 | $2.5 | $10 | $1.25 | - |
GPT-4o (2024-11-20) gpt-4o-2024-11-20 | 128,000 | $2.5 | $10 | $1.25 | - |
GPT-4o mini gpt-4o-mini | 128,000 | $0.15 | $0.6 | $0.08 | - |
GPT-5 gpt-5 reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT-5 Chat Latest gpt-5-chat-latest | 128,000 | $1.25 | $10 | $0.125 | - |
GPT-5 Mini gpt-5-mini reasoning | 400,000 | $0.25 | $2 | $0.025 | - |
GPT-5 Nano gpt-5-nano reasoning | 400,000 | $0.05 | $0.4 | $0.005 | - |
GPT-5 Pro gpt-5-pro reasoning | 400,000 | $15 | $120 | $0 | - |
GPT-5-Codex gpt-5-codex reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT-5.1 gpt-5.1 reasoning | 400,000 | $1.25 | $10 | $0.13 | - |
GPT-5.1 Chat gpt-5.1-chat-latest reasoning | 128,000 | $1.25 | $10 | $0.125 | - |
GPT-5.1 Codex gpt-5.1-codex reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT-5.1 Codex Max gpt-5.1-codex-max reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT-5.1 Codex mini gpt-5.1-codex-mini reasoning | 400,000 | $0.25 | $2 | $0.025 | - |
GPT-5.2 gpt-5.2 reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
GPT-5.2 Chat gpt-5.2-chat-latest reasoning | 128,000 | $1.75 | $14 | $0.175 | - |
GPT-5.2 Codex gpt-5.2-codex reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
GPT-5.2 Pro gpt-5.2-pro reasoning | 400,000 | $21 | $168 | $0 | - |
GPT-5.3 Chat (latest) gpt-5.3-chat-latest | 128,000 | $1.75 | $14 | $0.175 | - |
GPT-5.3 Codex gpt-5.3-codex reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
GPT-5.3 Codex Spark gpt-5.3-codex-spark reasoning | 128,000 | $1.75 | $14 | $0.175 | - |
GPT-5.4 gpt-5.4 reasoning | 272,000 | $2.5 | $15 | $0.25 | - |
GPT-5.4 mini gpt-5.4-mini reasoning | 400,000 | $0.75 | $4.5 | $0.075 | - |
GPT-5.4 nano gpt-5.4-nano reasoning | 400,000 | $0.2 | $1.25 | $0.02 | - |
GPT-5.4 Pro gpt-5.4-pro reasoning | 1,050,000 | $30 | $180 | $0 | - |
GPT-5.5 gpt-5.5 reasoning | 272,000 | $5 | $30 | $0.5 | - |
GPT-5.5 Pro gpt-5.5-pro reasoning | 1,050,000 | $30 | $180 | $0 | - |
o1 o1 reasoning | 200,000 | $15 | $60 | $7.5 | - |
o1-pro o1-pro reasoning | 200,000 | $150 | $600 | $0 | - |
o3 o3 reasoning | 200,000 | $2 | $8 | $0.5 | - |
o3-deep-research o3-deep-research reasoning | 200,000 | $10 | $40 | $2.5 | - |
o3-mini o3-mini reasoning | 200,000 | $1.1 | $4.4 | $0.55 | - |
o3-pro o3-pro reasoning | 200,000 | $20 | $80 | $0 | - |
o4-mini o4-mini reasoning | 200,000 | $1.1 | $4.4 | $0.28 | - |
o4-mini-deep-research o4-mini-deep-research reasoning | 200,000 | $2 | $8 | $0.5 | - |
| openai-codex (6 models) | |||||
GPT-5.2 gpt-5.2 reasoning | 272,000 | $1.75 | $14 | $0.175 | - |
GPT-5.3 Codex gpt-5.3-codex reasoning | 272,000 | $1.75 | $14 | $0.175 | - |
GPT-5.3 Codex Spark gpt-5.3-codex-spark reasoning | 272,000 | $1.75 | $14 | $0.175 | - |
GPT-5.4 gpt-5.4 reasoning | 272,000 | $2.5 | $15 | $0.25 | - |
GPT-5.4 mini gpt-5.4-mini reasoning | 272,000 | $0.75 | $4.5 | $0.075 | - |
GPT-5.5 gpt-5.5 reasoning | 272,000 | $5 | $30 | $0.5 | - |
| openai-responses (6 models) | |||||
GPT-5 (Responses) gpt-5 reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT-5 Codex (Responses) gpt-5-codex reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT-5 mini (Responses) gpt-5-mini reasoning | 400,000 | $0.25 | $2 | $0.025 | - |
GPT-5 nano (Responses) gpt-5-nano reasoning | 400,000 | $0.05 | $0.4 | $0.005 | - |
o3 (Responses) o3 reasoning | 200,000 | $2 | $8 | $0.5 | - |
o4-mini (Responses) o4-mini reasoning | 200,000 | $1.1 | $4.4 | $0.275 | - |
| opencode (40 models) | |||||
Big Pickle big-pickle reasoning | 200,000 | $0 | $0 | $0 | - |
Claude Haiku 4.5 claude-haiku-4-5 reasoning | 200,000 | $1 | $5 | $0.1 | $1.25 |
Claude Opus 4.1 claude-opus-4-1 reasoning | 200,000 | $15 | $75 | $1.5 | $18.75 |
Claude Opus 4.5 claude-opus-4-5 reasoning | 200,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.6 claude-opus-4-6 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.7 claude-opus-4-7 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.8 claude-opus-4-8 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Sonnet 4 claude-sonnet-4 reasoning | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.5 claude-sonnet-4-5 reasoning | 200,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.6 claude-sonnet-4-6 reasoning | 1,000,000 | $3 | $15 | $0.3 | $3.75 |
DeepSeek V4 Flash Free deepseek-v4-flash-free reasoning | 200,000 | $0 | $0 | $0 | - |
Gemini 3 Flash gemini-3-flash reasoning | 1,048,576 | $0.5 | $3 | $0.05 | - |
Gemini 3.1 Pro Preview gemini-3.1-pro reasoning | 1,048,576 | $2 | $12 | $0.2 | - |
Gemini 3.5 Flash gemini-3.5-flash reasoning | 1,048,576 | $1.5 | $9 | $0.15 | - |
GLM-5 glm-5 reasoning | 204,800 | $1 | $3.2 | $0.2 | - |
GLM-5.1 glm-5.1 reasoning | 204,800 | $1.4 | $4.4 | $0.26 | - |
GPT-5 gpt-5 reasoning | 400,000 | $1.07 | $8.5 | $0.107 | - |
GPT-5 Codex gpt-5-codex reasoning | 400,000 | $1.07 | $8.5 | $0.107 | - |
GPT-5 Nano gpt-5-nano reasoning | 400,000 | $0.05 | $0.4 | $0.005 | - |
GPT-5.1 gpt-5.1 reasoning | 400,000 | $1.07 | $8.5 | $0.107 | - |
GPT-5.1 Codex gpt-5.1-codex reasoning | 400,000 | $1.07 | $8.5 | $0.107 | - |
GPT-5.1 Codex Max gpt-5.1-codex-max reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT-5.1 Codex Mini gpt-5.1-codex-mini reasoning | 400,000 | $0.25 | $2 | $0.025 | - |
GPT-5.2 gpt-5.2 reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
GPT-5.2 Codex gpt-5.2-codex reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
GPT-5.3 Codex gpt-5.3-codex reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
GPT-5.4 gpt-5.4 reasoning | 272,000 | $2.5 | $15 | $0.25 | - |
GPT-5.4 Mini gpt-5.4-mini reasoning | 400,000 | $0.75 | $4.5 | $0.075 | - |
GPT-5.4 Nano gpt-5.4-nano reasoning | 400,000 | $0.2 | $1.25 | $0.02 | - |
GPT-5.4 Pro gpt-5.4-pro reasoning | 1,050,000 | $30 | $180 | $30 | - |
GPT-5.5 gpt-5.5 reasoning | 1,050,000 | $5 | $30 | $0.5 | - |
GPT-5.5 Pro gpt-5.5-pro reasoning | 1,050,000 | $30 | $180 | $30 | - |
Grok Build 0.1 grok-build-0.1 reasoning | 256,000 | $1 | $2 | $0.2 | - |
Kimi K2.5 kimi-k2.5 reasoning | 262,144 | $0.6 | $3 | $0.08 | - |
Kimi K2.6 kimi-k2.6 reasoning | 262,144 | $0.95 | $4 | $0.16 | - |
MiniMax M2.5 minimax-m2.5 reasoning | 204,800 | $0.3 | $1.2 | $0.06 | - |
MiniMax M2.7 minimax-m2.7 reasoning | 204,800 | $0.3 | $1.2 | $0.06 | - |
Nemotron 3 Super Free nemotron-3-super-free reasoning | 204,800 | $0 | $0 | $0 | - |
Qwen3.5 Plus qwen3.5-plus reasoning | 262,144 | $0.2 | $1.2 | $0.02 | $0.25 |
Qwen3.6 Plus qwen3.6-plus reasoning | 262,144 | $0.5 | $3 | $0.05 | $0.625 |
| opencode-go (12 models) | |||||
DeepSeek V4 Flash deepseek-v4-flash reasoning | 1,000,000 | $0.14 | $0.28 | $0.0028 | - |
DeepSeek V4 Pro deepseek-v4-pro reasoning | 1,000,000 | $1.74 | $3.48 | $0.0145 | - |
GLM-5 glm-5 reasoning | 202,752 | $1 | $3.2 | $0.2 | - |
GLM-5.1 glm-5.1 reasoning | 202,752 | $1.4 | $4.4 | $0.26 | - |
Kimi K2.5 kimi-k2.5 reasoning | 262,144 | $0.6 | $3 | $0.1 | - |
Kimi K2.6 kimi-k2.6 reasoning | 262,144 | $0.95 | $4 | $0.16 | - |
MiMo V2.5 mimo-v2.5 reasoning | 1,000,000 | $0.4 | $2 | $0.08 | - |
MiMo V2.5 Pro mimo-v2.5-pro reasoning | 1,048,576 | $1 | $3 | $0.2 | - |
MiniMax M2.5 minimax-m2.5 reasoning | 204,800 | $0.3 | $1.2 | $0.03 | - |
MiniMax M2.7 minimax-m2.7 reasoning | 204,800 | $0.3 | $1.2 | $0.06 | - |
Qwen3.5 Plus qwen3.5-plus reasoning | 262,144 | $0.2 | $1.2 | $0.02 | $0.25 |
Qwen3.6 Plus qwen3.6-plus reasoning | 262,144 | $0.5 | $3 | $0.05 | $0.625 |
| openrouter (268 models) | |||||
AI21: Jamba Large 1.7 ai21/jamba-large-1.7 | 256,000 | $2 | $8 | $0 | - |
Amazon: Nova 2 Lite amazon/nova-2-lite-v1 reasoning | 1,000,000 | $0.3 | $2.5 | $0 | - |
Amazon: Nova Lite 1.0 amazon/nova-lite-v1 | 300,000 | $0.06 | $0.24 | $0 | - |
Amazon: Nova Micro 1.0 amazon/nova-micro-v1 | 128,000 | $0.035 | $0.14 | $0 | - |
Amazon: Nova Premier 1.0 amazon/nova-premier-v1 | 1,000,000 | $2.5 | $12.5 | $0.625 | - |
Amazon: Nova Pro 1.0 amazon/nova-pro-v1 | 300,000 | $0.8 | $3.2 | $0 | - |
Anthropic Claude Haiku Latest ~anthropic/claude-haiku-latest reasoning | 200,000 | $1 | $5 | $0.1 | $1.25 |
Anthropic Claude Sonnet Latest ~anthropic/claude-sonnet-latest reasoning | 1,000,000 | $3 | $15 | $0.3 | $3.75 |
Anthropic: Claude 3 Haiku anthropic/claude-3-haiku | 200,000 | $0.25 | $1.25 | $0.03 | $0.3 |
Anthropic: Claude 3.5 Haiku anthropic/claude-3.5-haiku | 200,000 | $0.8 | $4 | $0.08 | $1 |
Anthropic: Claude Haiku 4.5 anthropic/claude-haiku-4.5 reasoning | 200,000 | $1 | $5 | $0.1 | $1.25 |
Anthropic: Claude Opus 4 anthropic/claude-opus-4 reasoning | 200,000 | $15 | $75 | $1.5 | $18.75 |
Anthropic: Claude Opus 4.1 anthropic/claude-opus-4.1 reasoning | 200,000 | $15 | $75 | $1.5 | $18.75 |
Anthropic: Claude Opus 4.5 anthropic/claude-opus-4.5 reasoning | 200,000 | $5 | $25 | $0.5 | $6.25 |
Anthropic: Claude Opus 4.6 anthropic/claude-opus-4.6 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Anthropic: Claude Opus 4.6 (Fast) anthropic/claude-opus-4.6-fast reasoning | 1,000,000 | $30 | $150 | $3 | $37.5 |
Anthropic: Claude Opus 4.7 anthropic/claude-opus-4.7 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Anthropic: Claude Opus 4.7 (Fast) anthropic/claude-opus-4.7-fast reasoning | 1,000,000 | $30 | $150 | $3 | $37.5 |
Anthropic: Claude Opus 4.8 anthropic/claude-opus-4.8 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Anthropic: Claude Opus 4.8 (Fast) anthropic/claude-opus-4.8-fast reasoning | 1,000,000 | $30 | $150 | $3 | $37.5 |
Anthropic: Claude Opus Latest ~anthropic/claude-opus-latest reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Anthropic: Claude Sonnet 4 anthropic/claude-sonnet-4 reasoning | 1,000,000 | $3 | $15 | $0.3 | $3.75 |
Anthropic: Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 reasoning | 1,000,000 | $3 | $15 | $0.3 | $3.75 |
Anthropic: Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 reasoning | 1,000,000 | $3 | $15 | $0.3 | $3.75 |
Arcee AI: Trinity Large Thinking arcee-ai/trinity-large-thinking reasoning | 262,144 | $0.22 | $0.85 | $0.06 | - |
Arcee AI: Trinity Large Thinking (free) arcee-ai/trinity-large-thinking:free reasoning | 262,144 | $0 | $0 | $0 | - |
Arcee AI: Trinity Mini arcee-ai/trinity-mini reasoning | 131,072 | $0.045 | $0.15 | $0 | - |
Arcee AI: Virtuoso Large arcee-ai/virtuoso-large | 131,072 | $0.75 | $1.2 | $0 | - |
Auto auto reasoning | 2,000,000 | $0 | $0 | $0 | - |
Auto Router openrouter/auto reasoning | 2,000,000 | - | - | $0 | - |
Baidu Qianfan: CoBuddy (free) baidu/cobuddy:free reasoning | 131,072 | $0 | $0 | $0 | - |
Baidu: ERNIE 4.5 21B A3B baidu/ernie-4.5-21b-a3b | 131,072 | $0.07 | $0.28 | $0 | - |
Baidu: ERNIE 4.5 VL 28B A3B baidu/ernie-4.5-vl-28b-a3b reasoning | 131,072 | $0.14 | $0.56 | $0 | - |
ByteDance Seed: Seed 1.6 bytedance-seed/seed-1.6 reasoning | 262,144 | $0.25 | $2 | $0 | - |
ByteDance Seed: Seed 1.6 Flash bytedance-seed/seed-1.6-flash reasoning | 262,144 | $0.075 | $0.3 | $0 | - |
ByteDance Seed: Seed-2.0-Lite bytedance-seed/seed-2.0-lite reasoning | 262,144 | $0.25 | $2 | $0 | - |
ByteDance Seed: Seed-2.0-Mini bytedance-seed/seed-2.0-mini reasoning | 262,144 | $0.1 | $0.4 | $0 | - |
Cohere: Command R (08-2024) cohere/command-r-08-2024 | 128,000 | $0.15 | $0.6 | $0 | - |
Cohere: Command R+ (08-2024) cohere/command-r-plus-08-2024 | 128,000 | $2.5 | $10 | $0 | - |
DeepSeek: DeepSeek V3 deepseek/deepseek-chat | 163,840 | $0.32 | $0.89 | $0 | - |
DeepSeek: DeepSeek V3 0324 deepseek/deepseek-chat-v3-0324 | 163,840 | $0.2 | $0.77 | $0.135 | - |
DeepSeek: DeepSeek V3.1 deepseek/deepseek-chat-v3.1 reasoning | 163,840 | $0.21 | $0.79 | $0.13 | - |
DeepSeek: DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus reasoning | 163,840 | $0.27 | $0.95 | $0.13 | - |
DeepSeek: DeepSeek V3.2 deepseek/deepseek-v3.2 reasoning | 131,072 | $0.252 | $0.378 | $0.0252 | - |
DeepSeek: DeepSeek V3.2 Exp deepseek/deepseek-v3.2-exp reasoning | 163,840 | $0.27 | $0.41 | $0 | - |
DeepSeek: DeepSeek V4 Flash deepseek/deepseek-v4-flash reasoning | 1,048,576 | $0.1 | $0.2 | $0.02 | - |
DeepSeek: DeepSeek V4 Flash (free) deepseek/deepseek-v4-flash:free reasoning | 1,048,576 | $0 | $0 | $0 | - |
DeepSeek: DeepSeek V4 Pro deepseek/deepseek-v4-pro reasoning | 1,048,576 | $0.435 | $0.87 | $0.003625 | - |
DeepSeek: R1 deepseek/deepseek-r1 reasoning | 163,840 | $0.7 | $2.5 | $0 | - |
DeepSeek: R1 0528 deepseek/deepseek-r1-0528 reasoning | 163,840 | $0.5 | $2.15 | $0.35 | - |
EssentialAI: Rnj 1 Instruct essentialai/rnj-1-instruct | 32,768 | $0.15 | $0.15 | $0 | - |
Free Models Router openrouter/free reasoning | 200,000 | $0 | $0 | $0 | - |
Google Gemini Flash Latest ~google/gemini-flash-latest reasoning | 1,048,576 | $1.5 | $9 | $0.15 | $0.0833333333333 |
Google Gemini Pro Latest ~google/gemini-pro-latest reasoning | 1,048,576 | $2 | $12 | $0.2 | $0.375 |
Google: Gemini 2.0 Flash google/gemini-2.0-flash-001 | 1,000,000 | $0.1 | $0.4 | $0.025 | $0.0833333333333 |
Google: Gemini 2.0 Flash Lite google/gemini-2.0-flash-lite-001 | 1,048,576 | $0.075 | $0.3 | $0 | - |
Google: Gemini 2.5 Flash google/gemini-2.5-flash reasoning | 1,048,576 | $0.3 | $2.5 | $0.03 | $0.0833333333333 |
Google: Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite reasoning | 1,048,576 | $0.1 | $0.4 | $0.01 | $0.0833333333333 |
Google: Gemini 2.5 Flash Lite Preview 09-2025 google/gemini-2.5-flash-lite-preview-09-2025 reasoning | 1,048,576 | $0.1 | $0.4 | $0.01 | $0.0833333333333 |
Google: Gemini 2.5 Pro google/gemini-2.5-pro reasoning | 1,048,576 | $1.25 | $10 | $0.125 | $0.375 |
Google: Gemini 2.5 Pro Preview 05-06 google/gemini-2.5-pro-preview-05-06 reasoning | 1,048,576 | $1.25 | $10 | $0.125 | $0.375 |
Google: Gemini 2.5 Pro Preview 06-05 google/gemini-2.5-pro-preview reasoning | 1,048,576 | $1.25 | $10 | $0.125 | $0.375 |
Google: Gemini 3 Flash Preview google/gemini-3-flash-preview reasoning | 1,048,576 | $0.5 | $3 | $0.05 | $0.0833333333333 |
Google: Gemini 3.1 Flash Lite google/gemini-3.1-flash-lite reasoning | 1,048,576 | $0.25 | $1.5 | $0.025 | $0.0833333333333 |
Google: Gemini 3.1 Flash Lite Preview google/gemini-3.1-flash-lite-preview reasoning | 1,048,576 | $0.25 | $1.5 | $0.025 | $0.0833333333333 |
Google: Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview reasoning | 1,048,576 | $2 | $12 | $0.2 | $0.375 |
Google: Gemini 3.1 Pro Preview Custom Tools google/gemini-3.1-pro-preview-customtools reasoning | 1,048,756 | $2 | $12 | $0.2 | $0.375 |
Google: Gemini 3.5 Flash google/gemini-3.5-flash reasoning | 1,048,576 | $1.5 | $9 | $0.15 | $0.0833333333333 |
Google: Gemma 3 12B google/gemma-3-12b-it | 131,072 | $0.04 | $0.13 | $0 | - |
Google: Gemma 3 27B google/gemma-3-27b-it | 131,072 | $0.08 | $0.16 | $0 | - |
Google: Gemma 4 26B A4B google/gemma-4-26b-a4b-it reasoning | 262,144 | $0.06 | $0.33 | $0 | - |
Google: Gemma 4 26B A4B (free) google/gemma-4-26b-a4b-it:free reasoning | 262,144 | $0 | $0 | $0 | - |
Google: Gemma 4 31B google/gemma-4-31b-it reasoning | 262,144 | $0.12 | $0.37 | $0 | - |
Google: Gemma 4 31B (free) google/gemma-4-31b-it:free reasoning | 262,144 | $0 | $0 | $0 | - |
IBM: Granite 4.1 8B ibm-granite/granite-4.1-8b | 131,072 | $0.05 | $0.1 | $0.05 | - |
Inception: Mercury 2 inception/mercury-2 reasoning | 128,000 | $0.25 | $0.75 | $0.025 | - |
inclusionAI: Ling-2.6-1T inclusionai/ling-2.6-1t | 262,144 | $0.075 | $0.625 | $0.015 | - |
inclusionAI: Ling-2.6-flash inclusionai/ling-2.6-flash | 262,144 | $0.01 | $0.03 | $0.002 | - |
inclusionAI: Ring-2.6-1T inclusionai/ring-2.6-1t reasoning | 262,144 | $0.075 | $0.625 | $0.015 | - |
Kwaipilot: KAT-Coder-Pro V2 kwaipilot/kat-coder-pro-v2 | 256,000 | $0.3 | $1.2 | $0.06 | - |
Meta: Llama 3.1 70B Instruct meta-llama/llama-3.1-70b-instruct | 131,072 | $0.4 | $0.4 | $0 | - |
Meta: Llama 3.1 8B Instruct meta-llama/llama-3.1-8b-instruct | 131,072 | $0.02 | $0.05 | $0 | - |
Meta: Llama 3.3 70B Instruct meta-llama/llama-3.3-70b-instruct | 131,072 | $0.1 | $0.32 | $0 | - |
Meta: Llama 3.3 70B Instruct (free) meta-llama/llama-3.3-70b-instruct:free | 131,072 | $0 | $0 | $0 | - |
Meta: Llama 4 Scout meta-llama/llama-4-scout | 10,000,000 | $0.08 | $0.3 | $0 | - |
MiniMax: MiniMax M1 minimax/minimax-m1 reasoning | 1,000,000 | $0.4 | $2.2 | $0 | - |
MiniMax: MiniMax M2 minimax/minimax-m2 reasoning | 204,800 | $0.255 | $1 | $0.03 | - |
MiniMax: MiniMax M2.1 minimax/minimax-m2.1 reasoning | 204,800 | $0.29 | $0.95 | $0.03 | - |
MiniMax: MiniMax M2.5 minimax/minimax-m2.5 reasoning | 204,800 | $0.15 | $1.15 | $0 | - |
MiniMax: MiniMax M2.5 (free) minimax/minimax-m2.5:free reasoning | 204,800 | $0 | $0 | $0 | - |
MiniMax: MiniMax M2.7 minimax/minimax-m2.7 reasoning | 204,800 | $0.279 | $1.2 | $0 | - |
Mistral Large mistralai/mistral-large | 128,000 | $2 | $6 | $0.2 | - |
Mistral Large 2407 mistralai/mistral-large-2407 | 131,072 | $2 | $6 | $0.2 | - |
Mistral Large 2411 mistralai/mistral-large-2411 | 131,072 | $2 | $6 | $0.2 | - |
Mistral: Codestral 2508 mistralai/codestral-2508 | 256,000 | $0.3 | $0.9 | $0.03 | - |
Mistral: Devstral 2 2512 mistralai/devstral-2512 | 262,144 | $0.4 | $2 | $0.04 | - |
Mistral: Devstral Medium mistralai/devstral-medium | 131,072 | $0.4 | $2 | $0.04 | - |
Mistral: Devstral Small 1.1 mistralai/devstral-small | 131,072 | $0.1 | $0.3 | $0.01 | - |
Mistral: Ministral 3 14B 2512 mistralai/ministral-14b-2512 | 262,144 | $0.2 | $0.2 | $0.02 | - |
Mistral: Ministral 3 3B 2512 mistralai/ministral-3b-2512 | 131,072 | $0.1 | $0.1 | $0.01 | - |
Mistral: Ministral 3 8B 2512 mistralai/ministral-8b-2512 | 262,144 | $0.15 | $0.15 | $0.015 | - |
Mistral: Mistral Large 3 2512 mistralai/mistral-large-2512 | 262,144 | $0.5 | $1.5 | $0.05 | - |
Mistral: Mistral Medium 3 mistralai/mistral-medium-3 | 131,072 | $0.4 | $2 | $0.04 | - |
Mistral: Mistral Medium 3.1 mistralai/mistral-medium-3.1 | 131,072 | $0.4 | $2 | $0.04 | - |
Mistral: Mistral Medium 3.5 mistralai/mistral-medium-3-5 reasoning | 262,144 | $1.5 | $7.5 | $0 | - |
Mistral: Mistral Nemo mistralai/mistral-nemo | 131,072 | $0.02 | $0.03 | $0 | - |
Mistral: Mistral Small 3.2 24B mistralai/mistral-small-3.2-24b-instruct | 128,000 | $0.075 | $0.2 | $0 | - |
Mistral: Mistral Small 4 mistralai/mistral-small-2603 reasoning | 262,144 | $0.15 | $0.6 | $0.015 | - |
Mistral: Mixtral 8x22B Instruct mistralai/mixtral-8x22b-instruct | 65,536 | $2 | $6 | $0.2 | - |
Mistral: Pixtral Large 2411 mistralai/pixtral-large-2411 | 131,072 | $2 | $6 | $0.2 | - |
Mistral: Saba mistralai/mistral-saba | 32,768 | $0.2 | $0.6 | $0.02 | - |
Mistral: Voxtral Small 24B 2507 mistralai/voxtral-small-24b-2507 | 32,000 | $0.1 | $0.3 | $0.01 | - |
MoonshotAI Kimi Latest ~moonshotai/kimi-latest reasoning | 262,144 | $0.73 | $3.49 | $0.25 | - |
MoonshotAI: Kimi K2 0711 moonshotai/kimi-k2 | 131,072 | $0.57 | $2.3 | $0 | - |
MoonshotAI: Kimi K2 0905 moonshotai/kimi-k2-0905 | 262,144 | $0.6 | $2.5 | $0 | - |
MoonshotAI: Kimi K2 Thinking moonshotai/kimi-k2-thinking reasoning | 262,144 | $0.6 | $2.5 | $0 | - |
MoonshotAI: Kimi K2.5 moonshotai/kimi-k2.5 reasoning | 262,144 | $0.41 | $2.06 | $0.07 | - |
MoonshotAI: Kimi K2.6 moonshotai/kimi-k2.6 reasoning | 262,144 | $0.73 | $3.49 | $0.25 | - |
Nex AGI: DeepSeek V3.1 Nex N1 nex-agi/deepseek-v3.1-nex-n1 | 131,072 | $0.135 | $0.5 | $0 | - |
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 nvidia/llama-3.3-nemotron-super-49b-v1.5 reasoning | 131,072 | $0.1 | $0.4 | $0 | - |
NVIDIA: Nemotron 3 Nano 30B A3B nvidia/nemotron-3-nano-30b-a3b reasoning | 262,144 | $0.05 | $0.2 | $0 | - |
NVIDIA: Nemotron 3 Nano 30B A3B (free) nvidia/nemotron-3-nano-30b-a3b:free reasoning | 256,000 | $0 | $0 | $0 | - |
NVIDIA: Nemotron 3 Nano Omni (free) nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free reasoning | 256,000 | $0 | $0 | $0 | - |
NVIDIA: Nemotron 3 Super nvidia/nemotron-3-super-120b-a12b reasoning | 1,000,000 | $0.09 | $0.45 | $0 | - |
NVIDIA: Nemotron 3 Super (free) nvidia/nemotron-3-super-120b-a12b:free reasoning | 1,000,000 | $0 | $0 | $0 | - |
NVIDIA: Nemotron Nano 12B 2 VL (free) nvidia/nemotron-nano-12b-v2-vl:free reasoning | 128,000 | $0 | $0 | $0 | - |
NVIDIA: Nemotron Nano 9B V2 nvidia/nemotron-nano-9b-v2 reasoning | 131,072 | $0.04 | $0.16 | $0 | - |
NVIDIA: Nemotron Nano 9B V2 (free) nvidia/nemotron-nano-9b-v2:free reasoning | 128,000 | $0 | $0 | $0 | - |
OpenAI GPT Latest ~openai/gpt-latest reasoning | 1,050,000 | $5 | $30 | $0.5 | - |
OpenAI GPT Mini Latest ~openai/gpt-mini-latest reasoning | 400,000 | $0.75 | $4.5 | $0.075 | - |
OpenAI: GPT Audio openai/gpt-audio | 128,000 | $2.5 | $10 | $0 | - |
OpenAI: GPT Audio Mini openai/gpt-audio-mini | 128,000 | $0.6 | $2.4 | $0 | - |
OpenAI: GPT Chat Latest openai/gpt-chat-latest | 400,000 | $5 | $30 | $0.5 | - |
OpenAI: GPT-3.5 Turbo openai/gpt-3.5-turbo | 16,385 | $0.5 | $1.5 | $0 | - |
OpenAI: GPT-3.5 Turbo (older v0613) openai/gpt-3.5-turbo-0613 | 4,095 | $1 | $2 | $0 | - |
OpenAI: GPT-3.5 Turbo 16k openai/gpt-3.5-turbo-16k | 16,385 | $3 | $4 | $0 | - |
OpenAI: GPT-4 openai/gpt-4 | 8,191 | $30 | $60 | $0 | - |
OpenAI: GPT-4 (older v0314) openai/gpt-4-0314 | 8,191 | $30 | $60 | $0 | - |
OpenAI: GPT-4 Turbo openai/gpt-4-turbo | 128,000 | $10 | $30 | $0 | - |
OpenAI: GPT-4 Turbo (older v1106) openai/gpt-4-1106-preview | 128,000 | $10 | $30 | $0 | - |
OpenAI: GPT-4 Turbo Preview openai/gpt-4-turbo-preview | 128,000 | $10 | $30 | $0 | - |
OpenAI: GPT-4.1 openai/gpt-4.1 | 1,047,576 | $2 | $8 | $0.5 | - |
OpenAI: GPT-4.1 Mini openai/gpt-4.1-mini | 1,047,576 | $0.4 | $1.6 | $0.1 | - |
OpenAI: GPT-4.1 Nano openai/gpt-4.1-nano | 1,047,576 | $0.1 | $0.4 | $0.025 | - |
OpenAI: GPT-4o openai/gpt-4o | 128,000 | $2.5 | $10 | $0 | - |
OpenAI: GPT-4o (2024-05-13) openai/gpt-4o-2024-05-13 | 128,000 | $5 | $15 | $0 | - |
OpenAI: GPT-4o (2024-08-06) openai/gpt-4o-2024-08-06 | 128,000 | $2.5 | $10 | $1.25 | - |
OpenAI: GPT-4o (2024-11-20) openai/gpt-4o-2024-11-20 | 128,000 | $2.5 | $10 | $1.25 | - |
OpenAI: GPT-4o Audio openai/gpt-4o-audio-preview | 128,000 | $2.5 | $10 | $0 | - |
OpenAI: GPT-4o-mini openai/gpt-4o-mini | 128,000 | $0.15 | $0.6 | $0.075 | - |
OpenAI: GPT-4o-mini (2024-07-18) openai/gpt-4o-mini-2024-07-18 | 128,000 | $0.15 | $0.6 | $0.075 | - |
OpenAI: GPT-5 openai/gpt-5 reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
OpenAI: GPT-5 Codex openai/gpt-5-codex reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
OpenAI: GPT-5 Mini openai/gpt-5-mini reasoning | 400,000 | $0.25 | $2 | $0.025 | - |
OpenAI: GPT-5 Nano openai/gpt-5-nano reasoning | 400,000 | $0.05 | $0.4 | $0.01 | - |
OpenAI: GPT-5 Pro openai/gpt-5-pro reasoning | 400,000 | $15 | $120 | $0 | - |
OpenAI: GPT-5.1 openai/gpt-5.1 reasoning | 400,000 | $1.25 | $10 | $0.13 | - |
OpenAI: GPT-5.1 Chat openai/gpt-5.1-chat | 128,000 | $1.25 | $10 | $0.125 | - |
OpenAI: GPT-5.1-Codex openai/gpt-5.1-codex reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
OpenAI: GPT-5.1-Codex-Max openai/gpt-5.1-codex-max reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
OpenAI: GPT-5.1-Codex-Mini openai/gpt-5.1-codex-mini reasoning | 400,000 | $0.25 | $2 | $0.03 | - |
OpenAI: GPT-5.2 openai/gpt-5.2 reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
OpenAI: GPT-5.2 Chat openai/gpt-5.2-chat | 128,000 | $1.75 | $14 | $0.175 | - |
OpenAI: GPT-5.2 Pro openai/gpt-5.2-pro reasoning | 400,000 | $21 | $168 | $0 | - |
OpenAI: GPT-5.2-Codex openai/gpt-5.2-codex reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
OpenAI: GPT-5.3 Chat openai/gpt-5.3-chat | 128,000 | $1.75 | $14 | $0.175 | - |
OpenAI: GPT-5.3-Codex openai/gpt-5.3-codex reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
OpenAI: GPT-5.4 openai/gpt-5.4 reasoning | 1,050,000 | $2.5 | $15 | $0.25 | - |
OpenAI: GPT-5.4 Mini openai/gpt-5.4-mini reasoning | 400,000 | $0.75 | $4.5 | $0.075 | - |
OpenAI: GPT-5.4 Nano openai/gpt-5.4-nano reasoning | 400,000 | $0.2 | $1.25 | $0.02 | - |
OpenAI: GPT-5.4 Pro openai/gpt-5.4-pro reasoning | 1,050,000 | $30 | $180 | $0 | - |
OpenAI: GPT-5.5 openai/gpt-5.5 reasoning | 1,050,000 | $5 | $30 | $0.5 | - |
OpenAI: GPT-5.5 Pro openai/gpt-5.5-pro reasoning | 1,050,000 | $30 | $180 | $0 | - |
OpenAI: gpt-oss-120b openai/gpt-oss-120b reasoning | 131,072 | $0.039 | $0.18 | $0 | - |
OpenAI: gpt-oss-120b (free) openai/gpt-oss-120b:free reasoning | 131,072 | $0 | $0 | $0 | - |
OpenAI: gpt-oss-20b openai/gpt-oss-20b reasoning | 131,072 | $0.03 | $0.14 | $0 | - |
OpenAI: gpt-oss-20b (free) openai/gpt-oss-20b:free reasoning | 131,072 | $0 | $0 | $0 | - |
OpenAI: gpt-oss-safeguard-20b openai/gpt-oss-safeguard-20b reasoning | 131,072 | $0.075 | $0.3 | $0.037 | - |
OpenAI: o1 openai/o1 reasoning | 200,000 | $15 | $60 | $7.5 | - |
OpenAI: o3 openai/o3 reasoning | 200,000 | $2 | $8 | $0.5 | - |
OpenAI: o3 Deep Research openai/o3-deep-research reasoning | 200,000 | $10 | $40 | $2.5 | - |
OpenAI: o3 Mini openai/o3-mini reasoning | 200,000 | $1.1 | $4.4 | $0.55 | - |
OpenAI: o3 Mini High openai/o3-mini-high reasoning | 200,000 | $1.1 | $4.4 | $0.55 | - |
OpenAI: o3 Pro openai/o3-pro reasoning | 200,000 | $20 | $80 | $0 | - |
OpenAI: o4 Mini openai/o4-mini reasoning | 200,000 | $1.1 | $4.4 | $0.275 | - |
OpenAI: o4 Mini Deep Research openai/o4-mini-deep-research reasoning | 200,000 | $2 | $8 | $0.5 | - |
OpenAI: o4 Mini High openai/o4-mini-high reasoning | 200,000 | $1.1 | $4.4 | $0.275 | - |
Owl Alpha openrouter/owl-alpha | 1,048,756 | $0 | $0 | $0 | - |
Poolside: Laguna M.1 (free) poolside/laguna-m.1:free reasoning | 131,072 | $0 | $0 | $0 | - |
Poolside: Laguna XS.2 (free) poolside/laguna-xs.2:free reasoning | 131,072 | $0 | $0 | $0 | - |
Prime Intellect: INTELLECT-3 prime-intellect/intellect-3 reasoning | 131,072 | $0.2 | $1.1 | $0 | - |
Qwen: Qwen Plus 0728 qwen/qwen-plus-2025-07-28 | 1,000,000 | $0.26 | $0.78 | $0 | $0.325 |
Qwen: Qwen Plus 0728 (thinking) qwen/qwen-plus-2025-07-28:thinking reasoning | 1,000,000 | $0.26 | $0.78 | $0 | $0.325 |
Qwen: Qwen-Plus qwen/qwen-plus | 1,000,000 | $0.26 | $0.78 | $0.052 | $0.325 |
Qwen: Qwen2.5 7B Instruct qwen/qwen-2.5-7b-instruct | 131,072 | $0.04 | $0.1 | $0 | - |
Qwen: Qwen3 14B qwen/qwen3-14b reasoning | 131,702 | $0.1 | $0.24 | $0 | - |
Qwen: Qwen3 235B A22B qwen/qwen3-235b-a22b reasoning | 131,072 | $0.455 | $1.82 | $0 | - |
Qwen: Qwen3 235B A22B Instruct 2507 qwen/qwen3-235b-a22b-2507 | 262,144 | $0.071 | $0.1 | $0 | - |
Qwen: Qwen3 235B A22B Thinking 2507 qwen/qwen3-235b-a22b-thinking-2507 reasoning | 262,144 | $0.1495 | $1.495 | $0 | - |
Qwen: Qwen3 30B A3B qwen/qwen3-30b-a3b reasoning | 131,072 | $0.09 | $0.45 | $0 | - |
Qwen: Qwen3 30B A3B Instruct 2507 qwen/qwen3-30b-a3b-instruct-2507 | 262,144 | $0.09 | $0.3 | $0 | - |
Qwen: Qwen3 30B A3B Thinking 2507 qwen/qwen3-30b-a3b-thinking-2507 reasoning | 131,072 | $0.08 | $0.4 | $0.08 | - |
Qwen: Qwen3 32B qwen/qwen3-32b reasoning | 131,072 | $0.08 | $0.28 | $0 | - |
Qwen: Qwen3 8B qwen/qwen3-8b reasoning | 131,072 | $0.05 | $0.4 | $0.05 | - |
Qwen: Qwen3 Coder 30B A3B Instruct qwen/qwen3-coder-30b-a3b-instruct | 160,000 | $0.07 | $0.27 | $0 | - |
Qwen: Qwen3 Coder 480B A35B qwen/qwen3-coder | 1,048,576 | $0.22 | $1.8 | $0 | - |
Qwen: Qwen3 Coder 480B A35B (free) qwen/qwen3-coder:free | 1,048,576 | $0 | $0 | $0 | - |
Qwen: Qwen3 Coder Flash qwen/qwen3-coder-flash | 1,000,000 | $0.195 | $0.975 | $0.039 | $0.24375 |
Qwen: Qwen3 Coder Next qwen/qwen3-coder-next | 262,144 | $0.11 | $0.8 | $0.07 | - |
Qwen: Qwen3 Coder Plus qwen/qwen3-coder-plus | 1,000,000 | $0.65 | $3.25 | $0.13 | $0.8125 |
Qwen: Qwen3 Max qwen/qwen3-max | 262,144 | $0.78 | $3.9 | $0.156 | $0.975 |
Qwen: Qwen3 Max Thinking qwen/qwen3-max-thinking reasoning | 262,144 | $0.78 | $3.9 | $0 | - |
Qwen: Qwen3 Next 80B A3B Instruct qwen/qwen3-next-80b-a3b-instruct | 262,144 | $0.09 | $1.1 | $0 | - |
Qwen: Qwen3 Next 80B A3B Instruct (free) qwen/qwen3-next-80b-a3b-instruct:free | 262,144 | $0 | $0 | $0 | - |
Qwen: Qwen3 Next 80B A3B Thinking qwen/qwen3-next-80b-a3b-thinking reasoning | 262,144 | $0.0975 | $0.78 | $0 | - |
Qwen: Qwen3 VL 235B A22B Instruct qwen/qwen3-vl-235b-a22b-instruct | 262,144 | $0.2 | $0.88 | $0.11 | - |
Qwen: Qwen3 VL 235B A22B Thinking qwen/qwen3-vl-235b-a22b-thinking reasoning | 131,072 | $0.26 | $2.6 | $0 | - |
Qwen: Qwen3 VL 30B A3B Instruct qwen/qwen3-vl-30b-a3b-instruct | 262,144 | $0.13 | $0.52 | $0 | - |
Qwen: Qwen3 VL 30B A3B Thinking qwen/qwen3-vl-30b-a3b-thinking reasoning | 131,072 | $0.13 | $1.56 | $0 | - |
Qwen: Qwen3 VL 32B Instruct qwen/qwen3-vl-32b-instruct | 262,144 | $0.104 | $0.416 | $0 | - |
Qwen: Qwen3 VL 8B Instruct qwen/qwen3-vl-8b-instruct | 256,000 | $0.08 | $0.5 | $0 | - |
Qwen: Qwen3 VL 8B Thinking qwen/qwen3-vl-8b-thinking reasoning | 256,000 | $0.117 | $1.365 | $0 | - |
Qwen: Qwen3.5 397B A17B qwen/qwen3.5-397b-a17b reasoning | 262,144 | $0.39 | $2.34 | $0 | - |
Qwen: Qwen3.5 Plus 2026-02-15 qwen/qwen3.5-plus-02-15 reasoning | 1,000,000 | $0.26 | $1.56 | $0 | $0.325 |
Qwen: Qwen3.5 Plus 2026-04-20 qwen/qwen3.5-plus-20260420 reasoning | 1,000,000 | $0.3 | $1.8 | $0 | - |
Qwen: Qwen3.5-122B-A10B qwen/qwen3.5-122b-a10b reasoning | 262,144 | $0.26 | $2.08 | $0 | - |
Qwen: Qwen3.5-27B qwen/qwen3.5-27b reasoning | 262,144 | $0.195 | $1.56 | $0 | - |
Qwen: Qwen3.5-35B-A3B qwen/qwen3.5-35b-a3b reasoning | 262,144 | $0.139 | $1 | $0 | - |
Qwen: Qwen3.5-9B qwen/qwen3.5-9b reasoning | 262,144 | $0.04 | $0.15 | $0 | - |
Qwen: Qwen3.5-Flash qwen/qwen3.5-flash-02-23 reasoning | 1,000,000 | $0.065 | $0.26 | $0 | $0.08125 |
Qwen: Qwen3.6 27B qwen/qwen3.6-27b reasoning | 262,144 | $0.3 | $3.2 | $0 | - |
Qwen: Qwen3.6 35B A3B qwen/qwen3.6-35b-a3b reasoning | 262,144 | $0.15 | $1 | $0 | - |
Qwen: Qwen3.6 Flash qwen/qwen3.6-flash reasoning | 1,000,000 | $0.1875 | $1.125 | $0 | $0.234375 |
Qwen: Qwen3.6 Max Preview qwen/qwen3.6-max-preview reasoning | 262,144 | $1.04 | $6.24 | $0 | $1.3 |
Qwen: Qwen3.6 Plus qwen/qwen3.6-plus reasoning | 1,000,000 | $0.325 | $1.95 | $0 | $0.40625 |
Qwen: Qwen3.7 Max qwen/qwen3.7-max reasoning | 1,000,000 | $2.5 | $7.5 | $0 | $3.125 |
Qwen2.5 72B Instruct qwen/qwen-2.5-72b-instruct | 131,072 | $0.36 | $0.4 | $0 | - |
Reka Edge rekaai/reka-edge | 16,384 | $0.1 | $0.1 | $0 | - |
Relace: Relace Search relace/relace-search | 256,000 | $1 | $3 | $0 | - |
Sao10k: Llama 3 Euryale 70B v2.1 sao10k/l3-euryale-70b | 8,192 | $1.48 | $1.48 | $0 | - |
Sao10K: Llama 3.1 Euryale 70B v2.2 sao10k/l3.1-euryale-70b | 131,072 | $0.85 | $0.85 | $0 | - |
StepFun: Step 3.5 Flash stepfun/step-3.5-flash reasoning | 262,144 | $0.09 | $0.3 | $0.02 | - |
Tencent: Hy3 preview tencent/hy3-preview reasoning | 262,144 | $0.066 | $0.26 | $0.029 | - |
TheDrummer: Rocinante 12B thedrummer/rocinante-12b | 32,768 | $0.17 | $0.43 | $0 | - |
TheDrummer: UnslopNemo 12B thedrummer/unslopnemo-12b | 32,768 | $0.4 | $0.4 | $0 | - |
Tongyi DeepResearch 30B A3B alibaba/tongyi-deepresearch-30b-a3b reasoning | 131,072 | $0.09 | $0.45 | $0.09 | - |
Upstage: Solar Pro 3 upstage/solar-pro-3 reasoning | 128,000 | $0.15 | $0.6 | $0.015 | - |
xAI: Grok 4.20 x-ai/grok-4.20 reasoning | 2,000,000 | $1.25 | $2.5 | $0.2 | - |
xAI: Grok 4.3 x-ai/grok-4.3 reasoning | 1,000,000 | $1.25 | $2.5 | $0.2 | - |
xAI: Grok Build 0.1 x-ai/grok-build-0.1 reasoning | 256,000 | $1 | $2 | $0.2 | - |
Xiaomi: MiMo-V2-Flash xiaomi/mimo-v2-flash reasoning | 262,144 | $0.1 | $0.3 | $0.01 | - |
Xiaomi: MiMo-V2-Omni xiaomi/mimo-v2-omni reasoning | 262,144 | $0.4 | $2 | $0.08 | - |
Xiaomi: MiMo-V2-Pro xiaomi/mimo-v2-pro reasoning | 1,048,576 | $1 | $3 | $0.2 | - |
Xiaomi: MiMo-V2.5 xiaomi/mimo-v2.5 reasoning | 1,048,576 | $0.4 | $2 | $0.08 | - |
Xiaomi: MiMo-V2.5-Pro xiaomi/mimo-v2.5-pro reasoning | 1,048,576 | $1 | $3 | $0.2 | - |
Z.ai: GLM 4 32B z-ai/glm-4-32b | 128,000 | $0.1 | $0.1 | $0 | - |
Z.ai: GLM 4.5 z-ai/glm-4.5 reasoning | 131,072 | $0.6 | $2.2 | $0.11 | - |
Z.ai: GLM 4.5 Air z-ai/glm-4.5-air reasoning | 131,072 | $0.13 | $0.85 | $0.025 | - |
Z.ai: GLM 4.5 Air (free) z-ai/glm-4.5-air:free reasoning | 131,072 | $0 | $0 | $0 | - |
Z.ai: GLM 4.5V z-ai/glm-4.5v reasoning | 65,536 | $0.6 | $1.8 | $0.11 | - |
Z.ai: GLM 4.6 z-ai/glm-4.6 reasoning | 202,752 | $0.43 | $1.74 | $0.08 | - |
Z.ai: GLM 4.6V z-ai/glm-4.6v reasoning | 131,072 | $0.3 | $0.9 | $0.05 | - |
Z.ai: GLM 4.7 z-ai/glm-4.7 reasoning | 202,752 | $0.4 | $1.75 | $0.08 | - |
Z.ai: GLM 4.7 Flash z-ai/glm-4.7-flash reasoning | 202,752 | $0.06 | $0.4 | $0.01 | - |
Z.ai: GLM 5 z-ai/glm-5 reasoning | 202,752 | $0.6 | $1.9 | $0.119 | - |
Z.ai: GLM 5 Turbo z-ai/glm-5-turbo reasoning | 202,752 | $1.2 | $4 | $0.24 | - |
Z.ai: GLM 5.1 z-ai/glm-5.1 reasoning | 202,752 | $0.98 | $3.08 | $0.182 | - |
Z.ai: GLM 5V Turbo z-ai/glm-5v-turbo reasoning | 202,752 | $1.2 | $4 | $0.24 | - |
| together (18 models) | |||||
DeepSeek V3 deepseek-ai/DeepSeek-V3 reasoning | 131,072 | $1.25 | $1.25 | $0 | - |
DeepSeek V3.1 deepseek-ai/DeepSeek-V3-1 reasoning | 131,072 | $0.6 | $1.7 | $0 | - |
DeepSeek V4 Pro deepseek-ai/DeepSeek-V4-Pro reasoning | 512,000 | $2.1 | $4.4 | $0.2 | - |
Gemma 4 31B Instruct google/gemma-4-31B-it reasoning | 262,144 | $0.2 | $0.5 | $0 | - |
GLM-5.1 zai-org/GLM-5.1 reasoning | 202,752 | $1.4 | $4.4 | $0 | - |
GPT OSS 120B openai/gpt-oss-120b reasoning | 131,072 | $0.15 | $0.6 | $0 | - |
Kimi K2.5 moonshotai/Kimi-K2.5 reasoning | 262,144 | $0.5 | $2.8 | $0 | - |
Kimi K2.6 moonshotai/Kimi-K2.6 reasoning | 262,144 | $1.2 | $4.5 | $0.2 | - |
Llama 3.3 70B meta-llama/Llama-3.3-70B-Instruct-Turbo | 131,072 | $0.88 | $0.88 | $0 | - |
MiniMax-M2.5 MiniMaxAI/MiniMax-M2.5 reasoning | 204,800 | $0.3 | $1.2 | $0.06 | - |
MiniMax-M2.7 MiniMaxAI/MiniMax-M2.7 reasoning | 202,752 | $0.3 | $1.2 | $0.06 | - |
Qwen3 235B A22B Instruct 2507 FP8 Qwen/Qwen3-235B-A22B-Instruct-2507-tput reasoning | 262,144 | $0.2 | $0.6 | $0 | - |
Qwen3 Coder 480B A35B Instruct Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8 | 262,144 | $2 | $2 | $0 | - |
Qwen3 Coder Next FP8 Qwen/Qwen3-Coder-Next-FP8 reasoning | 262,144 | $0.5 | $1.2 | $0 | - |
Qwen3.5 397B A17B Qwen/Qwen3.5-397B-A17B reasoning | 262,144 | $0.6 | $3.6 | $0 | - |
Qwen3.6 Plus Qwen/Qwen3.6-Plus reasoning | 1,000,000 | $0.5 | $3 | $0 | - |
Qwen3.7 Max Qwen/Qwen3.7-Max reasoning | 1,000,000 | $2.5 | $7.5 | $0 | - |
Rnj-1 Instruct essentialai/Rnj-1-Instruct | 32,768 | $0.15 | $0.15 | $0 | - |
| vercel-ai-gateway (159 models) | |||||
Claude 3 Haiku anthropic/claude-3-haiku | 200,000 | $0.25 | $1.25 | $0.03 | $0.3 |
Claude 3.5 Haiku anthropic/claude-3.5-haiku | 200,000 | $0.8 | $4 | $0.08 | $1 |
Claude Haiku 4.5 anthropic/claude-haiku-4.5 reasoning | 200,000 | $1 | $5 | $0.1 | $1.25 |
Claude Opus 4 anthropic/claude-opus-4 reasoning | 200,000 | $15 | $75 | $1.5 | $18.75 |
Claude Opus 4.1 anthropic/claude-opus-4.1 reasoning | 200,000 | $15 | $75 | $1.5 | $18.75 |
Claude Opus 4.5 anthropic/claude-opus-4.5 reasoning | 200,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.6 anthropic/claude-opus-4.6 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.7 anthropic/claude-opus-4.7 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Opus 4.8 anthropic/claude-opus-4.8 reasoning | 1,000,000 | $5 | $25 | $0.5 | $6.25 |
Claude Sonnet 4 anthropic/claude-sonnet-4 reasoning | 1,000,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 reasoning | 1,000,000 | $3 | $15 | $0.3 | $3.75 |
Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 reasoning | 1,000,000 | $3 | $15 | $0.3 | $3.75 |
Command A cohere/command-a | 256,000 | $2.5 | $10 | $0 | - |
DeepSeek V3 0324 deepseek/deepseek-v3 | 163,840 | $0.77 | $0.77 | $0 | - |
DeepSeek V3.1 Terminus deepseek/deepseek-v3.1-terminus reasoning | 131,072 | $0.27 | $1 | $0.135 | - |
DeepSeek V3.2 deepseek/deepseek-v3.2 | 128,000 | $0.28 | $0.42 | $0.028 | - |
DeepSeek V3.2 Thinking deepseek/deepseek-v3.2-thinking | 128,000 | $0.62 | $1.85 | $0 | - |
DeepSeek V4 Flash deepseek/deepseek-v4-flash reasoning | 1,000,000 | $0.14 | $0.28 | $0.0028 | - |
DeepSeek V4 Pro deepseek/deepseek-v4-pro reasoning | 1,000,000 | $0.435 | $0.87 | $0.0036 | - |
DeepSeek-R1 deepseek/deepseek-r1 reasoning | 128,000 | $1.35 | $5.4 | $0 | - |
DeepSeek-V3.1 deepseek/deepseek-v3.1 reasoning | 163,840 | $0.56 | $1.68 | $0.28 | - |
Devstral 2 mistral/devstral-2 | 256,000 | $0.4 | $2 | $0 | - |
Devstral Small 1.1 mistral/devstral-small | 128,000 | $0.1 | $0.3 | $0 | - |
Devstral Small 2 mistral/devstral-small-2 | 256,000 | $0.1 | $0.3 | $0 | - |
Gemini 2.0 Flash google/gemini-2.0-flash | 1,048,576 | $0.15 | $0.6 | $0.025 | - |
Gemini 2.0 Flash Lite google/gemini-2.0-flash-lite | 1,048,576 | $0.075 | $0.3 | $0.02 | - |
Gemini 2.5 Flash google/gemini-2.5-flash reasoning | 1,000,000 | $0.3 | $2.5 | $0.03 | - |
Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite reasoning | 1,048,576 | $0.1 | $0.4 | $0.01 | - |
Gemini 2.5 Pro google/gemini-2.5-pro reasoning | 1,048,576 | $1.25 | $10 | $0.125 | - |
Gemini 3 Flash google/gemini-3-flash reasoning | 1,000,000 | $0.5 | $3 | $0.05 | - |
Gemini 3 Pro Preview google/gemini-3-pro-preview reasoning | 1,000,000 | $2 | $12 | $0.2 | - |
Gemini 3.1 Flash Lite google/gemini-3.1-flash-lite reasoning | 1,000,000 | $0.25 | $1.5 | $0.03 | - |
Gemini 3.1 Flash Lite Preview google/gemini-3.1-flash-lite-preview reasoning | 1,000,000 | $0.25 | $1.5 | $0.03 | - |
Gemini 3.1 Pro Preview google/gemini-3.1-pro-preview reasoning | 1,000,000 | $2 | $12 | $0.2 | - |
Gemini 3.5 Flash google/gemini-3.5-flash reasoning | 1,000,000 | $1.5 | $9 | $0.15 | - |
Gemma 4 26B A4B IT google/gemma-4-26b-a4b-it | 262,144 | $0.13 | $0.4 | $0 | - |
Gemma 4 31B IT google/gemma-4-31b-it | 262,144 | $0.14 | $0.4 | $0 | - |
GLM 4.5 Air zai/glm-4.5-air reasoning | 128,000 | $0.2 | $1.1 | $0.03 | - |
GLM 4.5V zai/glm-4.5v | 66,000 | $0.6 | $1.8 | $0.11 | - |
GLM 4.6 zai/glm-4.6 reasoning | 200,000 | $0.6 | $2.2 | $0.11 | - |
GLM 4.7 zai/glm-4.7 reasoning | 131,000 | $2.25 | $2.75 | $2.25 | - |
GLM 4.7 Flash zai/glm-4.7-flash reasoning | 200,000 | $0.07 | $0.4 | $0 | - |
GLM 4.7 FlashX zai/glm-4.7-flashx reasoning | 200,000 | $0.06 | $0.4 | $0.01 | - |
GLM 5 zai/glm-5 reasoning | 202,800 | $1 | $3.2 | $0.2 | - |
GLM 5 Turbo zai/glm-5-turbo reasoning | 202,800 | $1.2 | $4 | $0.24 | - |
GLM 5.1 zai/glm-5.1 reasoning | 202,800 | $1.4 | $4.4 | $0.26 | - |
GLM 5V Turbo zai/glm-5v-turbo reasoning | 200,000 | $1.2 | $4 | $0.24 | - |
GLM-4.5 zai/glm-4.5 reasoning | 128,000 | $0.6 | $2.2 | $0.11 | - |
GLM-4.6V zai/glm-4.6v reasoning | 128,000 | $0.3 | $0.9 | $0.05 | - |
GLM-4.6V-Flash zai/glm-4.6v-flash reasoning | 128,000 | $0 | $0 | $0 | - |
GPT 5 Chat openai/gpt-5-chat reasoning | 128,000 | $1.25 | $10 | $0.125 | - |
GPT 5.1 Codex Max openai/gpt-5.1-codex-max reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT 5.1 Codex Mini openai/gpt-5.1-codex-mini reasoning | 400,000 | $0.25 | $2 | $0.025 | - |
GPT 5.1 Thinking openai/gpt-5.1-thinking reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT 5.2 openai/gpt-5.2 reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
GPT 5.2 openai/gpt-5.2-pro reasoning | 400,000 | $21 | $168 | $0 | - |
GPT 5.2 Chat openai/gpt-5.2-chat reasoning | 128,000 | $1.75 | $14 | $0.175 | - |
GPT 5.2 Codex openai/gpt-5.2-codex reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
GPT 5.3 Codex openai/gpt-5.3-codex reasoning | 400,000 | $1.75 | $14 | $0.175 | - |
GPT 5.4 openai/gpt-5.4 reasoning | 1,050,000 | $2.5 | $15 | $0.25 | - |
GPT 5.4 Mini openai/gpt-5.4-mini reasoning | 400,000 | $0.75 | $4.5 | $0.075 | - |
GPT 5.4 Nano openai/gpt-5.4-nano reasoning | 400,000 | $0.2 | $1.25 | $0.02 | - |
GPT 5.4 Pro openai/gpt-5.4-pro reasoning | 1,050,000 | $30 | $180 | $0 | - |
GPT 5.5 openai/gpt-5.5 reasoning | 1,000,000 | $5 | $30 | $0.5 | - |
GPT 5.5 Pro openai/gpt-5.5-pro reasoning | 1,000,000 | $30 | $180 | $0 | - |
GPT OSS 20B openai/gpt-oss-20b reasoning | 131,072 | $0.05 | $0.2 | $0 | - |
GPT OSS Safeguard 20B openai/gpt-oss-safeguard-20b reasoning | 131,072 | $0.075 | $0.3 | $0.037 | - |
GPT-4 Turbo openai/gpt-4-turbo | 128,000 | $10 | $30 | $0 | - |
GPT-4.1 openai/gpt-4.1 | 1,047,576 | $2 | $8 | $0.5 | - |
GPT-4.1 mini openai/gpt-4.1-mini | 1,047,576 | $0.4 | $1.6 | $0.1 | - |
GPT-4.1 nano openai/gpt-4.1-nano | 1,047,576 | $0.1 | $0.4 | $0.025 | - |
GPT-4o openai/gpt-4o | 128,000 | $2.5 | $10 | $1.25 | - |
GPT-4o mini openai/gpt-4o-mini | 128,000 | $0.15 | $0.6 | $0.075 | - |
GPT-5 openai/gpt-5 reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT-5 mini openai/gpt-5-mini reasoning | 400,000 | $0.25 | $2 | $0.025 | - |
GPT-5 nano openai/gpt-5-nano reasoning | 400,000 | $0.05 | $0.4 | $0.005 | - |
GPT-5 pro openai/gpt-5-pro reasoning | 400,000 | $15 | $120 | $0 | - |
GPT-5-Codex openai/gpt-5-codex reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT-5.1 Instant openai/gpt-5.1-instant reasoning | 128,000 | $1.25 | $10 | $0.125 | - |
GPT-5.1-Codex openai/gpt-5.1-codex reasoning | 400,000 | $1.25 | $10 | $0.125 | - |
GPT-5.3 Chat openai/gpt-5.3-chat reasoning | 128,000 | $1.75 | $14 | $0.175 | - |
Grok 4.1 Fast Non-Reasoning xai/grok-4.1-fast-non-reasoning | 1,000,000 | $0.2 | $0.5 | $0.05 | - |
Grok 4.1 Fast Reasoning xai/grok-4.1-fast-reasoning reasoning | 1,000,000 | $0.2 | $0.5 | $0.05 | - |
Grok 4.20 Beta Non-Reasoning xai/grok-4.20-non-reasoning-beta | 2,000,000 | $1.25 | $2.5 | $0.2 | - |
Grok 4.20 Beta Reasoning xai/grok-4.20-reasoning-beta reasoning | 2,000,000 | $1.25 | $2.5 | $0.2 | - |
Grok 4.20 Multi Agent Beta xai/grok-4.20-multi-agent-beta reasoning | 2,000,000 | $1.25 | $2.5 | $0.2 | - |
Grok 4.20 Multi-Agent xai/grok-4.20-multi-agent reasoning | 2,000,000 | $1.25 | $2.5 | $0.2 | - |
Grok 4.20 Non-Reasoning xai/grok-4.20-non-reasoning | 2,000,000 | $1.25 | $2.5 | $0.2 | - |
Grok 4.20 Reasoning xai/grok-4.20-reasoning reasoning | 2,000,000 | $1.25 | $2.5 | $0.2 | - |
Grok 4.3 xai/grok-4.3 reasoning | 1,000,000 | $1.25 | $2.5 | $0.2 | - |
Grok Build 0.1 xai/grok-build-0.1 reasoning | 256,000 | $1 | $2 | $0.2 | - |
Kat Coder Pro V2 kwaipilot/kat-coder-pro-v2 reasoning | 256,000 | $0.3 | $1.2 | $0.06 | - |
Kimi K2 Instruct moonshotai/kimi-k2 | 131,072 | $0.57 | $2.3 | $0 | - |
Kimi K2 Thinking moonshotai/kimi-k2-thinking reasoning | 262,114 | $0.6 | $2.5 | $0.15 | - |
Kimi K2 Thinking Turbo moonshotai/kimi-k2-thinking-turbo reasoning | 262,114 | $1.15 | $8 | $0.15 | - |
Kimi K2 Turbo moonshotai/kimi-k2-turbo | 256,000 | $1.15 | $8 | $0.15 | - |
Kimi K2.5 moonshotai/kimi-k2.5 reasoning | 262,114 | $0.6 | $3 | $0.1 | - |
Kimi K2.6 moonshotai/kimi-k2.6 reasoning | 262,000 | $0.95 | $4 | $0.16 | - |
Llama 3.1 70B Instruct meta/llama-3.1-70b | 128,000 | $0.72 | $0.72 | $0 | - |
Llama 3.1 8B Instruct meta/llama-3.1-8b | 128,000 | $0.22 | $0.22 | $0 | - |
Llama 3.2 11B Vision Instruct meta/llama-3.2-11b | 128,000 | $0.16 | $0.16 | $0 | - |
Llama 3.2 90B Vision Instruct meta/llama-3.2-90b | 128,000 | $0.72 | $0.72 | $0 | - |
Llama 3.3 70B Instruct meta/llama-3.3-70b | 128,000 | $0.72 | $0.72 | $0 | - |
Llama 4 Maverick 17B Instruct meta/llama-4-maverick | 128,000 | $0.24 | $0.97 | $0 | - |
Llama 4 Scout 17B Instruct meta/llama-4-scout | 128,000 | $0.17 | $0.66 | $0 | - |
LongCat Flash Chat meituan/longcat-flash-chat | 128,000 | $0 | $0 | $0 | - |
Mercury 2 inception/mercury-2 reasoning | 128,000 | $0.25 | $0.75 | $0.025 | - |
Mercury Coder Small Beta inception/mercury-coder-small | 32,000 | $0.25 | $1 | $0 | - |
MiMo M2.5 xiaomi/mimo-v2.5 reasoning | 1,050,000 | $0.4 | $2 | $0.08 | - |
MiMo V2 Flash xiaomi/mimo-v2-flash reasoning | 262,144 | $0.1 | $0.3 | $0.01 | - |
MiMo V2 Pro xiaomi/mimo-v2-pro reasoning | 1,000,000 | $1 | $3 | $0.2 | - |
MiMo V2.5 Pro xiaomi/mimo-v2.5-pro reasoning | 1,050,000 | $1 | $3 | $0.2 | - |
MiniMax M2 minimax/minimax-m2 reasoning | 205,000 | $0.3 | $1.2 | $0.03 | $0.375 |
MiniMax M2.1 minimax/minimax-m2.1 reasoning | 204,800 | $0.3 | $1.2 | $0.03 | $0.375 |
MiniMax M2.1 Lightning minimax/minimax-m2.1-lightning reasoning | 204,800 | $0.3 | $2.4 | $0.03 | $0.375 |
MiniMax M2.5 minimax/minimax-m2.5 reasoning | 204,800 | $0.3 | $1.2 | $0.03 | $0.375 |
MiniMax M2.5 High Speed minimax/minimax-m2.5-highspeed reasoning | 204,800 | $0.6 | $2.4 | $0.03 | $0.375 |
MiniMax M2.7 minimax/minimax-m2.7 reasoning | 204,800 | $0.3 | $1.2 | $0.06 | $0.375 |
MiniMax M2.7 High Speed minimax/minimax-m2.7-highspeed reasoning | 204,800 | $0.6 | $2.4 | $0.06 | $0.375 |
Ministral 3B mistral/ministral-3b | 128,000 | $0.1 | $0.1 | $0 | - |
Ministral 8B mistral/ministral-8b | 128,000 | $0.15 | $0.15 | $0 | - |
Mistral Codestral mistral/codestral | 128,000 | $0.3 | $0.9 | $0 | - |
Mistral Medium 3.1 mistral/mistral-medium | 128,000 | $0.4 | $2 | $0 | - |
Mistral Medium Latest mistral/mistral-medium-3.5 reasoning | 256,000 | $1.5 | $7.5 | $0 | - |
Mistral Small mistral/mistral-small | 32,000 | $0.1 | $0.3 | $0 | - |
Nvidia Nemotron Nano 12B V2 VL nvidia/nemotron-nano-12b-v2-vl reasoning | 131,072 | $0.2 | $0.6 | $0 | - |
Nvidia Nemotron Nano 9B V2 nvidia/nemotron-nano-9b-v2 reasoning | 131,072 | $0.06 | $0.23 | $0 | - |
o1 openai/o1 reasoning | 200,000 | $15 | $60 | $7.5 | - |
o3 openai/o3 reasoning | 200,000 | $2 | $8 | $0.5 | - |
o3 Pro openai/o3-pro reasoning | 200,000 | $20 | $80 | $0 | - |
o3-deep-research openai/o3-deep-research reasoning | 200,000 | $10 | $40 | $2.5 | - |
o3-mini openai/o3-mini reasoning | 200,000 | $1.1 | $4.4 | $0.55 | - |
o4-mini openai/o4-mini reasoning | 200,000 | $1.1 | $4.4 | $0.275 | - |
Pixtral 12B 2409 mistral/pixtral-12b | 128,000 | $0.15 | $0.15 | $0 | - |
Pixtral Large mistral/pixtral-large | 128,000 | $2 | $6 | $0 | - |
Qwen 3 32B alibaba/qwen-3-32b reasoning | 128,000 | $0.16 | $0.64 | $0 | - |
Qwen 3 Coder 30B A3B Instruct alibaba/qwen3-coder-30b-a3b reasoning | 262,144 | $0.15 | $0.6 | $0 | - |
Qwen 3 Max Thinking alibaba/qwen3-max-thinking reasoning | 256,000 | $1.2 | $6 | $0.24 | - |
Qwen 3.5 Flash alibaba/qwen3.5-flash reasoning | 1,000,000 | $0.1 | $0.4 | $0.001 | $0.125 |
Qwen 3.5 Plus alibaba/qwen3.5-plus reasoning | 1,000,000 | $0.4 | $2.4 | $0.04 | $0.5 |
Qwen 3.6 27B alibaba/qwen3.6-27b reasoning | 256,000 | $0.6 | $3.6 | $0 | - |
Qwen 3.6 Max Preview alibaba/qwen-3.6-max-preview reasoning | 240,000 | $1.3 | $7.8 | $0.26 | $1.625 |
Qwen 3.6 Plus alibaba/qwen3.6-plus reasoning | 1,000,000 | $0.5 | $3 | $0.1 | $0.625 |
Qwen 3.7 Max alibaba/qwen3.7-max reasoning | 991,000 | $1.25 | $3.75 | $0.25 | $1.5625 |
Qwen3 235B A22b Instruct 2507 alibaba/qwen-3-235b | 131,000 | $0.6 | $1.2 | $0.6 | - |
Qwen3 Coder 480B A35B Instruct alibaba/qwen3-coder | 262,144 | $1.5 | $7.5 | $0.3 | - |
Qwen3 Coder Next alibaba/qwen3-coder-next | 256,000 | $0.5 | $1.2 | $0 | - |
Qwen3 Coder Plus alibaba/qwen3-coder-plus | 1,000,000 | $1 | $5 | $0.2 | - |
Qwen3 Max alibaba/qwen3-max | 262,144 | $1.2 | $6 | $0.24 | - |
Qwen3 Max Preview alibaba/qwen3-max-preview | 262,144 | $1.2 | $6 | $0.24 | - |
Qwen3 VL 235B A22B Thinking alibaba/qwen3-235b-a22b-thinking reasoning | 131,072 | $0.4 | $4 | $0 | - |
Qwen3 VL 235B A22B Thinking alibaba/qwen3-vl-thinking reasoning | 131,072 | $0.4 | $4 | $0 | - |
Qwen3-14B alibaba/qwen-3-14b reasoning | 40,960 | $0.12 | $0.24 | $0 | - |
Qwen3-30B-A3B alibaba/qwen-3-30b reasoning | 40,960 | $0.08 | $0.29 | $0 | - |
Seed 1.6 bytedance/seed-1.6 reasoning | 256,000 | $0.25 | $2 | $0.05 | - |
Sonar perplexity/sonar | 127,000 | $0 | $0 | $0 | - |
Sonar Pro perplexity/sonar-pro | 200,000 | $0 | $0 | $0 | - |
Trinity Large Preview arcee-ai/trinity-large-preview | 131,000 | $0.25 | $1 | $0 | - |
Trinity Large Thinking arcee-ai/trinity-large-thinking reasoning | 262,100 | $0.25 | $0.9 | $0 | - |
| xai (7 models) | |||||
Grok 3 grok-3 | 131,072 | $3 | $15 | $0.75 | - |
Grok 3 Fast grok-3-fast | 131,072 | $5 | $25 | $1.25 | - |
Grok 4.20 (Non-Reasoning) grok-4.20-0309-non-reasoning | 2,000,000 | $1.25 | $2.5 | $0.2 | - |
Grok 4.20 (Reasoning) grok-4.20-0309-reasoning reasoning | 2,000,000 | $1.25 | $2.5 | $0.2 | - |
Grok 4.3 grok-4.3 reasoning | 1,000,000 | $1.25 | $2.5 | $0.2 | - |
Grok Build 0.1 grok-build-0.1 reasoning | 256,000 | $1 | $2 | $0.2 | - |
Grok Code Fast 1 grok-code-fast-1 | 32,768 | $0.2 | $1.5 | $0.02 | - |
| xiaomi (5 models) | |||||
MiMo-V2-Flash mimo-v2-flash reasoning | 262,144 | $0.1 | $0.3 | $0.01 | - |
MiMo-V2-Omni mimo-v2-omni reasoning | 262,144 | $0.4 | $2 | $0.08 | - |
MiMo-V2-Pro mimo-v2-pro reasoning | 1,048,576 | $1 | $3 | $0.2 | - |
MiMo-V2.5 mimo-v2.5 reasoning | 1,048,576 | $0.4 | $2 | $0.08 | - |
MiMo-V2.5-Pro mimo-v2.5-pro reasoning | 1,048,576 | $1 | $3 | $0.2 | - |
| xiaomi-token-plan-ams (5 models) | |||||
MiMo-V2-Flash mimo-v2-flash reasoning | 262,144 | $0.1 | $0.3 | $0.01 | - |
MiMo-V2-Omni mimo-v2-omni reasoning | 262,144 | $0.4 | $2 | $0.08 | - |
MiMo-V2-Pro mimo-v2-pro reasoning | 1,048,576 | $1 | $3 | $0.2 | - |
MiMo-V2.5 mimo-v2.5 reasoning | 1,048,576 | $0.4 | $2 | $0.08 | - |
MiMo-V2.5-Pro mimo-v2.5-pro reasoning | 1,048,576 | $1 | $3 | $0.2 | - |
| xiaomi-token-plan-cn (5 models) | |||||
MiMo-V2-Flash mimo-v2-flash reasoning | 262,144 | $0.1 | $0.3 | $0.01 | - |
MiMo-V2-Omni mimo-v2-omni reasoning | 262,144 | $0.4 | $2 | $0.08 | - |
MiMo-V2-Pro mimo-v2-pro reasoning | 1,048,576 | $1 | $3 | $0.2 | - |
MiMo-V2.5 mimo-v2.5 reasoning | 1,048,576 | $0.4 | $2 | $0.08 | - |
MiMo-V2.5-Pro mimo-v2.5-pro reasoning | 1,048,576 | $1 | $3 | $0.2 | - |
| xiaomi-token-plan-sgp (5 models) | |||||
MiMo-V2-Flash mimo-v2-flash reasoning | 262,144 | $0.1 | $0.3 | $0.01 | - |
MiMo-V2-Omni mimo-v2-omni reasoning | 262,144 | $0.4 | $2 | $0.08 | - |
MiMo-V2-Pro mimo-v2-pro reasoning | 1,048,576 | $1 | $3 | $0.2 | - |
MiMo-V2.5 mimo-v2.5 reasoning | 1,048,576 | $0.4 | $2 | $0.08 | - |
MiMo-V2.5-Pro mimo-v2.5-pro reasoning | 1,048,576 | $1 | $3 | $0.2 | - |
| zai (5 models) | |||||
GLM-4.5-Air glm-4.5-air reasoning | 131,072 | $0 | $0 | $0 | - |
GLM-4.7 glm-4.7 reasoning | 204,800 | $0 | $0 | $0 | - |
GLM-5-Turbo glm-5-turbo reasoning | 200,000 | $0 | $0 | $0 | - |
GLM-5.1 glm-5.1 reasoning | 200,000 | $0 | $0 | $0 | - |
GLM-5V-Turbo glm-5v-turbo reasoning | 200,000 | $0 | $0 | $0 | - |