Model Catalog

Models zot Knows About.

Generated from zot's built-in provider catalog in the source repo:packages/provider.

Models

943 / 943

ModelContextInput $/mOutput $/mCache Read $/mCache Write $/m
amazon-bedrock (90 models)
AU Anthropic Claude Opus 4.6
au.anthropic.claude-opus-4-6-v1
reasoning
1,000,000$16.5$82.5$0.5$6.25
AU Anthropic Claude Opus 4.8
au.anthropic.claude-opus-4-8
reasoning
1,000,000$16.5$82.5$0.5$6.25
AU Anthropic Claude Sonnet 4.6
au.anthropic.claude-sonnet-4-6
reasoning
1,000,000$3.3$16.5$0.33$4.125
Claude Haiku 4.5
anthropic.claude-haiku-4-5-20251001-v1:0
reasoning
200,000$1$5$0.1$1.25
Claude Haiku 4.5 (AU)
au.anthropic.claude-haiku-4-5-20251001-v1:0
reasoning
200,000$1$5$0.1$1.25
Claude Haiku 4.5 (EU)
eu.anthropic.claude-haiku-4-5-20251001-v1:0
reasoning
200,000$1$5$0.1$1.25
Claude Haiku 4.5 (Global)
global.anthropic.claude-haiku-4-5-20251001-v1:0
reasoning
200,000$1$5$0.1$1.25
Claude Haiku 4.5 (US)
us.anthropic.claude-haiku-4-5-20251001-v1:0
reasoning
200,000$1$5$0.1$1.25
Claude Opus 4.1
anthropic.claude-opus-4-1-20250805-v1:0
reasoning
200,000$15$75$1.5$18.75
Claude Opus 4.1 (US)
us.anthropic.claude-opus-4-1-20250805-v1:0
reasoning
200,000$15$75$1.5$18.75
Claude Opus 4.5
anthropic.claude-opus-4-5-20251101-v1:0
reasoning
200,000$5$25$0.5$6.25
Claude Opus 4.5 (EU)
eu.anthropic.claude-opus-4-5-20251101-v1:0
reasoning
200,000$5$25$0.5$6.25
Claude Opus 4.5 (Global)
global.anthropic.claude-opus-4-5-20251101-v1:0
reasoning
200,000$5$25$0.5$6.25
Claude Opus 4.5 (US)
us.anthropic.claude-opus-4-5-20251101-v1:0
reasoning
200,000$5$25$0.5$6.25
Claude Opus 4.6
anthropic.claude-opus-4-6-v1
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.6 (EU)
eu.anthropic.claude-opus-4-6-v1
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.6 (Global)
global.anthropic.claude-opus-4-6-v1
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.6 (US)
us.anthropic.claude-opus-4-6-v1
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.7
anthropic.claude-opus-4-7
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.7 (EU)
eu.anthropic.claude-opus-4-7
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.7 (Global)
global.anthropic.claude-opus-4-7
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.7 (JP)
jp.anthropic.claude-opus-4-7
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.7 (US)
us.anthropic.claude-opus-4-7
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.8
anthropic.claude-opus-4-8
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.8 (EU)
eu.anthropic.claude-opus-4-8
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.8 (Global)
global.anthropic.claude-opus-4-8
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.8 (JP)
jp.anthropic.claude-opus-4-8
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.8 (US)
us.anthropic.claude-opus-4-8
reasoning
1,000,000$5$25$0.5$6.25
Claude Sonnet 4.5
anthropic.claude-sonnet-4-5-20250929-v1:0
reasoning
200,000$3$15$0.3$3.75
Claude Sonnet 4.5 (AU)
au.anthropic.claude-sonnet-4-5-20250929-v1:0
reasoning
200,000$3$15$0.3$3.75
Claude Sonnet 4.5 (EU)
eu.anthropic.claude-sonnet-4-5-20250929-v1:0
reasoning
200,000$3$15$0.3$3.75
Claude Sonnet 4.5 (Global)
global.anthropic.claude-sonnet-4-5-20250929-v1:0
reasoning
200,000$3$15$0.3$3.75
Claude Sonnet 4.5 (JP)
jp.anthropic.claude-sonnet-4-5-20250929-v1:0
reasoning
200,000$3$15$0.3$3.75
Claude Sonnet 4.5 (US)
us.anthropic.claude-sonnet-4-5-20250929-v1:0
reasoning
200,000$3$15$0.3$3.75
Claude Sonnet 4.6
anthropic.claude-sonnet-4-6
reasoning
1,000,000$3$15$0.3$3.75
Claude Sonnet 4.6 (EU)
eu.anthropic.claude-sonnet-4-6
reasoning
1,000,000$3$15$0.3$3.75
Claude Sonnet 4.6 (Global)
global.anthropic.claude-sonnet-4-6
reasoning
1,000,000$3$15$0.3$3.75
Claude Sonnet 4.6 (JP)
jp.anthropic.claude-sonnet-4-6
reasoning
1,000,000$3$15$0.3$3.75
Claude Sonnet 4.6 (US)
us.anthropic.claude-sonnet-4-6
reasoning
1,000,000$3$15$0.3$3.75
DeepSeek-R1
deepseek.r1-v1:0
reasoning
128,000$1.35$5.4$0-
DeepSeek-R1 (US)
us.deepseek.r1-v1:0
reasoning
128,000$1.35$5.4$0-
DeepSeek-V3.1
deepseek.v3-v1:0
reasoning
163,840$0.58$1.68$0-
DeepSeek-V3.2
deepseek.v3.2
reasoning
163,840$0.62$1.85$0-
Devstral 2 123B
mistral.devstral-2-123b
256,000$0.4$2$0-
Gemma 3 4B IT
google.gemma-3-4b-it
128,000$0.04$0.08$0-
GLM-4.7
zai.glm-4.7
reasoning
204,800$0.6$2.2$0-
GLM-4.7-Flash
zai.glm-4.7-flash
reasoning
200,000$0.07$0.4$0-
GLM-5
zai.glm-5
reasoning
202,752$1$3.2$0-
Google Gemma 3 27B Instruct
google.gemma-3-27b-it
202,752$0.12$0.2$0-
GPT OSS Safeguard 120B
openai.gpt-oss-safeguard-120b
128,000$0.15$0.6$0-
GPT OSS Safeguard 20B
openai.gpt-oss-safeguard-20b
128,000$0.07$0.2$0-
gpt-oss-120b
openai.gpt-oss-120b-1:0
128,000$0.15$0.6$0-
gpt-oss-20b
openai.gpt-oss-20b-1:0
128,000$0.07$0.3$0-
Kimi K2 Thinking
moonshot.kimi-k2-thinking
reasoning
262,143$0.6$2.5$0-
Kimi K2.5
moonshotai.kimi-k2.5
reasoning
262,143$0.6$3$0-
Llama 3.1 70B Instruct
meta.llama3-1-70b-instruct-v1:0
128,000$0.72$0.72$0-
Llama 3.1 8B Instruct
meta.llama3-1-8b-instruct-v1:0
128,000$0.22$0.22$0-
Llama 3.3 70B Instruct
meta.llama3-3-70b-instruct-v1:0
128,000$0.72$0.72$0-
Llama 4 Maverick 17B Instruct
meta.llama4-maverick-17b-instruct-v1:0
1,000,000$0.24$0.97$0-
Llama 4 Maverick 17B Instruct (US)
us.meta.llama4-maverick-17b-instruct-v1:0
1,000,000$0.24$0.97$0-
Llama 4 Scout 17B Instruct
meta.llama4-scout-17b-instruct-v1:0
3,500,000$0.17$0.66$0-
Llama 4 Scout 17B Instruct (US)
us.meta.llama4-scout-17b-instruct-v1:0
3,500,000$0.17$0.66$0-
Magistral Small 1.2
mistral.magistral-small-2509
reasoning
128,000$0.5$1.5$0-
MiniMax M2
minimax.minimax-m2
reasoning
204,608$0.3$1.2$0-
MiniMax M2.1
minimax.minimax-m2.1
reasoning
204,800$0.3$1.2$0-
MiniMax M2.5
minimax.minimax-m2.5
reasoning
196,608$0.3$1.2$0-
Ministral 14B 3.0
mistral.ministral-3-14b-instruct
128,000$0.2$0.2$0-
Ministral 3 3B
mistral.ministral-3-3b-instruct
256,000$0.1$0.1$0-
Ministral 3 8B
mistral.ministral-3-8b-instruct
128,000$0.15$0.15$0-
Mistral Large 3
mistral.mistral-large-3-675b-instruct
256,000$0.5$1.5$0-
Nova 2 Lite
amazon.nova-2-lite-v1:0
128,000$0.33$2.75$0-
Nova Lite
amazon.nova-lite-v1:0
300,000$0.06$0.24$0.015-
Nova Micro
amazon.nova-micro-v1:0
128,000$0.035$0.14$0.00875-
Nova Pro
amazon.nova-pro-v1:0
300,000$0.8$3.2$0.2-
NVIDIA Nemotron 3 Super 120B A12B
nvidia.nemotron-super-3-120b
reasoning
262,144$0.15$0.65$0-
NVIDIA Nemotron Nano 12B v2 VL BF16
nvidia.nemotron-nano-12b-v2
128,000$0.2$0.6$0-
NVIDIA Nemotron Nano 3 30B
nvidia.nemotron-nano-3-30b
reasoning
128,000$0.06$0.24$0-
NVIDIA Nemotron Nano 9B v2
nvidia.nemotron-nano-9b-v2
128,000$0.06$0.23$0-
Palmyra X4
writer.palmyra-x4-v1:0
reasoning
122,880$2.5$10$0-
Palmyra X5
writer.palmyra-x5-v1:0
reasoning
1,040,000$0.6$6$0-
Pixtral Large (25.02)
mistral.pixtral-large-2502-v1:0
128,000$2$6$0-
Qwen/Qwen3-Next-80B-A3B-Instruct
qwen.qwen3-next-80b-a3b
262,000$0.14$1.4$0-
Qwen/Qwen3-VL-235B-A22B-Instruct
qwen.qwen3-vl-235b-a22b
262,000$0.3$1.5$0-
Qwen3 235B A22B 2507
qwen.qwen3-235b-a22b-2507-v1:0
262,144$0.22$0.88$0-
Qwen3 32B (dense)
qwen.qwen3-32b-v1:0
reasoning
16,384$0.15$0.6$0-
Qwen3 Coder 30B A3B Instruct
qwen.qwen3-coder-30b-a3b-v1:0
262,144$0.15$0.6$0-
Qwen3 Coder 480B A35B Instruct
qwen.qwen3-coder-480b-a35b-v1:0
131,072$0.22$1.8$0-
Qwen3 Coder Next
qwen.qwen3-coder-next
reasoning
131,072$0.22$1.8$0-
Voxtral Mini 3B 2507
mistral.voxtral-mini-3b-2507
128,000$0.04$0.04$0-
Voxtral Small 24B 2507
mistral.voxtral-small-24b-2507
32,000$0.15$0.35$0-
anthropic (24 models)
Claude Haiku 3
claude-3-haiku-20240307
200,000$0.25$1.25$0.03$0.3
Claude Haiku 3.5
claude-3-5-haiku-20241022
200,000$0.8$4$0.08$1
Claude Haiku 3.5 (latest)
claude-3-5-haiku-latest
200,000$0.8$4$0.08$1
Claude Haiku 4.5
claude-haiku-4-5-20251001
reasoning
200,000$1$5$0.1$1.25
Claude Haiku 4.5 (latest)
claude-haiku-4-5
reasoning
200,000$1$5$0.1$1.25
Claude Opus 3
claude-3-opus-20240229
200,000$15$75$1.5$18.75
Claude Opus 4
claude-opus-4-20250514
reasoning
200,000$15$75$1.5$18.75
Claude Opus 4 (latest)
claude-opus-4-0
reasoning
200,000$15$75$1.5$18.75
Claude Opus 4.1
claude-opus-4-1-20250805
reasoning
200,000$15$75$1.5$18.75
Claude Opus 4.1 (latest)
claude-opus-4-1
reasoning
200,000$15$75$1.5$18.75
Claude Opus 4.5
claude-opus-4-5-20251101
reasoning
200,000$5$25$0.5$6.25
Claude Opus 4.5 (latest)
claude-opus-4-5
reasoning
200,000$5$25$0.5$6.25
Claude Opus 4.6
claude-opus-4-6
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.7
claude-opus-4-7
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.8
claude-opus-4-8
reasoning
1,000,000$5$25$0.5$6.25
Claude Sonnet 3
claude-3-sonnet-20240229
200,000$3$15$0.3$0.3
Claude Sonnet 3.5
claude-3-5-sonnet-20240620
200,000$3$15$0.3$3.75
Claude Sonnet 3.5 v2
claude-3-5-sonnet-20241022
200,000$3$15$0.3$3.75
Claude Sonnet 3.7
claude-3-7-sonnet-20250219
reasoning
200,000$3$15$0.3$3.75
Claude Sonnet 4
claude-sonnet-4-20250514
reasoning
200,000$3$15$0.3$3.75
Claude Sonnet 4 (latest)
claude-sonnet-4-0
reasoning
200,000$3$15$0.3$3.75
Claude Sonnet 4.5
claude-sonnet-4-5-20250929
reasoning
200,000$3$15$0.3$3.75
Claude Sonnet 4.5 (latest)
claude-sonnet-4-5
reasoning
200,000$3$15$0.3$3.75
Claude Sonnet 4.6
claude-sonnet-4-6
reasoning
1,000,000$3$15$0.3$3.75
azure-openai-responses (42 models)
GPT-4
gpt-4
8,192$30$60$0-
GPT-4 Turbo
gpt-4-turbo
128,000$10$30$0-
GPT-4.1
gpt-4.1
1,047,576$2$8$0.5-
GPT-4.1 mini
gpt-4.1-mini
1,047,576$0.4$1.6$0.1-
GPT-4.1 nano
gpt-4.1-nano
1,047,576$0.1$0.4$0.03-
GPT-4o
gpt-4o
128,000$2.5$10$1.25-
GPT-4o (2024-05-13)
gpt-4o-2024-05-13
128,000$5$15$0-
GPT-4o (2024-08-06)
gpt-4o-2024-08-06
128,000$2.5$10$1.25-
GPT-4o (2024-11-20)
gpt-4o-2024-11-20
128,000$2.5$10$1.25-
GPT-4o mini
gpt-4o-mini
128,000$0.15$0.6$0.08-
GPT-5
gpt-5
reasoning
400,000$1.25$10$0.125-
GPT-5 Chat Latest
gpt-5-chat-latest
128,000$1.25$10$0.125-
GPT-5 Mini
gpt-5-mini
reasoning
400,000$0.25$2$0.025-
GPT-5 Nano
gpt-5-nano
reasoning
400,000$0.05$0.4$0.005-
GPT-5 Pro
gpt-5-pro
reasoning
400,000$15$120$0-
GPT-5-Codex
gpt-5-codex
reasoning
400,000$1.25$10$0.125-
GPT-5.1
gpt-5.1
reasoning
400,000$1.25$10$0.13-
GPT-5.1 Chat
gpt-5.1-chat-latest
reasoning
128,000$1.25$10$0.125-
GPT-5.1 Codex
gpt-5.1-codex
reasoning
400,000$1.25$10$0.125-
GPT-5.1 Codex Max
gpt-5.1-codex-max
reasoning
400,000$1.25$10$0.125-
GPT-5.1 Codex mini
gpt-5.1-codex-mini
reasoning
400,000$0.25$2$0.025-
GPT-5.2
gpt-5.2
reasoning
400,000$1.75$14$0.175-
GPT-5.2 Chat
gpt-5.2-chat-latest
reasoning
128,000$1.75$14$0.175-
GPT-5.2 Codex
gpt-5.2-codex
reasoning
400,000$1.75$14$0.175-
GPT-5.2 Pro
gpt-5.2-pro
reasoning
400,000$21$168$0-
GPT-5.3 Chat (latest)
gpt-5.3-chat-latest
128,000$1.75$14$0.175-
GPT-5.3 Codex
gpt-5.3-codex
reasoning
400,000$1.75$14$0.175-
GPT-5.3 Codex Spark
gpt-5.3-codex-spark
reasoning
128,000$1.75$14$0.175-
GPT-5.4
gpt-5.4
reasoning
272,000$2.5$15$0.25-
GPT-5.4 mini
gpt-5.4-mini
reasoning
400,000$0.75$4.5$0.075-
GPT-5.4 nano
gpt-5.4-nano
reasoning
400,000$0.2$1.25$0.02-
GPT-5.4 Pro
gpt-5.4-pro
reasoning
1,050,000$30$180$0-
GPT-5.5
gpt-5.5
reasoning
272,000$5$30$0.5-
GPT-5.5 Pro
gpt-5.5-pro
reasoning
1,050,000$30$180$0-
o1
o1
reasoning
200,000$15$60$7.5-
o1-pro
o1-pro
reasoning
200,000$150$600$0-
o3
o3
reasoning
200,000$2$8$0.5-
o3-deep-research
o3-deep-research
reasoning
200,000$10$40$2.5-
o3-mini
o3-mini
reasoning
200,000$1.1$4.4$0.55-
o3-pro
o3-pro
reasoning
200,000$20$80$0-
o4-mini
o4-mini
reasoning
200,000$1.1$4.4$0.28-
o4-mini-deep-research
o4-mini-deep-research
reasoning
200,000$2$8$0.5-
cerebras (4 models)
GPT OSS 120B
gpt-oss-120b
reasoning
131,072$0.25$0.69$0-
Llama 3.1 8B
llama3.1-8b
32,000$0.1$0.1$0-
Qwen 3 235B Instruct
qwen-3-235b-a22b-instruct-2507
131,000$0.6$1.2$0-
Z.AI GLM-4.7
zai-glm-4.7
131,072$2.25$2.75$0-
cloudflare-ai-gateway (36 models)
Claude Haiku 3
claude-3-haiku
200,000$0.25$1.25$0.03$0.3
Claude Haiku 3.5 (latest)
claude-3-5-haiku
200,000$0.8$4$0.08$1
Claude Haiku 3.5 (latest)
claude-3.5-haiku
200,000$0.8$4$0.08$1
Claude Haiku 4.5 (latest)
claude-haiku-4-5
reasoning
200,000$1$5$0.1$1.25
Claude Opus 3
claude-3-opus
200,000$15$75$1.5$18.75
Claude Opus 4 (latest)
claude-opus-4
reasoning
200,000$15$75$1.5$18.75
Claude Opus 4.1 (latest)
claude-opus-4-1
reasoning
200,000$15$75$1.5$18.75
Claude Opus 4.5 (latest)
claude-opus-4-5
reasoning
200,000$5$25$0.5$6.25
Claude Opus 4.6 (latest)
claude-opus-4-6
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.7
claude-opus-4-7
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.8
claude-opus-4-8
reasoning
1,000,000$5$25$0.5$6.25
Claude Sonnet 3
claude-3-sonnet
200,000$3$15$0.3$0.3
Claude Sonnet 3.5 v2
claude-3.5-sonnet
200,000$3$15$0.3$3.75
Claude Sonnet 4 (latest)
claude-sonnet-4
reasoning
200,000$3$15$0.3$3.75
Claude Sonnet 4.5 (latest)
claude-sonnet-4-5
reasoning
200,000$3$15$0.3$3.75
Claude Sonnet 4.6
claude-sonnet-4-6
reasoning
1,000,000$3$15$0.3$3.75
GLM-4.7-Flash
workers-ai/@cf/zai-org/glm-4.7-flash
reasoning
131,072$0.06$0.4$0-
GPT-4
gpt-4
8,192$30$60$0-
GPT-4 Turbo
gpt-4-turbo
128,000$10$30$0-
GPT-4o
gpt-4o
128,000$2.5$10$1.25-
GPT-4o mini
gpt-4o-mini
128,000$0.15$0.6$0.08-
GPT-5.1
gpt-5.1
reasoning
400,000$1.25$10$0.13-
GPT-5.1 Codex
gpt-5.1-codex
reasoning
400,000$1.25$10$0.125-
GPT-5.2
gpt-5.2
reasoning
400,000$1.75$14$0.175-
GPT-5.2 Codex
gpt-5.2-codex
reasoning
400,000$1.75$14$0.175-
GPT-5.3 Codex
gpt-5.3-codex
reasoning
400,000$1.75$14$0.175-
GPT-5.4
gpt-5.4
reasoning
1,050,000$2.5$15$0.25-
GPT-5.5
gpt-5.5
reasoning
1,050,000$5$30$0.5-
Kimi K2.5
workers-ai/@cf/moonshotai/kimi-k2.5
reasoning
256,000$0.6$3$0.1-
Kimi K2.6
workers-ai/@cf/moonshotai/kimi-k2.6
reasoning
256,000$0.95$4$0.16-
Nemotron 3 Super 120B
workers-ai/@cf/nvidia/nemotron-3-120b-a12b
reasoning
256,000$0.5$1.5$0-
o1
o1
reasoning
200,000$15$60$7.5-
o3
o3
reasoning
200,000$2$8$0.5-
o3-mini
o3-mini
reasoning
200,000$1.1$4.4$0.55-
o3-pro
o3-pro
reasoning
200,000$20$80$0-
o4-mini
o4-mini
reasoning
200,000$1.1$4.4$0.28-
cloudflare-workers-ai (12 models)
Gemma 4 26B A4B IT
@cf/google/gemma-4-26b-a4b-it
reasoning
256,000$0.1$0.3$0-
GLM-4.7-Flash
@cf/zai-org/glm-4.7-flash
reasoning
131,072$0.0605$0.4$0-
GPT OSS 120B
@cf/openai/gpt-oss-120b
reasoning
128,000$0.35$0.75$0-
GPT OSS 20B
@cf/openai/gpt-oss-20b
reasoning
128,000$0.2$0.3$0-
Granite 4.0 H Micro
@cf/ibm-granite/granite-4.0-h-micro
131,000$0.017$0.112$0-
Kimi K2.5
@cf/moonshotai/kimi-k2.5
reasoning
256,000$0.6$3$0.1-
Kimi K2.6
@cf/moonshotai/kimi-k2.6
reasoning
262,144$0.95$4$0.16-
Llama 3.3 70B Instruct fp8 Fast
@cf/meta/llama-3.3-70b-instruct-fp8-fast
24,000$0.293$2.253$0-
Llama 4 Scout 17B 16E Instruct
@cf/meta/llama-4-scout-17b-16e-instruct
131,000$0.27$0.85$0-
Mistral Small 3.1 24B Instruct
@cf/mistralai/mistral-small-3.1-24b-instruct
128,000$0.351$0.555$0-
Nemotron 3 Super 120B
@cf/nvidia/nemotron-3-120b-a12b
reasoning
256,000$0.5$1.5$0-
Qwen3 30B A3b fp8
@cf/qwen/qwen3-30b-a3b-fp8
reasoning
32,768$0.0509$0.335$0-
deepseek (2 models)
DeepSeek V4 Flash
deepseek-v4-flash
reasoning
1,000,000$0.14$0.28$0.0028-
DeepSeek V4 Pro
deepseek-v4-pro
reasoning
1,000,000$0.435$0.87$0.003625-
fireworks (12 models)
DeepSeek V4 Flash
accounts/fireworks/models/deepseek-v4-flash
reasoning
1,000,000$0.14$0.28$0.03-
DeepSeek V4 Pro
accounts/fireworks/models/deepseek-v4-pro
reasoning
1,000,000$1.74$3.48$0.145-
GLM 5.1
accounts/fireworks/models/glm-5p1
reasoning
202,800$1.4$4.4$0.26-
GLM 5.1 Fast
accounts/fireworks/routers/glm-5p1-fast
reasoning
202,800$2.8$8.8$0.52-
GPT OSS 120B
accounts/fireworks/models/gpt-oss-120b
reasoning
131,072$0.15$0.6$0.015-
GPT OSS 20B
accounts/fireworks/models/gpt-oss-20b
reasoning
131,072$0.07$0.3$0.035-
Kimi K2.5
accounts/fireworks/models/kimi-k2p5
reasoning
256,000$0.6$3$0.1-
Kimi K2.6
accounts/fireworks/models/kimi-k2p6
reasoning
262,000$0.95$4$0.16-
Kimi K2.6 Turbo
accounts/fireworks/routers/kimi-k2p6-turbo
reasoning
262,000$2$8$0.3-
MiniMax-M2.5
accounts/fireworks/models/minimax-m2p5
reasoning
196,608$0.3$1.2$0.03-
MiniMax-M2.7
accounts/fireworks/models/minimax-m2p7
reasoning
196,608$0.3$1.2$0.06-
Qwen 3.6 Plus
accounts/fireworks/models/qwen3p6-plus
reasoning
128,000$0.5$3$0.1-
github-copilot (21 models)
Claude Haiku 4.5
claude-haiku-4.5
reasoning
144,000$0$0$0-
Claude Opus 4.5
claude-opus-4.5
reasoning
160,000$0$0$0-
Claude Opus 4.6
claude-opus-4.6
reasoning
1,000,000$0$0$0-
Claude Opus 4.7
claude-opus-4.7
reasoning
144,000$0$0$0-
Claude Opus 4.8
claude-opus-4.8
reasoning
144,000$0$0$0-
Claude Sonnet 4.5
claude-sonnet-4.5
reasoning
144,000$0$0$0-
Claude Sonnet 4.6
claude-sonnet-4.6
reasoning
1,000,000$0$0$0-
Gemini 2.5 Pro
gemini-2.5-pro
128,000$0$0$0-
Gemini 3 Flash
gemini-3-flash-preview
reasoning
128,000$0$0$0-
Gemini 3.1 Pro Preview
gemini-3.1-pro-preview
reasoning
128,000$0$0$0-
Gemini 3.5 Flash
gemini-3.5-flash
reasoning
128,000$0$0$0-
GPT-4.1
gpt-4.1
128,000$0$0$0-
GPT-4o
gpt-4o
128,000$0$0$0-
GPT-5-mini
gpt-5-mini
reasoning
264,000$0$0$0-
GPT-5.2
gpt-5.2
reasoning
264,000$0$0$0-
GPT-5.2-Codex
gpt-5.2-codex
reasoning
400,000$0$0$0-
GPT-5.3-Codex
gpt-5.3-codex
reasoning
400,000$0$0$0-
GPT-5.4
gpt-5.4
reasoning
400,000$0$0$0-
GPT-5.4 Mini
gpt-5.4-mini
reasoning
400,000$0$0$0-
GPT-5.5
gpt-5.5
reasoning
400,000$0$0$0-
Grok Code Fast 1
grok-code-fast-1
reasoning
128,000$0$0$0-
google (16 models)
Gemini 2.0 Flash
gemini-2.0-flash
1,048,576$0.1$0.4$0.025-
Gemini 2.0 Flash-Lite
gemini-2.0-flash-lite
1,048,576$0.075$0.3$0-
Gemini 2.5 Flash
gemini-2.5-flash
reasoning
1,048,576$0.3$2.5$0.03-
Gemini 2.5 Flash-Lite
gemini-2.5-flash-lite
reasoning
1,048,576$0.1$0.4$0.01-
Gemini 2.5 Pro
gemini-2.5-pro
reasoning
1,048,576$1.25$10$0.125-
Gemini 3 Flash Preview
gemini-3-flash-preview
reasoning
1,048,576$0.5$3$0.05-
Gemini 3 Pro Preview
gemini-3-pro-preview
reasoning
1,048,576$2$12$0.2-
Gemini 3.1 Flash Lite
gemini-3.1-flash-lite
reasoning
1,048,576$0.25$1.5$0.025-
Gemini 3.1 Flash Lite Preview
gemini-3.1-flash-lite-preview
reasoning
1,048,576$0.25$1.5$0.025-
Gemini 3.1 Pro Preview
gemini-3.1-pro-preview
reasoning
1,048,576$2$12$0.2-
Gemini 3.1 Pro Preview Custom Tools
gemini-3.1-pro-preview-customtools
reasoning
1,048,576$2$12$0.2-
Gemini 3.5 Flash
gemini-3.5-flash
reasoning
1,048,576$1.5$9$0.15-
Gemini Flash Latest
gemini-flash-latest
reasoning
1,048,576$0.3$2.5$0.075-
Gemini Flash-Lite Latest
gemini-flash-lite-latest
reasoning
1,048,576$0.1$0.4$0.025-
Gemma 4 26B A4B IT
gemma-4-26b-a4b-it
reasoning
262,144$0$0$0-
Gemma 4 31B IT
gemma-4-31b-it
reasoning
262,144$0$0$0-
google-vertex (13 models)
Gemini 1.5 Flash (Vertex)
gemini-1.5-flash
1,000,000$0.075$0.3$0.01875-
Gemini 1.5 Flash-8B (Vertex)
gemini-1.5-flash-8b
1,000,000$0.0375$0.15$0.01-
Gemini 1.5 Pro (Vertex)
gemini-1.5-pro
1,000,000$1.25$5$0.3125-
Gemini 2.0 Flash (Vertex)
gemini-2.0-flash
1,048,576$0.15$0.6$0.0375-
Gemini 2.0 Flash Lite (Vertex)
gemini-2.0-flash-lite
reasoning
1,048,576$0.075$0.3$0.01875-
Gemini 2.5 Flash (Vertex)
gemini-2.5-flash
reasoning
1,048,576$0.3$2.5$0.03-
Gemini 2.5 Flash Lite (Vertex)
gemini-2.5-flash-lite
reasoning
1,048,576$0.1$0.4$0.01-
Gemini 2.5 Flash Lite Preview 09-25 (Vertex)
gemini-2.5-flash-lite-preview-09-2025
reasoning
1,048,576$0.1$0.4$0.01-
Gemini 2.5 Pro (Vertex)
gemini-2.5-pro
reasoning
1,048,576$1.25$10$0.125-
Gemini 3 Flash Preview (Vertex)
gemini-3-flash-preview
reasoning
1,048,576$0.5$3$0.05-
Gemini 3 Pro Preview (Vertex)
gemini-3-pro-preview
reasoning
1,000,000$2$12$0.2-
Gemini 3.1 Pro Preview (Vertex)
gemini-3.1-pro-preview
reasoning
1,048,576$2$12$0.2-
Gemini 3.1 Pro Preview Custom Tools (Vertex)
gemini-3.1-pro-preview-customtools
reasoning
1,048,576$2$12$0.2-
groq (18 models)
Compound
groq/compound
reasoning
131,072$0$0$0-
Compound Mini
groq/compound-mini
reasoning
131,072$0$0$0-
DeepSeek R1 Distill Llama 70B
deepseek-r1-distill-llama-70b
reasoning
131,072$0.75$0.99$0-
Gemma 2 9B
gemma2-9b-it
8,192$0.2$0.2$0-
GPT OSS 120B
openai/gpt-oss-120b
reasoning
131,072$0.15$0.6$0-
GPT OSS 20B
openai/gpt-oss-20b
reasoning
131,072$0.075$0.3$0-
Kimi K2 Instruct
moonshotai/kimi-k2-instruct
131,072$1$3$0-
Kimi K2 Instruct 0905
moonshotai/kimi-k2-instruct-0905
262,144$1$3$0-
Llama 3 70B
llama3-70b-8192
8,192$0.59$0.79$0-
Llama 3 8B
llama3-8b-8192
8,192$0.05$0.08$0-
Llama 3.1 8B Instant
llama-3.1-8b-instant
131,072$0.05$0.08$0-
Llama 3.3 70B Versatile
llama-3.3-70b-versatile
131,072$0.59$0.79$0-
Llama 4 Maverick 17B
meta-llama/llama-4-maverick-17b-128e-instruct
131,072$0.2$0.6$0-
Llama 4 Scout 17B
meta-llama/llama-4-scout-17b-16e-instruct
131,072$0.11$0.34$0-
Mistral Saba 24B
mistral-saba-24b
32,768$0.79$0.79$0-
Qwen QwQ 32B
qwen-qwq-32b
reasoning
131,072$0.29$0.39$0-
Qwen3 32B
qwen/qwen3-32b
reasoning
131,072$0.29$0.59$0-
Safety GPT OSS 20B
openai/gpt-oss-safeguard-20b
reasoning
131,072$0.075$0.3$0.037-
huggingface (22 models)
DeepSeek V4 Pro
deepseek-ai/DeepSeek-V4-Pro
reasoning
1,048,576$1.74$3.48$0.145-
DeepSeek-R1-0528
deepseek-ai/DeepSeek-R1-0528
reasoning
163,840$3$5$0-
DeepSeek-V3.2
deepseek-ai/DeepSeek-V3.2
reasoning
163,840$0.28$0.4$0-
GLM-4.7
zai-org/GLM-4.7
reasoning
204,800$0.6$2.2$0.11-
GLM-4.7-Flash
zai-org/GLM-4.7-Flash
reasoning
200,000$0$0$0-
GLM-5
zai-org/GLM-5
reasoning
202,752$1$3.2$0.2-
GLM-5.1
zai-org/GLM-5.1
reasoning
202,752$1$3.2$0.2-
Kimi-K2-Instruct
moonshotai/Kimi-K2-Instruct
131,072$1$3$0-
Kimi-K2-Instruct-0905
moonshotai/Kimi-K2-Instruct-0905
262,144$1$3$0-
Kimi-K2-Thinking
moonshotai/Kimi-K2-Thinking
reasoning
262,144$0.6$2.5$0.15-
Kimi-K2.5
moonshotai/Kimi-K2.5
reasoning
262,144$0.6$3$0.1-
Kimi-K2.6
moonshotai/Kimi-K2.6
reasoning
262,144$0.95$4$0.16-
MiMo-V2-Flash
XiaomiMiMo/MiMo-V2-Flash
reasoning
262,144$0.1$0.3$0-
MiniMax-M2.1
MiniMaxAI/MiniMax-M2.1
reasoning
204,800$0.3$1.2$0-
MiniMax-M2.5
MiniMaxAI/MiniMax-M2.5
reasoning
204,800$0.3$1.2$0.03-
MiniMax-M2.7
MiniMaxAI/MiniMax-M2.7
reasoning
204,800$0.3$1.2$0.06-
Qwen3-235B-A22B-Thinking-2507
Qwen/Qwen3-235B-A22B-Thinking-2507
reasoning
262,144$0.3$3$0-
Qwen3-Coder-480B-A35B-Instruct
Qwen/Qwen3-Coder-480B-A35B-Instruct
262,144$2$2$0-
Qwen3-Coder-Next
Qwen/Qwen3-Coder-Next
262,144$0.2$1.5$0-
Qwen3-Next-80B-A3B-Instruct
Qwen/Qwen3-Next-80B-A3B-Instruct
262,144$0.25$1$0-
Qwen3-Next-80B-A3B-Thinking
Qwen/Qwen3-Next-80B-A3B-Thinking
262,144$0.3$2$0-
Qwen3.5-397B-A17B
Qwen/Qwen3.5-397B-A17B
reasoning
262,144$0.6$3.6$0-
kimi (2 models)
Kimi For Coding
kimi-for-coding
reasoning
262,144$0$0$0-
Kimi K2 Thinking
kimi-k2-thinking
reasoning
262,144----
minimax (2 models)
MiniMax-M2.7
MiniMax-M2.7
reasoning
204,800$0.3$1.2$0.06$0.375
MiniMax-M2.7-highspeed
MiniMax-M2.7-highspeed
reasoning
204,800$0.6$2.4$0.06$0.375
minimax-cn (2 models)
MiniMax-M2.7
MiniMax-M2.7
reasoning
204,800$0.3$1.2$0.06$0.375
MiniMax-M2.7-highspeed
MiniMax-M2.7-highspeed
reasoning
204,800$0.6$2.4$0.06$0.375
mistral (28 models)
Codestral (latest)
codestral-latest
256,000$0.3$0.9$0-
Devstral 2
devstral-2512
262,144$0.4$2$0-
Devstral 2 (latest)
devstral-medium-latest
262,144$0.4$2$0-
Devstral Medium
devstral-medium-2507
128,000$0.4$2$0-
Devstral Small
devstral-small-2507
128,000$0.1$0.3$0-
Devstral Small 2
labs-devstral-small-2512
256,000$0$0$0-
Devstral Small 2505
devstral-small-2505
128,000$0.1$0.3$0-
Magistral Medium (latest)
magistral-medium-latest
reasoning
128,000$2$5$0-
Magistral Small
magistral-small
reasoning
128,000$0.5$1.5$0-
Ministral 3B (latest)
ministral-3b-latest
128,000$0.04$0.04$0-
Ministral 8B (latest)
ministral-8b-latest
128,000$0.1$0.1$0-
Mistral 7B
open-mistral-7b
8,000$0.25$0.25$0-
Mistral Large (latest)
mistral-large-latest
262,144$0.5$1.5$0-
Mistral Large 2.1
mistral-large-2411
131,072$2$6$0-
Mistral Large 3
mistral-large-2512
262,144$0.5$1.5$0-
Mistral Medium (latest)
mistral-medium-latest
reasoning
262,144$1.5$7.5$0-
Mistral Medium 3
mistral-medium-2505
131,072$0.4$2$0-
Mistral Medium 3.1
mistral-medium-2508
262,144$0.4$2$0-
Mistral Medium 3.5
mistral-medium-2604
reasoning
262,144$1.5$7.5$0-
Mistral Medium 3.5
mistral-medium-3.5
reasoning
262,144$1.5$7.5$0-
Mistral Nemo
mistral-nemo
128,000$0.15$0.15$0-
Mistral Small (latest)
mistral-small-latest
reasoning
256,000$0.15$0.6$0-
Mistral Small 3.2
mistral-small-2506
128,000$0.1$0.3$0-
Mistral Small 4
mistral-small-2603
reasoning
256,000$0.15$0.6$0-
Mixtral 8x22B
open-mixtral-8x22b
64,000$2$6$0-
Mixtral 8x7B
open-mixtral-8x7b
32,000$0.7$0.7$0-
Pixtral 12B
pixtral-12b
128,000$0.15$0.15$0-
Pixtral Large (latest)
pixtral-large-latest
128,000$2$6$0-
moonshotai (7 models)
Kimi K2 0711
kimi-k2-0711-preview
131,072$0.6$2.5$0.15-
Kimi K2 0905
kimi-k2-0905-preview
262,144$0.6$2.5$0.15-
Kimi K2 Thinking
kimi-k2-thinking
reasoning
262,144$0.6$2.5$0.15-
Kimi K2 Thinking Turbo
kimi-k2-thinking-turbo
reasoning
262,144$1.15$8$0.15-
Kimi K2 Turbo
kimi-k2-turbo-preview
262,144$2.4$10$0.6-
Kimi K2.5
kimi-k2.5
reasoning
262,144$0.6$3$0.1-
Kimi K2.6
kimi-k2.6
reasoning
262,144$0.95$4$0.16-
moonshotai-cn (7 models)
Kimi K2 0711
kimi-k2-0711-preview
131,072$0.6$2.5$0.15-
Kimi K2 0905
kimi-k2-0905-preview
262,144$0.6$2.5$0.15-
Kimi K2 Thinking
kimi-k2-thinking
reasoning
262,144$0.6$2.5$0.15-
Kimi K2 Thinking Turbo
kimi-k2-thinking-turbo
reasoning
262,144$1.15$8$0.15-
Kimi K2 Turbo
kimi-k2-turbo-preview
262,144$2.4$10$0.6-
Kimi K2.5
kimi-k2.5
reasoning
262,144$0.6$3$0.1-
Kimi K2.6
kimi-k2.6
reasoning
262,144$0.95$4$0.16-
openai (42 models)
GPT-4
gpt-4
8,192$30$60$0-
GPT-4 Turbo
gpt-4-turbo
128,000$10$30$0-
GPT-4.1
gpt-4.1
1,047,576$2$8$0.5-
GPT-4.1 mini
gpt-4.1-mini
1,047,576$0.4$1.6$0.1-
GPT-4.1 nano
gpt-4.1-nano
1,047,576$0.1$0.4$0.03-
GPT-4o
gpt-4o
128,000$2.5$10$1.25-
GPT-4o (2024-05-13)
gpt-4o-2024-05-13
128,000$5$15$0-
GPT-4o (2024-08-06)
gpt-4o-2024-08-06
128,000$2.5$10$1.25-
GPT-4o (2024-11-20)
gpt-4o-2024-11-20
128,000$2.5$10$1.25-
GPT-4o mini
gpt-4o-mini
128,000$0.15$0.6$0.08-
GPT-5
gpt-5
reasoning
400,000$1.25$10$0.125-
GPT-5 Chat Latest
gpt-5-chat-latest
128,000$1.25$10$0.125-
GPT-5 Mini
gpt-5-mini
reasoning
400,000$0.25$2$0.025-
GPT-5 Nano
gpt-5-nano
reasoning
400,000$0.05$0.4$0.005-
GPT-5 Pro
gpt-5-pro
reasoning
400,000$15$120$0-
GPT-5-Codex
gpt-5-codex
reasoning
400,000$1.25$10$0.125-
GPT-5.1
gpt-5.1
reasoning
400,000$1.25$10$0.13-
GPT-5.1 Chat
gpt-5.1-chat-latest
reasoning
128,000$1.25$10$0.125-
GPT-5.1 Codex
gpt-5.1-codex
reasoning
400,000$1.25$10$0.125-
GPT-5.1 Codex Max
gpt-5.1-codex-max
reasoning
400,000$1.25$10$0.125-
GPT-5.1 Codex mini
gpt-5.1-codex-mini
reasoning
400,000$0.25$2$0.025-
GPT-5.2
gpt-5.2
reasoning
400,000$1.75$14$0.175-
GPT-5.2 Chat
gpt-5.2-chat-latest
reasoning
128,000$1.75$14$0.175-
GPT-5.2 Codex
gpt-5.2-codex
reasoning
400,000$1.75$14$0.175-
GPT-5.2 Pro
gpt-5.2-pro
reasoning
400,000$21$168$0-
GPT-5.3 Chat (latest)
gpt-5.3-chat-latest
128,000$1.75$14$0.175-
GPT-5.3 Codex
gpt-5.3-codex
reasoning
400,000$1.75$14$0.175-
GPT-5.3 Codex Spark
gpt-5.3-codex-spark
reasoning
128,000$1.75$14$0.175-
GPT-5.4
gpt-5.4
reasoning
272,000$2.5$15$0.25-
GPT-5.4 mini
gpt-5.4-mini
reasoning
400,000$0.75$4.5$0.075-
GPT-5.4 nano
gpt-5.4-nano
reasoning
400,000$0.2$1.25$0.02-
GPT-5.4 Pro
gpt-5.4-pro
reasoning
1,050,000$30$180$0-
GPT-5.5
gpt-5.5
reasoning
272,000$5$30$0.5-
GPT-5.5 Pro
gpt-5.5-pro
reasoning
1,050,000$30$180$0-
o1
o1
reasoning
200,000$15$60$7.5-
o1-pro
o1-pro
reasoning
200,000$150$600$0-
o3
o3
reasoning
200,000$2$8$0.5-
o3-deep-research
o3-deep-research
reasoning
200,000$10$40$2.5-
o3-mini
o3-mini
reasoning
200,000$1.1$4.4$0.55-
o3-pro
o3-pro
reasoning
200,000$20$80$0-
o4-mini
o4-mini
reasoning
200,000$1.1$4.4$0.28-
o4-mini-deep-research
o4-mini-deep-research
reasoning
200,000$2$8$0.5-
openai-codex (6 models)
GPT-5.2
gpt-5.2
reasoning
272,000$1.75$14$0.175-
GPT-5.3 Codex
gpt-5.3-codex
reasoning
272,000$1.75$14$0.175-
GPT-5.3 Codex Spark
gpt-5.3-codex-spark
reasoning
272,000$1.75$14$0.175-
GPT-5.4
gpt-5.4
reasoning
272,000$2.5$15$0.25-
GPT-5.4 mini
gpt-5.4-mini
reasoning
272,000$0.75$4.5$0.075-
GPT-5.5
gpt-5.5
reasoning
272,000$5$30$0.5-
openai-responses (6 models)
GPT-5 (Responses)
gpt-5
reasoning
400,000$1.25$10$0.125-
GPT-5 Codex (Responses)
gpt-5-codex
reasoning
400,000$1.25$10$0.125-
GPT-5 mini (Responses)
gpt-5-mini
reasoning
400,000$0.25$2$0.025-
GPT-5 nano (Responses)
gpt-5-nano
reasoning
400,000$0.05$0.4$0.005-
o3 (Responses)
o3
reasoning
200,000$2$8$0.5-
o4-mini (Responses)
o4-mini
reasoning
200,000$1.1$4.4$0.275-
opencode (40 models)
Big Pickle
big-pickle
reasoning
200,000$0$0$0-
Claude Haiku 4.5
claude-haiku-4-5
reasoning
200,000$1$5$0.1$1.25
Claude Opus 4.1
claude-opus-4-1
reasoning
200,000$15$75$1.5$18.75
Claude Opus 4.5
claude-opus-4-5
reasoning
200,000$5$25$0.5$6.25
Claude Opus 4.6
claude-opus-4-6
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.7
claude-opus-4-7
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.8
claude-opus-4-8
reasoning
1,000,000$5$25$0.5$6.25
Claude Sonnet 4
claude-sonnet-4
reasoning
200,000$3$15$0.3$3.75
Claude Sonnet 4.5
claude-sonnet-4-5
reasoning
200,000$3$15$0.3$3.75
Claude Sonnet 4.6
claude-sonnet-4-6
reasoning
1,000,000$3$15$0.3$3.75
DeepSeek V4 Flash Free
deepseek-v4-flash-free
reasoning
200,000$0$0$0-
Gemini 3 Flash
gemini-3-flash
reasoning
1,048,576$0.5$3$0.05-
Gemini 3.1 Pro Preview
gemini-3.1-pro
reasoning
1,048,576$2$12$0.2-
Gemini 3.5 Flash
gemini-3.5-flash
reasoning
1,048,576$1.5$9$0.15-
GLM-5
glm-5
reasoning
204,800$1$3.2$0.2-
GLM-5.1
glm-5.1
reasoning
204,800$1.4$4.4$0.26-
GPT-5
gpt-5
reasoning
400,000$1.07$8.5$0.107-
GPT-5 Codex
gpt-5-codex
reasoning
400,000$1.07$8.5$0.107-
GPT-5 Nano
gpt-5-nano
reasoning
400,000$0.05$0.4$0.005-
GPT-5.1
gpt-5.1
reasoning
400,000$1.07$8.5$0.107-
GPT-5.1 Codex
gpt-5.1-codex
reasoning
400,000$1.07$8.5$0.107-
GPT-5.1 Codex Max
gpt-5.1-codex-max
reasoning
400,000$1.25$10$0.125-
GPT-5.1 Codex Mini
gpt-5.1-codex-mini
reasoning
400,000$0.25$2$0.025-
GPT-5.2
gpt-5.2
reasoning
400,000$1.75$14$0.175-
GPT-5.2 Codex
gpt-5.2-codex
reasoning
400,000$1.75$14$0.175-
GPT-5.3 Codex
gpt-5.3-codex
reasoning
400,000$1.75$14$0.175-
GPT-5.4
gpt-5.4
reasoning
272,000$2.5$15$0.25-
GPT-5.4 Mini
gpt-5.4-mini
reasoning
400,000$0.75$4.5$0.075-
GPT-5.4 Nano
gpt-5.4-nano
reasoning
400,000$0.2$1.25$0.02-
GPT-5.4 Pro
gpt-5.4-pro
reasoning
1,050,000$30$180$30-
GPT-5.5
gpt-5.5
reasoning
1,050,000$5$30$0.5-
GPT-5.5 Pro
gpt-5.5-pro
reasoning
1,050,000$30$180$30-
Grok Build 0.1
grok-build-0.1
reasoning
256,000$1$2$0.2-
Kimi K2.5
kimi-k2.5
reasoning
262,144$0.6$3$0.08-
Kimi K2.6
kimi-k2.6
reasoning
262,144$0.95$4$0.16-
MiniMax M2.5
minimax-m2.5
reasoning
204,800$0.3$1.2$0.06-
MiniMax M2.7
minimax-m2.7
reasoning
204,800$0.3$1.2$0.06-
Nemotron 3 Super Free
nemotron-3-super-free
reasoning
204,800$0$0$0-
Qwen3.5 Plus
qwen3.5-plus
reasoning
262,144$0.2$1.2$0.02$0.25
Qwen3.6 Plus
qwen3.6-plus
reasoning
262,144$0.5$3$0.05$0.625
opencode-go (12 models)
DeepSeek V4 Flash
deepseek-v4-flash
reasoning
1,000,000$0.14$0.28$0.0028-
DeepSeek V4 Pro
deepseek-v4-pro
reasoning
1,000,000$1.74$3.48$0.0145-
GLM-5
glm-5
reasoning
202,752$1$3.2$0.2-
GLM-5.1
glm-5.1
reasoning
202,752$1.4$4.4$0.26-
Kimi K2.5
kimi-k2.5
reasoning
262,144$0.6$3$0.1-
Kimi K2.6
kimi-k2.6
reasoning
262,144$0.95$4$0.16-
MiMo V2.5
mimo-v2.5
reasoning
1,000,000$0.4$2$0.08-
MiMo V2.5 Pro
mimo-v2.5-pro
reasoning
1,048,576$1$3$0.2-
MiniMax M2.5
minimax-m2.5
reasoning
204,800$0.3$1.2$0.03-
MiniMax M2.7
minimax-m2.7
reasoning
204,800$0.3$1.2$0.06-
Qwen3.5 Plus
qwen3.5-plus
reasoning
262,144$0.2$1.2$0.02$0.25
Qwen3.6 Plus
qwen3.6-plus
reasoning
262,144$0.5$3$0.05$0.625
openrouter (268 models)
AI21: Jamba Large 1.7
ai21/jamba-large-1.7
256,000$2$8$0-
Amazon: Nova 2 Lite
amazon/nova-2-lite-v1
reasoning
1,000,000$0.3$2.5$0-
Amazon: Nova Lite 1.0
amazon/nova-lite-v1
300,000$0.06$0.24$0-
Amazon: Nova Micro 1.0
amazon/nova-micro-v1
128,000$0.035$0.14$0-
Amazon: Nova Premier 1.0
amazon/nova-premier-v1
1,000,000$2.5$12.5$0.625-
Amazon: Nova Pro 1.0
amazon/nova-pro-v1
300,000$0.8$3.2$0-
Anthropic Claude Haiku Latest
~anthropic/claude-haiku-latest
reasoning
200,000$1$5$0.1$1.25
Anthropic Claude Sonnet Latest
~anthropic/claude-sonnet-latest
reasoning
1,000,000$3$15$0.3$3.75
Anthropic: Claude 3 Haiku
anthropic/claude-3-haiku
200,000$0.25$1.25$0.03$0.3
Anthropic: Claude 3.5 Haiku
anthropic/claude-3.5-haiku
200,000$0.8$4$0.08$1
Anthropic: Claude Haiku 4.5
anthropic/claude-haiku-4.5
reasoning
200,000$1$5$0.1$1.25
Anthropic: Claude Opus 4
anthropic/claude-opus-4
reasoning
200,000$15$75$1.5$18.75
Anthropic: Claude Opus 4.1
anthropic/claude-opus-4.1
reasoning
200,000$15$75$1.5$18.75
Anthropic: Claude Opus 4.5
anthropic/claude-opus-4.5
reasoning
200,000$5$25$0.5$6.25
Anthropic: Claude Opus 4.6
anthropic/claude-opus-4.6
reasoning
1,000,000$5$25$0.5$6.25
Anthropic: Claude Opus 4.6 (Fast)
anthropic/claude-opus-4.6-fast
reasoning
1,000,000$30$150$3$37.5
Anthropic: Claude Opus 4.7
anthropic/claude-opus-4.7
reasoning
1,000,000$5$25$0.5$6.25
Anthropic: Claude Opus 4.7 (Fast)
anthropic/claude-opus-4.7-fast
reasoning
1,000,000$30$150$3$37.5
Anthropic: Claude Opus 4.8
anthropic/claude-opus-4.8
reasoning
1,000,000$5$25$0.5$6.25
Anthropic: Claude Opus 4.8 (Fast)
anthropic/claude-opus-4.8-fast
reasoning
1,000,000$30$150$3$37.5
Anthropic: Claude Opus Latest
~anthropic/claude-opus-latest
reasoning
1,000,000$5$25$0.5$6.25
Anthropic: Claude Sonnet 4
anthropic/claude-sonnet-4
reasoning
1,000,000$3$15$0.3$3.75
Anthropic: Claude Sonnet 4.5
anthropic/claude-sonnet-4.5
reasoning
1,000,000$3$15$0.3$3.75
Anthropic: Claude Sonnet 4.6
anthropic/claude-sonnet-4.6
reasoning
1,000,000$3$15$0.3$3.75
Arcee AI: Trinity Large Thinking
arcee-ai/trinity-large-thinking
reasoning
262,144$0.22$0.85$0.06-
Arcee AI: Trinity Large Thinking (free)
arcee-ai/trinity-large-thinking:free
reasoning
262,144$0$0$0-
Arcee AI: Trinity Mini
arcee-ai/trinity-mini
reasoning
131,072$0.045$0.15$0-
Arcee AI: Virtuoso Large
arcee-ai/virtuoso-large
131,072$0.75$1.2$0-
Auto
auto
reasoning
2,000,000$0$0$0-
Auto Router
openrouter/auto
reasoning
2,000,000--$0-
Baidu Qianfan: CoBuddy (free)
baidu/cobuddy:free
reasoning
131,072$0$0$0-
Baidu: ERNIE 4.5 21B A3B
baidu/ernie-4.5-21b-a3b
131,072$0.07$0.28$0-
Baidu: ERNIE 4.5 VL 28B A3B
baidu/ernie-4.5-vl-28b-a3b
reasoning
131,072$0.14$0.56$0-
ByteDance Seed: Seed 1.6
bytedance-seed/seed-1.6
reasoning
262,144$0.25$2$0-
ByteDance Seed: Seed 1.6 Flash
bytedance-seed/seed-1.6-flash
reasoning
262,144$0.075$0.3$0-
ByteDance Seed: Seed-2.0-Lite
bytedance-seed/seed-2.0-lite
reasoning
262,144$0.25$2$0-
ByteDance Seed: Seed-2.0-Mini
bytedance-seed/seed-2.0-mini
reasoning
262,144$0.1$0.4$0-
Cohere: Command R (08-2024)
cohere/command-r-08-2024
128,000$0.15$0.6$0-
Cohere: Command R+ (08-2024)
cohere/command-r-plus-08-2024
128,000$2.5$10$0-
DeepSeek: DeepSeek V3
deepseek/deepseek-chat
163,840$0.32$0.89$0-
DeepSeek: DeepSeek V3 0324
deepseek/deepseek-chat-v3-0324
163,840$0.2$0.77$0.135-
DeepSeek: DeepSeek V3.1
deepseek/deepseek-chat-v3.1
reasoning
163,840$0.21$0.79$0.13-
DeepSeek: DeepSeek V3.1 Terminus
deepseek/deepseek-v3.1-terminus
reasoning
163,840$0.27$0.95$0.13-
DeepSeek: DeepSeek V3.2
deepseek/deepseek-v3.2
reasoning
131,072$0.252$0.378$0.0252-
DeepSeek: DeepSeek V3.2 Exp
deepseek/deepseek-v3.2-exp
reasoning
163,840$0.27$0.41$0-
DeepSeek: DeepSeek V4 Flash
deepseek/deepseek-v4-flash
reasoning
1,048,576$0.1$0.2$0.02-
DeepSeek: DeepSeek V4 Flash (free)
deepseek/deepseek-v4-flash:free
reasoning
1,048,576$0$0$0-
DeepSeek: DeepSeek V4 Pro
deepseek/deepseek-v4-pro
reasoning
1,048,576$0.435$0.87$0.003625-
DeepSeek: R1
deepseek/deepseek-r1
reasoning
163,840$0.7$2.5$0-
DeepSeek: R1 0528
deepseek/deepseek-r1-0528
reasoning
163,840$0.5$2.15$0.35-
EssentialAI: Rnj 1 Instruct
essentialai/rnj-1-instruct
32,768$0.15$0.15$0-
Free Models Router
openrouter/free
reasoning
200,000$0$0$0-
Google Gemini Flash Latest
~google/gemini-flash-latest
reasoning
1,048,576$1.5$9$0.15$0.0833333333333
Google Gemini Pro Latest
~google/gemini-pro-latest
reasoning
1,048,576$2$12$0.2$0.375
Google: Gemini 2.0 Flash
google/gemini-2.0-flash-001
1,000,000$0.1$0.4$0.025$0.0833333333333
Google: Gemini 2.0 Flash Lite
google/gemini-2.0-flash-lite-001
1,048,576$0.075$0.3$0-
Google: Gemini 2.5 Flash
google/gemini-2.5-flash
reasoning
1,048,576$0.3$2.5$0.03$0.0833333333333
Google: Gemini 2.5 Flash Lite
google/gemini-2.5-flash-lite
reasoning
1,048,576$0.1$0.4$0.01$0.0833333333333
Google: Gemini 2.5 Flash Lite Preview 09-2025
google/gemini-2.5-flash-lite-preview-09-2025
reasoning
1,048,576$0.1$0.4$0.01$0.0833333333333
Google: Gemini 2.5 Pro
google/gemini-2.5-pro
reasoning
1,048,576$1.25$10$0.125$0.375
Google: Gemini 2.5 Pro Preview 05-06
google/gemini-2.5-pro-preview-05-06
reasoning
1,048,576$1.25$10$0.125$0.375
Google: Gemini 2.5 Pro Preview 06-05
google/gemini-2.5-pro-preview
reasoning
1,048,576$1.25$10$0.125$0.375
Google: Gemini 3 Flash Preview
google/gemini-3-flash-preview
reasoning
1,048,576$0.5$3$0.05$0.0833333333333
Google: Gemini 3.1 Flash Lite
google/gemini-3.1-flash-lite
reasoning
1,048,576$0.25$1.5$0.025$0.0833333333333
Google: Gemini 3.1 Flash Lite Preview
google/gemini-3.1-flash-lite-preview
reasoning
1,048,576$0.25$1.5$0.025$0.0833333333333
Google: Gemini 3.1 Pro Preview
google/gemini-3.1-pro-preview
reasoning
1,048,576$2$12$0.2$0.375
Google: Gemini 3.1 Pro Preview Custom Tools
google/gemini-3.1-pro-preview-customtools
reasoning
1,048,756$2$12$0.2$0.375
Google: Gemini 3.5 Flash
google/gemini-3.5-flash
reasoning
1,048,576$1.5$9$0.15$0.0833333333333
Google: Gemma 3 12B
google/gemma-3-12b-it
131,072$0.04$0.13$0-
Google: Gemma 3 27B
google/gemma-3-27b-it
131,072$0.08$0.16$0-
Google: Gemma 4 26B A4B
google/gemma-4-26b-a4b-it
reasoning
262,144$0.06$0.33$0-
Google: Gemma 4 26B A4B (free)
google/gemma-4-26b-a4b-it:free
reasoning
262,144$0$0$0-
Google: Gemma 4 31B
google/gemma-4-31b-it
reasoning
262,144$0.12$0.37$0-
Google: Gemma 4 31B (free)
google/gemma-4-31b-it:free
reasoning
262,144$0$0$0-
IBM: Granite 4.1 8B
ibm-granite/granite-4.1-8b
131,072$0.05$0.1$0.05-
Inception: Mercury 2
inception/mercury-2
reasoning
128,000$0.25$0.75$0.025-
inclusionAI: Ling-2.6-1T
inclusionai/ling-2.6-1t
262,144$0.075$0.625$0.015-
inclusionAI: Ling-2.6-flash
inclusionai/ling-2.6-flash
262,144$0.01$0.03$0.002-
inclusionAI: Ring-2.6-1T
inclusionai/ring-2.6-1t
reasoning
262,144$0.075$0.625$0.015-
Kwaipilot: KAT-Coder-Pro V2
kwaipilot/kat-coder-pro-v2
256,000$0.3$1.2$0.06-
Meta: Llama 3.1 70B Instruct
meta-llama/llama-3.1-70b-instruct
131,072$0.4$0.4$0-
Meta: Llama 3.1 8B Instruct
meta-llama/llama-3.1-8b-instruct
131,072$0.02$0.05$0-
Meta: Llama 3.3 70B Instruct
meta-llama/llama-3.3-70b-instruct
131,072$0.1$0.32$0-
Meta: Llama 3.3 70B Instruct (free)
meta-llama/llama-3.3-70b-instruct:free
131,072$0$0$0-
Meta: Llama 4 Scout
meta-llama/llama-4-scout
10,000,000$0.08$0.3$0-
MiniMax: MiniMax M1
minimax/minimax-m1
reasoning
1,000,000$0.4$2.2$0-
MiniMax: MiniMax M2
minimax/minimax-m2
reasoning
204,800$0.255$1$0.03-
MiniMax: MiniMax M2.1
minimax/minimax-m2.1
reasoning
204,800$0.29$0.95$0.03-
MiniMax: MiniMax M2.5
minimax/minimax-m2.5
reasoning
204,800$0.15$1.15$0-
MiniMax: MiniMax M2.5 (free)
minimax/minimax-m2.5:free
reasoning
204,800$0$0$0-
MiniMax: MiniMax M2.7
minimax/minimax-m2.7
reasoning
204,800$0.279$1.2$0-
Mistral Large
mistralai/mistral-large
128,000$2$6$0.2-
Mistral Large 2407
mistralai/mistral-large-2407
131,072$2$6$0.2-
Mistral Large 2411
mistralai/mistral-large-2411
131,072$2$6$0.2-
Mistral: Codestral 2508
mistralai/codestral-2508
256,000$0.3$0.9$0.03-
Mistral: Devstral 2 2512
mistralai/devstral-2512
262,144$0.4$2$0.04-
Mistral: Devstral Medium
mistralai/devstral-medium
131,072$0.4$2$0.04-
Mistral: Devstral Small 1.1
mistralai/devstral-small
131,072$0.1$0.3$0.01-
Mistral: Ministral 3 14B 2512
mistralai/ministral-14b-2512
262,144$0.2$0.2$0.02-
Mistral: Ministral 3 3B 2512
mistralai/ministral-3b-2512
131,072$0.1$0.1$0.01-
Mistral: Ministral 3 8B 2512
mistralai/ministral-8b-2512
262,144$0.15$0.15$0.015-
Mistral: Mistral Large 3 2512
mistralai/mistral-large-2512
262,144$0.5$1.5$0.05-
Mistral: Mistral Medium 3
mistralai/mistral-medium-3
131,072$0.4$2$0.04-
Mistral: Mistral Medium 3.1
mistralai/mistral-medium-3.1
131,072$0.4$2$0.04-
Mistral: Mistral Medium 3.5
mistralai/mistral-medium-3-5
reasoning
262,144$1.5$7.5$0-
Mistral: Mistral Nemo
mistralai/mistral-nemo
131,072$0.02$0.03$0-
Mistral: Mistral Small 3.2 24B
mistralai/mistral-small-3.2-24b-instruct
128,000$0.075$0.2$0-
Mistral: Mistral Small 4
mistralai/mistral-small-2603
reasoning
262,144$0.15$0.6$0.015-
Mistral: Mixtral 8x22B Instruct
mistralai/mixtral-8x22b-instruct
65,536$2$6$0.2-
Mistral: Pixtral Large 2411
mistralai/pixtral-large-2411
131,072$2$6$0.2-
Mistral: Saba
mistralai/mistral-saba
32,768$0.2$0.6$0.02-
Mistral: Voxtral Small 24B 2507
mistralai/voxtral-small-24b-2507
32,000$0.1$0.3$0.01-
MoonshotAI Kimi Latest
~moonshotai/kimi-latest
reasoning
262,144$0.73$3.49$0.25-
MoonshotAI: Kimi K2 0711
moonshotai/kimi-k2
131,072$0.57$2.3$0-
MoonshotAI: Kimi K2 0905
moonshotai/kimi-k2-0905
262,144$0.6$2.5$0-
MoonshotAI: Kimi K2 Thinking
moonshotai/kimi-k2-thinking
reasoning
262,144$0.6$2.5$0-
MoonshotAI: Kimi K2.5
moonshotai/kimi-k2.5
reasoning
262,144$0.41$2.06$0.07-
MoonshotAI: Kimi K2.6
moonshotai/kimi-k2.6
reasoning
262,144$0.73$3.49$0.25-
Nex AGI: DeepSeek V3.1 Nex N1
nex-agi/deepseek-v3.1-nex-n1
131,072$0.135$0.5$0-
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5
nvidia/llama-3.3-nemotron-super-49b-v1.5
reasoning
131,072$0.1$0.4$0-
NVIDIA: Nemotron 3 Nano 30B A3B
nvidia/nemotron-3-nano-30b-a3b
reasoning
262,144$0.05$0.2$0-
NVIDIA: Nemotron 3 Nano 30B A3B (free)
nvidia/nemotron-3-nano-30b-a3b:free
reasoning
256,000$0$0$0-
NVIDIA: Nemotron 3 Nano Omni (free)
nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free
reasoning
256,000$0$0$0-
NVIDIA: Nemotron 3 Super
nvidia/nemotron-3-super-120b-a12b
reasoning
1,000,000$0.09$0.45$0-
NVIDIA: Nemotron 3 Super (free)
nvidia/nemotron-3-super-120b-a12b:free
reasoning
1,000,000$0$0$0-
NVIDIA: Nemotron Nano 12B 2 VL (free)
nvidia/nemotron-nano-12b-v2-vl:free
reasoning
128,000$0$0$0-
NVIDIA: Nemotron Nano 9B V2
nvidia/nemotron-nano-9b-v2
reasoning
131,072$0.04$0.16$0-
NVIDIA: Nemotron Nano 9B V2 (free)
nvidia/nemotron-nano-9b-v2:free
reasoning
128,000$0$0$0-
OpenAI GPT Latest
~openai/gpt-latest
reasoning
1,050,000$5$30$0.5-
OpenAI GPT Mini Latest
~openai/gpt-mini-latest
reasoning
400,000$0.75$4.5$0.075-
OpenAI: GPT Audio
openai/gpt-audio
128,000$2.5$10$0-
OpenAI: GPT Audio Mini
openai/gpt-audio-mini
128,000$0.6$2.4$0-
OpenAI: GPT Chat Latest
openai/gpt-chat-latest
400,000$5$30$0.5-
OpenAI: GPT-3.5 Turbo
openai/gpt-3.5-turbo
16,385$0.5$1.5$0-
OpenAI: GPT-3.5 Turbo (older v0613)
openai/gpt-3.5-turbo-0613
4,095$1$2$0-
OpenAI: GPT-3.5 Turbo 16k
openai/gpt-3.5-turbo-16k
16,385$3$4$0-
OpenAI: GPT-4
openai/gpt-4
8,191$30$60$0-
OpenAI: GPT-4 (older v0314)
openai/gpt-4-0314
8,191$30$60$0-
OpenAI: GPT-4 Turbo
openai/gpt-4-turbo
128,000$10$30$0-
OpenAI: GPT-4 Turbo (older v1106)
openai/gpt-4-1106-preview
128,000$10$30$0-
OpenAI: GPT-4 Turbo Preview
openai/gpt-4-turbo-preview
128,000$10$30$0-
OpenAI: GPT-4.1
openai/gpt-4.1
1,047,576$2$8$0.5-
OpenAI: GPT-4.1 Mini
openai/gpt-4.1-mini
1,047,576$0.4$1.6$0.1-
OpenAI: GPT-4.1 Nano
openai/gpt-4.1-nano
1,047,576$0.1$0.4$0.025-
OpenAI: GPT-4o
openai/gpt-4o
128,000$2.5$10$0-
OpenAI: GPT-4o (2024-05-13)
openai/gpt-4o-2024-05-13
128,000$5$15$0-
OpenAI: GPT-4o (2024-08-06)
openai/gpt-4o-2024-08-06
128,000$2.5$10$1.25-
OpenAI: GPT-4o (2024-11-20)
openai/gpt-4o-2024-11-20
128,000$2.5$10$1.25-
OpenAI: GPT-4o Audio
openai/gpt-4o-audio-preview
128,000$2.5$10$0-
OpenAI: GPT-4o-mini
openai/gpt-4o-mini
128,000$0.15$0.6$0.075-
OpenAI: GPT-4o-mini (2024-07-18)
openai/gpt-4o-mini-2024-07-18
128,000$0.15$0.6$0.075-
OpenAI: GPT-5
openai/gpt-5
reasoning
400,000$1.25$10$0.125-
OpenAI: GPT-5 Codex
openai/gpt-5-codex
reasoning
400,000$1.25$10$0.125-
OpenAI: GPT-5 Mini
openai/gpt-5-mini
reasoning
400,000$0.25$2$0.025-
OpenAI: GPT-5 Nano
openai/gpt-5-nano
reasoning
400,000$0.05$0.4$0.01-
OpenAI: GPT-5 Pro
openai/gpt-5-pro
reasoning
400,000$15$120$0-
OpenAI: GPT-5.1
openai/gpt-5.1
reasoning
400,000$1.25$10$0.13-
OpenAI: GPT-5.1 Chat
openai/gpt-5.1-chat
128,000$1.25$10$0.125-
OpenAI: GPT-5.1-Codex
openai/gpt-5.1-codex
reasoning
400,000$1.25$10$0.125-
OpenAI: GPT-5.1-Codex-Max
openai/gpt-5.1-codex-max
reasoning
400,000$1.25$10$0.125-
OpenAI: GPT-5.1-Codex-Mini
openai/gpt-5.1-codex-mini
reasoning
400,000$0.25$2$0.03-
OpenAI: GPT-5.2
openai/gpt-5.2
reasoning
400,000$1.75$14$0.175-
OpenAI: GPT-5.2 Chat
openai/gpt-5.2-chat
128,000$1.75$14$0.175-
OpenAI: GPT-5.2 Pro
openai/gpt-5.2-pro
reasoning
400,000$21$168$0-
OpenAI: GPT-5.2-Codex
openai/gpt-5.2-codex
reasoning
400,000$1.75$14$0.175-
OpenAI: GPT-5.3 Chat
openai/gpt-5.3-chat
128,000$1.75$14$0.175-
OpenAI: GPT-5.3-Codex
openai/gpt-5.3-codex
reasoning
400,000$1.75$14$0.175-
OpenAI: GPT-5.4
openai/gpt-5.4
reasoning
1,050,000$2.5$15$0.25-
OpenAI: GPT-5.4 Mini
openai/gpt-5.4-mini
reasoning
400,000$0.75$4.5$0.075-
OpenAI: GPT-5.4 Nano
openai/gpt-5.4-nano
reasoning
400,000$0.2$1.25$0.02-
OpenAI: GPT-5.4 Pro
openai/gpt-5.4-pro
reasoning
1,050,000$30$180$0-
OpenAI: GPT-5.5
openai/gpt-5.5
reasoning
1,050,000$5$30$0.5-
OpenAI: GPT-5.5 Pro
openai/gpt-5.5-pro
reasoning
1,050,000$30$180$0-
OpenAI: gpt-oss-120b
openai/gpt-oss-120b
reasoning
131,072$0.039$0.18$0-
OpenAI: gpt-oss-120b (free)
openai/gpt-oss-120b:free
reasoning
131,072$0$0$0-
OpenAI: gpt-oss-20b
openai/gpt-oss-20b
reasoning
131,072$0.03$0.14$0-
OpenAI: gpt-oss-20b (free)
openai/gpt-oss-20b:free
reasoning
131,072$0$0$0-
OpenAI: gpt-oss-safeguard-20b
openai/gpt-oss-safeguard-20b
reasoning
131,072$0.075$0.3$0.037-
OpenAI: o1
openai/o1
reasoning
200,000$15$60$7.5-
OpenAI: o3
openai/o3
reasoning
200,000$2$8$0.5-
OpenAI: o3 Deep Research
openai/o3-deep-research
reasoning
200,000$10$40$2.5-
OpenAI: o3 Mini
openai/o3-mini
reasoning
200,000$1.1$4.4$0.55-
OpenAI: o3 Mini High
openai/o3-mini-high
reasoning
200,000$1.1$4.4$0.55-
OpenAI: o3 Pro
openai/o3-pro
reasoning
200,000$20$80$0-
OpenAI: o4 Mini
openai/o4-mini
reasoning
200,000$1.1$4.4$0.275-
OpenAI: o4 Mini Deep Research
openai/o4-mini-deep-research
reasoning
200,000$2$8$0.5-
OpenAI: o4 Mini High
openai/o4-mini-high
reasoning
200,000$1.1$4.4$0.275-
Owl Alpha
openrouter/owl-alpha
1,048,756$0$0$0-
Poolside: Laguna M.1 (free)
poolside/laguna-m.1:free
reasoning
131,072$0$0$0-
Poolside: Laguna XS.2 (free)
poolside/laguna-xs.2:free
reasoning
131,072$0$0$0-
Prime Intellect: INTELLECT-3
prime-intellect/intellect-3
reasoning
131,072$0.2$1.1$0-
Qwen: Qwen Plus 0728
qwen/qwen-plus-2025-07-28
1,000,000$0.26$0.78$0$0.325
Qwen: Qwen Plus 0728 (thinking)
qwen/qwen-plus-2025-07-28:thinking
reasoning
1,000,000$0.26$0.78$0$0.325
Qwen: Qwen-Plus
qwen/qwen-plus
1,000,000$0.26$0.78$0.052$0.325
Qwen: Qwen2.5 7B Instruct
qwen/qwen-2.5-7b-instruct
131,072$0.04$0.1$0-
Qwen: Qwen3 14B
qwen/qwen3-14b
reasoning
131,702$0.1$0.24$0-
Qwen: Qwen3 235B A22B
qwen/qwen3-235b-a22b
reasoning
131,072$0.455$1.82$0-
Qwen: Qwen3 235B A22B Instruct 2507
qwen/qwen3-235b-a22b-2507
262,144$0.071$0.1$0-
Qwen: Qwen3 235B A22B Thinking 2507
qwen/qwen3-235b-a22b-thinking-2507
reasoning
262,144$0.1495$1.495$0-
Qwen: Qwen3 30B A3B
qwen/qwen3-30b-a3b
reasoning
131,072$0.09$0.45$0-
Qwen: Qwen3 30B A3B Instruct 2507
qwen/qwen3-30b-a3b-instruct-2507
262,144$0.09$0.3$0-
Qwen: Qwen3 30B A3B Thinking 2507
qwen/qwen3-30b-a3b-thinking-2507
reasoning
131,072$0.08$0.4$0.08-
Qwen: Qwen3 32B
qwen/qwen3-32b
reasoning
131,072$0.08$0.28$0-
Qwen: Qwen3 8B
qwen/qwen3-8b
reasoning
131,072$0.05$0.4$0.05-
Qwen: Qwen3 Coder 30B A3B Instruct
qwen/qwen3-coder-30b-a3b-instruct
160,000$0.07$0.27$0-
Qwen: Qwen3 Coder 480B A35B
qwen/qwen3-coder
1,048,576$0.22$1.8$0-
Qwen: Qwen3 Coder 480B A35B (free)
qwen/qwen3-coder:free
1,048,576$0$0$0-
Qwen: Qwen3 Coder Flash
qwen/qwen3-coder-flash
1,000,000$0.195$0.975$0.039$0.24375
Qwen: Qwen3 Coder Next
qwen/qwen3-coder-next
262,144$0.11$0.8$0.07-
Qwen: Qwen3 Coder Plus
qwen/qwen3-coder-plus
1,000,000$0.65$3.25$0.13$0.8125
Qwen: Qwen3 Max
qwen/qwen3-max
262,144$0.78$3.9$0.156$0.975
Qwen: Qwen3 Max Thinking
qwen/qwen3-max-thinking
reasoning
262,144$0.78$3.9$0-
Qwen: Qwen3 Next 80B A3B Instruct
qwen/qwen3-next-80b-a3b-instruct
262,144$0.09$1.1$0-
Qwen: Qwen3 Next 80B A3B Instruct (free)
qwen/qwen3-next-80b-a3b-instruct:free
262,144$0$0$0-
Qwen: Qwen3 Next 80B A3B Thinking
qwen/qwen3-next-80b-a3b-thinking
reasoning
262,144$0.0975$0.78$0-
Qwen: Qwen3 VL 235B A22B Instruct
qwen/qwen3-vl-235b-a22b-instruct
262,144$0.2$0.88$0.11-
Qwen: Qwen3 VL 235B A22B Thinking
qwen/qwen3-vl-235b-a22b-thinking
reasoning
131,072$0.26$2.6$0-
Qwen: Qwen3 VL 30B A3B Instruct
qwen/qwen3-vl-30b-a3b-instruct
262,144$0.13$0.52$0-
Qwen: Qwen3 VL 30B A3B Thinking
qwen/qwen3-vl-30b-a3b-thinking
reasoning
131,072$0.13$1.56$0-
Qwen: Qwen3 VL 32B Instruct
qwen/qwen3-vl-32b-instruct
262,144$0.104$0.416$0-
Qwen: Qwen3 VL 8B Instruct
qwen/qwen3-vl-8b-instruct
256,000$0.08$0.5$0-
Qwen: Qwen3 VL 8B Thinking
qwen/qwen3-vl-8b-thinking
reasoning
256,000$0.117$1.365$0-
Qwen: Qwen3.5 397B A17B
qwen/qwen3.5-397b-a17b
reasoning
262,144$0.39$2.34$0-
Qwen: Qwen3.5 Plus 2026-02-15
qwen/qwen3.5-plus-02-15
reasoning
1,000,000$0.26$1.56$0$0.325
Qwen: Qwen3.5 Plus 2026-04-20
qwen/qwen3.5-plus-20260420
reasoning
1,000,000$0.3$1.8$0-
Qwen: Qwen3.5-122B-A10B
qwen/qwen3.5-122b-a10b
reasoning
262,144$0.26$2.08$0-
Qwen: Qwen3.5-27B
qwen/qwen3.5-27b
reasoning
262,144$0.195$1.56$0-
Qwen: Qwen3.5-35B-A3B
qwen/qwen3.5-35b-a3b
reasoning
262,144$0.139$1$0-
Qwen: Qwen3.5-9B
qwen/qwen3.5-9b
reasoning
262,144$0.04$0.15$0-
Qwen: Qwen3.5-Flash
qwen/qwen3.5-flash-02-23
reasoning
1,000,000$0.065$0.26$0$0.08125
Qwen: Qwen3.6 27B
qwen/qwen3.6-27b
reasoning
262,144$0.3$3.2$0-
Qwen: Qwen3.6 35B A3B
qwen/qwen3.6-35b-a3b
reasoning
262,144$0.15$1$0-
Qwen: Qwen3.6 Flash
qwen/qwen3.6-flash
reasoning
1,000,000$0.1875$1.125$0$0.234375
Qwen: Qwen3.6 Max Preview
qwen/qwen3.6-max-preview
reasoning
262,144$1.04$6.24$0$1.3
Qwen: Qwen3.6 Plus
qwen/qwen3.6-plus
reasoning
1,000,000$0.325$1.95$0$0.40625
Qwen: Qwen3.7 Max
qwen/qwen3.7-max
reasoning
1,000,000$2.5$7.5$0$3.125
Qwen2.5 72B Instruct
qwen/qwen-2.5-72b-instruct
131,072$0.36$0.4$0-
Reka Edge
rekaai/reka-edge
16,384$0.1$0.1$0-
Relace: Relace Search
relace/relace-search
256,000$1$3$0-
Sao10k: Llama 3 Euryale 70B v2.1
sao10k/l3-euryale-70b
8,192$1.48$1.48$0-
Sao10K: Llama 3.1 Euryale 70B v2.2
sao10k/l3.1-euryale-70b
131,072$0.85$0.85$0-
StepFun: Step 3.5 Flash
stepfun/step-3.5-flash
reasoning
262,144$0.09$0.3$0.02-
Tencent: Hy3 preview
tencent/hy3-preview
reasoning
262,144$0.066$0.26$0.029-
TheDrummer: Rocinante 12B
thedrummer/rocinante-12b
32,768$0.17$0.43$0-
TheDrummer: UnslopNemo 12B
thedrummer/unslopnemo-12b
32,768$0.4$0.4$0-
Tongyi DeepResearch 30B A3B
alibaba/tongyi-deepresearch-30b-a3b
reasoning
131,072$0.09$0.45$0.09-
Upstage: Solar Pro 3
upstage/solar-pro-3
reasoning
128,000$0.15$0.6$0.015-
xAI: Grok 4.20
x-ai/grok-4.20
reasoning
2,000,000$1.25$2.5$0.2-
xAI: Grok 4.3
x-ai/grok-4.3
reasoning
1,000,000$1.25$2.5$0.2-
xAI: Grok Build 0.1
x-ai/grok-build-0.1
reasoning
256,000$1$2$0.2-
Xiaomi: MiMo-V2-Flash
xiaomi/mimo-v2-flash
reasoning
262,144$0.1$0.3$0.01-
Xiaomi: MiMo-V2-Omni
xiaomi/mimo-v2-omni
reasoning
262,144$0.4$2$0.08-
Xiaomi: MiMo-V2-Pro
xiaomi/mimo-v2-pro
reasoning
1,048,576$1$3$0.2-
Xiaomi: MiMo-V2.5
xiaomi/mimo-v2.5
reasoning
1,048,576$0.4$2$0.08-
Xiaomi: MiMo-V2.5-Pro
xiaomi/mimo-v2.5-pro
reasoning
1,048,576$1$3$0.2-
Z.ai: GLM 4 32B
z-ai/glm-4-32b
128,000$0.1$0.1$0-
Z.ai: GLM 4.5
z-ai/glm-4.5
reasoning
131,072$0.6$2.2$0.11-
Z.ai: GLM 4.5 Air
z-ai/glm-4.5-air
reasoning
131,072$0.13$0.85$0.025-
Z.ai: GLM 4.5 Air (free)
z-ai/glm-4.5-air:free
reasoning
131,072$0$0$0-
Z.ai: GLM 4.5V
z-ai/glm-4.5v
reasoning
65,536$0.6$1.8$0.11-
Z.ai: GLM 4.6
z-ai/glm-4.6
reasoning
202,752$0.43$1.74$0.08-
Z.ai: GLM 4.6V
z-ai/glm-4.6v
reasoning
131,072$0.3$0.9$0.05-
Z.ai: GLM 4.7
z-ai/glm-4.7
reasoning
202,752$0.4$1.75$0.08-
Z.ai: GLM 4.7 Flash
z-ai/glm-4.7-flash
reasoning
202,752$0.06$0.4$0.01-
Z.ai: GLM 5
z-ai/glm-5
reasoning
202,752$0.6$1.9$0.119-
Z.ai: GLM 5 Turbo
z-ai/glm-5-turbo
reasoning
202,752$1.2$4$0.24-
Z.ai: GLM 5.1
z-ai/glm-5.1
reasoning
202,752$0.98$3.08$0.182-
Z.ai: GLM 5V Turbo
z-ai/glm-5v-turbo
reasoning
202,752$1.2$4$0.24-
together (18 models)
DeepSeek V3
deepseek-ai/DeepSeek-V3
reasoning
131,072$1.25$1.25$0-
DeepSeek V3.1
deepseek-ai/DeepSeek-V3-1
reasoning
131,072$0.6$1.7$0-
DeepSeek V4 Pro
deepseek-ai/DeepSeek-V4-Pro
reasoning
512,000$2.1$4.4$0.2-
Gemma 4 31B Instruct
google/gemma-4-31B-it
reasoning
262,144$0.2$0.5$0-
GLM-5.1
zai-org/GLM-5.1
reasoning
202,752$1.4$4.4$0-
GPT OSS 120B
openai/gpt-oss-120b
reasoning
131,072$0.15$0.6$0-
Kimi K2.5
moonshotai/Kimi-K2.5
reasoning
262,144$0.5$2.8$0-
Kimi K2.6
moonshotai/Kimi-K2.6
reasoning
262,144$1.2$4.5$0.2-
Llama 3.3 70B
meta-llama/Llama-3.3-70B-Instruct-Turbo
131,072$0.88$0.88$0-
MiniMax-M2.5
MiniMaxAI/MiniMax-M2.5
reasoning
204,800$0.3$1.2$0.06-
MiniMax-M2.7
MiniMaxAI/MiniMax-M2.7
reasoning
202,752$0.3$1.2$0.06-
Qwen3 235B A22B Instruct 2507 FP8
Qwen/Qwen3-235B-A22B-Instruct-2507-tput
reasoning
262,144$0.2$0.6$0-
Qwen3 Coder 480B A35B Instruct
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
262,144$2$2$0-
Qwen3 Coder Next FP8
Qwen/Qwen3-Coder-Next-FP8
reasoning
262,144$0.5$1.2$0-
Qwen3.5 397B A17B
Qwen/Qwen3.5-397B-A17B
reasoning
262,144$0.6$3.6$0-
Qwen3.6 Plus
Qwen/Qwen3.6-Plus
reasoning
1,000,000$0.5$3$0-
Qwen3.7 Max
Qwen/Qwen3.7-Max
reasoning
1,000,000$2.5$7.5$0-
Rnj-1 Instruct
essentialai/Rnj-1-Instruct
32,768$0.15$0.15$0-
vercel-ai-gateway (159 models)
Claude 3 Haiku
anthropic/claude-3-haiku
200,000$0.25$1.25$0.03$0.3
Claude 3.5 Haiku
anthropic/claude-3.5-haiku
200,000$0.8$4$0.08$1
Claude Haiku 4.5
anthropic/claude-haiku-4.5
reasoning
200,000$1$5$0.1$1.25
Claude Opus 4
anthropic/claude-opus-4
reasoning
200,000$15$75$1.5$18.75
Claude Opus 4.1
anthropic/claude-opus-4.1
reasoning
200,000$15$75$1.5$18.75
Claude Opus 4.5
anthropic/claude-opus-4.5
reasoning
200,000$5$25$0.5$6.25
Claude Opus 4.6
anthropic/claude-opus-4.6
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.7
anthropic/claude-opus-4.7
reasoning
1,000,000$5$25$0.5$6.25
Claude Opus 4.8
anthropic/claude-opus-4.8
reasoning
1,000,000$5$25$0.5$6.25
Claude Sonnet 4
anthropic/claude-sonnet-4
reasoning
1,000,000$3$15$0.3$3.75
Claude Sonnet 4.5
anthropic/claude-sonnet-4.5
reasoning
1,000,000$3$15$0.3$3.75
Claude Sonnet 4.6
anthropic/claude-sonnet-4.6
reasoning
1,000,000$3$15$0.3$3.75
Command A
cohere/command-a
256,000$2.5$10$0-
DeepSeek V3 0324
deepseek/deepseek-v3
163,840$0.77$0.77$0-
DeepSeek V3.1 Terminus
deepseek/deepseek-v3.1-terminus
reasoning
131,072$0.27$1$0.135-
DeepSeek V3.2
deepseek/deepseek-v3.2
128,000$0.28$0.42$0.028-
DeepSeek V3.2 Thinking
deepseek/deepseek-v3.2-thinking
128,000$0.62$1.85$0-
DeepSeek V4 Flash
deepseek/deepseek-v4-flash
reasoning
1,000,000$0.14$0.28$0.0028-
DeepSeek V4 Pro
deepseek/deepseek-v4-pro
reasoning
1,000,000$0.435$0.87$0.0036-
DeepSeek-R1
deepseek/deepseek-r1
reasoning
128,000$1.35$5.4$0-
DeepSeek-V3.1
deepseek/deepseek-v3.1
reasoning
163,840$0.56$1.68$0.28-
Devstral 2
mistral/devstral-2
256,000$0.4$2$0-
Devstral Small 1.1
mistral/devstral-small
128,000$0.1$0.3$0-
Devstral Small 2
mistral/devstral-small-2
256,000$0.1$0.3$0-
Gemini 2.0 Flash
google/gemini-2.0-flash
1,048,576$0.15$0.6$0.025-
Gemini 2.0 Flash Lite
google/gemini-2.0-flash-lite
1,048,576$0.075$0.3$0.02-
Gemini 2.5 Flash
google/gemini-2.5-flash
reasoning
1,000,000$0.3$2.5$0.03-
Gemini 2.5 Flash Lite
google/gemini-2.5-flash-lite
reasoning
1,048,576$0.1$0.4$0.01-
Gemini 2.5 Pro
google/gemini-2.5-pro
reasoning
1,048,576$1.25$10$0.125-
Gemini 3 Flash
google/gemini-3-flash
reasoning
1,000,000$0.5$3$0.05-
Gemini 3 Pro Preview
google/gemini-3-pro-preview
reasoning
1,000,000$2$12$0.2-
Gemini 3.1 Flash Lite
google/gemini-3.1-flash-lite
reasoning
1,000,000$0.25$1.5$0.03-
Gemini 3.1 Flash Lite Preview
google/gemini-3.1-flash-lite-preview
reasoning
1,000,000$0.25$1.5$0.03-
Gemini 3.1 Pro Preview
google/gemini-3.1-pro-preview
reasoning
1,000,000$2$12$0.2-
Gemini 3.5 Flash
google/gemini-3.5-flash
reasoning
1,000,000$1.5$9$0.15-
Gemma 4 26B A4B IT
google/gemma-4-26b-a4b-it
262,144$0.13$0.4$0-
Gemma 4 31B IT
google/gemma-4-31b-it
262,144$0.14$0.4$0-
GLM 4.5 Air
zai/glm-4.5-air
reasoning
128,000$0.2$1.1$0.03-
GLM 4.5V
zai/glm-4.5v
66,000$0.6$1.8$0.11-
GLM 4.6
zai/glm-4.6
reasoning
200,000$0.6$2.2$0.11-
GLM 4.7
zai/glm-4.7
reasoning
131,000$2.25$2.75$2.25-
GLM 4.7 Flash
zai/glm-4.7-flash
reasoning
200,000$0.07$0.4$0-
GLM 4.7 FlashX
zai/glm-4.7-flashx
reasoning
200,000$0.06$0.4$0.01-
GLM 5
zai/glm-5
reasoning
202,800$1$3.2$0.2-
GLM 5 Turbo
zai/glm-5-turbo
reasoning
202,800$1.2$4$0.24-
GLM 5.1
zai/glm-5.1
reasoning
202,800$1.4$4.4$0.26-
GLM 5V Turbo
zai/glm-5v-turbo
reasoning
200,000$1.2$4$0.24-
GLM-4.5
zai/glm-4.5
reasoning
128,000$0.6$2.2$0.11-
GLM-4.6V
zai/glm-4.6v
reasoning
128,000$0.3$0.9$0.05-
GLM-4.6V-Flash
zai/glm-4.6v-flash
reasoning
128,000$0$0$0-
GPT 5 Chat
openai/gpt-5-chat
reasoning
128,000$1.25$10$0.125-
GPT 5.1 Codex Max
openai/gpt-5.1-codex-max
reasoning
400,000$1.25$10$0.125-
GPT 5.1 Codex Mini
openai/gpt-5.1-codex-mini
reasoning
400,000$0.25$2$0.025-
GPT 5.1 Thinking
openai/gpt-5.1-thinking
reasoning
400,000$1.25$10$0.125-
GPT 5.2
openai/gpt-5.2
reasoning
400,000$1.75$14$0.175-
GPT 5.2
openai/gpt-5.2-pro
reasoning
400,000$21$168$0-
GPT 5.2 Chat
openai/gpt-5.2-chat
reasoning
128,000$1.75$14$0.175-
GPT 5.2 Codex
openai/gpt-5.2-codex
reasoning
400,000$1.75$14$0.175-
GPT 5.3 Codex
openai/gpt-5.3-codex
reasoning
400,000$1.75$14$0.175-
GPT 5.4
openai/gpt-5.4
reasoning
1,050,000$2.5$15$0.25-
GPT 5.4 Mini
openai/gpt-5.4-mini
reasoning
400,000$0.75$4.5$0.075-
GPT 5.4 Nano
openai/gpt-5.4-nano
reasoning
400,000$0.2$1.25$0.02-
GPT 5.4 Pro
openai/gpt-5.4-pro
reasoning
1,050,000$30$180$0-
GPT 5.5
openai/gpt-5.5
reasoning
1,000,000$5$30$0.5-
GPT 5.5 Pro
openai/gpt-5.5-pro
reasoning
1,000,000$30$180$0-
GPT OSS 20B
openai/gpt-oss-20b
reasoning
131,072$0.05$0.2$0-
GPT OSS Safeguard 20B
openai/gpt-oss-safeguard-20b
reasoning
131,072$0.075$0.3$0.037-
GPT-4 Turbo
openai/gpt-4-turbo
128,000$10$30$0-
GPT-4.1
openai/gpt-4.1
1,047,576$2$8$0.5-
GPT-4.1 mini
openai/gpt-4.1-mini
1,047,576$0.4$1.6$0.1-
GPT-4.1 nano
openai/gpt-4.1-nano
1,047,576$0.1$0.4$0.025-
GPT-4o
openai/gpt-4o
128,000$2.5$10$1.25-
GPT-4o mini
openai/gpt-4o-mini
128,000$0.15$0.6$0.075-
GPT-5
openai/gpt-5
reasoning
400,000$1.25$10$0.125-
GPT-5 mini
openai/gpt-5-mini
reasoning
400,000$0.25$2$0.025-
GPT-5 nano
openai/gpt-5-nano
reasoning
400,000$0.05$0.4$0.005-
GPT-5 pro
openai/gpt-5-pro
reasoning
400,000$15$120$0-
GPT-5-Codex
openai/gpt-5-codex
reasoning
400,000$1.25$10$0.125-
GPT-5.1 Instant
openai/gpt-5.1-instant
reasoning
128,000$1.25$10$0.125-
GPT-5.1-Codex
openai/gpt-5.1-codex
reasoning
400,000$1.25$10$0.125-
GPT-5.3 Chat
openai/gpt-5.3-chat
reasoning
128,000$1.75$14$0.175-
Grok 4.1 Fast Non-Reasoning
xai/grok-4.1-fast-non-reasoning
1,000,000$0.2$0.5$0.05-
Grok 4.1 Fast Reasoning
xai/grok-4.1-fast-reasoning
reasoning
1,000,000$0.2$0.5$0.05-
Grok 4.20 Beta Non-Reasoning
xai/grok-4.20-non-reasoning-beta
2,000,000$1.25$2.5$0.2-
Grok 4.20 Beta Reasoning
xai/grok-4.20-reasoning-beta
reasoning
2,000,000$1.25$2.5$0.2-
Grok 4.20 Multi Agent Beta
xai/grok-4.20-multi-agent-beta
reasoning
2,000,000$1.25$2.5$0.2-
Grok 4.20 Multi-Agent
xai/grok-4.20-multi-agent
reasoning
2,000,000$1.25$2.5$0.2-
Grok 4.20 Non-Reasoning
xai/grok-4.20-non-reasoning
2,000,000$1.25$2.5$0.2-
Grok 4.20 Reasoning
xai/grok-4.20-reasoning
reasoning
2,000,000$1.25$2.5$0.2-
Grok 4.3
xai/grok-4.3
reasoning
1,000,000$1.25$2.5$0.2-
Grok Build 0.1
xai/grok-build-0.1
reasoning
256,000$1$2$0.2-
Kat Coder Pro V2
kwaipilot/kat-coder-pro-v2
reasoning
256,000$0.3$1.2$0.06-
Kimi K2 Instruct
moonshotai/kimi-k2
131,072$0.57$2.3$0-
Kimi K2 Thinking
moonshotai/kimi-k2-thinking
reasoning
262,114$0.6$2.5$0.15-
Kimi K2 Thinking Turbo
moonshotai/kimi-k2-thinking-turbo
reasoning
262,114$1.15$8$0.15-
Kimi K2 Turbo
moonshotai/kimi-k2-turbo
256,000$1.15$8$0.15-
Kimi K2.5
moonshotai/kimi-k2.5
reasoning
262,114$0.6$3$0.1-
Kimi K2.6
moonshotai/kimi-k2.6
reasoning
262,000$0.95$4$0.16-
Llama 3.1 70B Instruct
meta/llama-3.1-70b
128,000$0.72$0.72$0-
Llama 3.1 8B Instruct
meta/llama-3.1-8b
128,000$0.22$0.22$0-
Llama 3.2 11B Vision Instruct
meta/llama-3.2-11b
128,000$0.16$0.16$0-
Llama 3.2 90B Vision Instruct
meta/llama-3.2-90b
128,000$0.72$0.72$0-
Llama 3.3 70B Instruct
meta/llama-3.3-70b
128,000$0.72$0.72$0-
Llama 4 Maverick 17B Instruct
meta/llama-4-maverick
128,000$0.24$0.97$0-
Llama 4 Scout 17B Instruct
meta/llama-4-scout
128,000$0.17$0.66$0-
LongCat Flash Chat
meituan/longcat-flash-chat
128,000$0$0$0-
Mercury 2
inception/mercury-2
reasoning
128,000$0.25$0.75$0.025-
Mercury Coder Small Beta
inception/mercury-coder-small
32,000$0.25$1$0-
MiMo M2.5
xiaomi/mimo-v2.5
reasoning
1,050,000$0.4$2$0.08-
MiMo V2 Flash
xiaomi/mimo-v2-flash
reasoning
262,144$0.1$0.3$0.01-
MiMo V2 Pro
xiaomi/mimo-v2-pro
reasoning
1,000,000$1$3$0.2-
MiMo V2.5 Pro
xiaomi/mimo-v2.5-pro
reasoning
1,050,000$1$3$0.2-
MiniMax M2
minimax/minimax-m2
reasoning
205,000$0.3$1.2$0.03$0.375
MiniMax M2.1
minimax/minimax-m2.1
reasoning
204,800$0.3$1.2$0.03$0.375
MiniMax M2.1 Lightning
minimax/minimax-m2.1-lightning
reasoning
204,800$0.3$2.4$0.03$0.375
MiniMax M2.5
minimax/minimax-m2.5
reasoning
204,800$0.3$1.2$0.03$0.375
MiniMax M2.5 High Speed
minimax/minimax-m2.5-highspeed
reasoning
204,800$0.6$2.4$0.03$0.375
MiniMax M2.7
minimax/minimax-m2.7
reasoning
204,800$0.3$1.2$0.06$0.375
MiniMax M2.7 High Speed
minimax/minimax-m2.7-highspeed
reasoning
204,800$0.6$2.4$0.06$0.375
Ministral 3B
mistral/ministral-3b
128,000$0.1$0.1$0-
Ministral 8B
mistral/ministral-8b
128,000$0.15$0.15$0-
Mistral Codestral
mistral/codestral
128,000$0.3$0.9$0-
Mistral Medium 3.1
mistral/mistral-medium
128,000$0.4$2$0-
Mistral Medium Latest
mistral/mistral-medium-3.5
reasoning
256,000$1.5$7.5$0-
Mistral Small
mistral/mistral-small
32,000$0.1$0.3$0-
Nvidia Nemotron Nano 12B V2 VL
nvidia/nemotron-nano-12b-v2-vl
reasoning
131,072$0.2$0.6$0-
Nvidia Nemotron Nano 9B V2
nvidia/nemotron-nano-9b-v2
reasoning
131,072$0.06$0.23$0-
o1
openai/o1
reasoning
200,000$15$60$7.5-
o3
openai/o3
reasoning
200,000$2$8$0.5-
o3 Pro
openai/o3-pro
reasoning
200,000$20$80$0-
o3-deep-research
openai/o3-deep-research
reasoning
200,000$10$40$2.5-
o3-mini
openai/o3-mini
reasoning
200,000$1.1$4.4$0.55-
o4-mini
openai/o4-mini
reasoning
200,000$1.1$4.4$0.275-
Pixtral 12B 2409
mistral/pixtral-12b
128,000$0.15$0.15$0-
Pixtral Large
mistral/pixtral-large
128,000$2$6$0-
Qwen 3 32B
alibaba/qwen-3-32b
reasoning
128,000$0.16$0.64$0-
Qwen 3 Coder 30B A3B Instruct
alibaba/qwen3-coder-30b-a3b
reasoning
262,144$0.15$0.6$0-
Qwen 3 Max Thinking
alibaba/qwen3-max-thinking
reasoning
256,000$1.2$6$0.24-
Qwen 3.5 Flash
alibaba/qwen3.5-flash
reasoning
1,000,000$0.1$0.4$0.001$0.125
Qwen 3.5 Plus
alibaba/qwen3.5-plus
reasoning
1,000,000$0.4$2.4$0.04$0.5
Qwen 3.6 27B
alibaba/qwen3.6-27b
reasoning
256,000$0.6$3.6$0-
Qwen 3.6 Max Preview
alibaba/qwen-3.6-max-preview
reasoning
240,000$1.3$7.8$0.26$1.625
Qwen 3.6 Plus
alibaba/qwen3.6-plus
reasoning
1,000,000$0.5$3$0.1$0.625
Qwen 3.7 Max
alibaba/qwen3.7-max
reasoning
991,000$1.25$3.75$0.25$1.5625
Qwen3 235B A22b Instruct 2507
alibaba/qwen-3-235b
131,000$0.6$1.2$0.6-
Qwen3 Coder 480B A35B Instruct
alibaba/qwen3-coder
262,144$1.5$7.5$0.3-
Qwen3 Coder Next
alibaba/qwen3-coder-next
256,000$0.5$1.2$0-
Qwen3 Coder Plus
alibaba/qwen3-coder-plus
1,000,000$1$5$0.2-
Qwen3 Max
alibaba/qwen3-max
262,144$1.2$6$0.24-
Qwen3 Max Preview
alibaba/qwen3-max-preview
262,144$1.2$6$0.24-
Qwen3 VL 235B A22B Thinking
alibaba/qwen3-235b-a22b-thinking
reasoning
131,072$0.4$4$0-
Qwen3 VL 235B A22B Thinking
alibaba/qwen3-vl-thinking
reasoning
131,072$0.4$4$0-
Qwen3-14B
alibaba/qwen-3-14b
reasoning
40,960$0.12$0.24$0-
Qwen3-30B-A3B
alibaba/qwen-3-30b
reasoning
40,960$0.08$0.29$0-
Seed 1.6
bytedance/seed-1.6
reasoning
256,000$0.25$2$0.05-
Sonar
perplexity/sonar
127,000$0$0$0-
Sonar Pro
perplexity/sonar-pro
200,000$0$0$0-
Trinity Large Preview
arcee-ai/trinity-large-preview
131,000$0.25$1$0-
Trinity Large Thinking
arcee-ai/trinity-large-thinking
reasoning
262,100$0.25$0.9$0-
xai (7 models)
Grok 3
grok-3
131,072$3$15$0.75-
Grok 3 Fast
grok-3-fast
131,072$5$25$1.25-
Grok 4.20 (Non-Reasoning)
grok-4.20-0309-non-reasoning
2,000,000$1.25$2.5$0.2-
Grok 4.20 (Reasoning)
grok-4.20-0309-reasoning
reasoning
2,000,000$1.25$2.5$0.2-
Grok 4.3
grok-4.3
reasoning
1,000,000$1.25$2.5$0.2-
Grok Build 0.1
grok-build-0.1
reasoning
256,000$1$2$0.2-
Grok Code Fast 1
grok-code-fast-1
32,768$0.2$1.5$0.02-
xiaomi (5 models)
MiMo-V2-Flash
mimo-v2-flash
reasoning
262,144$0.1$0.3$0.01-
MiMo-V2-Omni
mimo-v2-omni
reasoning
262,144$0.4$2$0.08-
MiMo-V2-Pro
mimo-v2-pro
reasoning
1,048,576$1$3$0.2-
MiMo-V2.5
mimo-v2.5
reasoning
1,048,576$0.4$2$0.08-
MiMo-V2.5-Pro
mimo-v2.5-pro
reasoning
1,048,576$1$3$0.2-
xiaomi-token-plan-ams (5 models)
MiMo-V2-Flash
mimo-v2-flash
reasoning
262,144$0.1$0.3$0.01-
MiMo-V2-Omni
mimo-v2-omni
reasoning
262,144$0.4$2$0.08-
MiMo-V2-Pro
mimo-v2-pro
reasoning
1,048,576$1$3$0.2-
MiMo-V2.5
mimo-v2.5
reasoning
1,048,576$0.4$2$0.08-
MiMo-V2.5-Pro
mimo-v2.5-pro
reasoning
1,048,576$1$3$0.2-
xiaomi-token-plan-cn (5 models)
MiMo-V2-Flash
mimo-v2-flash
reasoning
262,144$0.1$0.3$0.01-
MiMo-V2-Omni
mimo-v2-omni
reasoning
262,144$0.4$2$0.08-
MiMo-V2-Pro
mimo-v2-pro
reasoning
1,048,576$1$3$0.2-
MiMo-V2.5
mimo-v2.5
reasoning
1,048,576$0.4$2$0.08-
MiMo-V2.5-Pro
mimo-v2.5-pro
reasoning
1,048,576$1$3$0.2-
xiaomi-token-plan-sgp (5 models)
MiMo-V2-Flash
mimo-v2-flash
reasoning
262,144$0.1$0.3$0.01-
MiMo-V2-Omni
mimo-v2-omni
reasoning
262,144$0.4$2$0.08-
MiMo-V2-Pro
mimo-v2-pro
reasoning
1,048,576$1$3$0.2-
MiMo-V2.5
mimo-v2.5
reasoning
1,048,576$0.4$2$0.08-
MiMo-V2.5-Pro
mimo-v2.5-pro
reasoning
1,048,576$1$3$0.2-
zai (5 models)
GLM-4.5-Air
glm-4.5-air
reasoning
131,072$0$0$0-
GLM-4.7
glm-4.7
reasoning
204,800$0$0$0-
GLM-5-Turbo
glm-5-turbo
reasoning
200,000$0$0$0-
GLM-5.1
glm-5.1
reasoning
200,000$0$0$0-
GLM-5V-Turbo
glm-5v-turbo
reasoning
200,000$0$0$0-