Available Models

Access the best AI models through a single, unified API. All models are OpenAI-compatible.

GPT-5.4

Available

OpenAI 2026 latest flagship, 1M context, all-around champion

openai/gpt-5.4

Context: 1,048,576 tokens

Pricing (per 1M tokens)

Input

$2.50

Output

$15.00

GPT-5.4-Pro

Available

GPT-5.4 Pro version, ultimate reasoning and code generation

openai/gpt-5.4-pro

Context: 1,048,576 tokens

Pricing (per 1M tokens)

Input

$30.00

Output

$180.00

GPT-5.4-Mini

Available

GPT-5.4 lightweight version, fast and cost-effective

openai/gpt-5.4-mini

Context: 1,048,576 tokens

Pricing (per 1M tokens)

Input

$0.75

Output

$4.50

GPT-5.4-Nano

Available

GPT-5.4 ultra lightweight, extremely low latency and cost

openai/gpt-5.4-nano

Context: 1,048,576 tokens

Pricing (per 1M tokens)

Input

$0.20

Output

$1.25

GPT-5

Available

OpenAI GPT-5 flagship, breakthrough capabilities

openai/gpt-5

Context: 1,048,576 tokens

Pricing (per 1M tokens)

Input

$1.25

Output

$10.00

GPT-5.2

Available

GPT-5 enhanced version, stronger reasoning capabilities

openai/gpt-5.2

Context: 1,048,576 tokens

Pricing (per 1M tokens)

Input

$1.75

Output

$14.00

GPT-4.1

Available

OpenAI 2025 flagship, strongest overall capabilities

openai/gpt-4.1

Context: 1,048,576 tokens

Pricing (per 1M tokens)

Input

$2.00

Output

$8.00

GPT-4o-mini

Available

OpenAI lightweight model, cost-effective

openai/gpt-4o-mini

Context: 128,000 tokens

Pricing (per 1M tokens)

Input

$0.15

Output

$0.60

GPT-OSS-120B

Available

OpenAI open source model, 120B parameters

openai/gpt-oss-120b

Context: 128,000 tokens

Pricing (per 1M tokens)

Input

$0.05

Output

$0.25

Claude Sonnet 4.6

Available

Anthropic 2026 latest flagship, coding and reasoning fully upgraded

anthropic/claude-sonnet-4-6

Context: 1,000,000 tokens

Pricing (per 1M tokens)

Input

$3.00

Output

$15.00

Claude Opus 4.6

Available

Anthropic 2026 most powerful flagship, autonomous agent task completion

anthropic/claude-opus-4-6

Context: 1,000,000 tokens

Pricing (per 1M tokens)

Input

$5.00

Output

$25.00

Claude Haiku 4.5

Available

Claude lightweight version, fast response, most cost-effective

anthropic/claude-haiku-4-5

Context: 1,000,000 tokens

Pricing (per 1M tokens)

Input

$1.00

Output

$5.00

Claude Sonnet 4.5

Available

Claude main model, balancing capability and cost

anthropic/claude-sonnet-4-5

Context: 1,000,000 tokens

Pricing (per 1M tokens)

Input

$3.00

Output

$15.00

Claude Opus 4.5

Available

Claude flagship model, complex task handling

anthropic/claude-opus-4-5

Context: 1,000,000 tokens

Pricing (per 1M tokens)

Input

$5.00

Output

$25.00

Gemini 3.1 Pro

Available

Google 2026 most powerful flagship, code and reasoning breakthrough

google/gemini-3.1-pro-preview

Context: 2,000,000 tokens

Pricing (per 1M tokens)

Input

$2.00

Output

$12.00

Gemini 3.1 Flash Lite

Available

Gemini 3.1 lightweight, optimized for speed and cost

google/gemini-3.1-flash-lite-preview

Context: 2,000,000 tokens

Pricing (per 1M tokens)

Input

$0.25

Output

$1.50

Gemini 3.1 Flash Image

Available

Gemini 3.1 image generation, text-to-image capability

google/gemini-3.1-flash-image-preview

Context: 2,000,000 tokens

Pricing (per 1M tokens)

Input

$0.15

Output

$30.00

Gemini 3 Pro

Available

Google Gemini 3 flagship model

google/gemini-3-pro-preview

Context: 2,000,000 tokens

Pricing (per 1M tokens)

Input

$2.00

Output

$12.00

Gemini 3 Flash

Available

Gemini 3 lightweight, fast and efficient

google/gemini-3-flash-preview

Context: 2,000,000 tokens

Pricing (per 1M tokens)

Input

$0.30

Output

$2.50

Gemini 2.5 Pro

Available

Google 2025 flagship, strong code and reasoning

google/gemini-2.5-pro

Context: 1,048,576 tokens

Pricing (per 1M tokens)

Input

$1.25

Output

$10.00

Gemini 2.5 Flash

Available

Gemini 2.5 lightweight, optimized for speed and cost

google/gemini-2.5-flash

Context: 1,048,576 tokens

Pricing (per 1M tokens)

Input

$0.30

Output

$2.50

Grok 4.1 Reasoning

Available

xAI latest reasoning model, fast deep thinking

xai/grok-4.1-fast-reasoning

Context: 128,000 tokens

Pricing (per 1M tokens)

Input

$0.20

Output

$0.50

Grok 4.1 Fast

Available

xAI fast model, ultra-fast response

xai/grok-4-1-fast-non-reasoning

Context: 128,000 tokens

Pricing (per 1M tokens)

Input

$0.20

Output

$0.50

Grok Code Fast

Available

xAI code-focused model, strong programming capabilities

xai/grok-code-fast-1

Context: 128,000 tokens

Pricing (per 1M tokens)

Input

$0.20

Output

$1.50

GLM-4

Available

Zhipu flagship model, strong Chinese capabilities

zhipu/glm-4

Context: 128,000 tokens

Pricing (per 1M tokens)

Input

$0.07

Output

$0.07

GLM-4-Flash

Available

Zhipu lightweight model, cost-effective

zhipu/glm-4-flash

Context: 128,000 tokens

Pricing (per 1M tokens)

Input

$0.001

Output

$0.001

Moonshot-v1-8K

Available

Moonshot lightweight model, 8K context

moonshot/moonshot-v1-8k

Context: 8,000 tokens

Pricing (per 1M tokens)

Input

$0.012

Output

$0.012

Moonshot-v1-32K

Available

Moonshot medium model, 32K context

moonshot/moonshot-v1-32k

Context: 32,000 tokens

Pricing (per 1M tokens)

Input

$0.024

Output

$0.024

Moonshot-v1-128K

Available

Moonshot flagship model, 128K ultra-long context

moonshot/moonshot-v1-128k

Context: 128,000 tokens

Pricing (per 1M tokens)

Input

$0.06

Output

$0.06

DeepSeek-V3

Available

DeepSeek latest generation general chat model, extremely cost-effective

deepseek/deepseek-chat

Context: 64,000 tokens

Pricing (per 1M tokens)

Input

$0.27

Output

$1.10

DeepSeek-R1

Available

Deep reasoning model, excels at complex logic and math reasoning

deepseek/deepseek-reasoner

Context: 64,000 tokens

Pricing (per 1M tokens)

Input

$0.55

Output

$2.19

Qwen-Max

Available

Qwen flagship model, strongest overall capabilities

alibaba/qwen-max

Context: 32,000 tokens

Pricing (per 1M tokens)

Input

$0.35

Output

$1.39

Qwen-Plus

Available

Enhanced model, balancing capability and cost

alibaba/qwen-plus

Context: 131,072 tokens

Pricing (per 1M tokens)

Input

$0.11

Output

$0.28

Qwen-Turbo

Available

Fast response model, suitable for high-concurrency scenarios

alibaba/qwen-turbo

Context: 131,072 tokens

Pricing (per 1M tokens)

Input

$0.04

Output

$0.08

QwQ-Plus

Available

Qwen reasoning model, deep thinking and logical reasoning

alibaba/qwq-plus

Context: 131,072 tokens

Pricing (per 1M tokens)

Input

$0.22

Output

$0.56

Qwen-Long

Available

Long text specialized model, supports ultra-long context

alibaba/qwen-long

Context: 1,000,000 tokens

Pricing (per 1M tokens)

Input

$0.07

Output

$0.28

GLM-5-Turbo

Available

Zhipu flagship model, fastest GLM-5 variant with 200K context

zhipu/glm-5-turbo

Context: 200,000 tokens

Pricing (per 1M tokens)

Input

$1.20

Output

$4.00

GLM-5

Available

Zhipu GLM-5 flagship, advanced reasoning and generation

zhipu/glm-5

Context: 203,000 tokens

Pricing (per 1M tokens)

Input

$1.00

Output

$3.20

GLM-4.7

Available

Zhipu enhanced model with 200K context window

zhipu/glm-4.7

Context: 200,000 tokens

Pricing (per 1M tokens)

Input

$0.60

Output

$2.20

GLM-4.6

Available

Zhipu balanced model, strong Chinese language capabilities

zhipu/glm-4.6

Context: 205,000 tokens

Pricing (per 1M tokens)

Input

$0.60

Output

$2.20

GLM-4.5-Air

Available

Zhipu lightweight model, highly cost-effective

zhipu/glm-4.5-air

Context: 128,000 tokens

Pricing (per 1M tokens)

Input

$0.13

Output

$0.85

MiniMax-M2.7

Available

MiniMax latest flagship with 205K context, advanced reasoning

minimax/minimax-m2.7

Context: 205,000 tokens

Pricing (per 1M tokens)

Input

$0.30

Output

$1.20

MiniMax-M2.5

Available

MiniMax advanced model with 205K context window

minimax/minimax-m2.5

Context: 205,000 tokens

Pricing (per 1M tokens)

Input

$0.30

Output

$1.20

MiniMax-M2.1

Available

MiniMax enhanced model, strong multilingual support

minimax/minimax-m2.1

Context: 205,000 tokens

Pricing (per 1M tokens)

Input

$0.30

Output

$1.20

MiniMax-M2

Available

MiniMax base model, reliable and efficient

minimax/minimax-m2

Context: 200,000 tokens

Pricing (per 1M tokens)

Input

$0.30

Output

$1.20

MiMo-V2-Pro

Available

Xiaomi flagship AI model with 1M context window

xiaomi/mimo-v2-pro

Context: 1,000,000 tokens

Pricing (per 1M tokens)

Input

$1.00

Output

$3.00

MiMo-V2-Omni

Available

Xiaomi multimodal AI model with 262K context

xiaomi/mimo-v2-omni

Context: 262,000 tokens

Pricing (per 1M tokens)

Input

$0.40

Output

$2.00

Qwen3.5-397B

Available

Qwen3.5 largest model, 397B parameters with MoE architecture

alibaba/qwen3.5-397b-a17b

Context: 262,000 tokens

Pricing (per 1M tokens)

Input

$0.60

Output

$3.60

Qwen3-235B-Instruct

Available

Qwen3 instruction-tuned model with 1M context window

alibaba/qwen3-235b-a22b-instruct

Context: 1,000,000 tokens

Pricing (per 1M tokens)

Input

$0.30

Output

$1.50

Qwen3-Coder-480B

Available

Qwen3 code-specialized model, strongest coding capabilities

alibaba/qwen3-coder-480b-a35b

Context: 262,000 tokens

Pricing (per 1M tokens)

Input

$0.33

Output

$1.65

Kimi-K2.5

Available

Moonshot Kimi latest flagship with 262K context

kimi/kimi-k2.5

Context: 262,000 tokens

Pricing (per 1M tokens)

Input

$0.66

Output

$3.30

Kimi-K2-Thinking

Available

Kimi deep reasoning model with chain-of-thought

kimi/kimi-k2-thinking

Context: 256,000 tokens

Pricing (per 1M tokens)

Input

$0.60

Output

$2.50

Kimi-K2

Available

Kimi balanced model, strong general capabilities

kimi/kimi-k2

Context: 256,000 tokens

Pricing (per 1M tokens)

Input

$0.60

Output

$2.50

DeepSeek-R1-0528

Available

DeepSeek latest reasoning model, advanced problem solving

deepseek/deepseek-r1-0528

Context: 164,000 tokens

Pricing (per 1M tokens)

Input

$0.70

Output

$2.50

DeepSeek-V3.1

Available

DeepSeek V3.1 enhanced model, improved capabilities

deepseek/deepseek-v3.1

Context: 128,000 tokens

Pricing (per 1M tokens)

Input

$0.27

Output

$1.00

DeepSeek-V3.2

Available

DeepSeek V3.2 latest model, most cost-effective

deepseek/deepseek-v3.2

Context: 128,000 tokens

Pricing (per 1M tokens)

Input

$0.269

Output

$0.40

Nano-Banana-2

Available

Google experimental model, optimized for image generation

google/nano-banana-2

Context: 131,000 tokens

Pricing (per 1M tokens)

Input

$0.15

Output

$30.00

Nano-Banana-Pro

Available

Google Nano Banana Pro, advanced multimodal capabilities

google/nano-banana-pro

Context: 66,000 tokens

Pricing (per 1M tokens)

Input

$2.00

Output

$12.00

Nano-Banana

Available

Google Nano Banana base model, efficient and versatile

google/nano-banana

Context: 33,000 tokens

Pricing (per 1M tokens)

Input

$0.30

Output

$2.50

文心一言 ERNIE-4.0

Coming Soon

百度最强大模型,多模态理解与生成

Provider: 百度

GLM-4-Plus

Coming Soon

清华智谱旗舰模型,全面能力提升

Provider: 智谱AI

Moonshot-v1

Coming Soon

Kimi超长上下文模型,支持200K

Provider: 月之暗面