Rankings

What the world is actually running.

Aggregate token volume routed through AIgateway over the last day, week, and month. Pick a model the market trusts — or pick the underdog it hasn't noticed yet.

40 models · updated hourly

Top models

#ModelProviderModalityTokensp50 latencyΔ
1Claude Opus 4.7
anthropic/claude-opus-4.7
Anthropictext4.92B1299ms-37%
2Gemini 3.1 Pro
google/gemini-3.1-pro
Googletext2.50B1111ms+55%
3GPT-5.4 Mini
openai/gpt-5.4-mini
OpenAItext2.49B1374ms-16%
4Claude Sonnet 4.6
anthropic/claude-sonnet-4.6
Anthropictext2.46B1035ms+0%
5Kimi K2.6
moonshot/kimi-k2.6
Moonshottext2.34B1035ms-47%
6Claude Haiku 4.5
anthropic/claude-haiku-4.5
Anthropictext2.25B235ms+46%
7Imagen 4
google/imagen-4
Googleimage2.13B1349ms+62%
8GPT-5.4
openai/gpt-5.4
OpenAItext2.05B828ms-7%
9Gpt-Oss-120b
openai/gpt-oss-120b
OpenAItext1.42B1026ms-25%
10Llama-4-Scout-17b-16e-Instruct
meta/llama-4-scout-17b-16e-instruct
Metatext1.28B674ms+73%
11Qwen2.5-Coder-32b-Instruct
qwen/qwen2.5-coder-32b-instruct
Alibaba Qwentext1.19B829ms-9%
12Veo 3.1
google/veo-3.1
Googlevideo1.18B985ms-26%
13Sonar Deep Research
perplexity/sonar-deep-research
Perplexitytext1.08B1209ms+39%
14Resnet-50
microsoft/resnet-50
Microsoftclassification1.07B201ms-43%
15ResNet-50
microsoft/resnet-50
Microsoftimage1.07B201ms-43%
16Gemma-3-12b-IT
google/gemma-3-12b-it
Googletext1.06B198ms+37%
17Nano Banana Pro
google/nano-banana-pro
Googleimage1.05B1379ms+47%
18Glm-4.7-Flash
zai-org/glm-4.7-flash
Zhipu AItext1.04B1038ms+16%
19Whisper
openai/whisper
OpenAIaudio-stt1.03B1021ms+63%
20Bge-M3
baai/bge-m3
BAAIembedding1.02B922ms-34%
21Tinyllama-1.1b-Chat-V1.0
tinyllama/tinyllama-1.1b-chat-v1.0
TinyLlamatext1.01B1133ms+42%
22Stable-Diffusion-V1-5-Img2img
runwayml/stable-diffusion-v1-5-img2img
Runwayimage1.01B400ms-2%
23Uform-Gen2-Qwen-500m
unum/uform-gen2-qwen-500m
Unumvision1.00B328ms-51%
24Stable-Diffusion-XL-Base-1.0
stabilityai/stable-diffusion-xl-base-1.0
Stability AIimage1.00B190ms-22%
25Veo 3 Fast
google/veo-3-fast
Googlevideo998.6M389ms-3%
26Granite-4.0-H-Micro
ibm-granite/granite-4.0-h-micro
IBMtext997.7M187ms-3%
27Flux-1-Schnell
black-forest-labs/flux-1-schnell
Black Forest Labsimage993.3M182ms-34%
28Mistral Small 4
mistral/mistral-small-4-0-26-03
Mistraltext992.0M181ms-34%
29Qwen 3 Max
alibaba/qwen3-max
Alibabatext991.4M1183ms+9%
30GPT-5.4 Nano
openai/gpt-5.4-nano
OpenAItext973.7M963ms+56%
31Pixverse V5.6
pixverse/v5.6
PixVersevideo967.7M1153ms-26%
32Llama-3.1-8b-Instruct-Fp8
meta/llama-3.1-8b-instruct-fp8
Metatext950.1M1066ms+23%
33Claude Sonnet 4
anthropic/claude-sonnet-4
Anthropictext945.9M1333ms-10%
34TTS-1
openai/tts-1
OpenAIaudio-tts939.4M193ms+51%
35Deepseek-Coder-6.7b-Base-Awq
hf/thebloke/deepseek-coder-6.7b-base-awq
Hugging Facetext932.3M1247ms+2%
36Meta-Llama-3-8b-Instruct
hf/meta-llama/meta-llama-3-8b-instruct
Hugging Facetext928.2M1180ms-12%
37Llama-3.1-8b-Instruct
meta/llama-3.1-8b-instruct
Metatext925.0M909ms-31%
38o4-mini
openai/o4-mini
openaitext924.5M975ms+50%
39Vidu Q3 Turbo
vidu/q3-turbo
Viduvideo921.9M236ms+78%
40Mistral-7b-Instruct-V0.2
hf/mistral/mistral-7b-instruct-v0.2
Hugging Facetext921.4M1306ms+49%

By provider

ProviderModelsTokens
Google1615.46B
OpenAI1312.61B
Anthropic611.41B
Meta219.92B
Hugging Face157.69B
Mistral85.48B
Alibaba Qwen85.09B
Recraft42.89B
Moonshot22.73B
BAAI62.52B
Alibaba32.40B
MiniMax52.35B
How this is built

Counted at the request, not scraped.

Every request through the gateway increments a counter keyed on the routed model. We anonymize before aggregation, publish daily, and never expose a single customer's workload. If you want your app attributed on the leaderboard, opt in from Dashboard → Settings.