Real aggregate usage across the open model ecosystem — token volume, requests, and market share, refreshed regularly. Pick a model the market trusts — or pick the underdog it hasn't noticed yet.
| # | Model | Provider | Modality | Tokens | Requests | Share |
|---|---|---|---|---|---|---|
| 1 | Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 | Anthropic | text | 1.99T | 59.3M | 16.5% |
| 2 | Claude Opus 4.7 anthropic/claude-opus-4.7 | Anthropic | text | 1.54T | 26.7M | 12.8% |
| 3 | Gemini 3 Flash google/gemini-3-flash | text | 1.13T | 138.3M | 9.4% | |
| 4 | Claude Opus 4.8 anthropic/claude-opus-4.8 | Anthropic | text | 1.09T | 22.4M | 9.1% |
| 5 | Gemini 2.5 Flash google/gemini-2.5-flash | text | 683.9B | 172.3M | 5.7% | |
| 6 | Gpt-Oss-120b openai/gpt-oss-120b | OpenAI | text | 677.7B | 138.2M | 5.6% |
| 7 | Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite | text | 667.2B | 275.7M | 5.5% | |
| 8 | Gemini 3.1 Flash Lite google/gemini-3.1-flash-lite | text | 659.9B | 130.8M | 5.5% | |
| 9 | Claude Opus 4.6 anthropic/claude-opus-4.6 | Anthropic | text | 579.3B | 11.7M | 4.8% |
| 10 | GPT-5.5 openai/gpt-5.5 | OpenAI | text | 516.7B | 12.6M | 4.3% |
| 11 | Kimi-K2.6 moonshot/kimi-k2.6 | Moonshot | text | 391.0B | 15.1M | 3.3% |
| 12 | Gemma-4-26b-A4b-IT google/gemma-4-26b-a4b-it | text | 277.4B | 63.4M | 2.3% | |
| 13 | Claude Haiku 4.5 anthropic/claude-haiku-4.5 | Anthropic | text | 263.7B | 30.6M | 2.2% |
| 14 | Gemini 3.1 Pro google/gemini-3.1-pro | text | 261.0B | 15.7M | 2.2% | |
| 15 | GPT-5.4 openai/gpt-5.4 | OpenAI | text | 257.6B | 14.0M | 2.1% |
| 16 | GPT-5.4 Mini openai/gpt-5.4-mini | OpenAI | text | 181.5B | 24.1M | 1.5% |
| 17 | Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 | Anthropic | text | 123.1B | 8.8M | 1% |
| 18 | GPT-5.4 Nano openai/gpt-5.4-nano | OpenAI | text | 103.5B | 17.6M | 0.9% |
| 19 | Gpt-Oss-20b openai/gpt-oss-20b | OpenAI | text | 102.9B | 34.3M | 0.9% |
| 20 | GPT-5 openai/gpt-5 | OpenAI | text | 88.4B | 7.2M | 0.7% |
| 21 | Grok 4.3 xai/grok-4.3 | xAI | text | 87.9B | 13.6M | 0.7% |
| 22 | GPT-4.1 Mini openai/gpt-4.1-mini | OpenAI | text | 83.4B | 26.2M | 0.7% |
| 23 | Gemini 2.5 Pro google/gemini-2.5-pro | text | 78.4B | 6.8M | 0.7% | |
| 24 | Qwen 3.5 397B A17B alibaba/qwen3.5-397b-a17b | Alibaba | text | 54.9B | 5.0M | 0.5% |
| 25 | Claude Sonnet 4 anthropic/claude-sonnet-4 | Anthropic | text | 53.6B | 5.7M | 0.4% |
| 26 | GPT-4.1 openai/gpt-4.1 | OpenAI | text | 53.5B | 10.2M | 0.4% |
| 27 | Llama-4-Scout-17b-16e-Instruct meta/llama-4-scout-17b-16e-instruct | Meta | text | 10.5B | 5.5M | 0.1% |
| 28 | Bge-M3 baai/bge-m3 | BAAI | embedding | 9.7B | 9.0M | 0.1% |
| 29 | GPT-5.5 Pro openai/gpt-5.5-pro | OpenAI | text | 3.1B | 125.7K | 0% |
| 30 | Seedream 4.5 bytedance/seedream-4.5 | ByteDance | image | 3.0B | 155.2K | 0% |
| 31 | O4-Mini openai/o4-mini | OpenAI | text | 2.4B | 633.5K | 0% |
| 32 | Qwen 3 Max alibaba/qwen3-max | Alibaba | text | 2.0B | 316.2K | 0% |
| 33 | Llama-3.2-3b-Instruct meta/llama-3.2-3b-instruct | Meta | text | 1.4B | 5.0M | 0% |
| 34 | Grok Imagine Image Quality xai/grok-imagine-image-quality | xAI | image | 836.5M | 149.7K | 0% |
| 35 | Bge-Base-EN-V1.5 baai/bge-base-en-v1.5 | BAAI | embedding | 635.2M | 1.7M | 0% |
| 36 | Granite-4.0-H-Micro ibm-granite/granite-4.0-h-micro | Ibm-granite | text | 520.5M | 451.4K | 0% |
| 37 | FLUX.2 [Pro] Preview black-forest-labs/flux-2-pro-preview | Black Forest Labs | image | 468.3M | 45.1K | 0% |
| 38 | Flux-2-Klein-4b black-forest-labs/flux-2-klein-4b | Black Forest Labs | image | 435.2M | 65.6K | 0% |
| 39 | Deepseek-R1-Distill-Qwen-32b deepseek/deepseek-r1-distill-qwen-32b | Deepseek | text | 228.5M | 173.9K | 0% |
| 40 | FLUX.2 [Max] black-forest-labs/flux-2-max | Black Forest Labs | image | 218.3M | 13.6K | 0% |
| Provider | Models | Tokens |
|---|---|---|
| Anthropic | 7 | 5.63T |
| 8 | 3.76T | |
| OpenAI | 13 | 2.07T |
| Moonshot | 1 | 391.0B |
| xAI | 2 | 88.7B |
| Alibaba | 2 | 56.9B |
| Meta | 3 | 12.2B |
| BAAI | 3 | 10.5B |
| ByteDance | 1 | 3.0B |
| Black Forest Labs | 4 | 1.2B |
| Ibm-granite | 1 | 520.5M |
| Perplexity | 2 | 260.6M |
These rankings reflect real aggregate usage across the open model ecosystem, refreshed regularly — real market signal, not invented numbers. As first-party traffic on AIgateway grows, it's blended into the same board — opt your app in from Dashboard → Settings.