Real aggregate usage across the open model ecosystem — token volume, requests, and market share, refreshed regularly. Pick a model the market trusts — or pick the underdog it hasn't noticed yet.
| # | Model | Provider | Modality | Tokens | Requests | Share |
|---|---|---|---|---|---|---|
| 1 | Claude Opus 4.7 anthropic/claude-opus-4.7 | Anthropic | text | 2.75T | 65.2M | 22.9% |
| 2 | Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 | Anthropic | text | 2.13T | 63.1M | 17.7% |
| 3 | Gemini 3 Flash google/gemini-3-flash | text | 1.05T | 127.5M | 8.7% | |
| 4 | Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite | text | 649.2B | 259.9M | 5.4% | |
| 5 | Kimi-K2.6 moonshot/kimi-k2.6 | Moonshot | text | 641.3B | 19.9M | 5.3% |
| 6 | Gpt-Oss-120b openai/gpt-oss-120b | OpenAI | text | 566.6B | 112.1M | 4.7% |
| 7 | Gemini 2.5 Flash google/gemini-2.5-flash | text | 565.1B | 149.4M | 4.7% | |
| 8 | Gemini 3.1 Flash Lite google/gemini-3.1-flash-lite | text | 548.1B | 109.4M | 4.6% | |
| 9 | GPT-5.5 openai/gpt-5.5 | OpenAI | text | 498.9B | 9.7M | 4.2% |
| 10 | Claude Opus 4.6 anthropic/claude-opus-4.6 | Anthropic | text | 483.1B | 9.6M | 4% |
| 11 | GPT-5.4 openai/gpt-5.4 | OpenAI | text | 335.8B | 14.5M | 2.8% |
| 12 | Gemini 3.1 Pro google/gemini-3.1-pro | text | 298.5B | 17.8M | 2.5% | |
| 13 | Claude Haiku 4.5 anthropic/claude-haiku-4.5 | Anthropic | text | 243.8B | 28.7M | 2% |
| 14 | Gemma-4-26b-A4b-IT google/gemma-4-26b-a4b-it | text | 201.9B | 57.6M | 1.7% | |
| 15 | GPT-5.4 Mini openai/gpt-5.4-mini | OpenAI | text | 158.4B | 20.2M | 1.3% |
| 16 | Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 | Anthropic | text | 124.1B | 8.7M | 1% |
| 17 | GPT-5.4 Nano openai/gpt-5.4-nano | OpenAI | text | 115.1B | 17.1M | 1% |
| 18 | GPT-5 openai/gpt-5 | OpenAI | text | 91.3B | 7.4M | 0.8% |
| 19 | Gpt-Oss-20b openai/gpt-oss-20b | OpenAI | text | 86.6B | 27.6M | 0.7% |
| 20 | Grok 4.3 xai/grok-4.3 | xAI | text | 80.6B | 12.2M | 0.7% |
| 21 | GPT-4.1 Mini openai/gpt-4.1-mini | OpenAI | text | 78.5B | 27.1M | 0.7% |
| 22 | Gemini 2.5 Pro google/gemini-2.5-pro | text | 71.6B | 6.7M | 0.6% | |
| 23 | Qwen 3.5 397B A17B alibaba/qwen3.5-397b-a17b | Alibaba | text | 67.0B | 7.5M | 0.6% |
| 24 | GPT-4.1 openai/gpt-4.1 | OpenAI | text | 48.7B | 11.3M | 0.4% |
| 25 | Claude Sonnet 4 anthropic/claude-sonnet-4 | Anthropic | text | 47.9B | 3.7M | 0.4% |
| 26 | Claude Opus 4.8 anthropic/claude-opus-4.8 | Anthropic | text | 39.9B | 644.2K | 0.3% |
| 27 | Llama-4-Scout-17b-16e-Instruct meta/llama-4-scout-17b-16e-instruct | Meta | text | 11.7B | 6.8M | 0.1% |
| 28 | Gemma-3-12b-IT google/gemma-3-12b-it | text | 7.5B | 11.5M | 0.1% | |
| 29 | Bge-M3 baai/bge-m3 | BAAI | embedding | 6.1B | 7.1M | 0.1% |
| 30 | GPT-5.5 Pro openai/gpt-5.5-pro | OpenAI | text | 5.9B | 292.5K | 0% |
| 31 | Seedream 4.5 bytedance/seedream-4.5 | ByteDance | image | 2.9B | 147.3K | 0% |
| 32 | O4-Mini openai/o4-mini | OpenAI | text | 2.0B | 645.3K | 0% |
| 33 | Qwen 3 Max alibaba/qwen3-max | Alibaba | text | 1.5B | 251.9K | 0% |
| 34 | Llama-3.2-3b-Instruct meta/llama-3.2-3b-instruct | Meta | text | 1.4B | 4.7M | 0% |
| 35 | GPT-5.4 Pro openai/gpt-5.4-pro | OpenAI | text | 1.3B | 164.2K | 0% |
| 36 | Grok Imagine Image Quality xai/grok-imagine-image-quality | xAI | image | 703.6M | 130.3K | 0% |
| 37 | Llama-3-8b-Instruct meta/llama-3-8b-instruct | Meta | text | 692.6M | 880.9K | 0% |
| 38 | FLUX.2 [Pro] Preview black-forest-labs/flux-2-pro-preview | Black Forest Labs | image | 676.9M | 53.4K | 0% |
| 39 | Bge-Base-EN-V1.5 baai/bge-base-en-v1.5 | BAAI | embedding | 652.4M | 1.5M | 0% |
| 40 | Llama-3.2-1b-Instruct meta/llama-3.2-1b-instruct | Meta | text | 450.2M | 270.0K | 0% |
| Provider | Models | Tokens |
|---|---|---|
| Anthropic | 7 | 5.82T |
| 9 | 3.39T | |
| OpenAI | 13 | 1.99T |
| Moonshot | 1 | 641.3B |
| xAI | 2 | 81.3B |
| Alibaba | 2 | 68.5B |
| Meta | 4 | 14.2B |
| BAAI | 3 | 6.8B |
| ByteDance | 1 | 2.9B |
| Black Forest Labs | 4 | 1.4B |
| Ibm-granite | 1 | 400.0M |
| Deepseek | 1 | 290.1M |
These rankings reflect real aggregate usage across the open model ecosystem, refreshed regularly — real market signal, not invented numbers. As first-party traffic on AIgateway grows, it's blended into the same board — opt your app in from Dashboard → Settings.