Real aggregate usage across the open model ecosystem — token volume, requests, and market share, refreshed regularly. Pick a model the market trusts — or pick the underdog it hasn't noticed yet.
| # | Model | Provider | Modality | Tokens | Requests | Share |
|---|---|---|---|---|---|---|
| 1 | Claude Opus 4.7 anthropic/claude-opus-4.7 | Anthropic | text | 2.32T | 48.0M | 19% |
| 2 | Claude Sonnet 4.6 anthropic/claude-sonnet-4.6 | Anthropic | text | 2.09T | 61.4M | 17.1% |
| 3 | Gemini 3 Flash google/gemini-3-flash | text | 1.04T | 132.7M | 8.5% | |
| 4 | Gpt-Oss-120b openai/gpt-oss-120b | OpenAI | text | 662.7B | 133.4M | 5.4% |
| 5 | Gemini 2.5 Flash Lite google/gemini-2.5-flash-lite | text | 635.9B | 259.1M | 5.2% | |
| 6 | Gemini 2.5 Flash google/gemini-2.5-flash | text | 622.8B | 155.2M | 5.1% | |
| 7 | Gemini 3.1 Flash Lite google/gemini-3.1-flash-lite | text | 602.8B | 117.9M | 4.9% | |
| 8 | Claude Opus 4.6 anthropic/claude-opus-4.6 | Anthropic | text | 602.1B | 11.9M | 4.9% |
| 9 | Kimi-K2.6 moonshot/kimi-k2.6 | Moonshot | text | 563.2B | 18.0M | 4.6% |
| 10 | Claude Opus 4.8 anthropic/claude-opus-4.8 | Anthropic | text | 500.3B | 9.0M | 4.1% |
| 11 | GPT-5.5 openai/gpt-5.5 | OpenAI | text | 498.1B | 10.5M | 4.1% |
| 12 | Gemini 3.1 Pro google/gemini-3.1-pro | text | 286.4B | 16.9M | 2.3% | |
| 13 | GPT-5.4 openai/gpt-5.4 | OpenAI | text | 264.5B | 12.9M | 2.2% |
| 14 | Claude Haiku 4.5 anthropic/claude-haiku-4.5 | Anthropic | text | 252.7B | 29.6M | 2.1% |
| 15 | Gemma-4-26b-A4b-IT google/gemma-4-26b-a4b-it | text | 249.8B | 58.5M | 2% | |
| 16 | GPT-5.4 Mini openai/gpt-5.4-mini | OpenAI | text | 168.7B | 21.0M | 1.4% |
| 17 | Claude Sonnet 4.5 anthropic/claude-sonnet-4.5 | Anthropic | text | 125.2B | 8.7M | 1% |
| 18 | GPT-5.4 Nano openai/gpt-5.4-nano | OpenAI | text | 106.1B | 15.9M | 0.9% |
| 19 | Gpt-Oss-20b openai/gpt-oss-20b | OpenAI | text | 97.1B | 31.9M | 0.8% |
| 20 | GPT-5 openai/gpt-5 | OpenAI | text | 93.1B | 7.2M | 0.8% |
| 21 | Grok 4.3 xai/grok-4.3 | xAI | text | 92.7B | 13.8M | 0.8% |
| 22 | GPT-4.1 Mini openai/gpt-4.1-mini | OpenAI | text | 81.0B | 27.3M | 0.7% |
| 23 | Gemini 2.5 Pro google/gemini-2.5-pro | text | 77.6B | 6.9M | 0.6% | |
| 24 | Claude Sonnet 4 anthropic/claude-sonnet-4 | Anthropic | text | 56.4B | 5.4M | 0.5% |
| 25 | GPT-4.1 openai/gpt-4.1 | OpenAI | text | 50.1B | 9.7M | 0.4% |
| 26 | Qwen 3.5 397B A17B alibaba/qwen3.5-397b-a17b | Alibaba | text | 39.3B | 3.8M | 0.3% |
| 27 | Llama-4-Scout-17b-16e-Instruct meta/llama-4-scout-17b-16e-instruct | Meta | text | 9.7B | 5.7M | 0.1% |
| 28 | Bge-M3 baai/bge-m3 | BAAI | embedding | 9.3B | 7.0M | 0.1% |
| 29 | GPT-5.5 Pro openai/gpt-5.5-pro | OpenAI | text | 4.4B | 231.0K | 0% |
| 30 | Seedream 4.5 bytedance/seedream-4.5 | ByteDance | image | 2.6B | 129.8K | 0% |
| 31 | O4-Mini openai/o4-mini | OpenAI | text | 2.2B | 620.4K | 0% |
| 32 | Qwen 3 Max alibaba/qwen3-max | Alibaba | text | 1.9B | 274.4K | 0% |
| 33 | Llama-3.2-3b-Instruct meta/llama-3.2-3b-instruct | Meta | text | 1.3B | 4.7M | 0% |
| 34 | Grok Imagine Image Quality xai/grok-imagine-image-quality | xAI | image | 822.4M | 149.9K | 0% |
| 35 | Bge-Base-EN-V1.5 baai/bge-base-en-v1.5 | BAAI | embedding | 772.7M | 2.4M | 0% |
| 36 | FLUX.2 [Pro] Preview black-forest-labs/flux-2-pro-preview | Black Forest Labs | image | 724.0M | 58.1K | 0% |
| 37 | Granite-4.0-H-Micro ibm-granite/granite-4.0-h-micro | Ibm-granite | text | 428.3M | 330.2K | 0% |
| 38 | Flux-2-Klein-4b black-forest-labs/flux-2-klein-4b | Black Forest Labs | image | 373.7M | 57.5K | 0% |
| 39 | GPT-5.4 Pro openai/gpt-5.4-pro | OpenAI | text | 281.2M | 21.3K | 0% |
| 40 | Deepseek-R1-Distill-Qwen-32b deepseek/deepseek-r1-distill-qwen-32b | Deepseek | text | 249.3M | 189.9K | 0% |
| Provider | Models | Tokens |
|---|---|---|
| Anthropic | 7 | 5.94T |
| 8 | 3.52T | |
| OpenAI | 13 | 2.03T |
| Moonshot | 1 | 563.2B |
| xAI | 2 | 93.5B |
| Alibaba | 2 | 41.2B |
| Meta | 3 | 11.3B |
| BAAI | 3 | 10.2B |
| ByteDance | 1 | 2.6B |
| Black Forest Labs | 4 | 1.4B |
| Ibm-granite | 1 | 428.3M |
| Deepseek | 1 | 249.3M |
These rankings reflect real aggregate usage across the open model ecosystem, refreshed regularly — real market signal, not invented numbers. As first-party traffic on AIgateway grows, it's blended into the same board — opt your app in from Dashboard → Settings.