Pricing per million tokens, context window, capabilities — pulled from each provider's public docs. All 1 are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.
DeepSeek-R1-Distill-Qwen-32B is a model distilled from DeepSeek-R1 based on Qwen2.5. It outperforms OpenAI-o1-mini across various benchmarks, achieving new state-of-the-art results for dense models.