Pricing per million tokens, context window, capabilities — pulled from each provider's public docs. All 2 are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.
DeepSeek-R1-Distill-Qwen-32B is a model distilled from DeepSeek-R1 based on Qwen2.5. It outperforms OpenAI-o1-mini across various benchmarks, achieving new state-of-the-art results for dense models.
The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks.
from openai import OpenAI
client = OpenAI(
base_url="https://api.aigateway.sh/v1",
api_key="sk-aig-...",
)
# Deepseek-R1-Distill-Qwen-32b
client.chat.completions.create(
model="deepseek/deepseek-r1-distill-qwen-32b",
messages=[{"role":"user","content":"hello"}],
)
# Qwen3-Embedding-0.6b
client.chat.completions.create(
model="qwen/qwen3-embedding-0.6b",
messages=[{"role":"user","content":"hello"}],
)