Question 1

How do I compare AI models on AIgateway?

Accepted Answer

Search the picker on /compare for the models you want, click each one to add a column, and AIgateway pulls live pricing, context window, and capabilities from each provider's docs. Shareable URL — copy the link and the comparison reproduces.

Question 2

How many models can I compare at once?

Accepted Answer

Up to 4 models side-by-side. Past that the columns get unreadable; for broader exploration use /models with filters.

Question 3

Where does the comparison data come from?

Accepted Answer

Each provider's published pricing and capability docs, plus our own catalog at https://api.aigateway.sh/v1/models which we keep in sync as new models ship. Cache reads, prompt caching, vision, and tool-calling flags reflect what the upstream provider actually supports.

Question 4

How does AIgateway pricing work?

Accepted Answer

Pass-through on the underlying provider rate plus a 5% platform fee at credit top-up — no markup per call, no monthly minimum. Cached requests get a 50% discount. New accounts get $5 signup credit (no card) on a curated edge-tier shortlist.

Question 5

Can I compare models across providers?

Accepted Answer

Yes — that's the point. Side-by-side any of 1000+ models from 85+ providers (Anthropic, OpenAI, Google, Moonshot, Meta, Mistral, xAI, Deepgram, ElevenLabs, Black Forest Labs, and more) without signing up for each.

Question 6

Can I run an eval to pick a winner on my own data?

Accepted Answer

Yes. POST /v1/evals with the candidate model slugs, a dataset, and a grader. AIgateway runs each prompt against each model and returns an alias that routes to the current winner. Re-run the eval when a new frontier model lands and your alias re-binds with zero code change.

Question 7

What's the difference between Deepseek-R1-Distill-Qwen-32b and Qwen2.5-Coder-32b-Instruct?

Accepted Answer

Deepseek-R1-Distill-Qwen-32b (deepseek/deepseek-r1-distill-qwen-32b) — DeepSeek-R1-Distill-Qwen-32B is a model distilled from DeepSeek-R1 based on Qwen2.5. It outperforms OpenAI-o1-mini across various benchmarks, achieving new state-of-the-art results for dense models. Qwen2.5-Coder-32b-Instruct (qwen/qwen2.5-coder-32b-instruct) — Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). As of now, Qwen2.5-Coder has covered six mainstream model sizes, 0.5, 1.5, 3, 7, 14, 32 billion parameters, to meet the needs of different developers. Qwen2.5-Coder brings the following improvements upon CodeQwen1.5: Input pricing: $0.500 /1M vs $0.660 /1M. Output: $4.88 /1M vs $1.00 /1M. Tools: — vs yes. Vision: — vs —. Context: 131,072 vs 131,072 tokens.

Deepseek-R1-Distill-Qwen-32b vs Qwen2.5-Coder-32b-Instruct

Compare with another

One key, all 2, one line different.