Pricing per million tokens, context window, capabilities — pulled from each provider's public docs. All 2 are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.
Google's most capable Gemini 2.5 model with strong reasoning, thinking support, and a 1M token context window.
Claude Haiku 4.5 delivers similar levels of coding performance at one-third the cost and more than twice the speed of larger models.
Source: each provider's published benchmarks. Higher is better. Run an eval to compare on your own data.
from openai import OpenAI
client = OpenAI(
base_url="https://api.aigateway.sh/v1",
api_key="sk-aig-...",
)
# Gemini 2.5 Pro
client.chat.completions.create(
model="google/gemini-2.5-pro",
messages=[{"role":"user","content":"hello"}],
)
# Claude Haiku 4.5
client.chat.completions.create(
model="anthropic/claude-haiku-4.5",
messages=[{"role":"user","content":"hello"}],
)