Pricing per million tokens, context window, capabilities — pulled from each provider's public docs. All 2 are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.
xAI's Grok 4.20 multi-agent model with a 2M-token context window. Multiple agents collaborate in parallel to perform deep research tasks, with function calling, structured outputs, and reasoning capabilities.
Google's most intelligent Gemini model with improved reasoning, a medium thinking level, and a 1M token context window.
Source: each provider's published benchmarks. Higher is better. Run an eval to compare on your own data.
from openai import OpenAI
client = OpenAI(
base_url="https://api.aigateway.sh/v1",
api_key="sk-aig-...",
)
# Grok 4.20 Multi-Agent
client.chat.completions.create(
model="xai/grok-4.20-multi-agent-0309",
messages=[{"role":"user","content":"hello"}],
)
# Gemini 3.1 Pro
client.chat.completions.create(
model="google/gemini-3.1-pro",
messages=[{"role":"user","content":"hello"}],
)