Pricing per million tokens, context window, capabilities — pulled from each provider's public docs. All 2 are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.
xAI Grok 4 Fast — low-latency variant.
xAI's Grok 4.20 multi-agent model with a 2M-token context window. Multiple agents collaborate in parallel to perform deep research tasks, with function calling, structured outputs, and reasoning capabilities.
from openai import OpenAI
client = OpenAI(
base_url="https://api.aigateway.sh/v1",
api_key="sk-aig-...",
)
# Grok 4 Fast
client.chat.completions.create(
model="xai/grok-4-fast",
messages=[{"role":"user","content":"hello"}],
)
# Grok 4.20 Multi-Agent
client.chat.completions.create(
model="xai/grok-4.20-multi-agent-0309",
messages=[{"role":"user","content":"hello"}],
)