LIVE$5* free credits on signup. Try Kimi K2.6, FLUX-2 Klein, Whisper Turbo, Aura 2, and our curated edge tier.Claim your key →
85+ labs · one schema · zero lock-in

One API.
Every model.
Every modality.

OpenAI-compatible endpoints to every model and every modality — text, image, video, voice, audio, music, embeddings. Plus eval-driven routing, sub-account keys for your end users, and replay across any model.

Get your key Read the docs
99.99% uptime 47ms p50 overhead SOC 2 in progress
python
typescript
curl
aigateway-py
aigateway-cli
quickstart.py
# one-line swap: point your SDK at us.
from openai import OpenAI

client = OpenAI(
    base_url="https://api.openai.com/v1"
             "https://api.aigateway.sh/v1",
    api_key="sk-aig-...",
)

stream = client.chat.completions.create(
    model="moonshot/kimi-k2.6",
    messages=msgs, stream=True,
)
STREAMINGmodel: kimi-k2.6tok/s: 0ttft: via edge / iad1
openaianthropicgooglemoonshotmetamistraldeepseekqwenblack-forest-labsstabilityaideepgram+ 27 more
MODELS
1000+
across 85+ upstream labs
LABS
85+
normalized behind one schema
EDGE OVERHEAD
47ms
p50 routing overhead · global edge
CACHE DISCOUNT
50%
off list price on cached requests
NEWmodel: "auto"
One model name. Every modality. Always cheaper than the premium pick.
Set model:"auto" and the router reads each request, picks the cheapest model in a curated pool that still clears the quality floor, and never charges you more than the premium model you'd have called yourself. Text, image, video, speech, transcription, music, embeddings. Every response tells you what ran and what you saved.
See the Auto Router →
$5* FREE ON SIGNUP

Start with $5.
Try the free tier.*

Your free credit covers a curated edge-tier shortlist — Kimi K2.6 for chat, FLUX-2 Klein for image, Whisper Turbo + Aura 2 for voice, BGE-M3 for embeddings, Gemma 4 for vision, Llama Guard for moderation. Top up to unlock the full catalog; topups never expire.

$5
signup_creditfree7 curated models7-day window
$5* free credit · no card preauth
Use it on the 7-model edge tier — chat, image, voice, transcription, vision, embeddings, moderation.
KIMI K2.6
~2M tok
on your free $5
FLUX KLEIN 9B
~2 images
on your free $5
WHISPER TURBO
~14 hours
on your free $5
AURA 2
~416K chars
on your free $5
Claim your $5 →See the 7 free-tier modelstopups never expire
python
first_call.py
from openai import OpenAI

client = OpenAI(
    base_url="https://api.aigateway.sh/v1",
    api_key="sk-aig-...",
)

r = client.chat.completions.create(
    model="anthropic/claude-opus-4.7",
    messages=[{"role": "user",
               "content": "Plan a"
                          " research agent."}],
    stream=True,
)

# $5 free credit on signup (7-day window, curated models).
# Top up to unlock the full 1000+-model catalog.

* Applicable only on selected models — see the model list for details. Free credit expires 7 days after signup; topups never expire.

Catalog

Every model. Every modality.
One key.

Drop-in compatible with the OpenAI SDK. Rotate models mid-conversation without changing any other line of code.

showing 12 of 1000+ models
Rankings →Providers →See all 1000+ models →
Popular comparisons:Opus 4.7 vs GPT-5.4Sonnet 4.6 vs GPT-5.4-miniGemini 3.1 Pro vs Opus 4.7Kimi K2.6 vs Opus 4.7Haiku 4.5 vs GPT-5.4-mini
Honest comparison

Why swap in
one line of code.

Anonymized competitor labels — same public pricing and feature matrix, minus the trash talk. We'll let you Google who's who.

 AIgatewayCompetitor A
breadth aggregator
Competitor B
platform-native gateway
Competitor C
enterprise governance
Models / modalities1000+ · text · image · video · music · voice · audio · embeddings · vision~300 · text only~40 · text + image~80 · image, video, audio
Open-weight served at the edgeyesno · pass-throughno · pass-throughno · queue
Edge latency (p50)47ms · global edge~180ms · single region~60ms · one platform~190ms · single region
Eval-driven routing (SLO on your data)yes
Sub-account / per-user key APIyes · programmaticworkspace only
Replay + shadow A/B across modelsyes
OpenAI-compatibledrop-in, zero changesdrop-indrop-in

Sourced from each competitor's public pricing page and docs, April 2026. If something is wrong, email hello@aigateway.sh and we'll fix it.

One catalog

Every model, every modality.
One endpoint, one schema.

1000+ models across 85+ labs — text, image, video, music, voice, audio, embeddings, vision. Same OpenAI-compatible body shape for every one. Rotate between Claude Opus 4.7, GPT-5.4, Gemini 3.1 Pro, Kimi K2.6, Llama 4, Flux 1 Schnell, Stable Diffusion XL, Veo 3.1, MiniMax Hailuo by changing a single string.

FRONTIER LABS
Anthropic, OpenAI, Google
Claude Opus 4.7, GPT-5.4, Gemini 3.1 Pro, Claude Haiku 4.5
OPEN-WEIGHT
Moonshot, Meta, Mistral, DeepSeek, IBM, Nvidia
Kimi K2.6, Llama 4 Scout, Mistral Small 3.1, DeepSeek-R1, Granite 4
MULTIMODAL
Black Forest Labs, Stability, MyShell, Deepgram
Flux 1 Schnell, Stable Diffusion XL, MeloTTS, Aura 2, Whisper Large v3
Primitives

Four things only the
aggregator can ship.

Things single-provider SDKs physically cannot do — because they only see one provider. We see all of them.

EVAL-DRIVEN ROUTING
Let your own data pick the model.
Upload a dataset, we grade every candidate model on it, then hand you an alias that always routes to the current winner. When a new frontier model lands, rerun the eval — your prod code doesn't change.
curl -X POST https://api.aigateway.sh/v1/evals \
  -H "Authorization: Bearer $KEY" \
  -d '{
    "name": "prod-summarize",
    "candidate_models": [
      "anthropic/claude-opus-4.7",
      "openai/gpt-5.4",
      "moonshot/kimi-k2.6"
    ],
    "dataset": [...],
    "metric": "quality"
  }'

# then just use it
model = "eval:prod-summarize"
SUB-ACCOUNT API
Scoped keys for your end users.
One API call mints a key for each of your customers, with its own spend cap, rate limit, and isolated analytics. No more "I'll build a billing layer on top." We are the billing layer.
POST /v1/sub-accounts
{
  "name": "acme-corp",
  "spend_cap_cents": 50000,     // $500 / mo hard cap
  "rate_limit_rpm": 300,
  "default_tag": "acme"
}

=> { "key": "sk-aig-...",       // hand to customer
     "spend_cap_cents": 50000,
     "id": "sa_9f3k..." }
REPLAY + SHADOW A/B
Test a new model on real traffic.
Replay any past request against a different model and see output, cost, and latency side-by-side. Shadow mode fires the new model in parallel without ever reaching the user — a safe cutover switch.
POST /v1/replays
{
  "source_request_id": "req_abc123",
  "target_model": "anthropic/claude-opus-4.7",
  "shadow": true
}

=> { "source_output": "...",
     "target_output": "...",
     "cost_source_cents": 1.2,
     "cost_target_cents": 4.7,
     "score_delta": 0.82 }
COST-ATTRIBUTION TAGS
Know what every feature costs.
Tag any request with a free-form string — feature, user, tenant — via the x-aig-tag header. Query monthly spend grouped by tag. Pair with hard budget caps per tag and no feature can silently eat your margin again.
# tag each request with the feature it powers
curl https://api.aigateway.sh/v1/chat/completions \
  -H "x-aig-tag: summarize" \
  ...

GET /v1/usage/by-tag?month=2026-04
=> [{ "tag": "summarize", "cost_cents": 4210 },
    { "tag": "chat",      "cost_cents": 9830 },
    { "tag": "rerank",    "cost_cents":  118 }]
For coding agents

Point Claude Code at us.
It configures itself.

A capable coding agent reads /llms.txt, /openapi.json, or /agents.md once — and from then on it can call every modality, mint sub-accounts, tag cost, and replay requests without a human editing config.

You can call any AI model through AIgateway, a universal AI API.

- Base URL: https://api.aigateway.sh/v1
- Auth: Authorization: Bearer sk-aig-...
- SDK: drop-in for the OpenAI SDK — only change base_url
- OpenAPI spec: https://api.aigateway.sh/openapi.json
- Capability map: https://aigateway.sh/llms.txt
- Live models: https://api.aigateway.sh/v1/models
- MCP server: https://api.aigateway.sh/mcp

Install (only when the OpenAI SDK isn't enough):
  pip install aigateway-py        # Python — async jobs, sub-accounts, evals
  pnpm add aigateway-js           # Node   — same surface in TypeScript
  npm i -g aigateway-cli          # CLI    — `aig init` walks through everything

Primitives nobody else has:
  POST /v1/sub-accounts — scoped keys + spend caps per end user
  POST /v1/evals        — grade candidate models on your data
  POST /v1/replays      — re-run a past request on a new model
  GET  /v1/usage/by-tag — per-feature cost via x-aig-tag header
LLMS.TXT
Agent-readable capability map
/llms.txt
OPENAPI.JSON
Typed 3.1 spec for code-gen
https://api.aigateway.sh/openapi.json
AGENTS.MD
Integration patterns + error remediation
/agents.md
INSTALL
pip install aigateway-py · pnpm add aigateway-js · npm i -g aigateway-cli
/integrations
Full agent guide →
Public beta · April 2026

We don't fake logos.
We'd rather ship to you.

Sign up in 30 seconds — no waitlist, no sales call. Cline, Aider, and a handful of agent startups are already on it. Want a personal onboard? Drop your email and I'll reply.

we'll email you once, from a real human. no drip sequence.