Pricing

Cost + 5%. Nothing else.

Provider cost plus a 5% platform fee on every API call — that's our entire revenue model. No monthly fee, no seat count, no minimum. Cached requests get a 50% discount.

A separate payment-processor fee (~5%) applies once at credit top-up — same as anywhere you pay online with a card. That fee goes to Stripe, not us.

$5* free credits on signupTopups never expireRefund within 90 days

Free signup credit

$5*free on signup
Try the curated 7-model edge tier. Expires 7 days after signup.
  • Kimi K2.6 (chat) · BGE-M3 (embed) · FLUX-2 Klein 9B (image)
  • Gemma 4 vision · Aura 2 (TTS) · Whisper Turbo (STT) · Llama Guard
  • Full playground, logs, and analytics
  • Card required at signup; auto-topup is opt-in
Get $5 free

Enterprise

Custom/from $10k/mo committed
Everything in PAYG, plus the primitives only an aggregator can ship.
  • Evals — replay real traffic against alternate models, score against your rubric
  • Guardrails — content safety + PII redaction, one policy every provider
  • Replay + shadow A/B — deterministic re-run, canary new models safely
  • Prompt IDs — versioned prompts server-side with auto prefix caching
  • SSO · SCIM · dedicated endpoint · security posture
  • 99.95% SLA · DPA · SOC 2 · private audit export
  • Direct-provider agreements · named engineer on Slack Connect
Talk to sales
Calculator

What you'd actually pay us.

Plug in your monthly volume. We show you side-by-side cost against the aggregators we replace. Cache savings are modeled separately.

$2.50 in / $10 out
120M
40M
30%
$738.50
5.5% platform fee
$741.00
~3% + platform min
$624.75
− $113.75 vs A
BREAKDOWN · AIGATEWAY
Provider token cost$700.005% platform fee$29.75Cache savings− $110.25Net$624.75
How we compare

Cheaper than OpenRouter.
Primitives none of them have.

Numbers pulled from each provider's public pricing page in April 2026. We update this monthly.

FeatureAIgatewayOpenRouterPortkeyHeliconeRequesty
Platform fee (our revenue)5% per call5.5% on credits$49/mo flat$79/mo+ flatfree
Markup on model cost5% · transparent0% · pass-through0% · pass-through0% · pass-through0%
Cache-hit discount50% off cached requestsnononono
Every modalitytext · image · video · voice · audio · embedtext onlytext onlytext + obsvaries
Sub-account / per-user keysyes, APInoworkspace onlyworkspace onlyno
Cost-attribution tags + hard capsyes, APIbasicyesyesno
BYOKfree · no per-request fee1M free/mo then 5%includedincludedyes
Evals on your trafficEnterprisenorules onlynono
Guardrails (content + PII)Enterprise · one policy, every providernoyesnono
Replay + shadow A/B across modelsEnterprisenonoprompt-level onlyno
Prompt IDs with auto-cachingEnterprisenoworkspace onlynono
Auto Router

Let us pick the model.
Pay even less.

Cost + 5% is the floor, not the only option. Set model:"auto" and the router picks the cheapest model in a curated pool that still clears the quality floor — cheaper than the premium model you'd otherwise call, guaranteed, with the baseline acting as a hard cost ceiling. Works across text, image, video, speech, transcription, music, and embeddings.

Every routed call returns headers showing the model that ran and the exact dollars saved versus your baseline. You keep the majority of every dollar the router saves, and you never pay above that baseline. The full pricing mechanics are in the docs fine-print.
See the Auto Router →Read the docs
FAQ

Questions we actually get.

What's the catch on 5%?
There isn't one. 5% is our entire revenue model — we add it to the underlying model cost on every API call. No monthly fees, no seat fees, no per-model surcharges, no markup on tokens, no minimum. You only pay when a request succeeds. (A separate payment-processor fee applies once at credit top-up — that one goes to Stripe, not us, and you'd pay it on any online charge.)
So what does it actually cost me?
Per call: provider cost × 1.05 — debited from your wallet when the request succeeds. At top-up: whatever credits you buy plus ~5% in payment-processing fees (Stripe's standard rate for cards). Worked example: buy $100 of credits → Stripe charges $105 → wallet credited $100 → $100 of provider usage debits $105 from wallet over time.
Is there a free tier?
Every new account gets $5 in free credits redeemable on a curated set of edge-tier models (Kimi K2.6 for chat, BGE-M3 for embeddings, FLUX-2 Klein 9B for image, Gemma 4 vision, Aura 2 TTS, Whisper Turbo STT, Llama Guard for moderation). The free credit expires 7 days after signup. Top up once and the full catalog — Claude Opus 4.7, GPT-5.4, Gemini 3.1 Pro, FLUX 2, Veo 3.1, everything — unlocks. Topups never expire.
What counts as a 'successful run'?
A request that returned a usable model response. Fallbacks that succeed bill once, at the model that actually answered. Failed runs and timeouts don't charge.
How do cache hits price?
Exact-match and semantic hits return in under 10ms and get a flat 50% discount on the uncached price. Cache is on by default; override per-request with the x-cache header.
Can I bring my own provider keys?
Yes, free on every paid account. Route through your Anthropic / OpenAI / Google / etc. key and pay zero per-request fees on that traffic — your existing volume discounts apply.
Sub-accounts and per-user keys?
Yes — one API call mints a scoped key with its own spend cap, rate limit, default tag, and analytics. Ideal for marketplaces or multi-tenant apps that want to meter end users.
How do STT, TTS, embeddings, video, image price?
Every modality uses the same math: underlying model rate plus 5%. Whisper, ElevenLabs, BGE, Flux, Veo — all one simple line item.
Do credits expire?
Topup credits never expire — once you pay, the balance stays. The $5 signup credit is the exception: it expires 7 days after signup. Refunds on unused topup balances within 90 days, no questions.
What's in Enterprise?
Evals, guardrails, replay + shadow A/B, and versioned prompt IDs — the gateway primitives. Plus SSO, 99.95% SLA, DPA, SOC 2, private audit export, dedicated endpoint, and a named engineer on Slack Connect. Details on /enterprise.

* Applicable only on selected models — see the model list for details. Free credit expires 7 days after signup; topups never expire.