Kimi K2.6 (moonshot/kimi-k2.6) is priced at $0.950/M input and $4.00/M output tokens through AIgateway. Cached input reads are $0.160/M tokens. 262,144-token context window. AIgateway charges pass-through on the underlying Moonshot rate plus a 5% platform fee applied at credit top-up — nothing per-call, no monthly minimum. Free tier available.
AIgateway bills at the same per-token rate Moonshot publishes. There's no inflated sticker price; the model-vendor's invoice and your gateway invoice agree on the token count and unit cost. Usage is aggregated to the nearest fraction of a cent in D1 and surfaced live in your dashboard.
The only margin on top is a 5% platform fee, applied at credit top-up — not per-call. A $100 top-up funds $95 of provider calls. Cache hits (when you use prompt caching or semantic cache) bill at 10% of the uncached cost, so long-context agent workloads often run net cheaper than calling Moonshot directly.
No subscription and no monthly minimum. Free tier gives you 100 requests/day on Kimi K2.6. Paid tier starts at a $5 top-up with no auto-renew. BYOK (bring your own Moonshot key) is supported on Enterprise if you already have a direct contract you want to preserve.
# pip install aigateway-py openai
# aigateway-py: sub-accounts, evals, replays, jobs, webhook verify.
# openai SDK: chat/embeddings/images/audio — drop-in compat per our SDK's own guidance.
from openai import OpenAI
client = OpenAI(
base_url="https://api.aigateway.sh/v1",
api_key="sk-aig-...",
)
r = client.chat.completions.create(
model="moonshot/kimi-k2.6",
messages=[{"role": "user", "content": "Explain vector databases in two sentences."}],
)
print(r.choices[0].message.content)// npm i aigateway-js openai
// aigateway-js: sub-accounts, evals, replays, jobs, webhook verify.
// openai SDK: chat/embeddings/images/audio — drop-in compat.
import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.aigateway.sh/v1",
apiKey: process.env.AIGATEWAY_KEY,
});
const r = await client.chat.completions.create({
model: "moonshot/kimi-k2.6",
messages: [{ role: "user", content: "Explain vector databases in two sentences." }],
});
console.log(r.choices[0].message.content);Input is $0.95/M tokens, output is $4/M tokens. A typical 1K-in / 500-out chat turn costs about $0.00295.
Yes — Kimi K2.6 is free on AIgateway through April 30, 2026 at 100 requests/day per account.
Yes. Cached input reads bill at $0.16/M. For long-context agent workloads this routinely cuts bills by 70%+.
Pass-through pricing means you automatically get whatever tier discount Moonshot publishes. AIgateway doesn't mark up; the 5% platform fee applies flat regardless of volume. Enterprise customers can negotiate committed-use discounts on top.
Every call returns usage.prompt_tokens and usage.completion_tokens in the response body, same as OpenAI. We also write the exact cost into x-aigateway-cost response header. Full audit log is in the dashboard.