Flux

audio-stt

Compare

Flux is the first conversational speech recognition model built specifically for voice agents.

MODALITIES

audio

RELEASED

2025-09-29

Flux (deepgram/flux) is a audio-stt model from Deepgram, released 2025-09-29. Pricing via AIgateway: $0.0077 per minute. Capabilities: streaming. Call it via https://api.aigateway.sh/v1/audio/transcriptions — set model="deepgram/flux".

model · deepgram/fluxfamily · FLUX

Use this model

model: deepgram/flux

curl https://api.aigateway.sh/v1/audio/transcriptions \
  -H "Authorization: Bearer $AIGATEWAY_API_KEY" \
  -F model="deepgram/flux" \
  -F file="@audio.mp3"

curl https://api.aigateway.sh/v1/audio/transcriptions \ -H "Authorization: Bearer $AIGATEWAY_API_KEY" \ -F model="deepgram/flux" \ -F file="@audio.mp3"

Async & streaming

Realtime streaming — open a WebSocket, stream audio frames, and receive interim + final transcripts live. Billed per minute at the websocket rate (higher than batch).

// Realtime WebSocket. Browsers pass the key as ?api_key=
const ws = new WebSocket(
  "wss://api.aigateway.sh/v1/realtime?model=deepgram/flux&encoding=linear16&sample_rate=16000&interim_results=true&api_key=" + AIGATEWAY_API_KEY,
);
ws.onmessage = (e) => {
  const msg = JSON.parse(e.data);
  if (msg.type === "Results") {
    console.log(msg.channel.alternatives[0].transcript, msg.is_final);
  }
};
// stream raw audio frames (linear16 PCM @ 16 kHz):
// ws.send(pcmChunk)
// ...then end the stream:
ws.send(JSON.stringify({ type: "CloseStream" }));

Capabilities

Streaming

Pricing

Per minute$0.0077

You pay pass-through pricing.

See API example →Compare API reference See usage ranking →

Collections

More audio models →More from Deepgram →Frontier models →Free-tier models →

API schema

Call Flux from any OpenAI SDK

POST https://api.aigateway.sh/v1/audio/transcriptions·Content-Type: multipart/form-data·Auth: Bearer sk-aig-...

Request body

json

# multipart/form-data — use curl -F or SDK file upload
model="deepgram/flux"
file=@audio.mp3
response_format=json    # or "verbose_json", "text", "srt", "vtt"
language=en             # optional

Response

json

{
  "text": "Hello from AIgateway.",
  "language": "en",
  "duration": 1.82
}

Quickstart

from openai import OpenAI
client = OpenAI(base_url="https://api.aigateway.sh/v1", api_key="sk-aig-...")

with open("audio.mp3", "rb") as f:
    r = client.audio.transcriptions.create(model="deepgram/flux", file=f)
print(r.text)

Errors

401authentication_errorInvalid or missing API key

402insufficient_creditsWallet empty (PAYG only)

404not_foundUnknown model or endpoint

429rate_limit_errorOver per-minute limit — see Retry-After header

500server_errorUpstream provider failed (retryable)

503service_unavailableUpstream saturated (retryable)

Full docs →API reference →OpenAPI spec →llms.txt →

Frequently asked questions

What is Flux?

Flux is the first conversational speech recognition model built specifically for voice agents. It is a audio-stt model from Deepgram, accessible via AIgateway's OpenAI-compatible API at slug deepgram/flux.

How much does Flux cost via AIgateway?

$0.0077 per minute of audio, billed pass-through.

How do I call Flux from my code?

Point the OpenAI SDK at https://api.aigateway.sh/v1 with your AIgateway key and set model to "deepgram/flux". The request and response shapes match OpenAI exactly.

Does Flux support streaming, tool calling, vision, and JSON mode?

Streaming — yes. Tool calling — no. Vision — no. JSON mode — no. Prompt caching — no.

Can I bring my own Deepgram API key (BYOK)?

Yes. Attach a Deepgram key in your AIgateway dashboard and this model flips to pass-through — you pay Deepgram directly and AIgateway adds no platform fee on those calls.