Whisper-Large-V3-Turbo

audio-stt

Compare

Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation.

MODALITIES

audio

RELEASED

2024-05-22

Whisper-Large-V3-Turbo (openai/whisper-large-v3-turbo) is a audio-stt model from OpenAI, released 2024-05-22. Pricing via AIgateway: $0.0060 per minute. Capabilities: async. Call it via https://api.aigateway.sh/v1/audio/transcriptions — set model="openai/whisper-large-v3-turbo".

model · openai/whisper-large-v3-turbofamily · Whisper

Use this model

model: openai/whisper-large-v3-turbo

curl https://api.aigateway.sh/v1/audio/transcriptions \
  -H "Authorization: Bearer $AIGATEWAY_API_KEY" \
  -F model="openai/whisper-large-v3-turbo" \
  -F file="@audio.mp3"

curl https://api.aigateway.sh/v1/audio/transcriptions \ -H "Authorization: Bearer $AIGATEWAY_API_KEY" \ -F model="openai/whisper-large-v3-turbo" \ -F file="@audio.mp3"

Async & streaming

Async transcription — submit and poll /v1/jobs/<id>, or have the result pushed to your webhook_url. Best for long files and batch pipelines.

# Submit (returns immediately with a job id)
curl -X POST https://api.aigateway.sh/v1/audio/transcriptions \
  -H "Authorization: Bearer $AIGATEWAY_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"openai/whisper-large-v3-turbo","audio_url":"https://example.com/audio.wav","async":true}'
# -> {"id":"<job_id>","status":"processing"}

# Poll for the transcript
curl https://api.aigateway.sh/v1/jobs/<job_id> \
  -H "Authorization: Bearer $AIGATEWAY_API_KEY"

# ...or skip polling: pass "webhook_url" and we POST the signed result when ready
#   {"model":"openai/whisper-large-v3-turbo","audio_url":"...","webhook_url":"https://you.example.com/hook"}

Capabilities

Async

Pricing

Per minute$0.0060

You pay pass-through pricing.

See API example →Compare API reference See usage ranking →

Collections

More audio models →More from OpenAI →Frontier models →Free-tier models →

API schema

Call Whisper-Large-V3-Turbo from any OpenAI SDK

POST https://api.aigateway.sh/v1/audio/transcriptions·Content-Type: multipart/form-data·Auth: Bearer sk-aig-...

Request body

json

# multipart/form-data — use curl -F or SDK file upload
model="openai/whisper-large-v3-turbo"
file=@audio.mp3
response_format=json    # or "verbose_json", "text", "srt", "vtt"
language=en             # optional

Response

json

{
  "text": "Hello from AIgateway.",
  "language": "en",
  "duration": 1.82
}

Quickstart

from openai import OpenAI
client = OpenAI(base_url="https://api.aigateway.sh/v1", api_key="sk-aig-...")

with open("audio.mp3", "rb") as f:
    r = client.audio.transcriptions.create(model="openai/whisper-large-v3-turbo", file=f)
print(r.text)

Errors

401authentication_errorInvalid or missing API key

402insufficient_creditsWallet empty (PAYG only)

404not_foundUnknown model or endpoint

429rate_limit_errorOver per-minute limit — see Retry-After header

500server_errorUpstream provider failed (retryable)

503service_unavailableUpstream saturated (retryable)

Full docs →API reference →OpenAPI spec →llms.txt →

Frequently asked questions

What is Whisper-Large-V3-Turbo?

Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. It is a audio-stt model from OpenAI, accessible via AIgateway's OpenAI-compatible API at slug openai/whisper-large-v3-turbo.

How much does Whisper-Large-V3-Turbo cost via AIgateway?

$0.0060 per minute of audio, billed pass-through.

How do I call Whisper-Large-V3-Turbo from my code?

Point the OpenAI SDK at https://api.aigateway.sh/v1 with your AIgateway key and set model to "openai/whisper-large-v3-turbo". The request and response shapes match OpenAI exactly.

Does Whisper-Large-V3-Turbo support streaming, tool calling, vision, and JSON mode?

Streaming — no. Tool calling — no. Vision — no. JSON mode — no. Prompt caching — no.

Can I bring my own OpenAI API key (BYOK)?

Yes. Attach a OpenAI key in your AIgateway dashboard and this model flips to pass-through — you pay OpenAI directly and AIgateway adds no platform fee on those calls.