models/OpenAI/GPT-4o Transcribe
OpenAI

GPT-4o Transcribe

audio-stt
Compare

A speech-to-text model that uses GPT-4o to transcribe audio with improved word error rate and better language recognition compared to original Whisper models.

MODALITIES
audio
USAGE
102.9M
0% market share
RELEASED
2026-05-22

GPT-4o Transcribe (openai/gpt-4o-transcribe) is a audio-stt model from OpenAI, released 2026-05-22. Pricing via AIgateway: $0.0060 per minute. Call it via https://api.aigateway.sh/v1/audio/transcriptions — set model="openai/gpt-4o-transcribe". Best for: Meeting transcripts, Captions, Voice agents.

model · openai/gpt-4o-transcribefamily · GPT-4

Use this model

model: openai/gpt-4o-transcribe
curl https://api.aigateway.sh/v1/audio/transcriptions \
  -H "Authorization: Bearer $AIGATEWAY_API_KEY" \
  -F model="openai/gpt-4o-transcribe" \
  -F file="@audio.mp3"

Capabilities

Strengths

  • Speech-to-text transcription

Use cases

Meeting transcriptsCaptionsVoice agents

Adoption

102.9M tokens
90.6K requests · 0% of tracked market volume
See the full leaderboard →
Aggregate usage across the open model ecosystem (as of 2026-05-30).

Pricing

Per minute$0.0060
You pay pass-through · 5% applied at credit top-up, not per-call.
See API example →CompareAPI referenceSee usage ranking →

Collections

More audio models →More from OpenAIFrontier models →Free-tier models →
API schema

Call GPT-4o Transcribe from any OpenAI SDK

POST https://api.aigateway.sh/v1/audio/transcriptions·Content-Type: multipart/form-data·Auth: Bearer sk-aig-...

Request body

json
# multipart/form-data — use curl -F or SDK file upload
model="openai/gpt-4o-transcribe"
file=@audio.mp3
response_format=json    # or "verbose_json", "text", "srt", "vtt"
language=en             # optional

Response

json
{
  "text": "Hello from AIgateway.",
  "language": "en",
  "duration": 1.82
}

Quickstart

from openai import OpenAI
client = OpenAI(base_url="https://api.aigateway.sh/v1", api_key="sk-aig-...")

with open("audio.mp3", "rb") as f:
    r = client.audio.transcriptions.create(model="openai/gpt-4o-transcribe", file=f)
print(r.text)

Errors

401authentication_errorInvalid or missing API key
402insufficient_creditsWallet empty (PAYG only)
404not_foundUnknown model or endpoint
429rate_limit_errorOver per-minute limit — see Retry-After header
500server_errorUpstream provider failed (retryable)
503service_unavailableUpstream saturated (retryable)
Full docs →API reference →OpenAPI spec →llms.txt →

Frequently asked questions

What is GPT-4o Transcribe?
A speech-to-text model that uses GPT-4o to transcribe audio with improved word error rate and better language recognition compared to original Whisper models. It is a audio-stt model from OpenAI, accessible via AIgateway's OpenAI-compatible API at slug openai/gpt-4o-transcribe.
How much does GPT-4o Transcribe cost via AIgateway?
$0.0060 per minute of audio. Pass-through plus a 5% platform fee applied at top-up.
How do I call GPT-4o Transcribe from my code?
Point the OpenAI SDK at https://api.aigateway.sh/v1 with your AIgateway key and set model to "openai/gpt-4o-transcribe". The request and response shapes match OpenAI exactly.
Does GPT-4o Transcribe support streaming, tool calling, vision, and JSON mode?
Streaming — no. Tool calling — no. Vision — no. JSON mode — no. Prompt caching — no.
What are the best use cases for GPT-4o Transcribe?
Meeting transcripts, Captions, Voice agents. Key strengths: Speech-to-text transcription.
Can I bring my own OpenAI API key (BYOK)?
Yes. Attach a OpenAI key in your AIgateway dashboard and this model flips to pass-through — you pay OpenAI directly and AIgateway waives the 5% platform fee on those calls.