Compare

Grok STT — and what?

Pricing per million tokens, context window, capabilities — pulled from each provider's public docs. All 1 are available via the same AIgateway OpenAI-compatible endpoint; flip the model string to switch.

Search1/4
Grok STT
xai/grok-stt
Provider
xAI
Family
Modality
audio-stt
Context window
Max output
Released
2026-06-03
License
Proprietary
Per minute
$0.0017 /min
Tools
Streaming
Vision
JSON mode
Reasoning
Prompt caching
Batch API
Try it
View model →
Grok STT
xai/grok-stt
Full spec →

xAI's Grok speech-to-text model. Transcribes audio files into text across 25 languages with word-level timestamps, multichannel transcription, speaker diarization, and key-term biasing.

Strengths
  • Speech-to-text transcription
Use cases
Meeting transcriptsCaptionsVoice agents