models/Google/Gemini 3.1 Flash TTS
Google

Gemini 3.1 Flash TTS

audio-tts
Compare

Newest audio model from Google introduces granular audio tags that give you precise control to direct AI speech for expressive audio generation.

MODALITIES
audio
USAGE
161.8M
0% market share
RELEASED
2026-05-22

Gemini 3.1 Flash TTS (google/gemini-3.1-flash-tts) is a audio-tts model from Google, released 2026-05-22. Pricing via AIgateway: $0.0000 per 1K chars. Capabilities: tools, vision, json. Call it via https://api.aigateway.sh/v1/audio/speech — set model="google/gemini-3.1-flash-tts". Best for: Voiceovers, IVR, Audiobooks.

model · google/gemini-3.1-flash-ttsfamily · Gemini 3

Use this model

model: google/gemini-3.1-flash-tts
curl https://api.aigateway.sh/v1/audio/speech \
  -H "Authorization: Bearer $AIGATEWAY_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"google/gemini-3.1-flash-tts","voice":"alloy","input":"Hello, world."}'

Capabilities

Tool callingVisionJSON mode

Strengths

  • Natural speech synthesis

Use cases

VoiceoversIVRAudiobooks

Adoption

161.8M tokens
428.2K requests · 0% of tracked market volume
See the full leaderboard →
Aggregate usage across the open model ecosystem (as of 2026-05-22).

Pricing

Per 1K chars$0.0000
You pay pass-through · 5% applied at credit top-up, not per-call.
See API example →CompareAPI referenceSee usage ranking →

Collections

More audio models →More from GoogleFrontier models →Free-tier models →
API schema

Call Gemini 3.1 Flash TTS from any OpenAI SDK

POST https://api.aigateway.sh/v1/audio/speech·Content-Type: application/json·Auth: Bearer sk-aig-...

Request body

json
{
  "model": "google/gemini-3.1-flash-tts",
  "input": "Hello from AIgateway.",
  "voice": "alloy",
  "format": "mp3",        // mp3 | wav | flac | opus
  "speed": 1.0
}

Response

json
// Binary audio stream in the requested format (mp3 by default).
// Content-Type: audio/mpeg  (or audio/wav, audio/flac, audio/opus)
// Read the response body directly to a file.

Quickstart

from openai import OpenAI
client = OpenAI(base_url="https://api.aigateway.sh/v1", api_key="sk-aig-...")

r = client.audio.speech.create(model="google/gemini-3.1-flash-tts", voice="alloy", input="Hello world.")
r.stream_to_file("out.mp3")

Errors

401authentication_errorInvalid or missing API key
402insufficient_creditsWallet empty (PAYG only)
404not_foundUnknown model or endpoint
429rate_limit_errorOver per-minute limit — see Retry-After header
500server_errorUpstream provider failed (retryable)
503service_unavailableUpstream saturated (retryable)
Full docs →API reference →OpenAPI spec →llms.txt →

Frequently asked questions

What is Gemini 3.1 Flash TTS?
Newest audio model from Google introduces granular audio tags that give you precise control to direct AI speech for expressive audio generation. It is a audio-tts model from Google, accessible via AIgateway's OpenAI-compatible API at slug google/gemini-3.1-flash-tts.
How much does Gemini 3.1 Flash TTS cost via AIgateway?
Pass-through pricing plus a 5% platform fee applied at top-up. See the pricing panel on this page for exact rates.
How do I call Gemini 3.1 Flash TTS from my code?
Point the OpenAI SDK at https://api.aigateway.sh/v1 with your AIgateway key and set model to "google/gemini-3.1-flash-tts". The request and response shapes match OpenAI exactly.
Does Gemini 3.1 Flash TTS support streaming, tool calling, vision, and JSON mode?
Streaming — no. Tool calling — yes. Vision — yes. JSON mode — yes. Prompt caching — no.
What are the best use cases for Gemini 3.1 Flash TTS?
Voiceovers, IVR, Audiobooks. Key strengths: Natural speech synthesis.
Can I bring my own Google API key (BYOK)?
Yes. Attach a Google key in your AIgateway dashboard and this model flips to pass-through — you pay Google directly and AIgateway waives the 5% platform fee on those calls.