Inworld's most powerful and expressive text-to-speech model. Builds on TTS 1.5 with rich expressive speech, real-time latency, natural language steering (e.g. [whisper], [say excitedly]), and stronger multilingual support across 15 production languages plus 90+ experimental languages.
Inworld TTS 2 (inworld/tts-2) is a audio-tts model from Inworld, released 2026-05-05. Pricing via AIgateway: $0.035 per 1K chars. Call it via https://api.aigateway.sh/v1/audio/speech — set model="inworld/tts-2". Best for: Voiceovers, IVR, Audiobooks.
curl https://api.aigateway.sh/v1/audio/speech \
-H "Authorization: Bearer $AIGATEWAY_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model":"inworld/tts-2","voice":"alloy","input":"Hello, world."}'{
"model": "inworld/tts-2",
"input": "Hello from AIgateway.",
"voice": "alloy",
"format": "mp3", // mp3 | wav | flac | opus
"speed": 1.0
}// Binary audio stream in the requested format (mp3 by default). // Content-Type: audio/mpeg (or audio/wav, audio/flac, audio/opus) // Read the response body directly to a file.
from openai import OpenAI
client = OpenAI(base_url="https://api.aigateway.sh/v1", api_key="sk-aig-...")
r = client.audio.speech.create(model="inworld/tts-2", voice="alloy", input="Hello world.")
r.stream_to_file("out.mp3")