Create custom voices using Qwen3-TTS Voice Design model and later use Clone Voice model to create your own voices!
Qwen 3 TTS - Voice Design [1.7B] (alibaba/qwen-3-tts-voice-design-1.7b) is a audio-tts model from Alibaba. Pricing via AIgateway: $90000.00 per 1K chars. Call it via https://api.aigateway.sh/v1/audio/speech — set model="alibaba/qwen-3-tts-voice-design-1.7b". Best for: Voiceovers, Audiobooks.
curl https://api.aigateway.sh/v1/audio/speech \
-H "Authorization: Bearer $AIGATEWAY_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model":"alibaba/qwen-3-tts-voice-design-1.7b","voice":"alloy","input":"Hello, world."}'{
"model": "alibaba/qwen-3-tts-voice-design-1.7b",
"input": "Hello from AIgateway.",
"voice": "alloy",
"format": "mp3", // mp3 | wav | flac | opus
"speed": 1.0
}// Binary audio stream in the requested format (mp3 by default). // Content-Type: audio/mpeg (or audio/wav, audio/flac, audio/opus) // Read the response body directly to a file.
from openai import OpenAI
client = OpenAI(base_url="https://api.aigateway.sh/v1", api_key="sk-aig-...")
r = client.audio.speech.create(model="alibaba/qwen-3-tts-voice-design-1.7b", voice="alloy", input="Hello world.")
r.stream_to_file("out.mp3")