Clone your voices using Qwen3-TTS Clone-Voice model with zero shot cloning capabilities and use it on text-to-speech models to create speeches of yours!
Qwen 3 TTS - Clone Voice [1.7B] (alibaba/qwen-3-tts-clone-voice-1.7b) is a music model from Alibaba. Pricing via AIgateway: $0.0008 per second. Call it via https://api.aigateway.sh/v1/audio/music — set model="alibaba/qwen-3-tts-clone-voice-1.7b". Best for: Background tracks, Game scores, Ad jingles.
curl https://api.aigateway.sh/v1/audio/music \
-H "Authorization: Bearer $AIGATEWAY_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model":"alibaba/qwen-3-tts-clone-voice-1.7b","prompt":"lo-fi piano with light rain","duration":15}'{
"model": "alibaba/qwen-3-tts-clone-voice-1.7b",
"prompt": "Lo-fi hip-hop with vinyl crackle and warm Rhodes piano",
"lyrics": "Optional — leave empty for instrumental",
"duration": 30, // seconds
"format": "mp3" // mp3 | wav
}
// Response is an async job — poll /v1/jobs/<id> until status === "completed".{
"id": "job_abc123",
"status": "queued",
"model": "alibaba/qwen-3-tts-clone-voice-1.7b",
"created": 1776947082
}
// After completion:
{
"id": "job_abc123",
"status": "completed",
"result": {
"url": "https://media.aigateway.sh/music/abc123.mp3",
"duration": 30
}
}# See docs at https://aigateway.sh/docs