models/Alibaba/Qwen 3 TTS - Clone Voice [1.7B]

Qwen 3 TTS - Clone Voice [1.7B]

music

Clone your voices using Qwen3-TTS Clone-Voice model with zero shot cloning capabilities and use it on text-to-speech models to create speeches of yours!

Qwen 3 TTS - Clone Voice [1.7B] (alibaba/qwen-3-tts-clone-voice-1.7b) is a music model from Alibaba. Pricing via AIgateway: $0.0008 per second. Call it via https://api.aigateway.sh/v1/audio/music — set model="alibaba/qwen-3-tts-clone-voice-1.7b". Best for: Background tracks, Game scores, Ad jingles.

slug · alibaba/qwen-3-tts-clone-voice-1.7bprovider · Alibaba

Use this model

model: alibaba/qwen-3-tts-clone-voice-1.7b

curl https://api.aigateway.sh/v1/audio/music \
  -H "Authorization: Bearer $AIGATEWAY_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"alibaba/qwen-3-tts-clone-voice-1.7b","prompt":"lo-fi piano with light rain","duration":15}'

POST /v1/chat/completions model: "alibaba/qwen-3-tts-clone-voice-1.7b"

Capabilities

Strengths

Full-track generation
Lyric-aware composition

Use cases

Background tracksGame scoresAd jingles

Pricing

Per video second$0.0008

You pay pass-through · 5% applied at credit top-up, not per-call.

Try in playground →Compare API reference See usage ranking →

Collections

More text models →More from Alibaba →Frontier models →Free-tier models →

API schema

Call Qwen 3 TTS - Clone Voice [1.7B] from any OpenAI SDK

POST https://api.aigateway.sh/v1/audio/music·Content-Type: application/json·Auth: Bearer sk-aig-...

Request body

json

{
  "model": "alibaba/qwen-3-tts-clone-voice-1.7b",
  "prompt": "Lo-fi hip-hop with vinyl crackle and warm Rhodes piano",
  "lyrics": "Optional — leave empty for instrumental",
  "duration": 30,         // seconds
  "format": "mp3"         // mp3 | wav
}

// Response is an async job — poll /v1/jobs/<id> until status === "completed".

Response

json

{
  "id": "job_abc123",
  "status": "queued",
  "model": "alibaba/qwen-3-tts-clone-voice-1.7b",
  "created": 1776947082
}

// After completion:
{
  "id": "job_abc123",
  "status": "completed",
  "result": {
    "url": "https://media.aigateway.sh/music/abc123.mp3",
    "duration": 30
  }
}

Quickstart

# See docs at https://aigateway.sh/docs

Errors

401authentication_errorInvalid or missing API key

402insufficient_creditsWallet empty (PAYG only)

404not_foundUnknown model or endpoint

429rate_limit_errorOver per-minute limit — see Retry-After header

500server_errorUpstream provider failed (retryable)

503service_unavailableUpstream saturated (retryable)

Full docs →API reference →OpenAPI spec →llms.txt →

Frequently asked questions

What is Qwen 3 TTS - Clone Voice [1.7B]?

Clone your voices using Qwen3-TTS Clone-Voice model with zero shot cloning capabilities and use it on text-to-speech models to create speeches of yours! It is a music model from Alibaba, accessible via AIgateway's OpenAI-compatible API at slug alibaba/qwen-3-tts-clone-voice-1.7b.

How much does Qwen 3 TTS - Clone Voice [1.7B] cost via AIgateway?

Pass-through pricing plus a 5% platform fee applied at top-up. See the pricing panel on this page for exact rates.

How do I call Qwen 3 TTS - Clone Voice [1.7B] from my code?

Point the OpenAI SDK at https://api.aigateway.sh/v1 with your AIgateway key and set model to "alibaba/qwen-3-tts-clone-voice-1.7b". The request and response shapes match OpenAI exactly.

Does Qwen 3 TTS - Clone Voice [1.7B] support streaming, tool calling, vision, and JSON mode?

Streaming — no. Tool calling — no. Vision — no. JSON mode — no. Prompt caching — no.

What are the best use cases for Qwen 3 TTS - Clone Voice [1.7B]?

Background tracks, Game scores, Ad jingles. Key strengths: Full-track generation; Lyric-aware composition.

Can I bring my own Alibaba API key (BYOK)?

Yes. Attach a Alibaba key in your AIgateway dashboard and this model flips to pass-through — you pay Alibaba directly and AIgateway waives the 5% platform fee on those calls.