models/Microsoft/Vibevoice 0.5b

Vibevoice 0.5b

audio-tts

Vibevoice 0.5b (microsoft/vibevoice-0.5b) is a audio-tts model from Microsoft. Pricing via AIgateway: $20000.00 per 1K chars. Call it via https://api.aigateway.sh/v1/audio/speech — set model="microsoft/vibevoice-0.5b". Best for: Voiceovers, Audiobooks.

slug · microsoft/vibevoice-0.5bprovider · Microsoft

Use this model

model: microsoft/vibevoice-0.5b

curl https://api.aigateway.sh/v1/audio/speech \
  -H "Authorization: Bearer $AIGATEWAY_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"microsoft/vibevoice-0.5b","voice":"alloy","input":"Hello, world."}'

curl https://api.aigateway.sh/v1/audio/speech \ -H "Authorization: Bearer $AIGATEWAY_API_KEY" \ -H "Content-Type: application/json" \ -d '{"model":"microsoft/vibevoice-0.5b","voice":"alloy","input":"Hello world."}'

Capabilities

Strengths

Natural speech synthesis

Use cases

VoiceoversAudiobooks

Pricing

Per 1K chars$20000.00

You pay pass-through · 5% applied at credit top-up, not per-call.

Try in playground →Compare API reference See usage ranking →

Collections

More audio models →More from Microsoft →Frontier models →Free-tier models →

API schema

Call Vibevoice 0.5b from any OpenAI SDK

POST https://api.aigateway.sh/v1/audio/speech·Content-Type: application/json·Auth: Bearer sk-aig-...

Request body

json

{
  "model": "microsoft/vibevoice-0.5b",
  "input": "Hello from AIgateway.",
  "voice": "alloy",
  "format": "mp3",        // mp3 | wav | flac | opus
  "speed": 1.0
}

Response

json

// Binary audio stream in the requested format (mp3 by default).
// Content-Type: audio/mpeg  (or audio/wav, audio/flac, audio/opus)
// Read the response body directly to a file.

Quickstart

from openai import OpenAI
client = OpenAI(base_url="https://api.aigateway.sh/v1", api_key="sk-aig-...")

r = client.audio.speech.create(model="microsoft/vibevoice-0.5b", voice="alloy", input="Hello world.")
r.stream_to_file("out.mp3")

Errors

401authentication_errorInvalid or missing API key

402insufficient_creditsWallet empty (PAYG only)

404not_foundUnknown model or endpoint

429rate_limit_errorOver per-minute limit — see Retry-After header

500server_errorUpstream provider failed (retryable)

503service_unavailableUpstream saturated (retryable)

Full docs →API reference →OpenAPI spec →llms.txt →

Frequently asked questions

What is Vibevoice 0.5b?

It is a audio-tts model from Microsoft, accessible via AIgateway's OpenAI-compatible API at slug microsoft/vibevoice-0.5b.

How much does Vibevoice 0.5b cost via AIgateway?

Pass-through pricing plus a 5% platform fee applied at top-up. See the pricing panel on this page for exact rates.

How do I call Vibevoice 0.5b from my code?

Point the OpenAI SDK at https://api.aigateway.sh/v1 with your AIgateway key and set model to "microsoft/vibevoice-0.5b". The request and response shapes match OpenAI exactly.

Does Vibevoice 0.5b support streaming, tool calling, vision, and JSON mode?

Streaming — no. Tool calling — no. Vision — no. JSON mode — no. Prompt caching — no.

What are the best use cases for Vibevoice 0.5b?

Voiceovers, Audiobooks. Key strengths: Natural speech synthesis.

Can I bring my own Microsoft API key (BYOK)?

Yes. Attach a Microsoft key in your AIgateway dashboard and this model flips to pass-through — you pay Microsoft directly and AIgateway waives the 5% platform fee on those calls.