Vibevoice 0.5b (microsoft/vibevoice-0.5b) is a audio-tts model from Microsoft. Pricing via AIgateway: $20000.00 per 1K chars. Call it via https://api.aigateway.sh/v1/audio/speech — set model="microsoft/vibevoice-0.5b". Best for: Voiceovers, Audiobooks.
curl https://api.aigateway.sh/v1/audio/speech \
-H "Authorization: Bearer $AIGATEWAY_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model":"microsoft/vibevoice-0.5b","voice":"alloy","input":"Hello, world."}'{
"model": "microsoft/vibevoice-0.5b",
"input": "Hello from AIgateway.",
"voice": "alloy",
"format": "mp3", // mp3 | wav | flac | opus
"speed": 1.0
}// Binary audio stream in the requested format (mp3 by default). // Content-Type: audio/mpeg (or audio/wav, audio/flac, audio/opus) // Read the response body directly to a file.
from openai import OpenAI
client = OpenAI(base_url="https://api.aigateway.sh/v1", api_key="sk-aig-...")
r = client.audio.speech.create(model="microsoft/vibevoice-0.5b", voice="alloy", input="Hello world.")
r.stream_to_file("out.mp3")