ByteDance's next-generation video model with a unified multimodal architecture. Generates high-quality video with synchronized audio from text, images, video clips, and audio inputs. Supports multimodal references (up to 9 images, 3 videos, 3 audio files), native audio generation, video editing, video extension, intelligent duration, and adaptive aspect ratio.
Bytedance Seedance 2.0 (Text To Video) (bytedance/seedance-2.0) is a video model from ByteDance, released 2026-05-17. Pricing via AIgateway: $0.014 per second. Call it via https://api.aigateway.sh/v1/videos/generations — set model="bytedance/seedance-2.0". Best for: Ads, Storyboards, Demos.
curl https://api.aigateway.sh/v1/videos/generations \
-H "Authorization: Bearer $AIGATEWAY_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model":"bytedance/seedance-2.0","prompt":"a slow drone shot over a mountain lake"}'{
"model": "bytedance/seedance-2.0",
"prompt": "A drone shot of a mountain lake at golden hour",
"duration": 5, // seconds
"aspect_ratio": "16:9"
}
// Response is an async job — poll /v1/jobs/<id> until status === "completed".{
"id": "job_abc123",
"status": "queued", // queued | processing | completed | failed
"model": "bytedance/seedance-2.0",
"created": 1776947082
}
// After completion:
{
"id": "job_abc123",
"status": "completed",
"result": {
"url": "https://media.aigateway.sh/video/abc123.mp4",
"duration": 5,
"resolution": "1920x1080"
}
}# See docs at https://aigateway.sh/docs