Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
Multimodal AI video generator with native audio, multi-shot storytelling, and 8+ language lip-sync.
Seedance 2.0 is ByteDance's multimodal AI video generation model, available through the PoYo API. It allows users to create cinematic video from text, image, video, and audio inputs. The model supports native audio-video joint synthesis, multi-shot storytelling, and lip-sync across multiple languages, offering director-level control over camera motion, lighting, and physics.
Seedance 2.0 API pricing on PoYo is credit-based, charged per second of video generated:
Features include pay-per-second pricing, non-expiring credits, a cheaper "fast" mode for drafts, and discounted rates for 480p resolution and video input usage. Pricing starts as low as $0.04 per second.