Elevate your storytelling with 1080P visual fidelity and unified audio synthesis. Create videos up to 15 seconds with built-in sound, first and last frame control, and negative prompts โ powered by Alibaba's 27B MoE architecture.
Your generated video will appear here
The most advanced open-source video model with built-in audio generation and frame-level control.
Unlike models that generate video and audio separately, Wan 2.7 produces both in a single pass. Background music, ambient sound effects, and character dialogue are synthesized together for perfectly synchronized output.
Upload a starting image and optionally an ending image to precisely control your video's narrative arc. Perfect for product demos, scene transitions, and storytelling with guaranteed start and end points.
Powered by a 27-billion parameter Mixture-of-Experts architecture under Apache 2.0 license. Delivers exceptional motion quality, temporal consistency, and detail preservation across the full 15-second duration.
Create stunning videos with audio in three simple steps.
Write a detailed text prompt or upload a starting image. Optionally add a last frame image for controlled animations. Use negative prompts to exclude unwanted elements.
Select resolution (720p or 1080p), duration (2-15 seconds), and aspect ratio (16:9, 9:16, 4:3, 3:4, or 1:1). Audio is generated automatically.
Click generate and get a complete video with synchronized audio. Preview the result and download in your chosen resolution.
Wan 2.7 is Alibaba's latest flagship open-source video model with a 27 billion parameter Mixture-of-Experts (MoE) architecture. It generates HD 1080p videos up to 15 seconds with unified audio synthesis โ background music, ambient sound, and character vocals are generated alongside the visuals. It supports both text-to-video and image-to-video modes with first/last frame control, negative prompts, and automatic prompt expansion for better results.
Multi-modal AI video with reference inputs
Joint audio-video with multilingual lip-sync
Frame to frame control & multi-image reference
Cinematic quality, production-ready output
Cinematic videos with multi-shot control and native audio
Transfer motion from a reference video to any character
Exceptional audio-visual synchronization
High-quality videos with synchronized audio