More controllable artistic styles
Deeper understanding of art-style keywords, smoother style blending, more unified aesthetics, and richer texture/color/brushwork details.
Wan 2.6 is officially released with major upgrades for professional video creation: roleplay-style generation, multi-shot storytelling, natural audio-visual sync, audio-driven video, and up to 15s duration.
Text-to-image and image generation are upgraded together: stronger style control, more realistic portraits, better layout for posters/infographics, and production-ready consistency for commercial assets.
Deeper understanding of art-style keywords, smoother style blending, more unified aesthetics, and richer texture/color/brushwork details.
More natural expressions and realistic skin/lighting with improved composition—reducing the “AI look” for portraits.
Generate posters, infographics, charts, and illustrated layouts from long-form Chinese or English text with better visual-text alignment.
Generate mixed text-and-image narratives with more coherent structure—great for storybooks, visual explanations, and storyboard-style content.
Combine, replace, or blend multiple reference images to fuse inspirations and create new creative outputs.
Keep characters/styles/elements consistent across variations—ideal for e-commerce, ads, IP characters, and series content production.
Wan 2.6 is the next-generation Wan video model upgraded for professional creation workflows. It introduces roleplay-style generation that can reference a character’s appearance and voice from an input video, enabling more believable and consistent performances.
With multi-shot storytelling, Wan 2.6 can expand a simple prompt into a storyboard and generate a coherent narrative across multiple shots—while keeping key identity and scene details consistent.
Wan 2.6 also improves natural audio-visual sync for more stable dialogue scenes and better music/song quality. It supports up to 15-second generation and can be driven by text plus audio input for expressive performances in more scenarios.
Wan 2.6 brings identity consistency, narrative structure, and audio-visual quality into one streamlined workflow—faster to create and easier to control.
Use clear character setup and camera language to generate production-ready short clips quickly.
Choose image-to-video or text-to-video (optionally audio-driven) depending on your target scene and workflow.
Specify character traits, scene, and camera language (shot type/movement/lighting), plus dialogue or narration tone and rhythm.
Generate and preview results, iterate quickly if needed, then download and use your final clip.
See how professionals across industries use Wan 2.6—roleplay generation, multi-shot storytelling, native audio sync, and audio-driven workflows—to turn ideas into production-ready video faster.
Turn a synopsis or script into a multi-shot storyboard structure and generate a coherent sequence—great for pitching and rapid previsualization.
Reference a character’s look and voice from an input video to keep performances consistent across shots.
Generate dialogue, ambience, and music more naturally in sync with visuals to reduce post-production alignment work.
Up to 15 seconds per generation enables richer pacing and more complete shot segments.
Try Wan 2.6 today: roleplay generation, multi-shot storytelling, audio sync, and up to 15 seconds per video—built for professional creation.