Welcome to our new platform! 🎉

WAN 2.2

Alibaba's next-generation video generation model featuring 27B parameters with 14B active per step, Mixture-of-Experts architecture, and enhanced visual quality. Achieves breakthrough performance in cinematic video generation with improved motion control, faster generation speeds, and extended training data.

Experience WAN 2.2

Generate professional-quality videos with cinematic motion and enhanced details using Alibaba's most advanced AI video generation model

WAN v2.2 5B
WAN v2.2 5B
Text to Video
WAN v2.2 A14B
WAN v2.2 A14B
Text to Video
WAN v2.2 A14B I2V
WAN v2.2 A14B I2V
Image to Video
(Public)
Public tasks are visible to all users
Login required
Fill in parameters to view credit consumption
Slide to Submit Task

What's WAN 2.2

Alibaba's breakthrough Mixture-of-Experts video generation model with revolutionary architecture and cinematic quality

WAN 2.2 represents a quantum leap in AI video generation technology, introducing a groundbreaking Mixture-of-Experts (MoE) architecture with 27B total parameters and 14B active per denoising step. With 65.6% more training images and 83.2% more training videos compared to its predecessor, WAN 2.2 achieves unprecedented cinematic quality, motion fidelity, and generation efficiency that sets new standards for AI-powered video creation.

Key Highlights

Revolutionary Mixture-of-Experts Architecture

Features dual-expert system with specialized high-noise and low-noise experts, optimizing computational efficiency while maintaining 27B parameter scale with only 14B active parameters per step for superior performance.

Expanded Training Dataset

Trained on massively expanded dataset with 65.6% more images and 83.2% more videos, including aesthetic data with detailed labels for lighting, composition, and color grading to achieve cinematic-quality outputs.

Enhanced Motion Control & Physics

Delivers exceptional motion consistency and realistic physics simulation with smooth object interactions, complex body movements, and fluid camera motions that accurately reflect real-world dynamics.

Consumer GPU Accessibility

Optimized for consumer hardware with efficient inference on NVIDIA RTX 4090, making professional-grade video generation accessible to creators and researchers with standard gaming equipment.

Technical Specifications

Duration

Up to 5 seconds (129-257 frames)

Resolution

480p, 720p

Aspect Ratio

16:9, 9:16, 1:1, 4:3, 3:4

Frame Rate

8-30 FPS (adjustable)

Audio

Input Types

Text prompts, Images

Max Prompt Length

512 tokens

WAN 2.2's Advanced Features

Explore the cutting-edge capabilities that make WAN 2.2 the most powerful AI video generation model, featuring Mixture-of-Experts architecture and unprecedented cinematic quality

Mixture-of-Experts Architecture

Revolutionary dual-expert system with 27B total parameters and 14B active per step, utilizing specialized high-noise and low-noise experts for optimal computational efficiency and superior video quality.

Massively Expanded Training Data

Trained on 65.6% more images and 83.2% more videos compared to WAN 2.1, including aesthetic data with detailed labels for lighting, composition, and color grading to achieve cinematic outputs.

Enhanced Motion Consistency

Delivers exceptional temporal coherence with smooth object interactions, complex body movements, and fluid camera motions that accurately simulate real-world physics and dynamics.

Multi-Resolution Support

Supports both 480p and 720p video generation with configurable aspect ratios including 16:9, 9:16, 1:1, 4:3, and 3:4 for diverse creative applications and platform requirements.

Flexible Frame Control

Generate videos with 65-257 frames (up to 5 seconds) at 8-30 FPS, providing precise control over video duration and temporal dynamics for various creative needs.

Advanced Text-to-Video Generation

Transform detailed text prompts into high-quality videos with superior understanding of complex descriptions, action sequences, and scene compositions through enhanced language processing.

Premium Image-to-Video Conversion

Convert static images into dynamic videos with the 14B A14B model, maintaining visual consistency while adding realistic motion and temporal depth to still imagery.

Consumer GPU Optimization

Efficiently runs on NVIDIA RTX 4090 and similar consumer hardware, making professional-grade video generation accessible to creators without enterprise-level equipment.

High-Quality Cinematic Output

Produces cinema-quality videos with enhanced visual fidelity, realistic lighting, professional color grading, and natural motion blur that rivals traditional video production.

Intelligent Prompt Processing

Advanced prompt expansion and safety checking capabilities ensure optimal results while maintaining content appropriateness and creative intent through sophisticated language understanding.

Recallable Task System

Built-in task recall functionality allows retrieval of pending or processing video generations, providing seamless workflow continuity and efficient resource management.

Open Source Accessibility

Released under Apache 2.0 license with full model weights and inference code available, enabling researchers and developers to build upon the technology for innovative applications.

WAN 2.2 Frequently Asked Questions

Find answers to the most common questions about WAN 2.2's Mixture-of-Experts architecture, capabilities, and usage

WAN 2.2 uses a groundbreaking dual-expert system with 27B total parameters but only 14B active per denoising step. This includes specialized high-noise and low-noise experts that optimize computational efficiency while maintaining superior video quality. The architecture enables faster inference with better results compared to traditional single-model approaches.
WAN 2.2 generates videos up to 5 seconds (65-257 frames) at 480p and 720p resolutions with configurable aspect ratios (16:9, 9:16, 1:1, 4:3, 3:4). It supports 8-30 FPS frame rates and features both 5B and 14B parameter variants for text-to-video, plus a premium 14B model for image-to-video conversion with cinema-grade aesthetic controls.
WAN 2.2 uses English prompts and supports multiple prompt formulas: Basic (Subject + Scene + Motion), Advanced (adds Aesthetic Control + Stylization), and Image-to-Video (Motion Description + Camera Movement). Use specific cinematic terms like 'dolly in', 'static shot', lighting descriptions, and stylization keywords for best results. The model supports up to 512 tokens per prompt.
WAN 2.2 includes built-in safety checkers and content filtering systems that can be enabled during generation. The model follows responsible AI practices with content appropriateness checks, though specific safety measures can be configured based on use case requirements. Users are responsible for ensuring ethical content generation under the Apache 2.0 license terms.
WAN 2.2 offers three main variants: 5B text-to-video (efficient, consumer-friendly), A14B text-to-video (premium quality, superior motion control), and A14B image-to-video (specialized for image animation with enhanced visual consistency). The A14B models provide better cinematic quality and motion fidelity but require more computational resources.
WAN 2.2 requires substantial computational resources, with the A14B models recommended for systems with 80GB+ VRAM for optimal performance. Consumer GPUs like RTX 4090 can run the models but with longer generation times. Current limitations include maximum 5-second video duration, English-only prompt support, and dependency on high-quality input prompts for best results.

How to Use WAN 2.2 for Text-to-Video Generation

Master the art of creating professional-quality videos from text prompts using WAN 2.2's advanced Mixture-of-Experts architecture and cinematic controls

step1

Craft Your Text Prompt

Configure Video Parameters

Generate and Refine

How to Use WAN 2.2 Image-to-Video

Learn how to transform static images into cinematic videos using WAN 2.2's advanced image-to-video generation capabilities

step1

Upload Your Image

Write Motion Description

Generate & Download

Pricing

Choose the plan that's right for you. No hidden fees, no surprises.

Popular

Pro

Elevate your AI experience

29.99
15
1 Month
USD
800points
1 Month
Up to 80 videos
1 Month
Up to 800 images
1 Month
Parallel Tasks: 3 tasks
Multi-Model Support
Text to Video
Image to Video
Video to Video
Consistent Character
AI Animation Generator
Templates & Effects
AI Video Enhancers
Interactive Community
Faster Generation Speed
No-watermark Outputs
More Camera Movement
Private Video Visibility
Copy Protection
Priority Support

Max

Unlock more advanced features

99.99
50
1 Month
USD
2800points
1 Month
Up to 280 videos
1 Month
Up to 2800 images
1 Month
Parallel Tasks: 3 tasks
Multi-Model Support
Text to Video
Image to Video
Video to Video
Consistent Character
AI Animation Generator
Templates & Effects
AI Video Enhancers
Interactive Community
Faster Generation Speed
No-watermark Outputs
More Camera Movement
Private Video Visibility
Copy Protection
Priority Support

Ultra

Powerful support for your team

499.99
250
1 Month
USD
16000points
1 Month
Up to 1600 videos
1 Month
Up to 16000 images
1 Month
Parallel Tasks: 3 tasks
Multi-Model Support
Text to Video
Image to Video
Video to Video
Consistent Character
AI Animation Generator
Templates & Effects
AI Video Enhancers
Interactive Community
Faster Generation Speed
No-watermark Outputs
More Camera Movement
Private Video Visibility
Copy Protection
Priority Support
WAN 2.2 - Advanced Mixture-of-Experts AI Video Generation | Dreamega AI