Explore/muapi.ai/wan2.2-text-to-video

muapi/wan2.2-text-to-video

Text to Video

Wan 2.2’s T2V mode transforms descriptive text prompts into high-quality, stylized video sequences. It excels at generating anime-style or cinematic visuals with smooth motion and strong thematic consistency.

Result

🚀Related Models

View all

wan2.2-image-to-video

Wan 2.2’s I2V mode brings static visuals to life with vivid, expressive animations. It interprets motion, emotion, and background dynamics from a single image to generate smooth and cinematic short videos.

Image to Video

wan2.2-animate

Wan2.2 Animate is a video-to-video model for animating a character or replacing a character in existing video clips. It replicates holistic movement and facial expressions from a reference video or pose while preserving the target character’s appearance. You upload both an image (for the character) and a video containing motion/expression, and the model generates a video where the character in your image moves like the reference. Supports 480p or 720p, up to 120 seconds

Video to Video

wan2.2-edit-video

Easily modify existing videos using simple text commands. With Wan 2.2 Video-Edit, you can change attire, character appearance, or other visual elements directly within your video—no need to start from scratch. Works on uploads of 480p or 720p, for up to two minutes.

Video to Video

wan2.2-spicy-image-to-video

Wan2.2-spicy Image-to-Video transforms a single creative image into a short dynamic video with bold motion, stylized effects, high-contrast lighting, and energy-driven animations. The “spicy” variant produces more dramatic movement, more vivid colors, and more expressive visual effects.

Image to Video

wan2.2-5b-fast-t2v

Wan 2.2 Fast is a lightweight, high-speed version of the Wan 2.2 model, optimized for quick text-to-video generation. It trades some cinematic detail for rapid results, making it perfect for prototyping, previews, social media clips, and quick storytelling.

Text to Video

wan2.2-speech-to-video

WAN2.2 Speech-to-Video transforms a static image into a talking video by synchronizing lip movements and facial expressions with an audio input. Simply provide a character image along with a speech dialogue, and the model generates a natural, expressive video where the subject speaks your lines.

Audio to Video

wan2.2-spicy-video-extend

Wan-2.2-spicy Video Extend continues an existing video by generating new frames that match the original style but add stronger motion, bolder effects, and spicier dramatics.

Video to Video

📝

Overview

About this model

Wan 2.2 Text-to-Video is a cutting-edge model that transforms descriptive text prompts into high-quality, stylized video sequences. Leveraging advanced AI algorithms, this model excels in generating anime-style or cinematic visuals characterized by smooth motion and strong thematic consistency. Whether you are aiming to create dynamic storytelling visuals or visually compelling short clips, Wan 2.2 offers a powerful toolset that seamlessly bridges creative intent with technological execution.

Under the hood, Wan 2.2 utilizes state-of-the-art deep learning techniques to interpret and render detailed prompts into video content. The model's technical capabilities include customizable aspect ratios, resolutions, and quality settings, ensuring users can tailor outputs to specific project needs. With its efficient processing and affordable pricing at $0.3 per generation, Wan 2.2 is an excellent choice for content creators, marketers, and developers who require a blend of performance and cost-effectiveness.

1Creating dynamic anime-style storytelling videos

2Generating cinematic trailers for promotional content

3Visualizing creative concepts and storyboards

4Producing social media clips with engaging movement

5Developing video content for digital marketing campaigns

💰

Pricing & Value

Cost analysis

Provider	Cost	Notes
muapiapp	$0.30 per generation	muapiapp offers this service at a significantly lower price, being 20-50% more affordable than its competitors while delivering comparable or superior video quality.
Fal.ai	$0.45 per generation	muapiapp is 20-50% cheaper than Fal.ai, making it a cost-effective choice without compromising on output quality.
Replicate	$0.45 per generation	Similar to Fal.ai, Replicate charges $0.45 per generation, and muapiapp remains 20-50% more affordable while ensuring high-quality video content.

muapiapp$0.30 per generation

muapiapp offers this service at a significantly lower price, being 20-50% more affordable than its competitors while delivering comparable or superior video quality.

Fal.ai$0.45 per generation

muapiapp is 20-50% cheaper than Fal.ai, making it a cost-effective choice without compromising on output quality.

Replicate$0.45 per generation

Similar to Fal.ai, Replicate charges $0.45 per generation, and muapiapp remains 20-50% more affordable while ensuring high-quality video content.

* Competitor pricing is estimated based on similar model architectures and usage tiers.

⚙️

Technical Details

Configuration schema

Parameter	Type	Description	Default
Prompt	string	The prompt to generate the video	`A white dragon warrior stands still, eyes filled with determination and strength, as the camera slowly moves closer or gently circles around the warrior, emphasizing the character’s powerful presence and heroic spirit in a focused, intense shot.`
Aspect Ratio	Enum (2 options)	Aspect ratio of the output video.	`16:9`
Resolution	Enum (2 options)	The resolution of the generated video.	`480p`
Quality	Enum (2 options)	The quality of the generated video.	`medium`
Duration	int	The duration of the generated video in seconds.	`5`

Promptstring

The prompt to generate the video

Default Value

A white dragon warrior stands still, eyes filled with determination and strength, as the camera slowly moves closer or gently circles around the warrior, emphasizing the character’s powerful presence and heroic spirit in a focused, intense shot.

Aspect RatioEnum (2 options)

Aspect ratio of the output video.

Default Value16:9

ResolutionEnum (2 options)

The resolution of the generated video.

Default Value480p

QualityEnum (2 options)

The quality of the generated video.

Default Valuemedium

Durationint

The duration of the generated video in seconds.

Default Value5

📖

Implementation Guide

Developer documentation

How to Use Wan 2.2 Text-to-Video

Prepare Your Prompt:
- Write a detailed description of the scene or character you'd like to visualize. For example: "A white dragon warrior stands still, eyes filled with determination and strength, as the camera slowly moves closer, emphasizing the character's heroic spirit."
Configure Settings:
- Choose the appropriate aspect ratio (16:9 or 9:16) based on your output needs.
- Select a resolution (480p or 720p) to balance between quality and performance.
- Decide on the quality setting (medium or high) according to your preference.
- Specify the duration (between 5 to 8 seconds) for the generated video.
Submit Your Request:
- Input your prompt and settings into the model's interface. The system will process the text and generate a video sequence.
Review the Result:
- Once the video has been generated, review it to ensure it meets your expectations. Use the output URL provided to access and share the video.
Refine if Necessary:
- Adjust your prompt or settings for further refinements if the initial output isn't perfectly aligned with your vision.

❓

Common Questions

Frequently asked

What types of videos can Wan 2.2 generate?

Wan 2.2 is designed to generate both anime-style and cinematic visuals, making it ideal for a wide range of creative projects.

How do I control the quality and resolution of the output?

You can customize the output by selecting between 480p or 720p resolutions and choosing the quality setting (medium or high) that best suits your project needs.

What is the duration limit for each generated video?

The model supports video durations between 5 and 8 seconds, with a default duration of 5 seconds for quick and effective rendering.

Is Wan 2.2 cost-effective compared to other providers?

Absolutely. At $0.3 per generation, Wan 2.2 offers a competitively priced solution while delivering high-quality outputs.

Can I use this model for commercial projects?

Yes, the generated videos are suitable for commercial use, including marketing, social media promotions, and digital storytelling.

minimax-hailuo-02-standard-t2v

meshy-6-image-to-3d

pixverse-v5-t2v

veo3-fast-text-to-video

kling-v1-avatar-pro

meshy-6-multi-image-to-3d

ai-product-photography

flux-kontext-dev-i2i

gemini-3-1-pro

gpt-image-1.5

ovi-text-to-video

minimax-hailuo-2.3-pro-i2v

happy-horse-1-text-to-video-720p

kling-v2.1-standard-i2v

pixverse-v6-i2v

wan2.2-image-to-video

veed-lipsync

vidu-v2.0-i2v

minimax-image-01-subject-reference

flux-pulid

latent-sync

infinitetalk-image-to-video

bytedance-seededit-v3

flux-redux

kling-v2.5-turbo-pro-i2v

wan2.2-animate

ai-background-remover

wan2.5-text-to-image

topaz-video-upscale

leonardoai-motion-2.0

ai-object-eraser

ovi-image-to-video

minimax-hailuo-2.3-pro-t2v

mmaudio-v2-text-to-audio

flux-dev-lora

vidu-q2-reference-to-image

minimax-speech-2.6-turbo

veo3.1-4k-video

kling-v3.0-std-motion-control

flux-kontext-pro-i2i

ai-skin-enhancer

suno-generate-lyrics

sd-2-character

ai-product-shot

ai-image-extension

veo3.1-fast-image-to-video

sd-2-image-to-video

wan2.2-edit-video

openai-sora-2-pro-text-to-video

ltx-2-pro-text-to-video

kling-v2-avatar-pro

runway-aleph-v2v

qwen-image-2.0-pro-edit

flux-2-klein-9b-turbo

qwen-image-edit-plus

kling-v2.6-pro-motion-control

pixverse-v6-t2v

flux-schnell

sd-2-video-watermark-remover-pro

wan2.7-image-edit

kling-v2.1-pro-i2v

veo3.1-lite-text-to-video

happy-horse-1-image-to-video-1080p

wan2.2-text-to-video

sd-2-vip-first-last-frame-1080p

kling-o3-image

tripo3d-h31-text-to-3d

veo3-image-to-video

openai-sora-2-text-to-video

kling-o1-text-to-video

kling-o1-edit-image

twitter-fetch-posts

gemini-omni-character

grok-imagine-video-1-5-preview

ai-image-face-swap

nano-banana-pro-edit

facebook-fetch-reels

generate-social-video-script

omnihuman-1-5

hidream-i1-full