Explore/muapi.ai/seedance-2-i2v

muapi/seedance-2-i2v

Image to Video

SD 2.0 is the latest multimodal video generation model by ByteDance, offering advanced camera control, native audio-video sync, and high-resolution output.

Input

Configure the model parameters below.

Prompt* requiredText prompt describing the video animation. Reference uploaded images using @image1, @image2, … @imageN (1-based, matching images_list order). To use a fictional character, reference it with @character:<id> (request_id from a completed Seedance 2 Character generation) — characters are automatically appended to images_list. Multiple characters are supported. Example: '@character:ab539e5f walks through a garden' or 'The cat in @image1 meets @character:ab539e5f'.

Image URLsUpload up to 9 image URLs. Reference them in the prompt using @image1, @image2, … @image9. The aspect ratio of the reference image takes precedence over the aspect_ratio parameter.0/9 items

Drag & drop images here or paste file/image

+Add

Aspect Ratio

Duration

Quality

Result

$0.60 per video— muapiapp offers SD 2.0 Image-to-Video starting at $0.60 per video (5s, basic quality), scaling at $0.12/sec for basic and $0.25/sec for high quality across 5–15 second durations.

🚀Related Models

View all

seedance-2-character

[Beta] Turn fictional character references into reusable video characters. Upload reference images and describe the outfit to get a character_id you can use in SD 2.0 Omni Reference.

Image to Image

seedance-2-t2v

SD 2.0 is the latest multimodal video generation model by ByteDance, offering advanced camera control, native audio-video sync, and high-resolution output.

Text to Video

seedance-2-watermark-remover

🎉 FREE for a limited time — Remove SD 2.0 watermarks from videos using LaMa AI inpainting. Automatically detects the watermark region, builds a precise mask via Canny edge detection, and inpaints each frame for artifact-free results. No credits deducted — requires a positive balance to access.

Video to Video

seedance-2-video-watermark-remover-pro

SD 2 Video Watermark Remover Pro uses the SD 2 AI model to remove watermarks, logos, and overlaid text from videos with high accuracy. Powered by ByteDance's SD 2 engine, it delivers superior quality compared to traditional inpainting approaches. Pricing: $0.013 per second, minimum charge for 5 seconds ($0.065).

Video to Video

seedance-2-i2v-480p

SD 2.0 480p image-to-video generation. Faster and more cost-effective than the 720p variant, ideal for previews and drafts.

Image to Video

seedance-2-omni-reference

SD 2.0 Omni Reference — generate videos with visual consistency using reference images, videos, and audio. Maintain character identity, style, and scene continuity. Supports up to 9 images, 3 video clips, and 3 audio clips. Use @image1, @video1, @audio1 syntax in your prompt.

Image to Video

seedance-2-omni-reference-train

Train a reusable character from a reference photo. Once complete, reference the character in Omni Reference video prompts using @omni-character:<request_id> to generate videos featuring that character consistently.

Training

seedance-2-video-edit

SD 2.0 Video Edit modifies existing videos based on text prompts and optional reference images.

Video to Video

seedance-2-extend

SD 2.0 Extend Video continues an existing SD 2.0 generated video seamlessly. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Text to Video

seedance-2-omni-reference-480p

SD 2.0 480p Omni Reference — generate videos with visual consistency using reference images, videos, and audio at 480p resolution. More cost-effective than the 720p variant. Supports up to 9 images, 3 video clips, and 3 audio clips. Use @image1, @video1, @audio1 syntax in your prompt.

Image to Video

seedance-2-t2v-480p

SD 2.0 480p text-to-video generation. Faster and more cost-effective than the 720p variant, ideal for previews and drafts.

Text to Video

seedance-2-vip-extend

SD 2.0 VIP Extend Video continues an existing SD 2.0 generated video seamlessly at 720p. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Text to Video

seedance-2-vip-extend-1080p

SD 2.0 VIP Extend Video 1080p continues an existing SD 2.0 generated video seamlessly at 1080p resolution. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Text to Video

📝

Overview

About this model

SD 2.0 Image-to-Video animates your still images into cinematic video clips while preserving the original composition, subject identity, lighting, and style. Upload up to 9 reference images and guide the animation with a text prompt — the model applies physics-aware motion, director-level camera movements, and native audio generation to bring your visuals to life. Ideal for turning product shots, illustrations, portraits, or concept art into dynamic video content.

1Product Animation: Turn product photography into dynamic showcase videos.

2Social Media: Animate illustrations, portraits, or brand images into engaging video posts.

3Storytelling: Chain multiple reference images into a consistent multi-shot narrative.

4Advertising: Bring still campaign visuals to life with cinematic camera movements.

5Concept Art: Animate storyboard frames or concept illustrations for pitch decks.

💰

Pricing & Value

Cost analysis

Provider	Cost	Notes
muapiapp	$0.60 per video	muapiapp offers SD 2.0 Image-to-Video starting at $0.60 per video (5s, basic quality), scaling at $0.12/sec for basic and $0.25/sec for high quality across 5–15 second durations.
Fal.ai	$0.3024/sec (high) / $0.2419/sec (basic)	Fal.ai charges $0.3024/sec for high quality and $0.2419/sec for basic. muapiapp is 17% cheaper on high ($0.25/sec) and 50% cheaper on basic ($0.12/sec).
Replicate	$0.3024/sec (high) / $0.2419/sec (basic)	Replicate charges the same as Fal.ai — $0.3024/sec (high), $0.2419/sec (basic). muapiapp saves you 17–50% depending on quality tier.

muapiapp$0.60 per video

muapiapp offers SD 2.0 Image-to-Video starting at $0.60 per video (5s, basic quality), scaling at $0.12/sec for basic and $0.25/sec for high quality across 5–15 second durations.

Fal.ai$0.3024/sec (high) / $0.2419/sec (basic)

Fal.ai charges $0.3024/sec for high quality and $0.2419/sec for basic. muapiapp is 17% cheaper on high ($0.25/sec) and 50% cheaper on basic ($0.12/sec).

Replicate$0.3024/sec (high) / $0.2419/sec (basic)

Replicate charges the same as Fal.ai — $0.3024/sec (high), $0.2419/sec (basic). muapiapp saves you 17–50% depending on quality tier.

* Competitor pricing is estimated based on similar model architectures and usage tiers.

⚙️

Technical Details

Configuration schema

Parameter	Type	Description	Default
Prompt	string	Text prompt describing the video animation. Reference uploaded images using @image1, @image2, … @imageN (1-based, matching images_list order). To use a fictional character, reference it with @character:<id> (request_id from a completed Seedance 2 Character generation) — characters are automatically appended to images_list. Multiple characters are supported. Example: '@character:ab539e5f walks through a garden' or 'The cat in @image1 meets @character:ab539e5f'.	`The lightbulb suddenly rockets across the room like a missile, smashing through curtains while water spins violently inside. The fish darts through swirling currents as the bulb ricochets off walls and finally bursts into floating droplets.`
Image URLs	array	Upload up to 9 image URLs. Reference them in the prompt using @image1, @image2, … @image9. The aspect ratio of the reference image takes precedence over the aspect_ratio parameter.	`https://d3adwkbyhxyrtq.cloudfront.net/webassets/videomodels/seedance-v2.0-i2v.jpg`
Aspect Ratio	Enum (4 options)	-	`16:9`
Duration	Enum (3 options)	-	`5`
Quality	Enum (2 options)	-	`basic`

Promptstring

Text prompt describing the video animation. Reference uploaded images using @image1, @image2, … @imageN (1-based, matching images_list order). To use a fictional character, reference it with @character:<id> (request_id from a completed Seedance 2 Character generation) — characters are automatically appended to images_list. Multiple characters are supported. Example: '@character:ab539e5f walks through a garden' or 'The cat in @image1 meets @character:ab539e5f'.

Default Value

The lightbulb suddenly rockets across the room like a missile, smashing through curtains while water spins violently inside. The fish darts through swirling currents as the bulb ricochets off walls and finally bursts into floating droplets.

Image URLsarray

Upload up to 9 image URLs. Reference them in the prompt using @image1, @image2, … @image9. The aspect ratio of the reference image takes precedence over the aspect_ratio parameter.

Default Valuehttps://d3adwkbyhxyrtq.cloudfront.net/webassets/videomodels/seedance-v2.0-i2v.jpg

Aspect RatioEnum (4 options)

Default Value16:9

DurationEnum (3 options)

Default Value5

QualityEnum (2 options)

Default Valuebasic

📖

Implementation Guide

Developer documentation

How to Use SD 2.0 Image-to-Video

Upload Your Image(s): Provide 1–9 high-quality image URLs via the images_list field. The model preserves composition, lighting, and subject identity from your reference images.
Write a Motion Prompt: Describe the animation you want — camera movement, subject action, environmental effects. You can reference specific images in your prompt using @image1, @image2, … @imageN (1-based, matching the order in images_list). Examples:
- Single reference: 'The cat in @image1 walks through a garden'
- Morphing: '@image1 transforms into @image2'
- Multi-subject: 'The whale in @image1 meets the ninja in @image2' The engine automatically converts @imageN to the upstream API format 《@图N》 — you may also use the Chinese format directly if preferred. If you don't use @imageN references, all provided images are used as general style/subject references.
Choose Quality: Select basic ($0.08/sec) for fast generation or high ($0.15/sec) for cinema-grade output with richer motion detail.
Set Duration: Choose 5, 10, or 15 seconds.
Pick Aspect Ratio: Use 16:9 for widescreen, 9:16 for mobile/social, or 4:3/3:4 for other formats. Note: if a reference image is provided, its aspect ratio takes precedence over this setting.
Submit and Poll: You'll receive a request_id immediately. Poll the result endpoint until status is completed.

❓

Common Questions

Frequently asked

How many images can I provide as input?

You can provide up to 9 images via the images_list field. The model uses them as visual references for composition, style, and subject identity throughout the animation.

How do I reference specific images in my prompt?

Use the @imageN placeholder in your prompt, where N is the 1-based position of the image in images_list. For example, if you provide two images, you can write '@image1 transforms into @image2'. The engine automatically converts @imageN to the upstream API format 《@图N》 — you may also use the Chinese format directly if preferred. If you reference @image3 but only supply two images, the request will be rejected with a 400 Bad Request error. You can also use multiple references in a single prompt — for example, 'The whale in @image1 meets the ninja in @image2'. If you don't use @imageN references at all, all provided images are used as general style and subject references.

Will the model preserve my image's style and composition?

Yes. SD 2.0 i2v is specifically designed to preserve the original composition, lighting, subject identity, and aesthetic of your reference images while adding motion.

What's the difference between basic and high quality?

Basic quality uses the fast-i2v model at $0.12/sec — great for quick drafts. High quality uses the standard-i2v model at $0.25/sec for cinema-grade output with smoother motion and finer detail preservation.

Does it generate audio?

Yes, SD 2.0 generates audio natively alongside video, synchronized with the visual content.

What is the maximum resolution?

SD 2.0 supports up to 2K resolution output.

ai-image-face-swap

youtube-fetch-shorts

mmaudio-v2-text-to-audio

perfect-pony-xl

ai-product-shot

omnihuman-1-5

kling-v3-turbo-pro-text-to-video

ai-skin-enhancer

flux-kontext-dev-i2i

veo3-fast-text-to-video

bytedance-seededit-v3

infinitetalk-image-to-video

happy-horse-1.1-text-to-video-1080p

happy-horse-1.1-image-to-video-1080p

flux-2-pro-edit

happy-horse-1.1-text-to-video-720p

flux-dev-lora

ai-product-photography

ai-image-extension

ai-object-eraser

flux-kontext-pro-i2i

happy-horse-1.1-image-to-video-720p

minimax-image-01-subject-reference

veed-lipsync

wan2.2-edit-video

ovi-image-to-video

openai-sora-2-pro-text-to-video

happy-horse-1.1-reference-to-video-1080p

happy-horse-1.1-reference-to-video-720p

vidu-q3-turbo-text-to-video

happy-horse-1.1-video-edit-1080p

nano-banana-pro-edit

qwen-image-edit-2511

happy-horse-1.1-video-edit-720p

gemini-omni-image-to-video

kling-v3.0-std-motion-control

pixverse-v6-t2v

tiktok-fetch-profile

gpt-image-2-text-to-image

wan2.5-text-to-image

topaz-video-upscale

happy-horse-1-reference-to-video-1080p

ai-video-upscaler-pro

happy-horse-1-video-edit-720p

kling-v3.0-omni-standard-text-to-video

leonardoai-lucid-origin

ltx-2-fast-text-to-video

kling-o1-text-to-video

kling-v2.6-pro-motion-control

flux-2-klein-9b

kling-o3-image

meshy-6-image-to-3d

kling-v2.1-standard-i2v

kling-v3.0-standard-image-to-video

ai-captions

flux-2-klein-9b-turbo

suno-generate-sounds

suno-generate-lyrics

seedance-2-character

veo3.1-lite-text-to-video

youtube-publish

seedance-2-mini-image-to-video

gpt-codex

wan2.7-text-to-image-pro

grok-imagine-video-1-5-preview

seedance-2-vip-text-to-video

gemini-3-1-pro

ai-background-remover

tripo3d-h31-text-to-3d

tripo3d-h31-image-to-3d

suno-remix-music

gemini-omni-audio

veo3-image-to-video

kling-v2.1-pro-i2v

flux-schnell

wan2.2-image-to-video

wan2.2-text-to-video

vidu-v2.0-i2v

claude-opus-4-8

qwen-image-edit-plus