Explore/muapi.ai/seedance-2-i2v

muapi/seedance-2-i2v

Image to Video

SD 2.0 is the latest multimodal video generation model by ByteDance, offering advanced camera control, native audio-video sync, and high-resolution output.

Input

Configure the model parameters below.

0/9 items
Drag & drop images here or paste file/image

Result

$0.60 per videomuapiapp offers SD 2.0 Image-to-Video starting at $0.60 per video (5s, basic quality), scaling at $0.12/sec for basic and $0.25/sec for high quality across 5–15 second durations.

🚀Related Models

View all
seedance-2-character

seedance-2-character

[Beta] Turn fictional character references into reusable video characters. Upload reference images and describe the outfit to get a character_id you can use in SD 2.0 Omni Reference.

Image to Image
seedance-2-t2v

seedance-2-t2v

SD 2.0 is the latest multimodal video generation model by ByteDance, offering advanced camera control, native audio-video sync, and high-resolution output.

Text to Video
seedance-2-watermark-remover

seedance-2-watermark-remover

🎉 FREE for a limited time — Remove SD 2.0 watermarks from videos using LaMa AI inpainting. Automatically detects the watermark region, builds a precise mask via Canny edge detection, and inpaints each frame for artifact-free results. No credits deducted — requires a positive balance to access.

Video to Video
seedance-2-video-watermark-remover-pro

seedance-2-video-watermark-remover-pro

SD 2 Video Watermark Remover Pro uses the SD 2 AI model to remove watermarks, logos, and overlaid text from videos with high accuracy. Powered by ByteDance's SD 2 engine, it delivers superior quality compared to traditional inpainting approaches. Pricing: $0.013 per second, minimum charge for 5 seconds ($0.065).

Video to Video
seedance-2-i2v-480p

seedance-2-i2v-480p

SD 2.0 480p image-to-video generation. Faster and more cost-effective than the 720p variant, ideal for previews and drafts.

Image to Video
seedance-2-omni-reference

seedance-2-omni-reference

SD 2.0 Omni Reference — generate videos with visual consistency using reference images, videos, and audio. Maintain character identity, style, and scene continuity. Supports up to 9 images, 3 video clips, and 3 audio clips. Use @image1, @video1, @audio1 syntax in your prompt.

Image to Video
seedance-2-omni-reference-train

seedance-2-omni-reference-train

Train a reusable character from a reference photo. Once complete, reference the character in Omni Reference video prompts using @omni-character:<request_id> to generate videos featuring that character consistently.

Training
seedance-2-video-edit

seedance-2-video-edit

SD 2.0 Video Edit modifies existing videos based on text prompts and optional reference images.

Video to Video
seedance-2-extend

seedance-2-extend

SD 2.0 Extend Video continues an existing SD 2.0 generated video seamlessly. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Text to Video
seedance-2-omni-reference-480p

seedance-2-omni-reference-480p

SD 2.0 480p Omni Reference — generate videos with visual consistency using reference images, videos, and audio at 480p resolution. More cost-effective than the 720p variant. Supports up to 9 images, 3 video clips, and 3 audio clips. Use @image1, @video1, @audio1 syntax in your prompt.

Image to Video
seedance-2-t2v-480p

seedance-2-t2v-480p

SD 2.0 480p text-to-video generation. Faster and more cost-effective than the 720p variant, ideal for previews and drafts.

Text to Video
seedance-2-vip-extend

seedance-2-vip-extend

SD 2.0 VIP Extend Video continues an existing SD 2.0 generated video seamlessly at 720p. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Text to Video
seedance-2-vip-extend-1080p

seedance-2-vip-extend-1080p

SD 2.0 VIP Extend Video 1080p continues an existing SD 2.0 generated video seamlessly at 1080p resolution. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Text to Video
📝

Overview

About this model

SD 2.0 Image-to-Video animates your still images into cinematic video clips while preserving the original composition, subject identity, lighting, and style. Upload up to 9 reference images and guide the animation with a text prompt — the model applies physics-aware motion, director-level camera movements, and native audio generation to bring your visuals to life. Ideal for turning product shots, illustrations, portraits, or concept art into dynamic video content.

1Product Animation: Turn product photography into dynamic showcase videos.
2Social Media: Animate illustrations, portraits, or brand images into engaging video posts.
3Storytelling: Chain multiple reference images into a consistent multi-shot narrative.
4Advertising: Bring still campaign visuals to life with cinematic camera movements.
5Concept Art: Animate storyboard frames or concept illustrations for pitch decks.
💰

Pricing & Value

Cost analysis

muapiapp$0.60 per video

muapiapp offers SD 2.0 Image-to-Video starting at $0.60 per video (5s, basic quality), scaling at $0.12/sec for basic and $0.25/sec for high quality across 5–15 second durations.

Fal.ai$0.3024/sec (high) / $0.2419/sec (basic)

Fal.ai charges $0.3024/sec for high quality and $0.2419/sec for basic. muapiapp is 17% cheaper on high ($0.25/sec) and 50% cheaper on basic ($0.12/sec).

Replicate$0.3024/sec (high) / $0.2419/sec (basic)

Replicate charges the same as Fal.ai — $0.3024/sec (high), $0.2419/sec (basic). muapiapp saves you 17–50% depending on quality tier.

* Competitor pricing is estimated based on similar model architectures and usage tiers.

⚙️

Technical Details

Configuration schema

Promptstring

Text prompt describing the video animation. Reference uploaded images using @image1, @image2, … @imageN (1-based, matching images_list order). To use a fictional character, reference it with @character:<id> (request_id from a completed Seedance 2 Character generation) — characters are automatically appended to images_list. Multiple characters are supported. Example: '@character:ab539e5f walks through a garden' or 'The cat in @image1 meets @character:ab539e5f'.

Default ValueThe lightbulb suddenly rockets across the room like a missile, smashing through curtains while water spins violently inside. The fish darts through swirling currents as the bulb ricochets off walls and finally bursts into floating droplets.
Image URLsarray

Upload up to 9 image URLs. Reference them in the prompt using @image1, @image2, … @image9. The aspect ratio of the reference image takes precedence over the aspect_ratio parameter.

Default Valuehttps://d3adwkbyhxyrtq.cloudfront.net/webassets/videomodels/seedance-v2.0-i2v.jpg
Aspect RatioEnum (4 options)

-

Default Value16:9
DurationEnum (3 options)

-

Default Value5
QualityEnum (2 options)

-

Default Valuebasic
📖

Implementation Guide

Developer documentation

How to Use SD 2.0 Image-to-Video

  1. Upload Your Image(s): Provide 1–9 high-quality image URLs via the images_list field. The model preserves composition, lighting, and subject identity from your reference images.

  2. Write a Motion Prompt: Describe the animation you want — camera movement, subject action, environmental effects. You can reference specific images in your prompt using @image1, @image2, … @imageN (1-based, matching the order in images_list). Examples:

    • Single reference: 'The cat in @image1 walks through a garden'
    • Morphing: '@image1 transforms into @image2'
    • Multi-subject: 'The whale in @image1 meets the ninja in @image2' The engine automatically converts @imageN to the upstream API format 《@图N》 — you may also use the Chinese format directly if preferred. If you don't use @imageN references, all provided images are used as general style/subject references.
  3. Choose Quality: Select basic ($0.08/sec) for fast generation or high ($0.15/sec) for cinema-grade output with richer motion detail.

  4. Set Duration: Choose 5, 10, or 15 seconds.

  5. Pick Aspect Ratio: Use 16:9 for widescreen, 9:16 for mobile/social, or 4:3/3:4 for other formats. Note: if a reference image is provided, its aspect ratio takes precedence over this setting.

  6. Submit and Poll: You'll receive a request_id immediately. Poll the result endpoint until status is completed.

Common Questions

Frequently asked

How many images can I provide as input?

You can provide up to 9 images via the images_list field. The model uses them as visual references for composition, style, and subject identity throughout the animation.

How do I reference specific images in my prompt?

Use the @imageN placeholder in your prompt, where N is the 1-based position of the image in images_list. For example, if you provide two images, you can write '@image1 transforms into @image2'. The engine automatically converts @imageN to the upstream API format 《@图N》 — you may also use the Chinese format directly if preferred. If you reference @image3 but only supply two images, the request will be rejected with a 400 Bad Request error. You can also use multiple references in a single prompt — for example, 'The whale in @image1 meets the ninja in @image2'. If you don't use @imageN references at all, all provided images are used as general style and subject references.

Will the model preserve my image's style and composition?

Yes. SD 2.0 i2v is specifically designed to preserve the original composition, lighting, subject identity, and aesthetic of your reference images while adding motion.

What's the difference between basic and high quality?

Basic quality uses the fast-i2v model at $0.12/sec — great for quick drafts. High quality uses the standard-i2v model at $0.25/sec for cinema-grade output with smoother motion and finer detail preservation.

Does it generate audio?

Yes, SD 2.0 generates audio natively alongside video, synchronized with the visual content.

What is the maximum resolution?

SD 2.0 supports up to 2K resolution output.