Explore/muapi.ai/seedance-2-vip-extend

muapi/seedance-2-vip-extend

Text to Video

SD 2.0 VIP Extend Video continues an existing SD 2.0 generated video seamlessly at 720p. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Input

Configure the model parameters below.

0/8 items
Drag & drop images here or paste file/image
0/3 items
Drag & drop videos here, paste file, or paste a link
0/3 items
Drag & drop audios here, paste file, or paste a link

Result

$1.05 per 5s (high) / $1.05 per 5s (basic)muapiapp prices SD 2.0 VIP Extend at $0.30/sec for high and $0.21/sec for basic, plus a small surcharge per reference video clip.

🚀Related Models

View all
seedance-2-character

seedance-2-character

[Beta] Turn fictional character references into reusable video characters. Upload reference images and describe the outfit to get a character_id you can use in SD 2.0 Omni Reference.

Image to Image
seedance-2-t2v

seedance-2-t2v

SD 2.0 is the latest multimodal video generation model by ByteDance, offering advanced camera control, native audio-video sync, and high-resolution output.

Text to Video
seedance-2-watermark-remover

seedance-2-watermark-remover

🎉 FREE for a limited time — Remove SD 2.0 watermarks from videos using LaMa AI inpainting. Automatically detects the watermark region, builds a precise mask via Canny edge detection, and inpaints each frame for artifact-free results. No credits deducted — requires a positive balance to access.

Video to Video
seedance-2-video-watermark-remover-pro

seedance-2-video-watermark-remover-pro

SD 2 Video Watermark Remover Pro uses the SD 2 AI model to remove watermarks, logos, and overlaid text from videos with high accuracy. Powered by ByteDance's SD 2 engine, it delivers superior quality compared to traditional inpainting approaches. Pricing: $0.013 per second, minimum charge for 5 seconds ($0.065).

Video to Video
seedance-2-i2v-480p

seedance-2-i2v-480p

SD 2.0 480p image-to-video generation. Faster and more cost-effective than the 720p variant, ideal for previews and drafts.

Image to Video
seedance-2-omni-reference

seedance-2-omni-reference

SD 2.0 Omni Reference — generate videos with visual consistency using reference images, videos, and audio. Maintain character identity, style, and scene continuity. Supports up to 9 images, 3 video clips, and 3 audio clips. Use @image1, @video1, @audio1 syntax in your prompt.

Image to Video
seedance-2-omni-reference-train

seedance-2-omni-reference-train

Train a reusable character from a reference photo. Once complete, reference the character in Omni Reference video prompts using @omni-character:<request_id> to generate videos featuring that character consistently.

Training
seedance-2-i2v

seedance-2-i2v

SD 2.0 is the latest multimodal video generation model by ByteDance, offering advanced camera control, native audio-video sync, and high-resolution output.

Image to Video
seedance-2-video-edit

seedance-2-video-edit

SD 2.0 Video Edit modifies existing videos based on text prompts and optional reference images.

Video to Video
seedance-2-extend

seedance-2-extend

SD 2.0 Extend Video continues an existing SD 2.0 generated video seamlessly. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Text to Video
seedance-2-omni-reference-480p

seedance-2-omni-reference-480p

SD 2.0 480p Omni Reference — generate videos with visual consistency using reference images, videos, and audio at 480p resolution. More cost-effective than the 720p variant. Supports up to 9 images, 3 video clips, and 3 audio clips. Use @image1, @video1, @audio1 syntax in your prompt.

Image to Video
seedance-2-t2v-480p

seedance-2-t2v-480p

SD 2.0 480p text-to-video generation. Faster and more cost-effective than the 720p variant, ideal for previews and drafts.

Text to Video
seedance-2-vip-extend-1080p

seedance-2-vip-extend-1080p

SD 2.0 VIP Extend Video 1080p continues an existing SD 2.0 generated video seamlessly at 1080p resolution. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Text to Video
📝

Overview

About this model

SD 2.0 VIP Extend Video continues an existing SD 2.0 generated video seamlessly at 720p. Provide the original request ID and an optional prompt to guide the new segment — the model preserves visual style, motion physics, character identity, and native audio across the extension. Ideal for building longer narratives, adding follow-up scenes, or exploring alternate continuations of a generated clip.

1Narrative Expansion: Continue a generated scene to build a longer story sequence.
2Scene Exploration: Generate alternate endings or continuations from the same starting video.
3Social Media: Extend short clips into longer-form content without regenerating from scratch.
4Filmmaking: Chain multiple extend operations to build multi-shot sequences with consistent style.
💰

Pricing & Value

Cost analysis

muapiapp$1.05 per 5s (high) / $1.05 per 5s (basic)

muapiapp prices SD 2.0 VIP Extend at $0.30/sec for high and $0.21/sec for basic, plus a small surcharge per reference video clip.

Fal.aiNot available

Fal.ai does not currently expose a managed SD 2.0 video extend endpoint.

ReplicateNot available

Replicate does not currently expose a managed SD 2.0 video extend endpoint.

* Competitor pricing is estimated based on similar model architectures and usage tiers.

⚙️

Technical Details

Configuration schema

Request Idstring

Request ID of the original Seedance 2.0 video generation.

Default Valuecab9517f-1818-4910-8d66-292701c78c2d
Promptstring

Optional prompt to guide the extension. Reference additional images with @image2…@image9, videos with @video1…@video3, and audio with @audio1…@audio3 — the source video's last frame is always @image1.

Default Value
Image URLsarray

Up to 8 additional reference image URLs (JPEG/PNG/WebP). Each Nth image corresponds to @image(N+1) in the prompt (the source video's last frame is @image1).

Default Valueundefined
Video Reference URLsarray

Up to 3 reference video clip URLs (MP4, max 15s each). Each Nth video corresponds to @videoN in the prompt.

Default Valueundefined
Audio Reference URLsarray

Up to 3 reference audio clip URLs (MP3/WAV, total max 15s). Each Nth audio corresponds to @audioN in the prompt.

Default Valueundefined
Aspect RatioEnum (6 options)

Output video aspect ratio (only used when reference images/videos/audio are provided).

Default Value16:9
Duration (seconds)int

Length of the extension clip in seconds.

Default Value5
QualityEnum (2 options)

-

Default Valuebasic
📖

Implementation Guide

Developer documentation

How to Use SD 2.0 VIP Extend Video

  1. Get the Request ID: Generate a video using sd-v2.0-t2v or sd-v2.0-i2v and copy the request_id from the response.

  2. Optionally Add a Prompt: Describe what should happen next in the video. If left empty, the model intelligently continues the existing scene.

  3. Choose Quality: Select basic ($0.21/sec) for drafts or high ($0.30/sec) for cinema-grade output.

  4. Set Duration: Choose 4–15 seconds for the extended segment.

  5. Submit and Poll: You'll receive a new request_id. Poll the result endpoint until status is completed.

Common Questions

Frequently asked

Which videos can I extend?

You can extend any video generated by SD 2.0 (sd-v2.0-t2v or sd-v2.0-i2v). Provide the request_id returned by the original generation.

Do I need to provide a prompt?

No, the prompt is optional. If omitted, SD 2.0 intelligently continues the scene based on the original video's content, motion, and style.

Will the extended video match the original style?

Yes. The model preserves visual style, character identity, motion physics, camera movement, and native audio consistency across the extension.

What output resolution does this endpoint produce?

720p. For 1080p output use the `sd-2-vip-extend-1080p` endpoint.