Explore/muapi.ai/seedance-2-vip-extend

muapi/seedance-2-vip-extend

Text to Video

SD 2.0 VIP Extend Video continues an existing SD 2.0 generated video seamlessly at 720p. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Input

Configure the model parameters below.

Request Id* requiredRequest ID of the original Seedance 2.0 video generation.

PromptOptional prompt to guide the extension. Reference additional images with @image2…@image9, videos with @video1…@video3, and audio with @audio1…@audio3 — the source video's last frame is always @image1.

Image URLsUp to 8 additional reference image URLs (JPEG/PNG/WebP). Each Nth image corresponds to @image(N+1) in the prompt (the source video's last frame is @image1).0/8 items

Drag & drop images here or paste file/image

+Add

Video Reference URLsUp to 3 reference video clip URLs (MP4, max 15s each). Each Nth video corresponds to @videoN in the prompt.0/3 items

Drag & drop videos here, paste file, or paste a link

Upload

Audio Reference URLsUp to 3 reference audio clip URLs (MP3/WAV, total max 15s). Each Nth audio corresponds to @audioN in the prompt.0/3 items

Drag & drop audios here, paste file, or paste a link

Upload

Aspect RatioOutput video aspect ratio (only used when reference images/videos/audio are provided). (Default: 16:9)

Duration (seconds)Length of the extension clip in seconds.

Quality

Result

$1.05 per 5s (high) / $1.05 per 5s (basic)— muapiapp prices SD 2.0 VIP Extend at $0.30/sec for high and $0.21/sec for basic, plus a small surcharge per reference video clip.

🚀Related Models

View all

seedance-2-character

[Beta] Turn fictional character references into reusable video characters. Upload reference images and describe the outfit to get a character_id you can use in SD 2.0 Omni Reference.

Image to Image

seedance-2-t2v

SD 2.0 is the latest multimodal video generation model by ByteDance, offering advanced camera control, native audio-video sync, and high-resolution output.

Text to Video

seedance-2-watermark-remover

🎉 FREE for a limited time — Remove SD 2.0 watermarks from videos using LaMa AI inpainting. Automatically detects the watermark region, builds a precise mask via Canny edge detection, and inpaints each frame for artifact-free results. No credits deducted — requires a positive balance to access.

Video to Video

seedance-2-video-watermark-remover-pro

SD 2 Video Watermark Remover Pro uses the SD 2 AI model to remove watermarks, logos, and overlaid text from videos with high accuracy. Powered by ByteDance's SD 2 engine, it delivers superior quality compared to traditional inpainting approaches. Pricing: $0.013 per second, minimum charge for 5 seconds ($0.065).

Video to Video

seedance-2-i2v-480p

SD 2.0 480p image-to-video generation. Faster and more cost-effective than the 720p variant, ideal for previews and drafts.

Image to Video

seedance-2-omni-reference

SD 2.0 Omni Reference — generate videos with visual consistency using reference images, videos, and audio. Maintain character identity, style, and scene continuity. Supports up to 9 images, 3 video clips, and 3 audio clips. Use @image1, @video1, @audio1 syntax in your prompt.

Image to Video

seedance-2-omni-reference-train

Train a reusable character from a reference photo. Once complete, reference the character in Omni Reference video prompts using @omni-character:<request_id> to generate videos featuring that character consistently.

Training

seedance-2-i2v

SD 2.0 is the latest multimodal video generation model by ByteDance, offering advanced camera control, native audio-video sync, and high-resolution output.

Image to Video

seedance-2-video-edit

SD 2.0 Video Edit modifies existing videos based on text prompts and optional reference images.

Video to Video

seedance-2-extend

SD 2.0 Extend Video continues an existing SD 2.0 generated video seamlessly. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Text to Video

seedance-2-omni-reference-480p

SD 2.0 480p Omni Reference — generate videos with visual consistency using reference images, videos, and audio at 480p resolution. More cost-effective than the 720p variant. Supports up to 9 images, 3 video clips, and 3 audio clips. Use @image1, @video1, @audio1 syntax in your prompt.

Image to Video

seedance-2-t2v-480p

SD 2.0 480p text-to-video generation. Faster and more cost-effective than the 720p variant, ideal for previews and drafts.

Text to Video

seedance-2-vip-extend-1080p

SD 2.0 VIP Extend Video 1080p continues an existing SD 2.0 generated video seamlessly at 1080p resolution. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Text to Video

📝

Overview

About this model

SD 2.0 VIP Extend Video continues an existing SD 2.0 generated video seamlessly at 720p. Provide the original request ID and an optional prompt to guide the new segment — the model preserves visual style, motion physics, character identity, and native audio across the extension. Ideal for building longer narratives, adding follow-up scenes, or exploring alternate continuations of a generated clip.

1Narrative Expansion: Continue a generated scene to build a longer story sequence.

2Scene Exploration: Generate alternate endings or continuations from the same starting video.

3Social Media: Extend short clips into longer-form content without regenerating from scratch.

4Filmmaking: Chain multiple extend operations to build multi-shot sequences with consistent style.

💰

Pricing & Value

Cost analysis

Provider	Cost	Notes
muapiapp	$1.05 per 5s (high) / $1.05 per 5s (basic)	muapiapp prices SD 2.0 VIP Extend at $0.30/sec for high and $0.21/sec for basic, plus a small surcharge per reference video clip.
Fal.ai	Not available	Fal.ai does not currently expose a managed SD 2.0 video extend endpoint.
Replicate	Not available	Replicate does not currently expose a managed SD 2.0 video extend endpoint.

muapiapp$1.05 per 5s (high) / $1.05 per 5s (basic)

muapiapp prices SD 2.0 VIP Extend at $0.30/sec for high and $0.21/sec for basic, plus a small surcharge per reference video clip.

Fal.aiNot available

Fal.ai does not currently expose a managed SD 2.0 video extend endpoint.

ReplicateNot available

Replicate does not currently expose a managed SD 2.0 video extend endpoint.

* Competitor pricing is estimated based on similar model architectures and usage tiers.

⚙️

Technical Details

Configuration schema

Parameter	Type	Description	Default
Request Id	string	Request ID of the original Seedance 2.0 video generation.	`cab9517f-1818-4910-8d66-292701c78c2d`
Prompt	string	Optional prompt to guide the extension. Reference additional images with @image2…@image9, videos with @video1…@video3, and audio with @audio1…@audio3 — the source video's last frame is always @image1.
Image URLs	array	Up to 8 additional reference image URLs (JPEG/PNG/WebP). Each Nth image corresponds to @image(N+1) in the prompt (the source video's last frame is @image1).	`undefined`
Video Reference URLs	array	Up to 3 reference video clip URLs (MP4, max 15s each). Each Nth video corresponds to @videoN in the prompt.	`undefined`
Audio Reference URLs	array	Up to 3 reference audio clip URLs (MP3/WAV, total max 15s). Each Nth audio corresponds to @audioN in the prompt.	`undefined`
Aspect Ratio	Enum (6 options)	Output video aspect ratio (only used when reference images/videos/audio are provided).	`16:9`
Duration (seconds)	int	Length of the extension clip in seconds.	`5`
Quality	Enum (2 options)	-	`basic`

Request Idstring

Request ID of the original Seedance 2.0 video generation.

Default Valuecab9517f-1818-4910-8d66-292701c78c2d

Promptstring

Optional prompt to guide the extension. Reference additional images with @image2…@image9, videos with @video1…@video3, and audio with @audio1…@audio3 — the source video's last frame is always @image1.

Default Value

Image URLsarray

Up to 8 additional reference image URLs (JPEG/PNG/WebP). Each Nth image corresponds to @image(N+1) in the prompt (the source video's last frame is @image1).

Default Valueundefined

Video Reference URLsarray

Up to 3 reference video clip URLs (MP4, max 15s each). Each Nth video corresponds to @videoN in the prompt.

Default Valueundefined

Audio Reference URLsarray

Up to 3 reference audio clip URLs (MP3/WAV, total max 15s). Each Nth audio corresponds to @audioN in the prompt.

Default Valueundefined

Aspect RatioEnum (6 options)

Output video aspect ratio (only used when reference images/videos/audio are provided).

Default Value16:9

Duration (seconds)int

Length of the extension clip in seconds.

Default Value5

QualityEnum (2 options)

Default Valuebasic

📖

Implementation Guide

Developer documentation

How to Use SD 2.0 VIP Extend Video

Get the Request ID: Generate a video using sd-v2.0-t2v or sd-v2.0-i2v and copy the request_id from the response.
Optionally Add a Prompt: Describe what should happen next in the video. If left empty, the model intelligently continues the existing scene.
Choose Quality: Select basic ($0.21/sec) for drafts or high ($0.30/sec) for cinema-grade output.
Set Duration: Choose 4–15 seconds for the extended segment.
Submit and Poll: You'll receive a new request_id. Poll the result endpoint until status is completed.

❓

Common Questions

Frequently asked

Which videos can I extend?

You can extend any video generated by SD 2.0 (sd-v2.0-t2v or sd-v2.0-i2v). Provide the request_id returned by the original generation.

Do I need to provide a prompt?

No, the prompt is optional. If omitted, SD 2.0 intelligently continues the scene based on the original video's content, motion, and style.

Will the extended video match the original style?

Yes. The model preserves visual style, character identity, motion physics, camera movement, and native audio consistency across the extension.

What output resolution does this endpoint produce?

720p. For 1080p output use the `sd-2-vip-extend-1080p` endpoint.

ai-image-face-swap

youtube-fetch-shorts

mmaudio-v2-text-to-audio

perfect-pony-xl

ai-product-shot

omnihuman-1-5

kling-v3-turbo-pro-text-to-video

ai-skin-enhancer

flux-kontext-dev-i2i

veo3-fast-text-to-video

bytedance-seededit-v3

infinitetalk-image-to-video

happy-horse-1.1-text-to-video-1080p

happy-horse-1.1-image-to-video-1080p

flux-2-pro-edit

happy-horse-1.1-text-to-video-720p

flux-dev-lora

ai-product-photography

ai-image-extension

ai-object-eraser

flux-kontext-pro-i2i

happy-horse-1.1-image-to-video-720p

minimax-image-01-subject-reference

veed-lipsync

wan2.2-edit-video

ovi-image-to-video

openai-sora-2-pro-text-to-video

happy-horse-1.1-reference-to-video-1080p

happy-horse-1.1-reference-to-video-720p

vidu-q3-turbo-text-to-video

happy-horse-1.1-video-edit-1080p

nano-banana-pro-edit

qwen-image-edit-2511

happy-horse-1.1-video-edit-720p

gemini-omni-image-to-video

kling-v3.0-std-motion-control

pixverse-v6-t2v

tiktok-fetch-profile

gpt-image-2-text-to-image

wan2.5-text-to-image

topaz-video-upscale

happy-horse-1-reference-to-video-1080p

ai-video-upscaler-pro

happy-horse-1-video-edit-720p

kling-v3.0-omni-standard-text-to-video

leonardoai-lucid-origin

ltx-2-fast-text-to-video

kling-o1-text-to-video

kling-v2.6-pro-motion-control

flux-2-klein-9b

kling-o3-image

meshy-6-image-to-3d

kling-v2.1-standard-i2v

kling-v3.0-standard-image-to-video

ai-captions

flux-2-klein-9b-turbo

suno-generate-sounds

suno-generate-lyrics

seedance-2-character

veo3.1-lite-text-to-video

youtube-publish

seedance-2-mini-image-to-video

gpt-codex

wan2.7-text-to-image-pro

grok-imagine-video-1-5-preview

seedance-2-vip-text-to-video

gemini-3-1-pro

ai-background-remover

tripo3d-h31-text-to-3d

tripo3d-h31-image-to-3d

suno-remix-music

gemini-omni-audio

veo3-image-to-video

kling-v2.1-pro-i2v

flux-schnell

wan2.2-image-to-video

wan2.2-text-to-video

vidu-v2.0-i2v

claude-opus-4-8

qwen-image-edit-plus