Explore/muapi.ai/seedance-2-mini-omni-reference

muapi/seedance-2-mini-omni-reference

Text to Video

Seedance 2 Mini Omni Reference generates video from a text prompt with optional image, video, and audio references. Cost-efficient mini-tier model for reference-driven workflows.

Input

Configure the model parameters below.

Prompt* requiredText prompt. Reference images with @image1..@image9, videos with @video1..@video3, audio with @audio1..@audio3.

Reference ImagesUp to 9 reference images (JPEG/PNG/WebP). Referenced in prompt via @image1..@image9.0/9 items

Drag & drop images here or paste file/image

+Add

Reference VideosUp to 3 reference video clips (MP4, total max 15s). Referenced in prompt via @video1..@video3.0/3 items

Drag & drop videos here, paste file, or paste a link

Upload

Reference AudioUp to 3 reference audio files (MP3/WAV, total max 15s). Referenced in prompt via @audio1..@audio3.0/3 items

Drag & drop audios here, paste file, or paste a link

Upload

Aspect RatioAspect ratio of the output video. (Default: 16:9)

ResolutionOutput video resolution. (Default: 720p)

DurationVideo duration in seconds.

Generate AudioWhether to generate AI audio synchronized with the video.

Whether to generate AI audio synchronized with the video.

High BitrateEnable high bitrate mode for better visual fidelity. Produces larger files.

Enable high bitrate mode for better visual fidelity. Produces larger files.

Result

Price varies by resolution and duration (video references add a surcharge)

Resolution	Duration	Cost
480p	5s	$0.40
480p	10s	$0.80
720p	5s	$0.75
720p	10s	$1.50

🚀Related Models

View all

seedance-2-mini-text-to-video

Seedance 2.0 Mini Text-to-Video is the fastest and most affordable text-to-video model in the Seedance lineup, generating smooth 720p video clips from text prompts. Designed for rapid iteration and high-volume workflows, it delivers approximately 2x faster generation than Seedance 2.0 Fast at a fraction of the cost.

Text to Video

seedance-2-mini-image-to-video

Seedance 2.0 Mini Image-to-Video is the fastest and most cost-efficient tier in the Seedance 2.0 family, converting still images into smooth cinematic video clips at up to 720p. Roughly 2x faster than Seedance 2.0 Fast, it is purpose-built for high-volume production, rapid iteration, and draft workflows.

Image to Video

📝

Overview

About this model

Seedance 2 Mini Omni Reference is the most flexible mini-tier generation mode, accepting up to 9 reference images, 3 reference video clips, and 3 reference audio files alongside your text prompt. Reference each asset in your prompt with @image1–@image9, @video1–@video3, and @audio1–@audio3 for precise creative control.

1Character consistency: Keep a character's appearance consistent across multiple shots using reference images.

2Motion reference: Supply a short video clip to guide the motion style of the generated video.

3Audio-driven: Sync generated visuals to a reference audio file such as music or speech.

4Multi-modal storytelling: Combine image, video, and audio references for complex creative workflows.

💰

Pricing & Value

Cost analysis

Provider	Cost	Notes
muapiapp	$0.15/sec at 720p + video surcharge, $0.08/sec at 480p + video surcharge	Video reference surcharge: 30% of rate × total input video seconds.
Fal.ai	Not available	Seedance 2 Mini Omni Reference not available via Fal.ai.
Replicate	Not available	Seedance 2 Mini Omni Reference not available via Replicate.

muapiapp$0.15/sec at 720p + video surcharge, $0.08/sec at 480p + video surcharge

Video reference surcharge: 30% of rate × total input video seconds.

Fal.aiNot available

Seedance 2 Mini Omni Reference not available via Fal.ai.

ReplicateNot available

Seedance 2 Mini Omni Reference not available via Replicate.

* Competitor pricing is estimated based on similar model architectures and usage tiers.

⚙️

Technical Details

Configuration schema

Parameter	Type	Description	Default
Prompt	string	Text prompt. Reference images with @image1..@image9, videos with @video1..@video3, audio with @audio1..@audio3.	`The character walks forward confidently in a sunny meadow, camera follows from behind.`
Reference Images	array	Up to 9 reference images (JPEG/PNG/WebP). Referenced in prompt via @image1..@image9.	`https://d3adwkbyhxyrtq.cloudfront.net/webassets/videomodels/seedance-v1.5-pro-i2v.jpg`
Reference Videos	array	Up to 3 reference video clips (MP4, total max 15s). Referenced in prompt via @video1..@video3.
Reference Audio	array	Up to 3 reference audio files (MP3/WAV, total max 15s). Referenced in prompt via @audio1..@audio3.
Aspect Ratio	Enum (6 options)	Aspect ratio of the output video.	`16:9`
Duration	int	Video duration in seconds.	`5`
Resolution	Enum (2 options)	Output video resolution.	`720p`
Generate Audio	boolean	Whether to generate AI audio synchronized with the video.	`true`
High Bitrate	boolean	Enable high bitrate mode for better visual fidelity. Produces larger files.	`false`

Promptstring

Text prompt. Reference images with @image1..@image9, videos with @video1..@video3, audio with @audio1..@audio3.

Default ValueThe character walks forward confidently in a sunny meadow, camera follows from behind.

Reference Imagesarray

Up to 9 reference images (JPEG/PNG/WebP). Referenced in prompt via @image1..@image9.

Default Valuehttps://d3adwkbyhxyrtq.cloudfront.net/webassets/videomodels/seedance-v1.5-pro-i2v.jpg

Reference Videosarray

Up to 3 reference video clips (MP4, total max 15s). Referenced in prompt via @video1..@video3.

Default Value

Reference Audioarray

Up to 3 reference audio files (MP3/WAV, total max 15s). Referenced in prompt via @audio1..@audio3.

Default Value

Aspect RatioEnum (6 options)

Aspect ratio of the output video.

Default Value16:9

Durationint

Video duration in seconds.

Default Value5

ResolutionEnum (2 options)

Output video resolution.

Default Value720p

Generate Audioboolean

Whether to generate AI audio synchronized with the video.

Default Valuetrue

High Bitrateboolean

Enable high bitrate mode for better visual fidelity. Produces larger files.

Default Valuefalse

📖

Implementation Guide

Developer documentation

How to Use Seedance 2 Mini Omni Reference

Write your prompt: Describe the scene and reference any uploaded assets with @image1, @video1, @audio1, etc.
Upload reference assets (all optional):
- images_list: Up to 9 reference images (@image1–@image9)
- video_files: Up to 3 video clips, total max 15s (@video1–@video3)
- audio_files: Up to 3 audio files, total max 15s (@audio1–@audio3)
Set resolution, duration, and aspect ratio.
High bitrate (optional): For better visual fidelity, routes to Kinovi.
Submit and poll /api/v1/predictions/{id}/result for results.

API Example

curl -X POST https://api.muapi.ai/api/v1/seedance-2-mini-omni-reference \
  -H 'x-api-key: YOUR_API_KEY' \
  -H 'Content-Type: application/json' \
  -d '{
    "prompt": "@image1 is the character. The person walks confidently in a sunny meadow",
    "images_list": ["https://example.com/character.jpg"],
    "aspect_ratio": "16:9",
    "duration": 5,
    "resolution": "720p"
  }'

❓

Common Questions

Frequently asked

What is the video reference duration limit?

The total duration of all reference video clips must not exceed 15 seconds. Each clip should be an MP4 file.

Does using video references change the cost?

Yes. The cost is base rate × output duration, plus 30% of rate × total input video duration for each second of reference video provided.

How do I reference uploaded assets in the prompt?

Use @image1 through @image9 for images, @video1 through @video3 for videos, and @audio1 through @audio3 for audio files, in the order they were provided in the respective arrays.

What happens if I don't provide any reference assets?

If no images, videos, or audio are provided, the model behaves like text-to-video and generates a clip from the prompt alone.

Can I mix images and videos in the same request?

Yes. You can simultaneously provide images_list, video_files, and audio_files. Reference each in your prompt with @image1, @video1, @audio1 etc.

ai-product-photography

wan2.2-image-to-video

facebook-publish

hunyuan-text-to-video

runway-aleph-v2v

flux-dev-lora

happy-horse-1.1-text-to-video-1080p

pixverse-v4.5-t2v

hidream-i1-full

creatify-lipsync

flux-kontext-pro-i2i

kling-v1-avatar-standard

heygen-video-translate

wan2.2-animate

ai-image-extension

openai-sora-2-text-to-video

ai-video-upscaler-pro

ai-object-eraser

veed-lipsync

veo3.1-fast-image-to-video

veo3.1-fast-text-to-video

ai-dance-effects

image-effects

gemini-omni-image-to-video

veo3-fast-text-to-video

ltx-2-fast-text-to-video

kling-v2.5-turbo-std-i2v

minimax-hailuo-2.3-pro-i2v

minimax-hailuo-2.3-pro-t2v

wan2.1-text-to-image

reve-image-edit

grok-imagine-text-to-video

nano-banana-pro-edit

qwen-image-edit-plus-lora

ai-image-face-swap

google-imagen4-fast

sdxl-lora

infinitetalk-image-to-video

wan2.2-edit-video

ltx-2-pro-text-to-video

mmaudio-v2-text-to-audio

kling-v2-avatar-pro

flux-2-flex

flux-2-pro-edit

ai-product-shot

seedance-v1.5-pro-t2v

bytedance-seededit-v3

add-video-watermark

ai-skin-enhancer

seedance-v1.5-pro-t2v-fast

qwen-image-edit-2511

qwen-text-to-image-2512

kling-v2.1-standard-i2v

kling-v3.0-standard-image-to-video

kling-v3.0-std-motion-control

suno-add-vocals

seedance-2-video-watermark-remover-pro

ai-background-remover

latent-sync

claude-opus-4-6

flux-kontext-dev-i2i

seedance-2-image-to-video-fast

pixverse-v5.5-t2v

wan2.7-video-edit

seedance-2-omni-reference-no-video

seedance-2-i2v-480p

suno-remix-music

seedance-2-vip-image-to-video-fast

happy-horse-1-text-to-video-1080p

veo3-image-to-video

flux-schnell

happy-horse-1-text-to-video-720p

kling-v2.1-pro-i2v

seedance-2-vip-image-to-video-1080p

seedance-2-vip-first-last-frame-1080p

kling-v3.0-4k-image-to-video

gemini-2-5-pro

wan2.2-text-to-video

vidu-v2.0-i2v

vidu-q3-turbo-text-to-video