Explore/ByteDance Models

ByteDance AI API models

ByteDance AI models on MuAPI

Explore ByteDance models for chat, code, image and video generation, including Gemini, Nano Banana and Veo-style workflows available through MuAPI.

All models

77 Models

Video Generation Models

Video

$0.025 / second

seedance-2-watermark-remover

🎉 FREE for a limited time — Remove SD 2.0 watermarks from videos using LaMa AI inpainting. Automatically detects the watermark region, builds a precise mask via Canny edge detection, and inpaints each frame for artifact-free results. No credits deducted — requires a positive balance to access.

Video

$1.250 / second

seedance-2-image-to-video

SD 2 Image-to-Video (Pro) by ByteDance. Animates a start-frame image into a high-quality video with native audio, 4–15 second duration, and 2K resolution.

Video

$0.200 / second

seedance-2-mini-text-to-video

Seedance 2.0 Mini Text-to-Video is the fastest and most affordable text-to-video model in the Seedance lineup, generating smooth 720p video clips from text prompts. Designed for rapid iteration and high-volume workflows, it delivers approximately 2x faster generation than Seedance 2.0 Fast at a fraction of the cost.

Video

$6.750 / second

seedance-2-vip-image-to-video-4k

SD 2 Image-to-Video VIP 4K by ByteDance. Animates a still image into a 4K ultra-HD video with priority routing and 4–15 second duration.

Video

$0.200 / second

seedance-2-mini-spicy-text-to-video

Seedance 2 Mini Spicy Text-to-Video is the fastest, lowest-cost Spicy-tier text-to-video generation, with reduced content-safety filtering on top of Seedance 2 Mini's speed and pricing.

Video

$0.200 / second

seedance-2-mini-spicy-image-to-video

Seedance 2 Mini Spicy Image-to-Video is the fastest, lowest-cost Spicy-tier image animation, with reduced content-safety filtering on top of Seedance 2 Mini's speed and pricing.

Video

$0.260 / second

seedance-v1.5-pro-t2v-fast

Seedance v1.5 Pro Text-to-Video Fast generates short cinematic videos directly from text with an emphasis on speed and stability. It produces coherent scenes with simple camera motion, light environmental animation, and consistent lighting.

Video

$0.200 / second

seedance-2-mini-image-to-video

Seedance 2.0 Mini Image-to-Video is the fastest and most cost-efficient tier in the Seedance 2.0 family, converting still images into smooth cinematic video clips at up to 720p. Roughly 2x faster than Seedance 2.0 Fast, it is purpose-built for high-volume production, rapid iteration, and draft workflows.

Video

$0.065 / second

seedance-2-video-watermark-remover-pro

SD 2 Video Watermark Remover Pro uses the SD 2 AI model to remove watermarks, logos, and overlaid text from videos with high accuracy. Powered by ByteDance's SD 2 engine, it delivers superior quality compared to traditional inpainting approaches. Pricing: $0.013 per second, minimum charge for 5 seconds ($0.065).

Video

$1.500 / second

seedance-2-vip-text-to-video

SD 2 Text-to-Video VIP (Pro) by ByteDance. Generates high-quality cinematic video from a text prompt with priority routing, native audio-visual sync, up to 2K resolution, and 4–15 second duration.

Video

$0.060 / second

seedance-pro-i2v-fast

Seedance Pro Fast is the high-speed image-to-video generation variant from ByteDance’s Seedance series. With this model you upload a reference image and—using a text prompt—generate short, dynamic video clips (typically 3-12 seconds) featuring smooth motion, cinematic camera moves, prompt-accurate actions, and high visual fidelity. It supports resolutions up to 1080p, multiple aspect ratios (16:9, 9:16, etc.), and rapid turnaround—ideal for social content, product motion, storytelling from a still, and fast prototyping.

Video

$0.340 / second

seedance-v1.5-pro-t2v

Seedance v1.5 Pro Text-to-Video generates high-quality cinematic videos directly from text prompts. It focuses on smooth motion, rich atmosphere, and coherent scene structure, making it ideal for fantasy worlds, sci-fi environments, surreal visuals, and cinematic storytelling shots with detailed lighting and depth.

Video

$1.250 / second

seedance-2-omni-reference-no-video

SD 2 Omni Reference by ByteDance. Generate videos using up to 9 image references and up to 3 audio references. Reference images in your prompt with @image1, @image2, etc. and audio with @audio1, @audio2, etc.

Video

$0.340 / second

seedance-v1.5-pro-video-extend

Seedance v1.5 Pro Video Extend continues an existing video by generating additional frames that match the original scene’s style, lighting, motion, and mood. It is designed for smooth temporal consistency, making it ideal for extending cinematic shots, atmospheric scenes, or slow camera moves without introducing visual jumps or style changes.

Video

$0.750 / second

seedance-2-t2v

SD 2.0 is the latest multimodal video generation model by ByteDance, offering advanced camera control, native audio-video sync, and high-resolution output.

Video

$0.250 / second

omnihuman-1-5

Generate realistic talking head video from portrait image and audio using KIE OmniHuman 1.5.

Video

$1.500 / second

seedance-2-omni-reference

SD 2.0 Omni Reference — generate videos with visual consistency using reference images, videos, and audio. Maintain character identity, style, and scene continuity. Supports up to 9 images, 3 video clips, and 3 audio clips. Use @image1, @video1, @audio1 syntax in your prompt.

Video

$0.600 / second

seedance-2-i2v-480p

SD 2.0 480p image-to-video generation. Faster and more cost-effective than the 720p variant, ideal for previews and drafts.

Video

$0.750 / second

seedance-2-image-to-video-fast

SD 2 Image-to-Video (Fast) by ByteDance. Quickly animates a start-frame image into video with 4–15 second duration at reduced cost.

Video

$1.250 / second

seedance-2-first-last-frame

SD 2 First & Last Frame (Pro) by ByteDance. Generate video that transitions between two reference images. Provide 1 image for start-frame-only, or 2 images for both start and end frames.

Video

$0.750 / second

seedance-2-omni-reference-no-video-fast

SD 2 Omni Reference (Fast) by ByteDance. Quickly generate videos using up to 9 image references and up to 3 audio references at reduced cost. Reference images in your prompt with @image1, @image2, etc. and audio with @audio1, @audio2, etc.

Video

$1.050 / second

seedance-2-vip-image-to-video-fast

SD 2 Image-to-Video VIP Fast by ByteDance. Faster animation of a start-frame image with priority routing, 4–15 second duration, and 2K resolution.

Video

$1.050 / second

seedance-2-vip-omni-reference-fast

SD 2 Omni Reference VIP Fast by ByteDance. Faster video generation using up to 9 image references, up to 3 video clips, and up to 3 audio references with priority routing. Reference materials in your prompt with @image1…@image9, @video1…@video3, and @audio1…@audio3.

Video

$3.375 / second

seedance-2-vip-image-to-video-1080p

SD 2 Image-to-Video VIP 1080p by ByteDance. Animates a still image into a cinematic 1080p video with priority routing, 4–15 second duration.

Video

$3.375 / second

seedance-2-vip-first-last-frame-1080p

SD 2 First & Last Frame VIP 1080p by ByteDance. Generate 1080p video that transitions between two reference images with priority routing. Provide 1 image for start-frame-only, or 2 images for both start and end frames.

Video

$0.400 / second

seedance-2.1-text-to-video

Seedance 2.1 Text-to-Video generates high-quality cinematic videos directly from text prompts. The model excels at complex scene composition, smooth temporal flow, and accurate prompt interpretation — producing vivid motion sequences up to 1080p with enhanced realism over the previous generation.

Video

$0.600 / second

seedance-2.5-text-to-video

Seedance 2.5 Text-to-Video is the most advanced text-to-video model, generating photorealistic 4K cinematic videos from detailed prompts. It offers extended clip durations up to 16 seconds, native audio synthesis, and unprecedented scene coherence — pushing the frontier of AI video creation.

Video

$6.750 / second

seedance-2-vip-first-last-frame-4k

SD 2 First & Last Frame VIP 4K by ByteDance. Generate 4K video that transitions between two reference images with priority routing. Provide 1 image for start-frame-only, or 2 images for both start and end frames.

Video

$0.600 / second

seedance-2.5-image-to-video

Seedance 2.5 Image-to-Video is the flagship next-generation model delivering photorealistic video from a single image. It supports native 4K output, advanced audio generation, multi-scene composition, and precise camera trajectory control — setting a new benchmark in AI video quality.

Video

$0.400 / second

seedance-2.1-image-to-video

Seedance 2.1 Image-to-Video converts a single still image into a high-fidelity cinematic video. This next-generation model delivers enhanced temporal consistency, improved motion realism, and smarter prompt adherence — producing smooth, coherent clips up to 1080p.

Video

$6.750 / second

seedance-2-vip-text-to-video-4k

SD 2 Text-to-Video VIP 4K by ByteDance. Generates ultra-high-resolution 4K video from a text prompt with priority routing, native audio-visual sync, and 4–15 second duration.

Video

$6.750 / second

seedance-2-vip-omni-reference-4k

SD 2 Omni Reference VIP 4K by ByteDance. Generate 4K ultra-HD videos using up to 9 image references, up to 3 video clips, and up to 3 audio references with priority routing. Reference materials in your prompt with @image1…@image9, @video1…@video3, and @audio1…@audio3.

Video

$1.500 / second

seedance-2-spicy-text-to-video

Seedance 2 Spicy Text-to-Video by ByteDance. Same VIP-tier priority routing, native audio-visual sync, and up to 2K resolution as Seedance 2 VIP, with reduced content-safety filtering for more creative freedom.

Video

$0.150 / second

seedance-2-mini-omni-reference

Seedance 2 Mini Omni Reference generates video from a text prompt with optional image, video, and audio references. Cost-efficient mini-tier model for reference-driven workflows.

Video

$1.050 / second

seedance-2-spicy-text-to-video-fast

Seedance 2 Spicy Text-to-Video Fast by ByteDance. The quickest Spicy-tier text-to-video generation, with reduced content-safety filtering and the same fast queue as Seedance 2 VIP Fast.

Video

$1.500 / second

seedance-2-spicy-image-to-video

Seedance 2 Spicy Image-to-Video by ByteDance. Animates a start frame into cinematic video with VIP-tier priority routing and up to 2K resolution, with reduced content-safety filtering for more creative freedom.

Video

$0.600 / second

seedance-2.5-spicy-image-to-video

Seedance 2.5 Spicy Image-to-Video is the relaxed-moderation variant of the Seedance 2.5 flagship model. It animates a single image into photorealistic 4K video with reduced content-safety filtering and more dramatic, higher-contrast motion than the standard tier — while keeping native audio generation and precise camera trajectory control.

Video

$0.600 / second

seedance-2.5-spicy-text-to-video

Seedance 2.5 Spicy Text-to-Video is the relaxed-moderation variant of the Seedance 2.5 flagship model, generating photorealistic 4K cinematic video directly from a text prompt with reduced content-safety filtering and more dramatic, higher-contrast motion than the standard tier — while retaining native audio synthesis and extended clip durations.

Video

$1.050 / second

seedance-2-spicy-image-to-video-fast

Seedance 2 Spicy Image-to-Video Fast by ByteDance. The quickest Spicy-tier image animation, with reduced content-safety filtering and the same fast queue as Seedance 2 VIP Fast.

Video

$3.375 / second

seedance-2-vip-text-to-video-1080p

SD 2 Text-to-Video VIP 1080p by ByteDance. Generates cinematic 1080p video from a text prompt with priority routing, native audio-visual sync, and 4–15 second duration.

Video

$0.100 / second

seedance-lite-i2v

Seedance Lite I2V version animates static images into short videos quickly, focusing on basic motion effects and efficient processing—best suited for fast demos or mobile-friendly use.

Video

$0.180 / second

seedance-pro-i2v

Seedance Pro I2V advanced model animates still images into stunning short videos, preserving intricate visual details and applying smooth motion dynamics, ideal for high-end visuals and cinematic edits.

Video

$0.100 / second

seedance-lite-t2v

Seedance Lite T2V offers quick video generation from text with decent visual quality and motion. Ideal for fast previews, prototyping, or lightweight use cases where speed matters more than fine detail.

Video

$0.180 / second

seedance-pro-t2v

Seedance Pro delivers high-fidelity video generation from text, producing rich visuals, smooth camera movement, and realistic scenes. Best for storytelling, content creation, and visual production.

Video

$0.100 / second

seedance-lite-reference-video

Seedance Lite's Reference-to-Video feature allows you to supply up to 4 images as reference inputs. The model intelligently blends aspects from these images to generate a cohesive, high-quality video.

Video

$0.060 / second

seedance-pro-t2v-fast

Seedance Pro Fast is ByteDance’s advanced text-to-video model that turns natural-language prompts into short, cinematic video clips with realistic motion, camera dynamics, and consistent scene detail.

Video

$0.340 / second

seedance-v1.5-pro-i2v

Seedance v1.5 Pro Image-to-Video converts a single still image into a smooth cinematic video clip. It preserves the original image’s composition, subject identity, and lighting while adding controlled camera motion, natural parallax, and environmental animation. This mode balances visual quality and motion complexity, making it ideal for cinematic scenes, fantasy worlds, sci-fi environments, and storytelling shots.

Video

$0.260 / second

seedance-v1.5-pro-i2v-fast

Seedance v1.5 Pro Image-to-Video Fast converts a single still image into a short cinematic video with quick generation speed. It preserves the original image’s composition, subject identity, and lighting while adding simple camera motion, light parallax, and subtle environmental animation.

Video

$0.260 / second

seedance-v1.5-pro-video-extend-fast

Seedance v1.5 Pro Video Extend Fast quickly extends an existing video by generating a short continuation that matches the original style, motion, and lighting. This mode prioritizes fast output and smooth continuity with minimal new motion, making it ideal for previews, quick edits, and lightweight shot extensions without complex effects.

Video

$0.750 / second

seedance-2-i2v

SD 2.0 is the latest multimodal video generation model by ByteDance, offering advanced camera control, native audio-video sync, and high-resolution output.

Video

$1.500 / second

seedance-2-video-edit

SD 2.0 Video Edit modifies existing videos based on text prompts and optional reference images.

Video

$1.050 / second

seedance-2-extend

SD 2.0 Extend Video continues an existing SD 2.0 generated video seamlessly. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Video

$1.440 / second

seedance-2-omni-reference-480p

SD 2.0 480p Omni Reference — generate videos with visual consistency using reference images, videos, and audio at 480p resolution. More cost-effective than the 720p variant. Supports up to 9 images, 3 video clips, and 3 audio clips. Use @image1, @video1, @audio1 syntax in your prompt.

Video

$1.250 / second

seedance-2-text-to-video

SD 2 Text-to-Video (Pro) by ByteDance. Generates high-quality cinematic video from a text prompt with native audio-visual sync, up to 2K resolution, and 4–15 second duration.

Video

$0.750 / second

seedance-2-first-last-frame-fast

SD 2 First & Last Frame (Fast) by ByteDance. Quickly generate video that transitions between reference images at reduced cost. Provide 1 or 2 images.

Video

$0.600 / second

seedance-2-t2v-480p

SD 2.0 480p text-to-video generation. Faster and more cost-effective than the 720p variant, ideal for previews and drafts.

Video

$1.500 / second

seedance-2-vip-image-to-video

SD 2 Image-to-Video VIP (Pro) by ByteDance. Animates a start-frame image into a high-quality video with priority routing, native audio, 4–15 second duration, and 2K resolution.

Video

$0.750 / second

seedance-2-text-to-video-fast

SD 2 Text-to-Video (Fast) by ByteDance. Generates video from text at faster speeds with 4–15 second duration and 2K resolution.

Video

$1.500 / second

seedance-2-vip-first-last-frame

SD 2 First & Last Frame VIP (Pro) by ByteDance. Generate video that transitions between two reference images with priority routing. Provide 1 image for start-frame-only, or 2 images for both start and end frames.

Video

$1.050 / second

seedance-2-vip-first-last-frame-fast

SD 2 First & Last Frame VIP Fast by ByteDance. Faster generation of video transitions between two reference images with priority routing.

Video

$1.050 / second

seedance-2-vip-text-to-video-fast

SD 2 Text-to-Video VIP Fast by ByteDance. Faster generation with priority routing from a text prompt, 4–15 second duration and 2K resolution.

Video

$1.500 / second

seedance-2-vip-omni-reference

SD 2 Omni Reference VIP (Pro) by ByteDance. Generate videos using up to 9 image references, up to 3 video clips, and up to 3 audio references with priority routing. Reference materials in your prompt with @image1…@image9, @video1…@video3, and @audio1…@audio3. Also supports @omni-character:<char_id> for trained characters.

Video

$3.375 / second

seedance-2-vip-omni-reference-1080p

SD 2 Omni Reference VIP 1080p by ByteDance. Generate full HD videos using up to 9 image references, up to 3 video clips, and up to 3 audio references with priority routing. Reference materials in your prompt with @image1…@image9, @video1…@video3, and @audio1…@audio3.

Video

$1.050 / second

seedance-2-vip-extend

SD 2.0 VIP Extend Video continues an existing SD 2.0 generated video seamlessly at 720p. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Video

$2.362 / second

seedance-2-vip-extend-1080p

SD 2.0 VIP Extend Video 1080p continues an existing SD 2.0 generated video seamlessly at 1080p resolution. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Image Generation Models

Image

$0.030 / generation

bytedance-seededit-v3

Seededit allows precise edits to images using masks and prompt guidance. Whether you're replacing backgrounds, changing clothing, or inpainting missing areas, Seededit ensures realistic, high-quality results with semantic control.

Image

$0.045 / generation

bytedance-seedream-5.0-pro-edit

Seedream 5.0 Pro Edit is ByteDance's flagship image editing model, extending Seedream 5.0 Lite Edit with higher-fidelity rendering, deeper visual reasoning, and finer typography control. It accepts one or more reference images and a natural-language instruction, targeting up to 2K output with stronger scene composition and richer prompt adherence for professional creative workflows.

Image

$0.040 / 1K tokens

bytedance-seedream-v4

Seedream v4 generates stunning, high-fidelity images from text prompts. It’s designed for creativity with strong support for realism, fantasy, and artistic styles.

Image

$0.033 / generation

bytedance-seedream-v5.0-edit

Seedream 5.0 Lite Edit is an advanced image transformation model by ByteDance, enabling precise, controllable edits using natural language. It specializes in high-fidelity style transfer (Anime, Cyberpunk, Fantasy), background swaps, and object modification while preserving original lighting, color tones, and character consistency for professional-grade creative reworks.

Image

$0.180 / generation

seedance-2-character

[Beta] Turn fictional character references into reusable video characters. Upload reference images and describe the outfit to get a character_id you can use in SD 2.0 Omni Reference.

Image

$0.500 / generation

seedance-2-omni-reference-train

Train a reusable character from a reference photo. Once complete, reference the character in Omni Reference video prompts using @omni-character:<request_id> to generate videos featuring that character consistently.

Image

$0.045 / 1K tokens

bytedance-seedream-5.0-pro

Seedream 5.0 Pro is ByteDance's flagship next-generation text-to-image model, extending Seedream 5.0 Lite with higher-fidelity rendering, deeper visual reasoning, and finer typography control. It targets up to 2K output with stronger scene composition, more consistent multi-character generation, and richer prompt adherence for professional creative workflows.

Image

$0.030 / 1K tokens

bytedance-seedream-v3

Seedream is designed for generating visually rich and artistic images from text prompts. It excels at fantasy, anime, surrealism, and vibrant color compositions — ideal for creative visuals, storyboards, and concept art.

Image

$0.040 / generation

bytedance-seedream-v4-edit

Seedream v4 Edit refines or transforms existing images based on a new prompt and a reference. Instead of masking, you provide a source image and describe how it should be altered — adjusting style, details, or replacing elements while keeping the subject consistent.

Image

$0.050 / generation

bytedance-seedream-v4.5-edit

Seedream-v4.5 Edit allows you to transform an existing image using natural-language instructions. It preserves the core composition, lighting, and style of the original while modifying only the requested elements — perfect for object replacement, environment changes, stylistic adjustments, and high-detail creative reworks.

Image

$0.050 / 1K tokens

bytedance-seedream-v4.5

Seedream-v4.5 is ByteDance’s advanced text-to-image diffusion model designed for generating high-detail, high-contrast, cinematic and stylized images. It excels at surreal fantasy concepts, sci-fi worlds, product visuals, photoreal scenes, and artistic compositions with strong prompt adherence and crisp detail.

Image

$0.033 / 1K tokens

bytedance-seedream-v5.0

Seedream 5.0 Lite is ByteDance’s next-generation text-to-image model, delivering high-fidelity AI art with advanced visual reasoning and precise typography. Supporting up to 4K resolution and cinematic detail, it excels at complex scene construction, consistent character generation, and real-time knowledge integration for accurate, contextually relevant visuals.

ByteDance AI Models API

ByteDance AI models on MuAPI

All models

Video Generation Models

seedance-2-watermark-remover

seedance-2-image-to-video

seedance-2-mini-text-to-video

seedance-2-vip-image-to-video-4k

seedance-2-mini-spicy-text-to-video

seedance-2-mini-spicy-image-to-video

seedance-v1.5-pro-t2v-fast

seedance-2-mini-image-to-video

seedance-2-video-watermark-remover-pro

seedance-2-vip-text-to-video

seedance-pro-i2v-fast

seedance-v1.5-pro-t2v

seedance-2-omni-reference-no-video

seedance-v1.5-pro-video-extend

seedance-2-t2v

omnihuman-1-5

seedance-2-omni-reference

seedance-2-i2v-480p

seedance-2-image-to-video-fast

seedance-2-first-last-frame

seedance-2-omni-reference-no-video-fast

seedance-2-vip-image-to-video-fast

seedance-2-vip-omni-reference-fast

seedance-2-vip-image-to-video-1080p

seedance-2-vip-first-last-frame-1080p

seedance-2.1-text-to-video

seedance-2.5-text-to-video

seedance-2-vip-first-last-frame-4k

seedance-2.5-image-to-video

seedance-2.1-image-to-video

seedance-2-vip-text-to-video-4k

seedance-2-vip-omni-reference-4k

seedance-2-spicy-text-to-video

seedance-2-mini-omni-reference

seedance-2-spicy-text-to-video-fast

seedance-2-spicy-image-to-video

seedance-2.5-spicy-image-to-video

seedance-2.5-spicy-text-to-video

seedance-2-spicy-image-to-video-fast

seedance-2-vip-text-to-video-1080p

seedance-lite-i2v

seedance-pro-i2v

seedance-lite-t2v

seedance-pro-t2v

seedance-lite-reference-video

seedance-pro-t2v-fast

seedance-v1.5-pro-i2v

seedance-v1.5-pro-i2v-fast

seedance-v1.5-pro-video-extend-fast

seedance-2-i2v

seedance-2-video-edit

seedance-2-extend

seedance-2-omni-reference-480p

seedance-2-text-to-video

seedance-2-first-last-frame-fast

seedance-2-t2v-480p

seedance-2-vip-image-to-video

seedance-2-text-to-video-fast

seedance-2-vip-first-last-frame

seedance-2-vip-first-last-frame-fast

seedance-2-vip-text-to-video-fast

seedance-2-vip-omni-reference

seedance-2-vip-omni-reference-1080p

seedance-2-vip-extend

seedance-2-vip-extend-1080p

Image Generation Models

bytedance-seededit-v3

bytedance-seedream-5.0-pro-edit

bytedance-seedream-v4

bytedance-seedream-v5.0-edit

seedance-2-character

seedance-2-omni-reference-train

bytedance-seedream-5.0-pro

bytedance-seedream-v3

bytedance-seedream-v4-edit

bytedance-seedream-v4.5-edit