ByteDance AI models on MuAPI
Back to Providers
Explore/ByteDance Models
ByteDance

ByteDance AI API models

ByteDance AI models on MuAPI

Explore ByteDance models for chat, code, image and video generation, including Gemini, Nano Banana and Veo-style workflows available through MuAPI.

All models

55 Models

Video Generation Models

Video

$0.060 / second

seedance-pro-i2v-fast

Seedance Pro Fast is the high-speed image-to-video generation variant from ByteDance’s Seedance series. With this model you upload a reference image and—using a text prompt—generate short, dynamic video clips (typically 3-12 seconds) featuring smooth motion, cinematic camera moves, prompt-accurate actions, and high visual fidelity. It supports resolutions up to 1080p, multiple aspect ratios (16:9, 9:16, etc.), and rapid turnaround—ideal for social content, product motion, storytelling from a still, and fast prototyping.

Video

$0.340 / second

seedance-v1.5-pro-t2v

Seedance v1.5 Pro Text-to-Video generates high-quality cinematic videos directly from text prompts. It focuses on smooth motion, rich atmosphere, and coherent scene structure, making it ideal for fantasy worlds, sci-fi environments, surreal visuals, and cinematic storytelling shots with detailed lighting and depth.

Video

$0.260 / second

seedance-v1.5-pro-t2v-fast

Seedance v1.5 Pro Text-to-Video Fast generates short cinematic videos directly from text with an emphasis on speed and stability. It produces coherent scenes with simple camera motion, light environmental animation, and consistent lighting.

Video

$0.340 / second

seedance-v1.5-pro-video-extend

Seedance v1.5 Pro Video Extend continues an existing video by generating additional frames that match the original scene’s style, lighting, motion, and mood. It is designed for smooth temporal consistency, making it ideal for extending cinematic shots, atmospheric scenes, or slow camera moves without introducing visual jumps or style changes.

Video

$0.065 / second

sd-2-video-watermark-remover-pro

SD 2 Video Watermark Remover Pro uses the SD 2 AI model to remove watermarks, logos, and overlaid text from videos with high accuracy. Powered by ByteDance's SD 2 engine, it delivers superior quality compared to traditional inpainting approaches. Pricing: $0.013 per second, minimum charge for 5 seconds ($0.065).

Video

$1.250 / second

sd-2-first-last-frame

SD 2 First & Last Frame (Pro) by ByteDance. Generate video that transitions between two reference images. Provide 1 image for start-frame-only, or 2 images for both start and end frames.

Video

$1.500 / second

sd-2-vip-text-to-video

SD 2 Text-to-Video VIP (Pro) by ByteDance. Generates high-quality cinematic video from a text prompt with priority routing, native audio-visual sync, up to 2K resolution, and 4–15 second duration.

Video

$3.375 / second

sd-2-vip-image-to-video-1080p

SD 2 Image-to-Video VIP 1080p by ByteDance. Animates a still image into a cinematic 1080p video with priority routing, 4–15 second duration.

Video

$1.250 / second

sd-2-omni-reference-no-video

SD 2 Omni Reference by ByteDance. Generate videos using up to 9 image references and up to 3 audio references. Reference images in your prompt with @image1, @image2, etc. and audio with @audio1, @audio2, etc.

Video

$0.750 / second

sd-2-image-to-video-fast

SD 2 Image-to-Video (Fast) by ByteDance. Quickly animates a start-frame image into video with 4–15 second duration at reduced cost.

Video

$0.750 / second

sd-2-t2v

SD 2.0 is the latest multimodal video generation model by ByteDance, offering advanced camera control, native audio-video sync, and high-resolution output.

Video

$0.025 / second

sd-2-watermark-remover

🎉 FREE for a limited time — Remove SD 2.0 watermarks from videos using LaMa AI inpainting. Automatically detects the watermark region, builds a precise mask via Canny edge detection, and inpaints each frame for artifact-free results. No credits deducted — requires a positive balance to access.

Video

$1.050 / second

sd-2-vip-image-to-video-fast

SD 2 Image-to-Video VIP Fast by ByteDance. Faster animation of a start-frame image with priority routing, 4–15 second duration, and 2K resolution.

Video

$0.600 / second

sd-2-i2v-480p

SD 2.0 480p image-to-video generation. Faster and more cost-effective than the 720p variant, ideal for previews and drafts.

Video

$3.375 / second

sd-2-vip-first-last-frame-1080p

SD 2 First & Last Frame VIP 1080p by ByteDance. Generate 1080p video that transitions between two reference images with priority routing. Provide 1 image for start-frame-only, or 2 images for both start and end frames.

Video

$1.500 / second

sd-2-omni-reference

SD 2.0 Omni Reference — generate videos with visual consistency using reference images, videos, and audio. Maintain character identity, style, and scene continuity. Supports up to 9 images, 3 video clips, and 3 audio clips. Use @image1, @video1, @audio1 syntax in your prompt.

Video

$0.750 / second

sd-2-omni-reference-no-video-fast

SD 2 Omni Reference (Fast) by ByteDance. Quickly generate videos using up to 9 image references and up to 3 audio references at reduced cost. Reference images in your prompt with @image1, @image2, etc. and audio with @audio1, @audio2, etc.

Video

$1.050 / second

sd-2-vip-omni-reference-fast

SD 2 Omni Reference VIP Fast by ByteDance. Faster video generation using up to 9 image references, up to 3 video clips, and up to 3 audio references with priority routing. Reference materials in your prompt with @image1…@image9, @video1…@video3, and @audio1…@audio3.

Video

$3.375 / second

sd-2-vip-text-to-video-1080p

SD 2 Text-to-Video VIP 1080p by ByteDance. Generates cinematic 1080p video from a text prompt with priority routing, native audio-visual sync, and 4–15 second duration.

Video

$0.100 / second

seedance-lite-i2v

Seedance Lite I2V version animates static images into short videos quickly, focusing on basic motion effects and efficient processing—best suited for fast demos or mobile-friendly use.

Video

$0.180 / second

seedance-pro-i2v

Seedance Pro I2V advanced model animates still images into stunning short videos, preserving intricate visual details and applying smooth motion dynamics, ideal for high-end visuals and cinematic edits.

Video

$0.100 / second

seedance-lite-t2v

Seedance Lite T2V offers quick video generation from text with decent visual quality and motion. Ideal for fast previews, prototyping, or lightweight use cases where speed matters more than fine detail.

Video

$0.180 / second

seedance-pro-t2v

Seedance Pro delivers high-fidelity video generation from text, producing rich visuals, smooth camera movement, and realistic scenes. Best for storytelling, content creation, and visual production.

Video

$0.100 / second

seedance-lite-reference-video

Seedance Lite's Reference-to-Video feature allows you to supply up to 4 images as reference inputs. The model intelligently blends aspects from these images to generate a cohesive, high-quality video.

Video

$0.060 / second

seedance-pro-t2v-fast

Seedance Pro Fast is ByteDance’s advanced text-to-video model that turns natural-language prompts into short, cinematic video clips with realistic motion, camera dynamics, and consistent scene detail.

Video

$0.340 / second

seedance-v1.5-pro-i2v

Seedance v1.5 Pro Image-to-Video converts a single still image into a smooth cinematic video clip. It preserves the original image’s composition, subject identity, and lighting while adding controlled camera motion, natural parallax, and environmental animation. This mode balances visual quality and motion complexity, making it ideal for cinematic scenes, fantasy worlds, sci-fi environments, and storytelling shots.

Video

$0.260 / second

seedance-v1.5-pro-i2v-fast

Seedance v1.5 Pro Image-to-Video Fast converts a single still image into a short cinematic video with quick generation speed. It preserves the original image’s composition, subject identity, and lighting while adding simple camera motion, light parallax, and subtle environmental animation.

Video

$0.260 / second

seedance-v1.5-pro-video-extend-fast

Seedance v1.5 Pro Video Extend Fast quickly extends an existing video by generating a short continuation that matches the original style, motion, and lighting. This mode prioritizes fast output and smooth continuity with minimal new motion, making it ideal for previews, quick edits, and lightweight shot extensions without complex effects.

Video

$0.750 / second

sd-2-i2v

SD 2.0 is the latest multimodal video generation model by ByteDance, offering advanced camera control, native audio-video sync, and high-resolution output.

Video

$1.500 / second

sd-2-video-edit

SD 2.0 Video Edit modifies existing videos based on text prompts and optional reference images.

Video

$1.050 / second

sd-2-extend

SD 2.0 Extend Video continues an existing SD 2.0 generated video seamlessly. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Video

$1.440 / second

sd-2-omni-reference-480p

SD 2.0 480p Omni Reference — generate videos with visual consistency using reference images, videos, and audio at 480p resolution. More cost-effective than the 720p variant. Supports up to 9 images, 3 video clips, and 3 audio clips. Use @image1, @video1, @audio1 syntax in your prompt.

Video

$1.250 / second

sd-2-text-to-video

SD 2 Text-to-Video (Pro) by ByteDance. Generates high-quality cinematic video from a text prompt with native audio-visual sync, up to 2K resolution, and 4–15 second duration.

Video

$1.250 / second

sd-2-image-to-video

SD 2 Image-to-Video (Pro) by ByteDance. Animates a start-frame image into a high-quality video with native audio, 4–15 second duration, and 2K resolution.

Video

$0.750 / second

sd-2-first-last-frame-fast

SD 2 First & Last Frame (Fast) by ByteDance. Quickly generate video that transitions between reference images at reduced cost. Provide 1 or 2 images.

Video

$0.600 / second

sd-2-t2v-480p

SD 2.0 480p text-to-video generation. Faster and more cost-effective than the 720p variant, ideal for previews and drafts.

Video

$1.500 / second

sd-2-vip-image-to-video

SD 2 Image-to-Video VIP (Pro) by ByteDance. Animates a start-frame image into a high-quality video with priority routing, native audio, 4–15 second duration, and 2K resolution.

Video

$0.750 / second

sd-2-text-to-video-fast

SD 2 Text-to-Video (Fast) by ByteDance. Generates video from text at faster speeds with 4–15 second duration and 2K resolution.

Video

$1.500 / second

sd-2-vip-first-last-frame

SD 2 First & Last Frame VIP (Pro) by ByteDance. Generate video that transitions between two reference images with priority routing. Provide 1 image for start-frame-only, or 2 images for both start and end frames.

Video

$1.050 / second

sd-2-vip-first-last-frame-fast

SD 2 First & Last Frame VIP Fast by ByteDance. Faster generation of video transitions between two reference images with priority routing.

Video

$1.050 / second

sd-2-vip-text-to-video-fast

SD 2 Text-to-Video VIP Fast by ByteDance. Faster generation with priority routing from a text prompt, 4–15 second duration and 2K resolution.

Video

$1.500 / second

sd-2-vip-omni-reference

SD 2 Omni Reference VIP (Pro) by ByteDance. Generate videos using up to 9 image references, up to 3 video clips, and up to 3 audio references with priority routing. Reference materials in your prompt with @image1…@image9, @video1…@video3, and @audio1…@audio3. Also supports @omni-character:<char_id> for trained characters.

Video

$3.375 / second

sd-2-vip-omni-reference-1080p

SD 2 Omni Reference VIP 1080p by ByteDance. Generate full HD videos using up to 9 image references, up to 3 video clips, and up to 3 audio references with priority routing. Reference materials in your prompt with @image1…@image9, @video1…@video3, and @audio1…@audio3.

Video

$1.050 / second

sd-2-vip-extend

SD 2.0 VIP Extend Video continues an existing SD 2.0 generated video seamlessly at 720p. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Video

$2.362 / second

sd-2-vip-extend-1080p

SD 2.0 VIP Extend Video 1080p continues an existing SD 2.0 generated video seamlessly at 1080p resolution. Provide the original request ID and an optional prompt to guide the extension — the model preserves visual style, motion, characters, and audio consistency across the new segment. Optional image, video, and audio references can be supplied to steer the extension: user-supplied references map to @image2…@image9, @video1…@video3, @audio1…@audio3 in the prompt (the source video's last frame is always @image1).

Image Generation Models

bytedance-seedream-v4
Image

$0.040 / 1K tokens

bytedance-seedream-v4

Seedream v4 generates stunning, high-fidelity images from text prompts. It’s designed for creativity with strong support for realism, fantasy, and artistic styles.

bytedance-seededit-v3
Image

$0.030 / generation

bytedance-seededit-v3

Seededit allows precise edits to images using masks and prompt guidance. Whether you're replacing backgrounds, changing clothing, or inpainting missing areas, Seededit ensures realistic, high-quality results with semantic control.

bytedance-seedream-v5.0-edit
Image

$0.033 / generation

bytedance-seedream-v5.0-edit

Seedream 5.0 Lite Edit is an advanced image transformation model by ByteDance, enabling precise, controllable edits using natural language. It specializes in high-fidelity style transfer (Anime, Cyberpunk, Fantasy), background swaps, and object modification while preserving original lighting, color tones, and character consistency for professional-grade creative reworks.

sd-2-omni-reference-train
Image

$0.500 / generation

sd-2-omni-reference-train

Train a reusable character from a reference photo. Once complete, reference the character in Omni Reference video prompts using @omni-character:<request_id> to generate videos featuring that character consistently.

bytedance-seedream-v3
Image

$0.030 / 1K tokens

bytedance-seedream-v3

Seedream is designed for generating visually rich and artistic images from text prompts. It excels at fantasy, anime, surrealism, and vibrant color compositions — ideal for creative visuals, storyboards, and concept art.

bytedance-seedream-v4-edit
Image

$0.040 / generation

bytedance-seedream-v4-edit

Seedream v4 Edit refines or transforms existing images based on a new prompt and a reference. Instead of masking, you provide a source image and describe how it should be altered — adjusting style, details, or replacing elements while keeping the subject consistent.

bytedance-seedream-v4.5-edit
Image

$0.050 / generation

bytedance-seedream-v4.5-edit

Seedream-v4.5 Edit allows you to transform an existing image using natural-language instructions. It preserves the core composition, lighting, and style of the original while modifying only the requested elements — perfect for object replacement, environment changes, stylistic adjustments, and high-detail creative reworks.

bytedance-seedream-v4.5
Image

$0.050 / 1K tokens

bytedance-seedream-v4.5

Seedream-v4.5 is ByteDance’s advanced text-to-image diffusion model designed for generating high-detail, high-contrast, cinematic and stylized images. It excels at surreal fantasy concepts, sci-fi worlds, product visuals, photoreal scenes, and artistic compositions with strong prompt adherence and crisp detail.

bytedance-seedream-v5.0
Image

$0.033 / 1K tokens

bytedance-seedream-v5.0

Seedream 5.0 Lite is ByteDance’s next-generation text-to-image model, delivering high-fidelity AI art with advanced visual reasoning and precise typography. Supporting up to 4K resolution and cinematic detail, it excels at complex scene construction, consistent character generation, and real-time knowledge integration for accurate, contextually relevant visuals.

sd-2-character
Image

$0.180 / generation

sd-2-character

[Beta] Turn fictional character references into reusable video characters. Upload reference images and describe the outfit to get a character_id you can use in SD 2.0 Omni Reference.