Kling AI models on MuAPI
Back to Providers
Explore/Kling Models
Kling

Kling AI API models

Kling AI models on MuAPI

Explore Kling models for chat, code, image and video generation, including Gemini, Nano Banana and Veo-style workflows available through MuAPI.

All models

41 Models

Video Generation Models

Video

$0.400 / second

kling-v2.1-pro-i2v

Kling 2.1 Pro is the high-end version of Kuaishou’s video generation model, offering enhanced realism, longer motion sequences, and cinematic quality. In I2V mode, it animates static images with fluid environmental effects.

Video

$0.350 / second

kling-v1-avatar-standard

Kling AI Avatar Standard creates talking avatar videos from a single image + audio input. It supports realistic humans, animals, or stylized characters, producing lip-synced avatar videos easily.

Video

$0.650 / second

kling-v1-avatar-pro

Kling AI Avatar Pro is the premium tier for making high-quality talking avatars. You upload a character image plus an audio file, and the model generates a realistic avatar video with lip-sync.

Video

$0.450 / second

kling-v2.5-turbo-pro-i2v

Kling 2.5 Turbo Pro: Top-tier image-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.

Video

$0.225 / second

kling-v2.1-standard-i2v

Kling 2.1 Standard (developed by Kuaishou) brings static images to life by generating smooth, realistic video clips from a single frame. It captures subtle motion, background dynamics, and camera movement to produce professional-looking animations — ideal for portraits, digital art, and cinematic illustrations.

Video

$0.750 / second

kling-v2-avatar-pro

AI-Avatar v2 Pro takes a reference image of a person/character and an audio dialogue clip, then generates a realistic talking-avatar video. It preserves identity, lip syncs accurately to the audio, adds natural head movement, eye motion, expressions, and cinematic lighting.

Video

$1.090 / second

kling-o1-standard-video-edit

Kling O1 Standard Video-to-Video Edit modifies an existing video while preserving its original structure, motion, and realism. It is designed for subtle, stable edits such as object replacement, background changes, lighting adjustments, or small visual tweaks. This mode prioritizes temporal consistency and natural motion, making it.

Video

$0.450 / second

kling-v2.5-turbo-pro-t2v

Kling 2.5 Turbo Pro: Top-tier text-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.

Video

$0.145 / second

kling-v2.6-pro-motion-control

Kling v2.6 Pro Motion Control allows precise control over camera movement, subject motion, and scene dynamics during video generation. Instead of leaving motion fully implicit, this mode lets you explicitly define how the camera moves (pan, tilt, orbit, dolly, zoom) and how objects or characters behave over time.

Video

$0.720 / second

kling-o1-text-to-video

Kling O1 is a unified, multi-modal video generation engine that transforms natural language prompts into short cinematic video clips. It supports text-to-video generation with realistic motion, dynamic camera moves, and coherent scene rendering.

Video

$2.000 / second

kling-v3.0-4k-image-to-video

Kling 3.0 4K Image-to-Video animates a single input image into ultra-high-resolution 3840×2160 video with smooth camera motion, natural physics, and strong temporal consistency. 4K mode delivers the sharpest detail in Kling 3.0 — ideal for cinematic shots, product showcases, and premium content where pixel-level clarity matters.

Video

$0.280 / second

kling-v2.5-turbo-std-i2v

Kling 2.5 Turbo Std: Top-tier image-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.

Video

$0.900 / second

kling-v2.6-pro-i2v

Kling-v2.6-Pro Image-to-Video transforms a single creative image into a short cinematic video. It preserves the original style, lighting, and composition while adding smooth camera motion, atmospheric effects, and dynamic environmental animation.

Video

$0.420 / second

kling-v3.0-omni-standard-text-to-video

Kling v3 Omni at 720P. Multi-image reference video generation — supply up to 4 images and reference them in your prompt with <<<image_N>>>. Apimart-backed.

Video

$0.420 / second

kling-v3.0-omni-standard-image-to-video

Kling v3 Omni at 720P. Multi-image reference video generation — supply up to 4 images and reference them in your prompt with <<<image_N>>>. Apimart-backed.

Video

$0.720 / second

kling-o1-reference-to-video

Kling O1’s Reference-to-Video mode generates a dynamic video using one or multiple reference images as the visual foundation. It preserves identity, style, composition, and key visual details from the references while adding realistic camera motion, environment dynamics, and scene animation.

Video

$0.585 / second

kling-o1-video-edit-fast

Video Edit Fast is the lightweight, high-speed editing mode of Kling O1. It performs quick edits on an existing video without heavy processing—ideal for fast object replacements, light enhancements, color tweaks, or simple visual adjustments. This mode focuses on speed over complex reconstruction, making it suitable for rapid iterations, previews, and small edits while preserving the original video’s motion and structure.

Video

$0.720 / second

kling-v3.0-standard-text-to-video

Kling 3.0 Standard Text-to-Video generates smooth, realistic videos from text with stable motion and natural behavior. It works best with clear subjects, simple actions, and one continuous scene, making it ideal for cute animals, small actions, and calm cinematic moments.

Video

$0.900 / second

kling-v2.6-pro-t2v

Kling-v2.6-Pro Text-to-Video generates high-fidelity cinematic videos directly from text prompts. It excels at complex compositions, dramatic lighting, fluid camera motion, and visually rich fantasy or sci-fi sequences.

Video

$0.100 / second

kling-v3.0-std-motion-control

Kling V3.0 Standard Motion Control allows for precise control over the camera and subject movement in generated videos. Powered by the latest Kling V3.0 architecture for improved temporal consistency and quality.

Video

$0.500 / second

kling-o1-standard-image-to-video

Kling O1 Standard Image-to-Video converts a single still image into a short, natural-looking video clip. It preserves the original image’s composition and lighting while adding subtle camera motion, gentle parallax, and light environmental animation. This mode focuses on realism and stability rather than heavy effects, making it ideal for clean cinematic shots, environments, characters, and product visuals.

Video

$0.720 / second

kling-v3.0-pro-text-to-video

Kling 3.0 Pro is a high-end video generation model capable of producing longer, smoother, and more realistic cinematic videos with strong motion consistency. It handles complex scenes, realistic physics, natural camera movement, and detailed environments better than earlier versions.

Video

$0.720 / second

kling-v3.0-standard-image-to-video

Kling 3.0 Standard Image-to-Video animates a single input image into a short, realistic video with smooth, stable motion. It prioritizes temporal consistency, natural physics, and subtle camera movement, making it ideal for everyday scenes, travel moments, people, vehicles, and calm cinematic shots.

Video

$0.560 / second

kling-v3.0-omni-pro-image-to-video

Kling v3 Omni at 1080P. Multi-image reference video generation — supply up to 4 images and reference them in your prompt with <<<image_N>>>. Apimart-backed.

Video

$1.200 / second

kling-v2.1-master-t2v

Kling 2.1 Master’s T2V mode allows users to generate vivid, high-quality videos from detailed text prompts. It supports dynamic scenes, natural motion, and cinematic quality — perfect for storytelling, ads, or content creation from imagination alone.

Video

$0.300 / second

kling-v2.1-master-i2v

Kling 2.1 Master’s I2V animates a still image into a coherent video sequence. It interprets motion, environment, and context to create realistic, visually stunning video outputs — ideal for animating portraits, scenes, or concept art.

Video

$0.720 / second

kling-o1-image-to-video

Kling O1’s Image-to-Video mode transforms one or more reference images into short cinematic video clips by adding natural motion, camera choreography, and scene dynamics while preserving subject identity and visual consistency. It supports start/end frames.

Video

$1.090 / second

kling-o1-video-edit

Kling O1 Video Edit lets you send an existing video clip plus an instruction/prompt to edit or transform the clip while preserving temporal coherence and subject identity. Typical edits include color grading, background replacement, object removal, slow-motion slo-mo, speed ramps, style transfer, subtle camera stabilization, and short extension/outro generation. Inputs can include: the source video, an optional frame mask (for localized edits), time range, and style/reference images.

Video

$0.350 / second

kling-v2-avatar-standard

AI-Avatar v2 Standard generates a talking-avatar video from a reference image and an audio dialogue. It performs accurate lip-sync, natural facial expressions, subtle head motion, blinking, and light emotional cues based on voice tone. This Standard version focuses on speed and natural realism.

Video

$0.720 / second

kling-o1-standard-reference-to-video

Kling O1 Standard Reference-to-Video generates a smooth, realistic video using one or multiple reference images as visual guidance. It preserves the visual identity, composition, and lighting from the references while adding subtle camera motion, natural parallax, and light environmental animation. This mode prioritizes stability and realism, making it ideal for character shots, environments, product visuals, and calm cinematic scenes.

Video

$0.450 / second

kling-v2.6-std-motion-control

Kling v2.6 Pro Motion Control allows precise control over camera movement, subject motion, and scene dynamics during video generation. Instead of leaving motion fully implicit, this mode lets you explicitly define how the camera moves (pan, tilt, orbit, dolly, zoom) and how objects or characters behave over time.

Video

$0.160 / second

kling-v3.0-pro-motion-control

Kling V3.0 Pro Motion Control provides the highest level of detail and control for video generation. Suitable for professional workflows requiring complex cinematic camera work and subject consistency.

Video

$0.720 / second

kling-v3.0-pro-image-to-video

Kling 3.0 Pro Image-to-Video animates a single input image into a high-quality, realistic video with smooth camera motion, natural physics, and strong temporal consistency. It excels at real-world scenes, human motion, environmental details, and cinematic movement while preserving the original image’s structure and lighting.

Video

$2.000 / second

kling-v3.0-4k-text-to-video

Kling 3.0 4K Text-to-Video generates ultra-high-resolution 3840×2160 cinematic video directly from text prompts with smooth, realistic motion and strong temporal consistency. Choose 4K when you need the sharpest output Kling 3.0 can produce — perfect for high-end advertising, hero shots, and large-screen playback.

Video

$0.560 / second

kling-v3.0-omni-pro-text-to-video

Kling v3 Omni at 1080P. Multi-image reference video generation — supply up to 4 images and reference them in your prompt with <<<image_N>>>. Apimart-backed.

Video

$2.679 / second

kling-v3.0-omni-4k-text-to-video

Kling v3 Omni at 4K. Multi-image reference video generation — supply up to 4 images and reference them in your prompt with <<<image_N>>>. Apimart-backed.

Video

$2.679 / second

kling-v3.0-omni-4k-image-to-video

Kling v3 Omni at 4K. Multi-image reference video generation — supply up to 4 images and reference them in your prompt with <<<image_N>>>. Apimart-backed.