Explore/muapi.ai/kling-v3.0-omni-standard-text-to-video

muapi/kling-v3.0-omni-standard-text-to-video

Text to Video

Kling v3 Omni at 720P. Multi-image reference video generation — supply up to 4 images and reference them in your prompt with <<<image_N>>>. Apimart-backed.

Input

Configure the model parameters below.

When enabled, generate native audio with the video (adds to cost).

Result

Price varies by duration and audio

DurationAudioCost
5sNo$0.42
5sYes$0.56
10sNo$0.84
10sYes$1.12
📝

Overview

About this model

Kling v3 Omni at Standard 720P (T2V). Multi-image reference video generation — reference up to 4 images in your prompt using <<<image_N>>>. Routes through apimart's managed Kling endpoint.

1Character Consistency: Keep a subject across multiple shots by referencing the same character image.
2Style Transfer: Use one image as the visual style reference and another as the subject.
3Storyboard-to-Video: Animate stills from a storyboard into a coherent clip.
4Product Showcase: Animate a product photo with prompted camera moves.
💰

Pricing & Value

Cost analysis

muapiapp$0.084/sec base / $0.112/sec with audio

Default 5-second clip costs $0.4200. Matches official Kling pricing.

Kling.ai (official)Same

muapiapp routes to apimart's Kling provider; pricing mirrors Kling.ai's published rate.

Replicate / Fal.aiNot available

Kling v3 Omni isn't generally available on Replicate or Fal.ai.

* Competitor pricing is estimated based on similar model architectures and usage tiers.

⚙️

Technical Details

Configuration schema

Promptstring

Text prompt. Reference images via <<<image_N>>> (1-indexed). If omitted, <<<image_1>>> is auto-prepended.

Default ValueA cyberpunk samurai crouched on a rooftop edge during a thunderstorm, glowing katana in hand. The samurai instantly launches forward across rooftops at extreme speed. Rain sprays behind each landing while he slices through neon signs and wall-runs across skyscrapers. The camera whips aggressively around every movement.
Aspect RatioEnum (3 options)

Aspect ratio of the output video.

Default Value16:9
DurationEnum (13 options)

Duration of the generated video in seconds.

Default Value5
Generate Audioboolean

When enabled, generate native audio with the video (adds to cost).

Default Valuefalse
📖

Implementation Guide

Developer documentation

How to Use Kling v3 Omni — Standard 720P T2V

  1. Write your prompt: A clear text description of the scene.

  2. Pick aspect ratio and duration: 16:9 / 9:16 / 1:1, 3–15 seconds.

  3. Toggle generate_audio: Disabled by default. Enabling it raises the per-second rate.

  4. Submit and poll: You'll receive a request_id immediately. Poll the result endpoint until status is completed.

Common Questions

Frequently asked

How do I reference multiple images in my prompt?

Use `<<<image_1>>>`, `<<<image_2>>>`, etc. — 1-indexed against the order you submitted in `images_list`. If you skip the references, the model auto-prepends `<<<image_1>>>` to your prompt.

Does audio cost extra?

Yes — base rate is $0.084/sec; with audio it's $0.112/sec.

What's the maximum duration?

15 seconds. Minimum is 3 seconds.