Kling 2.6

Native audio in a single pass: speech, ambience, and effects are generated directly in-frame without separate dubbing. Use it for budget clips and talking heads when multi-scene control is not needed.

Cost

from 3 tokens/s

VideoProvider: Kling
Run generation
curl -X POST https://api.givon.ai/api/v1/generations \
  -H "Authorization: Bearer $GIVON_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"type":"video","model":"kling-2.6","input":{"prompt":"cinematic drone shot over a city at night","aspectRatio":"9:16","resolution":"720p","duration":5,"audioEnabled":true}}'

Input fields

* required
prompt*prompt

Scene / motion description.

Type
string
Default
Allowed
up to 2500 chars
aspectRatioaspectRatio
Type
string
Default
9:16
Allowed
9:16, 16:9, 1:1, 4:3, 3:4
resolutionresolution
Type
string
Default
720p
Allowed
720p, 1080p
durationduration
Type
number
Default
5
Allowed
5, 10
audioEnabledaudioEnabled

Native audio is available only in Kling 2.6 variants where the selected price supports it.

Type
boolean
Default
true
Allowed
boolean
Start imagestartImage

Asset input for the startImage slot.

Type
string
Default
Allowed
image · asset, https, data

Cost

from 3 tokens/s
1080p4.2 tokens
1080p · Audio8.4 tokens
720p3 tokens/s
720p3 tokens
1080pdefault4 tokens/s
1080p · Audio8 tokens/s

The variant is selected automatically from request fields, so you do not need to send it.

Capabilities

Modestext_to_videoimage_to_video
Asset slotsstartImage:image

Run Kling 2.6

Get an API key and the same request shape will work across every model in the catalog.

Kling 2.6 API - video generation · Givon AI