HeyGen Avatar IV

Talking avatar from a single photo: the model reads vocal tone and rhythm, then builds lifelike expressions and hand gestures. Sync from text or an existing voiceover.

Cost

3 tokens/s

VideoProvider: HeyGen
Run generation
curl -X POST https://api.givon.ai/api/v1/generations \
  -H "Authorization: Bearer $GIVON_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"type":"video","model":"heygen-photo-avatar","input":{"prompt":"cinematic drone shot over a city at night","aspectRatio":"9:16","resolution":"1080p","duration":1,"avatarPhoto":"asset://asset_..."}}'

Input fields

* required
prompt*prompt

Scene / motion description.

Type
string
Default
Allowed
string
aspectRatioaspectRatio
Type
string
Default
9:16
Allowed
9:16, 16:9
resolutionresolution
Type
string
Default
1080p
Allowed
1080p, 720p
durationduration
Type
number
Default
1
Allowed
from 1 · up to 60 · step 1 seconds
Avatar photo*avatarPhoto

Asset input for the avatarPhoto slot.

Type
string
Default
Allowed
image · asset, https, data
Speech audiospeechAudio

Asset input for the speechAudio slot.

Type
string
Default
Allowed
voiceover · asset, https, data

Cost

3 tokens/s

Capabilities

Modesimage_to_video
Asset slotsavatarPhoto:image*speechAudio:voiceover

Run HeyGen Avatar IV

Get an API key and the same request shape will work across every model in the catalog.

HeyGen Avatar IV API - video generation · Givon AI