HeyGen Avatar IV
Talking avatar from a single photo: the model reads vocal tone and rhythm, then builds lifelike expressions and hand gestures. Sync from text or an existing voiceover.
Cost
3 tokens/s
VideoProvider: HeyGen
Run generation
curl -X POST https://api.givon.ai/api/v1/generations \
-H "Authorization: Bearer $GIVON_API_KEY" \
-H "Content-Type: application/json" \
-d '{"type":"video","model":"heygen-photo-avatar","input":{"prompt":"cinematic drone shot over a city at night","aspectRatio":"9:16","resolution":"1080p","duration":1,"avatarPhoto":"asset://asset_..."}}'Input fields
* required| Field | Type | Default | Allowed |
|---|---|---|---|
prompt* promptScene / motion description. | string | — | string |
aspectRatio aspectRatio | string | 9:16 | 9:16, 16:9 |
resolution resolution | string | 1080p | 1080p, 720p |
duration duration | number | 1 | from 1 · up to 60 · step 1 seconds |
Avatar photo* avatarPhotoAsset input for the avatarPhoto slot. | string | — | image · asset, https, data |
Speech audio speechAudioAsset input for the speechAudio slot. | string | — | voiceover · asset, https, data |
prompt*
promptScene / motion description.
- Type
- string
- Default
- —
- Allowed
- string
aspectRatio
aspectRatio- Type
- string
- Default
- 9:16
- Allowed
- 9:16, 16:9
resolution
resolution- Type
- string
- Default
- 1080p
- Allowed
- 1080p, 720p
duration
duration- Type
- number
- Default
- 1
- Allowed
- from 1 · up to 60 · step 1 seconds
Avatar photo*
avatarPhotoAsset input for the avatarPhoto slot.
- Type
- string
- Default
- —
- Allowed
- image · asset, https, data
Speech audio
speechAudioAsset input for the speechAudio slot.
- Type
- string
- Default
- —
- Allowed
- voiceover · asset, https, data
Cost
3 tokens/sCapabilities
Modes
image_to_videoAsset slots
avatarPhoto:image*speechAudio:voiceoverRun HeyGen Avatar IV
Get an API key and the same request shape will work across every model in the catalog.