Gen-4 Act-Two

Pixio briefing

How to get the best out of Gen-4 Act-Two

Prompt to Motion

Best when you want to direct the whole shot from language.

New scenes, camera intent, atmosphere-first ideation.

Image to Video

Best when the first frame or reference look needs to stay locked.

Keyframes, product shots, character continuity, style anchoring.

Scale to Finals

Best when the clip already works and you want more control instead of a reroll.

Continuations, polish passes, cleanup, stronger finals.

Gen-4 Act-Two

Gen-4 Act-Two on Pixio is Runway’s character-driven video model. You provide a reference image of a character (or person) and a text prompt that describes how they should move or act; the model generates video that keeps the character consistent across the clip. Use it when you need a specific character or spokesperson to perform an action or deliver a scene—talking, gesturing, or moving—without character drift.

Use this when

You have a character reference (photo, illustration, or design) and need them to perform in video—talking, gesturing, walking, or acting.

You want character consistency—same face, look, and proportions across the generated clip.

You need motion and expression driven by a text prompt (e.g. “waves at camera”, “explains product with hand gestures”).

You’re building spokesperson, avatar, or character animation content without full lip-sync or voice (pair with Act-One or voice tools for speech).

Mode	Input	Best for
Character to Video	One character reference image + prompt	Character performs the described action; consistency from reference

Mode

Input

Best for

Character to Video

One character reference image + prompt

Character performs the described action; consistency from reference

Option	Values	Notes
Reference	One image (character/person)	Clear face and body; front or three-quarter view works best
Duration	Depends on backend	Check Pixio for limits
Prompt	Action, expression, camera	Describe what the character does, not their appearance

Option

Values

Notes

Reference

One image (character/person)

Clear face and body; front or three-quarter view works best

Duration

Depends on backend

Check Pixio for limits

Prompt

Action, expression, camera

Describe what the character does, not their appearance

Why Act-Two fits character-driven video

Act-Two is built for one character in, one character out: the reference image defines who we see, and the prompt defines what they do. The model keeps the character’s look consistent while animating motion and expression. Use it for spokesperson clips, character moments, or when you need a specific person/character to perform an action. For talking head with lip-sync and voice, combine with Runway Act-One or other voice/lip-sync tools.

Scenario	Best choice
Character-driven clip from one reference	Gen-4 Act-Two
Talking head + lip-sync + voice	Fabric, Character 3, OmniHuman, or Act-One + voice
General image-to-video (no character focus)	Gen-4 (Image to Video) or Seedance 2 Pro
Restyle existing video	Gen-4 Aleph

Scenario

Best choice

Character-driven clip from one reference

Gen-4 Act-Two

Talking head + lip-sync + voice

Fabric, Character 3, OmniHuman, or Act-One + voice

General image-to-video (no character focus)

Gen-4 (Image to Video) or Seedance 2 Pro

Restyle existing video

Gen-4 Aleph

How to get the best out of Gen-4 Act-Two

Gen-4 Act-Two

How to get the best out of Gen-4 Act-Two

Gen-4 Act-Two

Use this when

Modes in Pixio

Options

Credits

Why Act-Two fits character-driven video

Prompt structure

When to use Gen-4 Act-Two vs other models

Tips