ReplicaTraining

Replica Training

Learn how to create the best personal replica with a high-quality training video.

Training Overview

To clone your personal replica, we first need you to submit a training video. A high-quality training video helps our AI model properly map your face and voice, resulting in a more realistic replica overall. Your training video will be one continuous video, containing the following, in-order:

15 seconds of silence (listening mode)

2 minute of talking

You need to upload a 9:16 or 16:9 video clip that meets the cloning training requirements on our platform, based on your personal needs.

Training typically takes around 4-5 hours to complete. You can check the status of your replica training by:

Enable cloned avatar from [Personal Replicas] when creating conversation You can click on [Create Replica] to create and submit training, and enter [Personal Replicas] to view the training results

How do I record 15 seconds of silence?

Your training footage should begin with 15 seconds of silence. Our model uses your silent footage to create a natural resting position for your replica’s head and improve its listening behavior. During this period:

Pretend that you are “actively listening” to someone
Incorporate small (but non-repetitive) head movements throughout the minute
Ensure that your lips are closed the entire time

How do I record 2 minute of talking?

We do not require a predefined script. You are welcome to discuss anything that showcases your natural speaking style and expertise.

How do I create a high-quality training video?

To ensure your replica is the best possible quality, follow the guidelines below before recording your training footage.

✔️Video Material Requirements:

Framing: The subject must be centered in the video.Both green screen and live-action shooting are acceptable
Hand Movements: Avoid hand gestures during normal speech segments.
Silent Segment: First 15 seconds must show completely silent performance (no speaking) with natural listening expressions:
- Subtle smiling
- Occasional blinking
- Slight nodding
Technical Specs: MP4 format, 2-3 minutes duration

✔️Shooting Guidelines:

Set up environment :

ensure that you are in a quiet, well-lit area without background movement.

Check that your face is evenly lit without any shadows. A large diffuse light works best for neutral lighting.
Avoid environments with background noise or reverb (e.g. air conditioning, construction).
Keep your background clear by removing moving objects and people.

Green Screen Setup:

Fully cover frame with wrinkle-free green screen
Eliminate shadows for optimal keying
Confirm perfect setup before recording

Hair & Accessories:

Avoid dangling earrings (interferes with lip-sync AI)
Use hairspray to control flyaway hairs (critical for clean keying)
Avoid colors close to green/yellow (e.g., yellow-green) when using a green screen.
If possible, avoid beards, glasses, high-collar shirts (e.g. turtlenecks) and accessories (e.g. hats).

Lighting & Positioning:

Use even “three-point lighting.” For shadow-free chin areas, add reflectors or foam boards (“mibo”).
Stand 1-2 meters from the green screen to prevent edge spill or shadows.

Recommended Camera Settings:

Resolution: 4K (3840×2160) preferred; minimum 1080P (1920×1080).
Frame Rate: 25 fps.
Lens: ~50mm (full-frame equivalent); use 85mm for a slimmer face effect.
Aperture: smaller aperture for green screen shoots. Adjust for background blur in real-scene shoots (ensure subject clarity).

Smartphones:

Shoot in highest quality (1080P+). iPhone users: Select “Most Compatible” format.

Record training video

The first 15 seconds of the training video must show the subject in complete silence (no speaking). Natural listening cues like subtle smiles, slow blinking, or slight nodding are allowed to simulate an engaged listening state。Next, read the consent script, followed by 2 minutes of talking

Aim for an engaging tone and a relaxed pace, while maintaining continuous eye contact with the camera.
Minimize body movement, such as hand gestures, head movement, jolts, etc.
Close your lips during pauses and at the end of sentences.
If you stumble, continue speaking. Perfection is not necessary!
Your training video will be one continuous video.You can record your training video in any language you prefer

Submit your training video

After ensuring that your training video fits our quality requirements, submit your video

Next Steps after Training

Upon submission, your replica will immediately begin training in the background. After around 4-6 hours, you can enter [Personal Replicas] to check your personal replica is ready for use. If you’re not happy with the results, be sure to contact us.

Congrats on finishing the training process — now explore generating videos or starting

API Documentation

Replica

Conversational Video Interface

iOS SDK

Android SDK

H5 SDK

Training Overview

How do I record 15 seconds of silence?

How do I record 2 minute of talking?

How do I create a high-quality training video?

✔️Video Material Requirements:

✔️Shooting Guidelines:

Set up environment :

Green Screen Setup:

Hair & Accessories:

Lighting & Positioning:

Recommended Camera Settings:

Smartphones:

Record training video

Submit your training video

Next Steps after Training

API Documentation

Replica

Conversational Video Interface

iOS SDK

Android SDK

H5 SDK

​Training Overview

​How do I record 15 seconds of silence?

​How do I record 2 minute of talking?

​How do I create a high-quality training video?

​✔️Video Material Requirements:

​✔️Shooting Guidelines:

​Set up environment :

​Green Screen Setup:

​Hair & Accessories:

​Lighting & Positioning:

​Recommended Camera Settings:

​Smartphones:

​Record training video

​Submit your training video

​Next Steps after Training

Training Overview

How do I record 15 seconds of silence?

How do I record 2 minute of talking?

How do I create a high-quality training video?

✔️Video Material Requirements:

✔️Shooting Guidelines:

Set up environment :

Green Screen Setup:

Hair & Accessories:

Lighting & Positioning:

Recommended Camera Settings:

Smartphones:

Record training video

Submit your training video

Next Steps after Training