Research Article
Realistic Speech-Driven Talking Video Generation with Personalized Pose
Table 1
Mean Opinion Score (MOS) of 100 participants on 4 questions. Q1: completeness of body. Q2: the face is clear. Q3: the body movement is correlated with audio. Q4:overall quality.