Real-Time Audio-Guided Multi-Face Reenactment

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

SPL Volume 29 | 2022

Real-Time Audio-Guided Multi-Face Reenactment

SPL Articles

By:

Jiangning Zhang; Xianfang Zeng; Chao Xu; Yong Liu

Audio-guided face reenactment aims to generate authentic target faces that have matched facial expression of the input audio, and many learning-based methods have successfully achieved this. However, most methods can only reenact a particular person once trained or suffer from the low-quality generation of the target images. Also, nearly none of the current reenactment works consider the model size and running speed that are important for practical use. To solve the above challenges, we propose an efficient A udio-guided M ulti-face reenactment model named AMNet , which can reenact target faces among multiple persons with corresponding source faces and drive signals as inputs. Concretely, we design a Geometric Controller (GC) module to inject the drive signals so that the model can be optimized in an end-to-end manner and generate more authentic images. Also, we adopt a lightweight network for our face reenactor so that the model can run in real-time on both CPU and GPU devices. Abundant experiments prove our approach’s superiority over existing methods, e.g ., averagely decreasing FID by 0.12 ↓ and increasing SSIM by 0.031 ↑ than APB2Face, while owning fewer parameters ( ×4↓ ) and faster CPU speed ( ×4↑ ).

Audio-guided face reenactment aims to generate authentic target faces under the condition of audio information along with auxiliary pose and eye blink signals, which has promising applications such as animation production, virtual human, and game. However, most current methods can only reenact a particular person once finishing the training procedure or suffer from the low-quality problem of the generated target images. Also, nearly none of the current reenactment works take the model size and running speed into account that is important for practical use. This work focuses on solving the above problems, and we improve previous APB2Face [1] to an efficient end-to-end model to handle audio-guided multi-face reenactment, where different target faces among multiple persons can be reenacted by only one unified model.

Read on IEEE Xplore

Tags:

IEEE SPL Article

SPS Social Media

IEEE SPS Facebook Page https://www.facebook.com/ieeeSPS
IEEE SPS X Page https://x.com/IEEEsps
IEEE SPS Instagram Page https://www.instagram.com/ieeesps/?hl=en
IEEE SPS LinkedIn Page https://www.linkedin.com/company/ieeesps/
IEEE SPS YouTube Channel https://www.youtube.com/ieeeSPS

IEEE SPS Educational Resources

IEEE SPS Resource Center

IEEE SPS YouTube Channel

© Copyright 2025 IEEE - All rights reserved. Use of this website signifies your agreement to the IEEE Terms and Conditions.
A public charity, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.

webinar_cube.jpg

SPS BSI Webinar: Unlocking Precision Mental Health with Data-Driven Neuroimaging Biomarkers

multimedia_general.jpg

2025 Cycle 1 Chapter Initiative: DecodeX: A Comprehensive Signal Processing Experience

SP-Society-Name-Change-Forum.jpg

2025 Cycle 1 IEEE SPS Forum on IGNITE : A PhD Forum and PG Poster Presentation 2.0

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

Real-Time Audio-Guided Multi-Face Reenactment

Publications & Resources

Signal Processing Letters

For Authors

light_bulb_general.jpg

lrac2025_vertical_text (2).png

congratulations.jpg

Top Reasons to Join SPS Today!

Real-Time Audio-Guided Multi-Face Reenactment

SPS Social Media

IEEE SPS Educational Resources

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

Real-Time Audio-Guided Multi-Face Reenactment

Search form

You are here

Publications & Resources

Signal Processing Letters

For Authors

Top Reasons to Join SPS Today!

Real-Time Audio-Guided Multi-Face Reenactment

SPS Social Media

IEEE SPS Educational Resources