Kim, SeonghyeonJung, SunjinSeo, KwanggyoonRibera, Roger Blanco iNoh, JunyongZhang, Fang-Lue and Eisemann, Elmar and Singh, Karan2021-10-142021-10-1420211467-8659https://doi.org/10.1111/cgf.14400https://diglib.eg.org:443/handle/10.1111/cgf14400Traditional approaches to retarget existing facial blendshape animations to other characters rely heavily on manually paired data including corresponding anchors, expressions, or semantic parametrizations to preserve the characteristics of the original performance. In this paper, inspired by recent developments in face swapping and reenactment, we propose a novel unsupervised learning method that reformulates the retargeting of 3D facial blendshape-based animations in the image domain. The expressions of a source model is transferred to a target model via the rendered images of the source animation. For this purpose, a reenactment network is trained with the rendered images of various expressions created by the source and target models in a shared latent space. The use of shared latent space enable an automatic cross-mapping obviating the need for manual pairing. Next, a blendshape prediction network is used to extract the blendshape weights from the translated image to complete the retargeting of the animation onto a 3D target model. Our method allows for fully unsupervised retargeting of facial expressions between models of different configurations, and once trained, is suitable for automatic real-time applications.Computing methodologiesAnimationComputer visionDeep Learning-Based Unsupervised Human Facial Retargeting10.1111/cgf.1440045-55