A Data Collection Protocol, Tool and Analysis for the Mapping of Speech Volume to Avatar Facial Animation

Miyawaki, Ryosuke; Perusquia-Hernandez, Monica; Isoyama, Naoya; Uchiyama, Hideaki; Kiyokawa, Kiyoshi

A Data Collection Protocol, Tool and Analysis for the Mapping of Speech Volume to Avatar Facial Animation

dc.contributor.author	Miyawaki, Ryosuke	en_US
dc.contributor.author	Perusquia-Hernandez, Monica	en_US
dc.contributor.author	Isoyama, Naoya	en_US
dc.contributor.author	Uchiyama, Hideaki	en_US
dc.contributor.author	Kiyokawa, Kiyoshi	en_US
dc.contributor.editor	Hideaki Uchiyama	en_US
dc.contributor.editor	Jean-Marie Normand	en_US
dc.date.accessioned	2022-11-29T07:25:17Z
dc.date.available	2022-11-29T07:25:17Z
dc.date.issued	2022
dc.description.abstract	Knowing the relationship between speech-related facial movement and speech is important for avatar animation. Accurate facial displays are necessary to convey perceptual speech characteristics fully. Recently, an effort has been made to infer the relationship between facial movement and speech with data-driven methodologies using computer vision. To this aim, we propose to use blendshape-based facial movement tracking, because it can be easily translated to avatar movement. Furthermore, we present a protocol for audio-visual and behavioral data collection and a tool running on WEB that aids in collecting and synchronizing data. As a start, we provide a database of six Japanese participants reading emotion-related scripts at different volume levels. Using this methodology, we found a relationship between speech volume and facial movement around the nose, cheek, mouth, and head pitch. We hope that our protocols, WEB-based tool, and collected data will be useful for other scientists to derive models for avatar animation.	en_US
dc.description.sectionheaders	Interaction
dc.description.seriesinformation	ICAT-EGVE 2022 - International Conference on Artificial Reality and Telexistence and Eurographics Symposium on Virtual Environments
dc.identifier.doi	10.2312/egve.20221273
dc.identifier.isbn	978-3-03868-179-3
dc.identifier.issn	1727-530X
dc.identifier.pages	27-34
dc.identifier.pages	8 pages
dc.identifier.uri	https://doi.org/10.2312/egve.20221273
dc.identifier.uri	https://diglib.eg.org:443/handle/10.2312/egve20221273
dc.publisher	The Eurographics Association	en_US
dc.rights	Attribution 4.0 International License
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject	CCS Concepts: Human-centered computing -> Visualization toolkits
dc.subject	Human centered computing
dc.subject	Visualization toolkits
dc.title	A Data Collection Protocol, Tool and Analysis for the Mapping of Speech Volume to Avatar Facial Animation	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 027-034.pdf
Size:: 1.84 MB
Format:: Adobe Portable Document Format

Download

Collections

ICAT-EGVE2022