TY - GEN
T1 - Integration of Gesture Generation System Using Gesture Library with DIY Robot Design Kit
AU - Teshima, Hitoshi
AU - Wake, Naoki
AU - Thomas, Diego
AU - Nakashima, Yuta
AU - Baumert, David
AU - Kawasaki, Hiroshi
AU - Ikeuchi, Katsushi
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - Conversational agents are expected to improve the quality of communication by adding gestures to the speech, and are considered to be a promising tool. Recent data-driven methods are capable of attaching gestures to arbitrary speech, but the output is still not in line with human intuition. Therefore, we propose a gesture transformation system that utilizes gesture types as intermediate information, based on the theory of psycholinguistics. We employ the gesture-first principle to create gesture clusters based on gesture similarities among imagistic gestures, one type of gesture to represent image-like motions, which are considered to represent important concepts in conversations. Since this system explicitly takes into account the gesture types recognized by a deep neural network (DNN) and the semantics of the sentence to select gestures, it is expected to output gestures that are more in line with human intuition than existing end-to-end systems that do not place these intermediate states. We prepared a DIY robot kit consisting of cheap parts so that conversational agents at home become available to ordinary users, and implemented the proposed gesture generation system on this robot. In order to evaluate the effectiveness of the conversational agent, we evaluated user impression when using various media for conversation and confirmed the advantage of using our agent.
AB - Conversational agents are expected to improve the quality of communication by adding gestures to the speech, and are considered to be a promising tool. Recent data-driven methods are capable of attaching gestures to arbitrary speech, but the output is still not in line with human intuition. Therefore, we propose a gesture transformation system that utilizes gesture types as intermediate information, based on the theory of psycholinguistics. We employ the gesture-first principle to create gesture clusters based on gesture similarities among imagistic gestures, one type of gesture to represent image-like motions, which are considered to represent important concepts in conversations. Since this system explicitly takes into account the gesture types recognized by a deep neural network (DNN) and the semantics of the sentence to select gestures, it is expected to output gestures that are more in line with human intuition than existing end-to-end systems that do not place these intermediate states. We prepared a DIY robot kit consisting of cheap parts so that conversational agents at home become available to ordinary users, and implemented the proposed gesture generation system on this robot. In order to evaluate the effectiveness of the conversational agent, we evaluated user impression when using various media for conversation and confirmed the advantage of using our agent.
UR - http://www.scopus.com/inward/record.url?scp=85126258643&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85126258643&partnerID=8YFLogxK
U2 - 10.1109/SII52469.2022.9708837
DO - 10.1109/SII52469.2022.9708837
M3 - Conference contribution
AN - SCOPUS:85126258643
T3 - 2022 IEEE/SICE International Symposium on System Integration, SII 2022
SP - 361
EP - 366
BT - 2022 IEEE/SICE International Symposium on System Integration, SII 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE/SICE International Symposium on System Integration, SII 2022
Y2 - 9 January 2022 through 12 January 2022
ER -