Toward relaying an affective Speech-to-Speech translator: Cross-language perception of emotional state represented by emotion dimensions
- 1 September 2014
Abstract
Affective speech-to-speech translation (S2ST) is to preserve the affective state conveyed in the speaker's message. The ultimate goal of this study is to construct an affective S2ST system that has the ability to transform the emotional states of a spoken utterance from one language to another language. A universal automatic speech-emotion-recognition system is required to detect emotional state regardless of language. Therefore, this study investigates commonalities and differences of emotion perception across multi-languages. Thirty subjects from three countries, Japan, China and Vietnam, evaluate three emotional speech databases, Japanese, Chinese and German, in valence-activation space. The results reveal that directions from neutral to other emotions are similar among subjects groups. However, the estimated degree of emotional state depend on the expressed emotional styles. Moreover, neutral positions were significantly different among subjects groups. Thus, directions and distances from neutral to other emotions could be adopted as features to recognize emotional states for multi-languages.Keywords
This publication has 7 references indexed in Scilit:
- Cross-lingual speech emotion recognition system based on a three-layer model for human perceptionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2013
- Constructing a spoken dialogue corpus for studying paralinguistic information in expressive conversation and analyzing its statistical/acoustic characteristicsSpeech Communication, 2010
- NICT/ATR Chinese-Japanese-English speech-to-speech translation systemTsinghua Science and Technology, 2008
- Comparison of Japanese expressive speech perception by Japanese and Taiwanese listenersThe Journal of the Acoustical Society of America, 2008
- Toward detecting emotions in spoken dialogsIEEE Transactions on Speech and Audio Processing, 2005
- Acoustic profiles in vocal emotion expression.Journal of Personality and Social Psychology, 1996
- A description of the affective quality attributed to environments.Journal of Personality and Social Psychology, 1980