Learning with synthesized speech for automatic emotion recognition

14 years 7 months ago

Download felix.syntheticspeech.de

Data sparseness is an ever dominating problem in automatic emotion recognition. Using artiﬁcially generated speech for training or adapting models could potentially ease this: though less natural than human speech, one could synthesize the exact spoken content in different emotional nuances - of many speakers and even in different languages. To investigate chances, the phonemisation components Txt2Pho and openMary are used with Emoﬁlt and Mbrola for emotional speech synthesis. Analysis is realized with our Munich open Emotion and Affect Recognition toolkit. As test set we gently limit to the acted Berlin and eNTERFACE databases for the moment. In the result synthesized speech can indeed be used for the recognition of human emotional speech.

Bjoern Schuller, Felix Burkhardt

Real-time Traffic

Automatic Emotion Recognition | Emotional Speech | Exact Spoken Content | ICASSP 2010 | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Bjoern Schuller, Felix Burkhardt

Comments (0)

Sciweavers

Learning with synthesized speech for automatic emotion recognition

Automatic Emotion Recognition | Emotional Speech | Exact Spoken Content | ICASSP 2010 | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers