Comparison of approaches for instrumentally predicting the quality of text-to-speech systems

15 years 1 months ago

Download individual.utoronto.ca

In this paper, we compare and combine different approaches for instrumentally predicting the perceived quality of Text-to-Speech systems. First, a log-likelihood is determined by comparing features extracted from the synthesized speech signal with features trained on natural speech. Second, parameters are extracted which capture quality-relevant degradations of the synthesized speech signal. Both approaches are combined and evaluated on three auditory test databases. The results show that auditory quality judgments can in many cases be predicted with a sufficiently high accuracy and reliability, but that there are considerable differences, mainly between male and female speech samples.

Sebastian Möller, Florian Hinterleitner, Tiag

Real-time Traffic

Auditory Quality Judgments | Capture Quality-relevant Degradations | INTERSPEECH 2010 | Signal Processing | Speech Signal |

claim paper

» A Comparison of Fuzzy and CPWL Approximations in the Continuoustime Nonlinear Modelpredict...

» A highthroughput de novo sequencing approach for shotgun proteomics using highresolution t...

» Change Prediction in ObjectOriented Software Systems A Probabilistic Approach

» Predictable Code and Data Paging for Real Time Systems

» Predicting the proteinprotein interactions using primary structures with predicted protein...

» Fidelity and Yield in a Volcano Monitoring Sensor Network

» Comparing SVM ensembles for imbalanced datasets

» ZCURVEV a new selftraining system for recognizing proteincoding genes in viral and phage g...

Post Info
More Details (n/a)

Added	18 May 2011
Updated	18 May 2011
Type	Journal
Year	2010
Where	INTERSPEECH
Authors	Sebastian Möller, Florian Hinterleitner, Tiago H. Falk, Tim Polzehl

Comments (0)

Sciweavers

Comparison of approaches for instrumentally predicting the quality of text-to-speech systems

Auditory Quality Judgments | Capture Quality-relevant Degradations | INTERSPEECH 2010 | Signal Processing | Speech Signal |

Explore & Download

Productivity Tools

Sciweavers