Integration of multilayer regression analysis with structure-based pronunciation assessment

14 years 9 months ago

Download www.gavo.t.u-tokyo.ac.jp

Automatic pronunciation assessment has several difficulties. Adequacy in controlling the vocal organs is often estimated from the spectral envelopes of input utterances but the envelope patterns are also affected by other factors such as speaker identity. Recently, a new method of speech representation was proposed where these non-linguistic variations are effectively removed through modeling only the contrastive aspects of speech features. This speech representation is called speech structure. However, the often excessively high dimensionality of the speech structure can degrade the performance of structurebased pronunciation assessment. To deal with this problem, we integrate multilayer regression analysis with the structure-based assessment. The results show higher correlation between human and machine scores and also show much higher robustness to speaker differences compared to widely used GOP-based analysis.

Masayuki Suzuki, Yu Qiao, Nobuaki Minematsu, Keiki

Real-time Traffic

Automatic Pronunciation Assessment | INTERSPEECH 2010 | Pronunciation Assessment | Signal Processing | Speech Structure |

claim paper

Post Info
More Details (n/a)

Added	19 May 2011
Updated	19 May 2011
Type	Journal
Year	2010
Where	INTERSPEECH
Authors	Masayuki Suzuki, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose

Comments (0)

Sciweavers

Integration of multilayer regression analysis with structure-based pronunciation assessment

Automatic Pronunciation Assessment | INTERSPEECH 2010 | Pronunciation Assessment | Signal Processing | Speech Structure |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers