Acoustic modeling using an extended phone set considering cross-lingual pronunciation variations

13 years 10 months ago

Download www3.ntu.edu.sg

To deal with the issue of data unbalanced condition among a task of multilingual speech recognition and a phenomenon of pronunciation variations across languages, we propose an approach to clustering context dependent phones from an extended phone set in an acoustic model trained on a data unbalanced bilingual corpus. First, we generate an extended phone set using pronunciation modeling by a confidence measure between Mandarin and Taiwanese. Second, we use a two-step agglomerative hierarchical clustering with delta Bayesian information criteria to automatically generate a merged extended phone set (MEPS). Third, we choose a parametric modeling technique, model complexity selection, to increase the final number of Gaussian components dependent on the available training data in a data unbalanced condition. The experimental results show that the proposed automatic extending phone clustering approach reduced relative syllable error rate by 8.3% over the best result of the decision tree ba...

Dau-Cheng Lyu, Ren-Yuan Lyu, Ming-Tat Ko

Real-time Traffic

Data Unbalanced Condition | Extended Phone | ICMCS 2009 | Multimedia | Phone |

claim paper

Post Info
More Details (n/a)

Added	19 Feb 2011
Updated	19 Feb 2011
Type	Journal
Year	2009
Where	ICMCS
Authors	Dau-Cheng Lyu, Ren-Yuan Lyu, Ming-Tat Ko

Comments (0)

Sciweavers

Acoustic modeling using an extended phone set considering cross-lingual pronunciation variations

Data Unbalanced Condition | Extended Phone | ICMCS 2009 | Multimedia | Phone |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers