Joint Processing and Discriminative Training for Letter-to-Phoneme Conversion

14 years 1 months ago

Download aclweb.org

We present a discriminative structureprediction model for the letter-to-phoneme task, a crucial step in text-to-speech processing. Our method encompasses three tasks that have been previously handled separately: input segmentation, phoneme prediction, and sequence modeling. The key idea is online discriminative training, which updates parameters according to a comparison of the current system output to the desired output, allowing us to train all of our components together. By folding the three steps of a pipeline approach into a unified dynamic programming framework, we are able to achieve substantial performance gains. Our results surpass the current state-of-the-art on six publicly available data sets representing four different languages.

Sittichai Jiampojamarn, Colin Cherry, Grzegorz Kon

Real-time Traffic

ACL 2008 | Computational Linguistics | Discriminative Structureprediction Model | Online Discriminative Training | Phoneme Prediction |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	ACL
Authors	Sittichai Jiampojamarn, Colin Cherry, Grzegorz Kondrak

Comments (0)

Sciweavers

Joint Processing and Discriminative Training for Letter-to-Phoneme Conversion

ACL 2008 | Computational Linguistics | Discriminative Structureprediction Model | Online Discriminative Training | Phoneme Prediction |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers