Experiments on the Construction of a Phonetically Balanced Corpus from the Web

14 years 5 months ago

Download ccc.inaoep.mx

The construction of a speech recognition system requires a recorded set of phrases to compute the pertinent acoustic models. This set of phrases must be phonetically rich and balanced in order to obtain a robust recognizer. By tradition, this set is defined manually implicating a great human effort. In this paper we propose an automated method for assembling a phonetically balanced corpus (set of phrases) from the Web. The proposed method was used to construct a phonetically balanced corpus for the Mexican Spanish language.

Luis Villaseñor Pineda, Manuel Montes-y-G&o

Real-time Traffic

Balanced Corpus | CICLING 2004 | Natural Language Processing | Pertinent Acoustic Models | Phonetically |

claim paper

Post Info
More Details (n/a)

Added	01 Jul 2010
Updated	01 Jul 2010
Type	Conference
Year	2004
Where	CICLING
Authors	Luis Villaseñor Pineda, Manuel Montes-y-Gómez, Dominique Vaufreydaz, Jean-François Serignat

Comments (0)

Sciweavers

Experiments on the Construction of a Phonetically Balanced Corpus from the Web

Balanced Corpus | CICLING 2004 | Natural Language Processing | Pertinent Acoustic Models | Phonetically |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers