Web derived pronunciations for spoken term detection

16 years 1 months ago

Download symptotic.com

Indexing and retrieval of speech content in various forms such as broadcast news, customer care data and on-line media has gained a lot of interest for a wide range of applications, from customer analytics to on-line media search. For most retrieval applications, the speech content is typically ﬁrst converted to a lexical or phonetic representation using automatic speech recognition (ASR). The ﬁrst step in searching through indexes built on these representations is the generation of pronunciations for named entities and foreign language query terms. This paper summarizes the results of the work conducted during the 2008 JHU Summer Workshop by the Multilingual Spoken Term Detection team, on mining the web for pronunciations and analyzing their impact on spoken term detection. We will ﬁrst present methods to use the vast amount of pronunciation information available on the Web, in the form of IPA and ad-hoc transcriptions. We describe techniques for extracting candidate pronunciat...

Dogan Can, Erica Cooper, Arnab Ghoshal, Martin Jan

Real-time Traffic

Information Retrieval | IPA Pronunciations | SIGIR 2009 | Speech Content | Spoken Term Detection |

claim paper

» Stochastic pronunciation modelling and soft match for outofvocabulary spoken term detectio...

» A spoken term detection framework for recovering outofvocabulary words using the web

» Unsupervised pronunciation validation

» Augmented set of features for confidence estimation in spoken term detection

» Reversible SoundtoLetterLettertoSound Modeling Based on Syllable Structure

» A fast and robust method for web page template detection and removal

» Anomaly detection of webbased attacks

» Using LatentStructure to Detect Objects on the Web

Post Info
More Details (n/a)

Added	28 May 2010
Updated	28 May 2010
Type	Conference
Year	2009
Where	SIGIR
Authors	Dogan Can, Erica Cooper, Arnab Ghoshal, Martin Jansche, Sanjeev Khudanpur, Bhuvana Ramabhadran, Michael Riley, Murat Saraclar, Abhinav Sethy, Morgan Ulinski, Christopher M. White

Comments (0)

Sciweavers

Web derived pronunciations for spoken term detection

Information Retrieval | IPA Pronunciations | SIGIR 2009 | Speech Content | Spoken Term Detection |

Explore & Download

Productivity Tools

Sciweavers