Sciweavers

56 search results - page 3 / 12
» Finite State Models for the Generation of Large Corpora of N...
Sort
View
EMNLP
2009
13 years 5 months ago
Polylingual Topic Models
Topic models are a useful tool for analyzing large text collections, but have previously been applied in only monolingual, or at most bilingual, contexts. Meanwhile, massive colle...
David M. Mimno, Hanna M. Wallach, Jason Naradowsky...
EMNLP
2008
13 years 9 months ago
Latent-Variable Modeling of String Transductions with Finite-State Methods
String-to-string transduction is a central problem in computational linguistics and natural language processing. It occurs in tasks as diverse as name transliteration, spelling co...
Markus Dreyer, Jason Smith, Jason Eisner
CICLING
2009
Springer
14 years 8 months ago
Guessers for Finite-State Transducer Lexicons
Abstract. Language software applications encounter new words, e.g., acronyms, technical terminology, names or compounds of such words. In order to add new words to a lexicon, we ne...
Krister Lindén
ACL
2008
13 years 9 months ago
Grounded Language Modeling for Automatic Speech Recognition of Sports Video
Grounded language models represent the relationship between words and the non-linguistic context in which they are said. This paper describes how they are learned from large corpo...
Michael Fleischman, Deb Roy
EMNLP
2009
13 years 5 months ago
A Joint Language Model With Fine-grain Syntactic Tags
We present a scalable joint language model designed to utilize fine-grain syntactic tags. We discuss challenges such a design faces and describe our solutions that scale well to l...
Denis Filimonov, Mary P. Harper