Sciweavers

ACL
2011
13 years 3 months ago
Using Derivation Trees for Treebank Error Detection
This work introduces a new approach to checking treebank consistency. Derivation trees based on a variant of Tree Adjoining Grammar are used to compare the annotation of word sequ...
Seth Kulick, Ann Bies, Justin Mott
ICASSP
2011
IEEE
13 years 3 months ago
Automatically finding semantically consistent n-grams to add new words in LVCSR systems
This paper presents a new method to automatically add n-grams containing out-of-vocabulary (OOV) words to a baseline language model (LM), where these n-grams are sought to be gram...
Gwénolé Lecorvé, Guillaume Gr...
ACL
1996
14 years 25 days ago
The Rhythm of Lexical Stress in Prose
\Prose rhythm" is a widely observed but scarcely quanti ed phenomenon. We describe an information-theoretic model for measuring the regularity of lexical stress in English te...
Doug Beeferman
NIPS
2000
14 years 25 days ago
A Neural Probabilistic Language Model
A goal of statistical language modeling is to learn the joint probability function of sequences of words in a language. This is intrinsically difficult because of the curse of dim...
Yoshua Bengio, Réjean Ducharme, Pascal Vinc...
EACL
2003
ACL Anthology
14 years 26 days ago
Detecting Novel Compounds: The Role of Distributional Evidence
Research on the discovery of terms from corpora has focused on word sequences whose recurrent occurrence in a corpus is indicative of their terminological status, and has not addr...
Mirella Lapata, Alex Lascarides
INEX
2007
Springer
14 years 5 months ago
Phrase Detection in the Wikipedia
The Wikipedia XML collection turned out to be rich of marked-up phrases as we carried out our INEX 2007 experiments. Assuming that a phrase occurs at the inline level of the markup...
Miro Lehtonen, Antoine Doucet