Sciweavers

182 search results - page 12 / 37
» Probabilistic Document Length Priors for Language Models
Sort
View
CICLING
2008
Springer
13 years 9 months ago
A Probabilistic Model for Guessing Base Forms of New Words by Analogy
Language software applications encounter new words, e.g., acronyms, technical terminology, loan words, names or compounds of such words. Looking at English, one might assume that t...
Krister Lindén
EMNLP
2009
13 years 5 months ago
Multilingual Spectral Clustering Using Document Similarity Propagation
We present a novel approach for multilingual document clustering using only comparable corpora to achieve cross-lingual semantic interoperability. The method models document colle...
Dani Yogatama, Kumiko Tanaka-Ishii
CLEF
2010
Springer
13 years 8 months ago
Automatic Prior Art Searching and Patent Encoding at CLEF-IP '10
In the intellectual property field two tasks are of high relevance: prior art searching and patent classification. Prior art search is fundamental for many strategic issues such as...
Douglas Teodoro, Julien Gobeill, Emilie Pasche, Di...
TREC
2007
13 years 8 months ago
The Open University at TREC 2007 Enterprise Track
The Multimedia and Information Systems group at the Knowledge Media Institute of the Open University participated in the Expert Search and Document Search tasks of the Enterprise ...
Jianhan Zhu, Dawei Song, Stefan M. Rüger
EMNLP
2010
13 years 5 months ago
Crouching Dirichlet, Hidden Markov Model: Unsupervised POS Tagging with Context Local Tag Generation
We define the crouching Dirichlet, hidden Markov model (CDHMM), an HMM for partof-speech tagging which draws state prior distributions for each local document context. This simple...
Taesun Moon, Katrin Erk, Jason Baldridge