Sciweavers

182 search results - page 9 / 37
» Probabilistic Document Length Priors for Language Models
Sort
View
ACL
2010
13 years 5 months ago
Learning Common Grammar from Multilingual Corpus
We propose a corpus-based probabilistic framework to extract hidden common syntax across languages from non-parallel multilingual corpora in an unsupervised fashion. For this purp...
Tomoharu Iwata, Daichi Mochihashi, Hiroshi Sawada
ECIR
2008
Springer
13 years 9 months ago
Modeling Documents as Mixtures of Persons for Expert Finding
Abstract. In this paper we address the problem of searching for knowledgeable persons within the enterprise, known as the expert finding (or expert search) task. We present a proba...
Pavel Serdyukov, Djoerd Hiemstra
COLING
2002
13 years 7 months ago
A New Probabilistic Model for Title Generation
Title generation is a complex task involving both natural language understanding and natural language synthesis. In this paper, we propose a new probabilistic model for title gene...
Rong Jin, Alexander G. Hauptmann
NAACL
2010
13 years 5 months ago
Language Identification: The Long and the Short of the Matter
Language identification is the task of identifying the language a given document is written in. This paper describes a detailed examination of what models perform best under diffe...
Timothy Baldwin, Marco Lui
KDD
1998
ACM
101views Data Mining» more  KDD 1998»
13 years 12 months ago
Probabilistic Modeling for Information Retrieval with Unsupervised Training Data
We apply a well-known Bayesian probabilistic model to textual information retrieval: the classification of documents based on their relevance to a query. This model was previously...
Ernest P. Chan, Santiago Garcia, Salim Roukos