Sciweavers

73 search results - page 6 / 15
» Compression-based document length prior for language models
Sort
View
PKDD
2010
Springer
154views Data Mining» more  PKDD 2010»
13 years 6 months ago
Topic Models Conditioned on Relations
Latent Dirichlet allocation is a fully generative statistical language model that has been proven to be successful in capturing both the content and the topics of a corpus of docum...
Mirwaes Wahabzada, Zhao Xu, Kristian Kersting
IHI
2012
266views Healthcare» more  IHI 2012»
12 years 3 months ago
A software tool for large-scale sharing and querying of clinical documents modeled using HL7 version 3 standard
We present a novel software tool called CDN (Collaborative Data Network) for large-scale sharing and querying of clinical documents modeled using HL7 v3 standard (e.g., Clinical D...
Praveen R. Rao, Tivakar Komara Swami, Deepthi S. R...
SIGIR
2002
ACM
13 years 7 months ago
Two-stage language models for information retrieval
The optimal settings of retrieval parameters often depend on both the document collection and the query, and are usually found through empirical tuning. In this paper, we propose ...
ChengXiang Zhai, John D. Lafferty
EMNLP
2009
13 years 5 months ago
Multilingual Spectral Clustering Using Document Similarity Propagation
We present a novel approach for multilingual document clustering using only comparable corpora to achieve cross-lingual semantic interoperability. The method models document colle...
Dani Yogatama, Kumiko Tanaka-Ishii
CLEF
2010
Springer
13 years 8 months ago
Automatic Prior Art Searching and Patent Encoding at CLEF-IP '10
In the intellectual property field two tasks are of high relevance: prior art searching and patent classification. Prior art search is fundamental for many strategic issues such as...
Douglas Teodoro, Julien Gobeill, Emilie Pasche, Di...