Sciweavers

403 search results - page 9 / 81
» Testing the cluster hypothesis in distributed information re...
Sort
View
EMNLP
2010
13 years 5 months ago
Positional Language Models for Clinical Information Retrieval
The PECO framework is a knowledge representation for formulating clinical questions. Queries are decomposed into four aspects, which are Patient-Problem (P), Exposure (E), Compari...
Florian Boudin, Jian-Yun Nie, Martin Dawes
NIPS
2003
13 years 9 months ago
Learning the k in k-means
When clustering a dataset, the right number k of clusters to use is often not obvious, and choosing k automatically is a hard algorithmic problem. In this paper we present an impr...
Greg Hamerly, Charles Elkan
BMCBI
2007
168views more  BMCBI 2007»
13 years 7 months ago
GOSim - an R-package for computation of information theoretic GO similarities between terms and gene products
Background: With the increased availability of high throughput data, such as DNA microarray data, researchers are capable of producing large amounts of biological data. During the...
Holger Fröhlich, Nora Speer, Annemarie Poustk...
SIGIR
2004
ACM
14 years 1 months ago
Corpus structure, language models, and ad hoc information retrieval
Most previous work on the recently developed languagemodeling approach to information retrieval focuses on document-specific characteristics, and therefore does not take into acc...
Oren Kurland, Lillian Lee
AIRS
2004
Springer
14 years 1 months ago
Automatic Word Clustering for Text Categorization Using Global Information
This paper presents a cluster-based text categorization system which uses class distributional clustering of words. We propose a new clustering model which considers the global in...
Wenliang Chen, Xingzhi Chang, Huizhen Wang, Jingbo...