Sciweavers

328 search results - page 59 / 66
» A Multi-level Approach for Document Clustering
Sort
View
ICDM
2003
IEEE
134views Data Mining» more  ICDM 2003»
14 years 24 days ago
Probabilistic User Behavior Models
We present a mixture model based approach for learning individualized behavior models for the Web users. We investigate the use of maximum entropy and Markov mixture models for ge...
Eren Manavoglu, Dmitry Pavlov, C. Lee Giles
ISMB
2000
13 years 8 months ago
Genes, Themes, and Microarrays: Using Information Retrieval for Large-Scale Gene Analysis
The immensevolumeof data resulting from DNAmicroarray experiments, accompaniedby an increase in the numberof publications discussing gene-related discoveries, presents a majordata...
Hagit Shatkay, Stephen Edwards, W. John Wilbur, Ma...
WSDM
2010
ACM
204views Data Mining» more  WSDM 2010»
14 years 2 months ago
Learning URL patterns for webpage de-duplication
Presence of duplicate documents in the World Wide Web adversely affects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...
Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...
KDD
2009
ACM
239views Data Mining» more  KDD 2009»
14 years 8 months ago
Applying syntactic similarity algorithms for enterprise information management
: ? Applying Syntactic Similarity Algorithms for Enterprise Information Management Ludmila Cherkasova, Kave Eshghi, Charles B. Morrey III, Joseph Tucek, Alistair Veitch HP Laborato...
Ludmila Cherkasova, Kave Eshghi, Charles B. Morrey...
KDD
2007
ACM
169views Data Mining» more  KDD 2007»
14 years 8 months ago
Exploiting underrepresented query aspects for automatic query expansion
Users attempt to express their search goals through web search queries. When a search goal has multiple components or aspects, documents that represent all the aspects are likely ...
Daniel Crabtree, Peter Andreae, Xiaoying Gao