Sciweavers

359 search results - page 31 / 72
» Document clustering using word clusters via the information ...
Sort
View
BMCBI
2010
121views more  BMCBI 2010»
13 years 6 months ago
A grammar-based distance metric enables fast and accurate clustering of large sets of 16S sequences
Background: We propose a sequence clustering algorithm and compare the partition quality and execution time of the proposed algorithm with those of a popular existing algorithm. T...
David J. Russell, Samuel F. Way, Andrew K. Benson,...
ASC
2004
13 years 8 months ago
Clustering terms in the Bayesian network retrieval model: a new approach with two term-layers
The retrieval performance of an information retrieval system usually increases when it uses the relationships among the terms contained in a given document collection. However, th...
Luis M. de Campos, Juan M. Fernández-Luna, ...
CIKM
2003
Springer
14 years 1 months ago
Tracking changes in user interests with a few relevance judgments
Keeping track of changes in user interests from a document stream with a few relevance judgments is not an easy task. To tackle this problem, we propose a novel method that integr...
Dwi H. Widyantoro, Thomas R. Ioerger, John Yen
EMNLP
2004
13 years 10 months ago
Trained Named Entity Recognition using Distributional Clusters
This work applies boosted wrapper induction (BWI), a machine learning algorithm for information extraction from semi-structured documents, to the problem of named entity recogniti...
Dayne Freitag
ECIR
2004
Springer
13 years 10 months ago
Performance Analysis of Distributed Architectures to Index One Terabyte of Text
We simulate different architectures of a distributed Information Retrieval system on a very large Web collection, in order to work out the optimal setting for a particular set of r...
Fidel Cacheda, Vassilis Plachouras, Iadh Ounis