Search Sciweavers | Sciweavers

280 search results - page 11 / 56

» A Semi-Supervised Document Clustering Algorithm Based on EM

185

click to vote

JMLR
2002

111views more JMLR 2002»

The Learning-Curve Sampling Method Applied to Model-Based Clustering

15 years 5 months ago

Download jmlr.csail.mit.edu

We examine the learning-curve sampling method, an approach for applying machinelearning algorithms to large data sets. The approach is based on the observation that the computatio...

Christopher Meek, Bo Thiesson, David Heckerman

claim paper

Read More »

162

click to vote

DATAMINE
2006

166views more DATAMINE 2006»

Accelerated EM-based clustering of large data sets

15 years 6 months ago

Download www.dpem.tuc.gr

Motivated by the poor performance (linear complexity) of the EM algorithm in clustering large data sets, and inspired by the successful accelerated versions of related algorithms l...

Jakob J. Verbeek, Jan Nunnink, Nikos A. Vlassis

claim paper

Read More »

156

click to vote

SIGIR
1998
ACM

129views Information Technology» more SIGIR 1998»

Web Document Clustering: A Feasibility Demonstration

15 years 10 months ago

Download www.cs.washington.edu

Users of Web search engines are often forced to sift through the long ordered list of document “snippets” returned by the engines. The IR community has explored document cluste...

Oren Zamir, Oren Etzioni

claim paper

Read More »

150

click to vote

WWW
2007
ACM

157views Internet Technology» more WWW 2007»

A new suffix tree similarity measure for document clustering

16 years 6 months ago

Download www2007.org

In this paper, we propose a new similarity measure to compute the pairwise similarity of text-based documents based on suffix tree document model. By applying the new suffix tree ...

Hung Chim, Xiaotie Deng

claim paper

Read More »

175

click to vote

ACL
2008

173views Computational Linguistics» more ACL 2008»

Inducing Gazetteers for Named Entity Recognition by Large-Scale Clustering of Dependency Relations

15 years 7 months ago

Download www.aclweb.org

We propose using large-scale clustering of dependency relations between verbs and multiword nouns (MNs) to construct a gazetteer for named entity recognition (NER). Since dependen...

Jun'ichi Kazama, Kentaro Torisawa

claim paper

Read More »

« Prev « First page 11 / 56 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers