Search Sciweavers | Sciweavers

832 search results - page 24 / 167

» Document clustering with committees

210

click to vote

WEBI
2005
Springer

216views Internet Technology» more WEBI 2005»

A Semi-Supervised Document Clustering Algorithm Based on EM

15 years 11 months ago

Download www.dii.unisi.it

Document clustering is a very hard task in Automatic Text Processing since it requires to extract regular patterns from a document collection without a priori knowledge on the cat...

Leonardo Rigutini, Marco Maggini

claim paper

Read More »

163

click to vote

ACMSE
2007
ACM

162views Theoretical Computer Science» more ACMSE 2007»

Enhancing clustering blog documents by utilizing author/reader comments

15 years 10 months ago

Download www.cs.uky.edu

Blogs are a new form of internet phenomenon and a vast everincreasing information resource. Mining blog files for information is a very new research direction in data mining. We p...

Beibei Li, Shuting Xu, Jun Zhang

claim paper

Read More »

171

click to vote

WWW
2004
ACM

180views Internet Technology» more WWW 2004»

A hierarchical monothetic document clustering algorithm for summarization and browsing search results

16 years 6 months ago

Download www.iw3c2.org

Organizing Web search results into a hierarchy of topics and subtopics facilitates browsing the collection and locating results of interest. In this paper, we propose a new hierar...

Krishna Kummamuru, Rohit Lotlikar, Shourya Roy, Ka...

claim paper

Read More »

165

click to vote

ACL
2009

151views Computational Linguistics» more ACL 2009»

Creating a Gold Standard for Sentence Clustering in Multi-Document Summarization

15 years 4 months ago

Download 140.116.245.248

Sentence Clustering is often used as a first step in Multi-Document Summarization (MDS) to find redundant information. All the same there is no gold standard available. This paper...

Johanna Geiss

claim paper

Read More »

148

click to vote

SIGIR
2002
ACM

152views Information Technology» more SIGIR 2002»

Unsupervised document classification using sequential information maximization

15 years 5 months ago

Download www.cs.huji.ac.il

We present a novel sequential clustering algorithm which is motivated by the Information Bottleneck (IB) method. In contrast to the agglomerative IB algorithm, the new sequential ...

Noam Slonim, Nir Friedman, Naftali Tishby

claim paper

Read More »

« Prev « First page 24 / 167 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers