Search Sciweavers | Sciweavers

280 search results - page 7 / 56

» Comparison of Cluster Algorithms for the Analysis of Text Da...

click to vote

KDD
2002
ACM

170views Data Mining» more KDD 2002»

Enhanced word clustering for hierarchical text classification

14 years 8 months ago

Download www.cs.utexas.edu

In this paper we propose a new information-theoretic divisive algorithm for word clustering applied to text classification. In previous work, such "distributional clustering&...

Inderjit S. Dhillon, Subramanyam Mallela, Rahul Ku...

claim paper

Read More »

click to vote

SIGMOD
2009
ACM

136views Database» more SIGMOD 2009»

A comparison of approaches to large-scale data analysis

14 years 7 months ago

Download database.cs.brown.edu

There is currently considerable enthusiasm around the MapReduce (MR) paradigm for large-scale data analysis [17]. Although the basic control flow of this framework has existed in ...

Andrew Pavlo, Erik Paulson, Alexander Rasin, Danie...

claim paper

Read More »

click to vote

AUSAI
2005
Springer

139views Artificial Intelligence» more AUSAI 2005»

Semantic Correlation Network Based Text Clustering

14 years 1 months ago

Download www.cs.ust.hk

Abstract. Text documents have sparse data spaces, and nearest neighbors may belong to diﬀerent classes when using current existing proximity measures to describe the correlation ...

Shaoxu Song, Chunping Li

claim paper

Read More »

click to vote

JCDL
2011
ACM

374views Education» more JCDL 2011»

Comparative evaluation of text- and citation-based plagiarism detection approaches using guttenplag

12 years 10 months ago

Download gipp.com

Various approaches for plagiarism detection exist. All are based on more or less sophisticated text analysis methods such as string matching, fingerprinting or style comparison. I...

Bela Gipp, Norman Meuschke, Jöran Beel

claim paper

Read More »

click to vote

IPM
2006

151views more IPM 2006»

Document clustering using nonnegative matrix factorization

13 years 7 months ago

Download www.math.wfu.edu

A methodology for automatically identifying and clustering semantic features or topics in a heterogeneous text collection is presented. Textual data is encoded using a low rank no...

Farial Shahnaz, Michael W. Berry, V. Paul Pauca, R...

claim paper

Read More »

« Prev « First page 7 / 56 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers