Search Sciweavers | Sciweavers

57 search results - page 4 / 12

» Evaluation of Text Clustering Algorithms with N-Gram-Based D...

click to vote

IPM
2006

151views more IPM 2006»

Document clustering using nonnegative matrix factorization

13 years 7 months ago

Download www.math.wfu.edu

A methodology for automatically identifying and clustering semantic features or topics in a heterogeneous text collection is presented. Textual data is encoded using a low rank no...

Farial Shahnaz, Michael W. Berry, V. Paul Pauca, R...

claim paper

Read More »

click to vote

AIRWEB
2006
Springer

136views Internet Technology» more AIRWEB 2006»

Tracking Web Spam with Hidden Style Similarity

13 years 11 months ago

Download airweb.cse.lehigh.edu

Automatically generated content is ubiquitous in the web: dynamic sites built using the three-tier paradigm are good examples (e.g. commercial sites, blogs and other sites powered...

Tanguy Urvoy, Thomas Lavergne, Pascal Filoche

claim paper

Read More »

click to vote

LWA
2008

176views Software Engineering» more LWA 2008»

Labeling Clusters - Tagging Resources

13 years 9 months ago

Download wwwiti.cs.uni-magdeburg.de

In order to support the navigation in huge document collections efficiently, tagged hierarchical structures can be used. Often, multiple tags are used to describe resources. For u...

Korinna Bade, Andreas Nürnberger

claim paper

Read More »

click to vote

ICDM
2009
IEEE

176views Data Mining» more ICDM 2009»

SISC: A Text Classification Approach Using Semi Supervised Subspace Clustering

13 years 5 months ago

Download www.utdallas.edu

Text classification poses some specific challenges. One such challenge is its high dimensionality where each document (data point) contains only a small subset of them. In this pap...

Mohammad Salim Ahmed, Latifur Khan

claim paper

Read More »

click to vote

COLING
2008

116views Computational Linguistics» more COLING 2008»

A Framework for Identifying Textual Redundancy

13 years 9 months ago

Download www.aclweb.org

The task of identifying redundant information in documents that are generated from multiple sources provides a significant challenge for summarization and QA systems. Traditional ...

Kapil Thadani, Kathleen McKeown

claim paper

Read More »

« Prev « First page 4 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers