Sciweavers

150 search results - page 8 / 30
» A neighborhood-based approach for clustering of linked docum...
Sort
View
IRCDL
2007
13 years 8 months ago
An Hybrid Approach for Improving Word Sense Disambiguation and Text Clustering
Abstract— In this paper we suggest a new approach to represent text document collections, integrating background knowledge to improve clustering effectiveness. Background knowled...
Paolo Casoto, Carlo Tasso
ICCS
2009
Springer
14 years 1 months ago
Frequent Itemset Mining for Clustering Near Duplicate Web Documents
A vast amount of documents in the Web have duplicates, which is a challenge for developing efficient methods that would compute clusters of similar documents. In this paper we use ...
Dmitry I. Ignatov, Sergei O. Kuznetsov
LREC
2008
112views Education» more  LREC 2008»
13 years 8 months ago
Modeling Document Dynamics: an Evolutionary Approach
News articles about the same event published over time have properties that challenge NLP and IR applications. A cluster of such texts typically exhibits instances of paraphrase a...
Jahna Otterbacher, Dragomir R. Radev
ICPR
2008
IEEE
14 years 8 months ago
Clustering of short commercial documents for the web
Document clustering techniques have been applied in several areas, with the web as one of the most recent and influent. Both general-purpose and text-oriented techniques exist and...
Elisabetta Binaghi, Ignazio Gallo, Moreno Carullo,...
GRC
2005
IEEE
14 years 29 days ago
Semantic based clustering of Web documents
Abstract. A new methodology that structures the semantics of a collection of documents into the geometry of a simplicial complex is developed. A simplicial complex is topologically...
Tsau Young Lin, I-Jen Chiang