Sciweavers

700 search results - page 126 / 140
» Clustering XML Documents by Structure
Sort
View
CIKM
2009
Springer
14 years 4 months ago
Completing wikipedia's hyperlink structure through dimensionality reduction
Wikipedia is the largest monolithic repository of human knowledge. In addition to its sheer size, it represents a new encyclopedic paradigm by interconnecting articles through hyp...
Robert West, Doina Precup, Joelle Pineau
ICWE
2003
Springer
14 years 3 months ago
Genre and Domain Processing in an Information Retrieval Perspective
Abstract. The massive amount of textual data on the Web raises numerous classification problems. Although the notion of domain is widely acknowledged in the IR field, the applica...
Céline Poudat, Guillaume Cleuziou
KES
2004
Springer
14 years 3 months ago
Analyzing the Temporal Sequences for Text Categorization
– This paper describes a text categorization approach that is based on a combination of a newly designed text representation with a kNN classifier. The new text document represen...
Xiao Luo, A. Nur Zincir-Heywood
JACM
2010
208views more  JACM 2010»
13 years 8 months ago
The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies
clustering of documents according to sharing of topics at multiple levels of abstraction. Given a corpus of documents, a posterior inference algorithm finds an approximation to a ...
David M. Blei, Thomas L. Griffiths, Michael I. Jor...
WWW
2008
ACM
14 years 10 months ago
Topigraphy: visualization for large-scale tag clouds
This paper proposes a new method for displaying large-scale tag clouds. We use a topographical image that helps users to grasp the relationship among tags intuitively as a backgro...
Ko Fujimura, Shigeru Fujimura, Tatsushi Matsubayas...