Sciweavers

878 search results - page 23 / 176
» Clustering the Chilean Web
Sort
View
JMLR
2010
175views more  JMLR 2010»
13 years 2 months ago
Hierarchical Convex NMF for Clustering Massive Data
We present an extension of convex-hull non-negative matrix factorization (CH-NMF) which was recently proposed as a large scale variant of convex non-negative matrix factorization ...
Kristian Kersting, Mirwaes Wahabzada, Christian Th...
WSDM
2012
ACM
329views Data Mining» more  WSDM 2012»
12 years 3 months ago
Beyond 100 million entities: large-scale blocking-based resolution for heterogeneous data
A prerequisite for leveraging the vast amount of data available on the Web is Entity Resolution, i.e., the process of identifying and linking data that describe the same real-worl...
George Papadakis, Ekaterini Ioannou, Claudia Niede...
WWW
2003
ACM
14 years 8 months ago
Web Sessions Clustering with Artificial Ants Colonies
In this paper, we apply AntClust, an ant based clustering algorithm, to the Web usage-mining problem. We define a Web session as a weighted multi-modal vector and we propose an ad...
Gilles Venturini, Nicolas Labroche, Nicolas Monmar...
PVLDB
2010
146views more  PVLDB 2010»
13 years 2 months ago
HaLoop: Efficient Iterative Data Processing on Large Clusters
The growing demand for large-scale data mining and data analysis applications has led both industry and academia to design new types of highly scalable data-intensive computing pl...
Yingyi Bu, Bill Howe, Magdalena Balazinska, Michae...
WISE
2002
Springer
14 years 21 days ago
A Unified Framework for Clustering Heterogeneous Web Objects
In this paper, we introduce a novel framework for clustering web data which is often heterogeneous in nature. As most existing methods often integrate heterogeneous data into a un...
Hua-Jun Zeng, Zheng Chen, Wei-Ying Ma