Sciweavers

878 search results - page 79 / 176
» Clustering the Chilean Web
Sort
View
KDD
2002
ACM
138views Data Mining» more  KDD 2002»
14 years 9 months ago
Learning to match and cluster large high-dimensional data sets for data integration
Part of the process of data integration is determining which sets of identifiers refer to the same real-world entities. In integrating databases found on the Web or obtained by us...
William W. Cohen, Jacob Richman
WISE
2009
Springer
14 years 6 months ago
Clustering of Social Tagging System Users: A Topic and Time Based Approach
Abstract. Under Social Tagging Systems, a typical Web 2.0 application, users label digital data sources by using freely chosen textual descriptions (tags). Mining tag information r...
Vassiliki A. Koutsonikola, Athena Vakali, Eirini G...
SIGIR
2009
ACM
14 years 3 months ago
A comparison of retrieval-based hierarchical clustering approaches to person name disambiguation
This paper describes a simple clustering approach to person name disambiguation of retrieved documents. The methods are based on standard IR concepts and do not require any task-s...
Christof Monz, Wouter Weerkamp
CINQ
2004
Springer
133views Database» more  CINQ 2004»
14 years 27 days ago
Inductive Querying for Discovering Subgroups and Clusters
We introduce the problem of cluster-grouping and show that it integrates several important data mining tasks, i.e. subgroup discovery, mining correlated patterns and aspects from c...
Albrecht Zimmermann, Luc De Raedt
PDP
2008
IEEE
14 years 3 months ago
Bulk-Synchronous On-Line Crawling on Clusters of Computers
This paper describes the design of a crawler devised to perform the periodic retrieval of Web documents for a search engine able to accept on-line updates in a concurrent manner. ...
Mauricio Marín, Carolina Bonacic