Sciweavers

106 search results - page 12 / 22
» kdd 2002
Sort
View
KDD
2002
ACM
197views Data Mining» more  KDD 2002»
14 years 10 months ago
SimRank: a measure of structural-context similarity
The problem of measuring "similarity" of objects arises in many applications, and many domain-specific measures have been developed, e.g., matching text across documents...
Glen Jeh, Jennifer Widom
KDD
2002
ACM
93views Data Mining» more  KDD 2002»
14 years 10 months ago
Interactive deduplication using active learning
Deduplication is a key operation in integrating data from multiple sources. The main challenge in this task is designing a function that can resolve when a pair of records refer t...
Sunita Sarawagi, Anuradha Bhamidipaty
KDD
2002
ACM
193views Data Mining» more  KDD 2002»
14 years 10 months ago
Query, analysis, and visualization of hierarchically structured data using Polaris
In the last several years, large OLAP databases have become common in a variety of applications such as corporate data warehouses and scientific computing. To support interactive ...
Chris Stolte, Diane Tang, Pat Hanrahan
KDD
2002
ACM
144views Data Mining» more  KDD 2002»
14 years 10 months ago
Efficiently mining frequent trees in a forest
Mining frequent trees is very useful in domains like bioinformatics, web mining, mining semi-structured data, and so on. We formulate the problem of mining (embedded) subtrees in ...
Mohammed Javeed Zaki
KDD
2002
ACM
157views Data Mining» more  KDD 2002»
14 years 10 months ago
Exploiting unlabeled data in ensemble methods
An adaptive semi-supervised ensemble method, ASSEMBLE, is proposed that constructs classification ensembles based on both labeled and unlabeled data. ASSEMBLE alternates between a...
Kristin P. Bennett, Ayhan Demiriz, Richard Maclin