Sciweavers

1126 search results - page 69 / 226
» Similarity Evaluation on Tree-structured Data
Sort
View
96
Voted
JMLR
2007
58views more  JMLR 2007»
15 years 4 months ago
Distances between Data Sets Based on Summary Statistics
The concepts of similarity and distance are crucial in data mining. We consider the problem of defining the distance between two data sets by comparing summary statistics compute...
Nikolaj Tatti
PAKDD
2010
ACM
178views Data Mining» more  PAKDD 2010»
15 years 8 months ago
SkyDist: Data Mining on Skyline Objects
The skyline operator is a well established database primitive which is traditionally applied in a way that only a single skyline is computed. In this paper we use multiple skylines...
Christian Böhm, Annahita Oswald, Claudia Plan...
ICDM
2007
IEEE
103views Data Mining» more  ICDM 2007»
15 years 10 months ago
An Examination of Experimental Methodology for Classifiers of Relational Data
Experimental methodology for evaluating classification algorithms in relational (i.e., networked) data is complicated by dependencies between related data instances. We survey the...
Brian Gallagher, Tina Eliassi-Rad
VLDB
2002
ACM
110views Database» more  VLDB 2002»
15 years 3 months ago
Eliminating Fuzzy Duplicates in Data Warehouses
The duplicate elimination problem of detecting multiple tuples, which describe the same real world entity, is an important data cleaning problem. Previous domain independent solut...
Rohit Ananthakrishna, Surajit Chaudhuri, Venkatesh...
IGARSS
2009
15 years 1 months ago
Reducing the Dimensionality of Hyperspectral Data using Diffusion Maps
We examine the analysis of hyperspectral data produced by the Hyperspectral Core Imager of AngloGold Ashanti. The dimension of the data is reduced using diffusion maps and the dat...
Luis du Plessis, Steven Damelin, Michael Sears