Sciweavers

310 search results - page 14 / 62
» High-Dimensional Similarity Joins
Sort
View
DMIN
2009
185views Data Mining» more  DMIN 2009»
13 years 6 months ago
A Sparse Coding Based Similarity Measure
In high dimensional data sets not all dimensions contain an equal amount of information and most of the time global features are more important than local differences. This makes ...
Sebastian Klenk, Gunther Heidemann
SSDBM
2005
IEEE
184views Database» more  SSDBM 2005»
14 years 2 months ago
Optimizing Multiple Top-K Queries over Joins
Advanced Data Mining applications require more and more support from relational database engines. Especially clustering applications in high dimensional features space demand a pr...
Dirk Habich, Wolfgang Lehner, Alexander Hinneburg
SIGMOD
2012
ACM
288views Database» more  SIGMOD 2012»
11 years 11 months ago
Exploiting MapReduce-based similarity joins
Cloud enabled systems have become a crucial component to efficiently process and analyze massive amounts of data. One of the key data processing and analysis operations is the Sim...
Yasin N. Silva, Jason M. Reed
CORR
2011
Springer
186views Education» more  CORR 2011»
13 years 3 months ago
Similarity Join Size Estimation using Locality Sensitive Hashing
Similarity joins are important operations with a broad range of applications. In this paper, we study the problem of vector similarity join size estimation (VSJ). It is a generali...
Hongrae Lee, Raymond T. Ng, Kyuseok Shim
ICDE
2006
IEEE
161views Database» more  ICDE 2006»
14 years 10 months ago
A Primitive Operator for Similarity Joins in Data Cleaning
Data cleaning based on similarities involves identification of "close" tuples, where closeness is evaluated using a variety of similarity functions chosen to suit the do...
Surajit Chaudhuri, Venkatesh Ganti, Raghav Kaushik