Sciweavers

54 search results - page 1 / 11
» Efficient parallel set-similarity joins using MapReduce
Sort
View
SIGMOD
2010
ACM
208views Database» more  SIGMOD 2010»
14 years 7 days ago
Efficient parallel set-similarity joins using MapReduce
Rares Vernica, Michael J. Carey, Chen Li
SIGMOD
2011
ACM
299views Database» more  SIGMOD 2011»
12 years 10 months ago
Processing theta-joins using MapReduce
Joins are essential for many data analysis tasks, but are not supported directly by the MapReduce paradigm. While there has been progress on equi-joins, implementation of join alg...
Alper Okcan, Mirek Riedewald
CORR
2010
Springer
205views Education» more  CORR 2010»
13 years 7 months ago
Behavioral Simulations in MapReduce
In many scientific domains, researchers are turning to large-scale behavioral simulations to better understand real-world phenomena. While there has been a great deal of work on s...
Guozhang Wang, Marcos Antonio Vaz Salles, Benjamin...
ADBIS
2009
Springer
162views Database» more  ADBIS 2009»
13 years 11 months ago
Efficient Set Similarity Joins Using Min-prefixes
Identification of all objects in a dataset whose similarity is not less than a specified threshold is of major importance for management, search, and analysis of data. Set similari...
Leonardo Ribeiro, Theo Härder
ICDE
2009
IEEE
194views Database» more  ICDE 2009»
14 years 9 months ago
Top-k Set Similarity Joins
Abstract-- Similarity join is a useful primitive operation underlying many applications, such as near duplicate Web page detection, data integration, and pattern recognition. Tradi...
Chuan Xiao, Wei Wang 0011, Xuemin Lin, Haichuan Sh...