Sciweavers

SIGMOD
2010
ACM
364views Database» more  SIGMOD 2010»
14 years 16 days ago
Optimizing schema-last tuple-store queries in graphd
Current relational databases require that a database schema exist prior to data entry and require manual optimization for best performance. We describe the query optimization tech...
Scott M. Meyer, Jutta Degener, John Giannandrea, B...
SIGMOD
2010
ACM
312views Database» more  SIGMOD 2010»
14 years 16 days ago
TwitterMonitor: trend detection over the twitter stream
We present TwitterMonitor, a system that performs trend detection over the Twitter stream. The system identifies emerging topics (i.e. ‘trends’) on Twitter in real time and p...
Michael Mathioudakis, Nick Koudas
SIGMOD
2010
ACM
321views Database» more  SIGMOD 2010»
14 years 16 days ago
HadoopDB in action: building real world applications
HadoopDB is a hybrid of MapReduce and DBMS technologies, designed to meet the growing demand of analyzing massive datasets on very large clusters of machines. Our previous work ha...
Azza Abouzied, Kamil Bajda-Pawlikowski, Jiewen Hua...
SIGMOD
2010
ACM
208views Database» more  SIGMOD 2010»
14 years 16 days ago
Efficient parallel set-similarity joins using MapReduce
Rares Vernica, Michael J. Carey, Chen Li
SIGMOD
2010
ACM
377views Database» more  SIGMOD 2010»
14 years 16 days ago
Online aggregation and continuous query support in MapReduce
MapReduce is a popular framework for data-intensive distributed computing of batch jobs. To simplify fault tolerance, the output of each MapReduce task and job is materialized to ...
Tyson Condie, Neil Conway, Peter Alvaro, Joseph M....
SIGMOD
2010
ACM
236views Database» more  SIGMOD 2010»
14 years 16 days ago
Differentially private aggregation of distributed time-series with transformation and encryption
We propose PASTE, the first differentially private aggregation algorithms for distributed time-series data that offer good practical utility without any trusted server. PASTE add...
Vibhor Rastogi, Suman Nath
SIGMOD
2010
ACM
231views Database» more  SIGMOD 2010»
14 years 16 days ago
Automatically incorporating new sources in keyword search-based data integration
Scientific data offers some of the most interesting challenges in data integration today. Scientific fields evolve rapidly and accumulate masses of observational and experiment...
Partha Pratim Talukdar, Zachary G. Ives, Fernando ...
SIGMOD
2010
ACM
153views Database» more  SIGMOD 2010»
14 years 16 days ago
Threshold query optimization for uncertain data
The probabilistic threshold query (PTQ) is one of the most common queries in uncertain databases, where all results satisfying the query with probabilities that meet the threshold...
Yinian Qi, Rohit Jain, Sarvjeet Singh, Sunil Prabh...
SIGMOD
2010
ACM
197views Database» more  SIGMOD 2010»
14 years 16 days ago
TACO: tunable approximate computation of outliers in wireless sensor networks
Nikos Giatrakos, Yannis Kotidis, Antonios Deligian...
SIGMOD
2010
ACM
223views Database» more  SIGMOD 2010»
14 years 16 days ago
Finding maximal cliques in massive networks by H*-graph
Maximal clique enumeration (MCE) is a fundamental problem in graph theory and has important applications in many areas such as social network analysis and bioinformatics. The prob...
James Cheng, Yiping Ke, Ada Wai-Chee Fu, Jeffrey X...