Sciweavers

ICDE
2010
IEEE
408views Database» more  ICDE 2010»
14 years 7 months ago
Hive - a petabyte scale data warehouse using Hadoop
— The size of data sets being collected and analyzed in the industry for business intelligence is growing rapidly, making traditional warehousing solutions prohibitively expensiv...
Ashish Thusoo, Joydeep Sen Sarma, Namit Jain, Zhen...
ICDE
2010
IEEE
331views Database» more  ICDE 2010»
14 years 7 months ago
Navigation system for product search
Abstract— We demonstrate Product EntityCube, a product recommendation and navigation system. While the unprecedented scale of a product search portal enables to satisfy users wit...
Jongwuk Lee, Seung-won Hwang, Zaiqing Nie, Ji-Rong...
ICDE
2010
IEEE
173views Database» more  ICDE 2010»
14 years 7 months ago
Progressive clustering of networks using Structure-Connected Order of Traversal
— Network clustering enables us to view a complex network at the macro level, by grouping its nodes into units whose characteristics and interrelationships are easier to analyze ...
Dustin Bortner, Jiawei Han
ICDE
2010
IEEE
209views Database» more  ICDE 2010»
14 years 7 months ago
Optimal tree node ordering for child/descendant navigations
Abstract— There are many applications in which users interactively access huge tree data by repeating set-based navigations. In this paper, we focus on label-specific/wildcard c...
Atsuyuki Morishima, Keishi Tajima, Masateru Tadais...
ICDE
2010
IEEE
183views Database» more  ICDE 2010»
14 years 7 months ago
Estimating the compression fraction of an index using sampling
—Data compression techniques such as null suppression and dictionary compression are commonly used in today’s database systems. In order to effectively leverage compression, it...
Stratos Idreos, Raghav Kaushik, Vivek R. Narasayya...
ICDE
2010
IEEE
185views Database» more  ICDE 2010»
14 years 7 months ago
A tuple space for social networking on mobile phones
Abstract— Social networking is increasingly becoming a popular means of communication for online users. The trend is also true for offline scenarios where people use their mobil...
Emre Sarigöl, Oriana Riva, Gustavo Alonso
ICDE
2010
IEEE
231views Database» more  ICDE 2010»
14 years 7 months ago
Estimating the progress of MapReduce pipelines
Abstract— In parallel query-processing environments, accurate, time-oriented progress indicators could provide much utility given that inter- and intra-query execution times can ...
Kristi Morton, Abram Friesen, Magdalena Balazinska...
ICDE
2010
IEEE
224views Database» more  ICDE 2010»
14 years 7 months ago
Inconsistency resolution in online databases
Yannis Katsis, Alin Deutsch, Yannis Papakonstantin...
ICDE
2010
IEEE
238views Database» more  ICDE 2010»
14 years 7 months ago
Correlation hiding by independence masking
— Extracting useful correlation from a dataset has been extensively studied. In this paper, we deal with the opposite, namely, a problem we call correlation hiding (CH), which is...
Yufei Tao, Jian Pei, Jiexing Li, Xiaokui Xiao, Ke ...
ICDE
2010
IEEE
258views Database» more  ICDE 2010»
14 years 7 months ago
Anonymized Data: Generation, models, usage
Data anonymization techniques have been the subject of intense investigation in recent years, for many kinds of structured data, including tabular, item set and graph data. They e...
Graham Cormode, Divesh Srivastava