Sciweavers

SSDBM
2010
IEEE
220views Database» more  SSDBM 2010»
14 years 18 hour ago
Prefix Tree Indexing for Similarity Search and Similarity Joins on Genomic Data
Similarity search and similarity join on strings are important for applications such as duplicate detection, error detection, data cleansing, or comparison of biological sequences....
Astrid Rheinländer, Martin Knobloch, Nicky Ho...
SIGMOD
2010
ACM
144views Database» more  SIGMOD 2010»
14 years 22 hour ago
Interactive visual exploration of neighbor-based patterns in data streams
We will demonstrate our system, called V iStream, supporting interactive visual exploration of neighbor-based patterns [7] in data streams. V istream does not only apply innovativ...
Di Yang, Zhenyu Guo, Zaixian Xie, Elke A. Rundenst...
SIGMOD
2010
ACM
310views Database» more  SIGMOD 2010»
14 years 22 hour ago
The DataPath system: a data-centric analytic processing engine for large data warehouses
Since the 1970’s, database systems have been “compute-centric”. When a computation needs the data, it requests the data, and the data are pulled through the system. We belie...
Subi Arumugam, Alin Dobra, Christopher M. Jermaine...
ICDE
2010
IEEE
184views Database» more  ICDE 2010»
14 years 22 hour ago
On optimal anonymization for l+-diversity
-- Publishing person specific data while protecting privacy is an important problem. Existing algorithms that enforce the privacy principle called l-diversity are heuristic based d...
Junqiang Liu, Ke Wang
ICDE
2010
IEEE
203views Database» more  ICDE 2010»
14 years 22 hour ago
Optimizing ETL workflows for fault-tolerance
Extract-Transform-Load (ETL) processes play an important role in data warehousing. Typically, design work on ETL has focused on performance as the sole metric to make sure that the...
Alkis Simitsis, Kevin Wilkinson, Umeshwar Dayal, M...
ICDE
2010
IEEE
176views Database» more  ICDE 2010»
14 years 22 hour ago
Efficient fuzzy type-ahead search in TASTIER
TASTIER is a research project on the new information-access paradigm called type-ahead search, in which systems find answers to a keyword query on-the-fly as users type in the quer...
Guoliang Li, Shengyue Ji, Chen Li, Jiannan Wang, J...
ICDE
2010
IEEE
145views Database» more  ICDE 2010»
14 years 22 hour ago
MASS: a multi-facet domain-specific influential blogger mining system
With rapid development of web 2.0 technology and e-business, bloggers play significant roles in the blogosphere as well as the external world. In particular, influential bloggers c...
Yichuan Cai, Yi Chen
ICDE
2010
IEEE
248views Database» more  ICDE 2010»
14 years 22 hour ago
MashRank: Towards uncertainty-aware and rank-aware mashups
Mashups are situational applications that build data flows to link the contents of multiple Web sources. Often times, ranking the results of a mashup is handled in a materializethe...
Mohamed A. Soliman, Mina Saleeb, Ihab F. Ilyas
ICDE
2010
IEEE
750views Database» more  ICDE 2010»
14 years 22 hour ago
Efficient and accurate discovery of patterns in sequence datasets
Existing sequence mining algorithms mostly focus on mining for subsequences. However, a large class of applications, such as biological DNA and protein motif mining, require effici...
Avrilia Floratou, Sandeep Tata, Jignesh M. Patel