Sciweavers

1260 search results - page 146 / 252
» Data Quality in Genome Databases
Sort
View
ICDE
2007
IEEE
148views Database» more  ICDE 2007»
14 years 10 months ago
Conquering the Divide: Continuous Clustering of Distributed Data Streams
Data is often collected over a distributed network, but in many cases, is so voluminous that it is impractical and undesirable to collect it in a central location. Instead, we mus...
Graham Cormode, S. Muthukrishnan, Wei Zhuang
ICDE
2007
IEEE
146views Database» more  ICDE 2007»
14 years 10 months ago
Conditional Functional Dependencies for Data Cleaning
We propose a class of constraints, referred to as conditional functional dependencies (CFDs), and study their applications in data cleaning. In contrast to traditional functional ...
Philip Bohannon, Wenfei Fan, Floris Geerts, Xibei ...
ICDE
2002
IEEE
128views Database» more  ICDE 2002»
14 years 10 months ago
Efficiently Ordering Query Plans for Data Integration
The goal of a data integration system is to provide a uniform interface to a multitude of data sources. Given a user query formulated in this interface, the system translates it i...
AnHai Doan, Alon Y. Halevy
ADBIS
2008
Springer
219views Database» more  ADBIS 2008»
14 years 3 months ago
Data Stream Analysis for Location-Aware Collaborative Information Retrieval
Abstract. We propose a new approach for enhancing collaborative information retrieval by means of incorporating positional data for a location-aware personalized retrieval process....
Andreas Behrend, Frank Reichartz, Christian Dorau,...
ICDM
2002
IEEE
163views Data Mining» more  ICDM 2002»
14 years 2 months ago
High Performance Data Mining Using the Nearest Neighbor Join
The similarity join has become an important database primitive to support similarity search and data mining. A similarity join combines two sets of complex objects such that the r...
Christian Böhm, Florian Krebs