Sciweavers

VLDB
2001
ACM

Potter's Wheel: An Interactive Data Cleaning System

14 years 3 months ago
Potter's Wheel: An Interactive Data Cleaning System
Cleaning data of errors in structure and content is important for data warehousing and integration. Current solutions for data cleaning involve many iterations of data “auditing” to find errors, and long-running transformations to fix them. Users need to endure long waits, and often write complex transformation scripts. We present Potter’s Wheel, an interactive data cleaning system that tightly integrates transformation and discrepancy detection. Users gradually build transformations to clean the data by adding or undoing transforms on a spreadsheet-like interface; the effect of a transform is shown at once on records visible on screen. These transforms are specified either through simple graphical operations, or by showing the desired effects on example data values. In the background, Potter’s Wheel automatically infers structures for data values in terms of user-defined domains, and accordingly checks for constraint violations. Thus users can gradually build a transformatio...
Vijayshankar Raman, Joseph M. Hellerstein
Added 30 Jul 2010
Updated 30 Jul 2010
Type Conference
Year 2001
Where VLDB
Authors Vijayshankar Raman, Joseph M. Hellerstein
Comments (0)