: Genome databases store data about molecular biological entities such as genes, proteins, diseases, etc. The main purpose of creating and maintaining such databases in commercial ...
What modes and domains of knowledge about data production processes are most critical for producing high-quality data? This study provides an answer to this question. Data are coll...
Previous research on the skills needed by data quality professionals have focused on Information Systems (IS) curriculum standards, survey input from Information Quality (IQ) profe...
We introduce ClueMaker, the first language designed specifically for approximate record matching. Clues written in ClueMaker predict whether two records denote the same thing based...
Martin Buechi, Andrew Borthwick, Adam Winkel, Arth...