What modes and domains of knowledge about data production processes are most critical for producing high-quality data? This study provides an answer to this question. Data are collected via questionnaire and analyzed using linear regression. The results show some similarities and differences in which knowledge variables are significant for various data quality dimensions. Three results are of particular interest to data quality managers and researchers. The first is the complexity and mix of knowledge associated with producing accurate data. The second is the significant results overall for knowledge about the data collection process as compared to data storage and utilization processes. The third is the negative associations of the knowing-why mode of knowledge as compared to the positive associations for knowing-what and knowing-how. Each of these results has managerial implications and generates avenues for further research.
Yang W. Lee, Diane M. Strong