ABSTRACT

This entry presents a definition of data and their quality dimensions as the basis for a survey of data quality management and improvement techniques, with special attention to data published on the Internet. Over the past several years several factors have contributed to making improved data quality an urgent priority. These factors include “data quality disasters” such as the Year 2000 Presidential Election, corporate reporting, and homeland security, and the stunning growth in data volumes, and recognition of the importance of data to the modern organization. This entry does not treat issues of database system quality, such as system reliability, accessibility, and usability, or related issues such as data security.