Dirty data

From Wikipedia, the free encyclopedia
Jump to: navigation, search

Dirty data is inaccurate, incomplete or erroneous data, especially in a computer system or database.[1]

In reference to databases, this is data that contain errors. Unclean data can contain such mistakes as spelling or punctuation errors, incorrect data associated with a field, incomplete or outdated data, or even data that has been duplicated in the database.

See also[edit]


  1. ^ Margaret Chu (2004), "What Are Dirty Data?", Blissful Data, p. 71 et seq., ISBN 9780814407806