Quasi-identifiers are pieces of information that are not of themselves unique identifiers, but are sufficiently well correlated with an entity that they can be combined with other quasi-identifiers to create a unique identifier.
Quasi-identifiers can thus, when combined, become personally identifying information. This process is called re-identification. Motwani and Ying warn about potential privacy breaches being enabled by publication of large volumes of government and business data containing quasi-identifiers. As an example, neither gender, birth dates nor postal codes uniquely identify an individual, but the combination of all three is sufficient to identify 87% of individuals in the United States.
- "Glossary of Statistical Terms: Quasi-identifier". OECD. November 10, 2005. Retrieved 29 September 2013.
- Rajeev Motwani and Ying Xu (2007). "Efficient Algorithms for Masking and Finding Quasi-Identifiers". Proceedings of the Conference on Very Large Data Bases (VLDB).
|This statistics-related article is a stub. You can help Wikipedia by expanding it.|