Metadata removal tool

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Metadata removal tool or metadata scrubber is a type of privacy software built to protect the privacy of its users by removing potentially privacy-compromising metadata from files before they are shared with others, e.g., by sending them as e-mail attachments or by posting them on the Web.[1][2]


Metadata removal tools are also commonly used to reduce the overall sizes of files, particularly image files posted on the Web. For example, a small image on a website, which may contain metadata including a thumbnail image, can easily contain as much metadata as image data, thus removal of that metadata can halve the file size.

Metadata can be found in many types of files such as documents, spreadsheets, presentations, images, and audio files. They can include information such as details on the file authors, file creation and modification dates, location (GPS), document revision history, thumbnail images and comments.[3]

Since metadata is sometimes not clearly visible in authoring applications (depending on the application and its settings), there is a risk that the user will be unaware of its existence or will forget about it and, if the file is shared, private or confidential information will inadvertently be exposed. The purpose of metadata removal tools is to minimize the risk of such data leakage.[4]

The metadata removal tools that exist today can be divided into four groups:

  • Integral metadata removal tools, which are included in some applications, like the Document Inspector in Microsoft Office.
  • Batch metadata removal tools, which can process multiple files.
  • E-mail client add-ins, which are designed to remove metadata from e-mail attachments just before they are sent.
  • Server based systems, which are designed to automatically remove metadata at the network gateway.

To securely delete the metadata of a PDF file, it is important to linearize the PDF file afterwards, otherwise changes are reversible and the metadata can be recovered.[5][6]

See also[edit]


  1. ^ Hassan, Nihad, and Hijazi, Rami. Digital Privacy and Security Using Windows: A Practical Guide. Apress, 2017, pp. 56-59.
  2. ^ The Many Faces of Fraud, LAWPRO Magazine, June 2004.
  3. ^ A Guardian Guide to Your Metadata, Interactive Graphic
  4. ^ Dennis O'Reilly. Remove metadata from Office files, PDFs, and images, CNET, May 16, 2014.
  5. ^ "PDF Tags".
  6. ^ "exiftool Application Documentation".