From Wikipedia, the free encyclopedia
Original author(s)Didier Stevens
Initial releaseMay 2, 2008 (2008-05-02)
Written inPython programming language
Operating systemMultiplatform, including smart phones
TypePDF software
LicensePublic domain

Pdf-parser is a command-line program that parses and analyses PDF documents. It provides features to extract raw data from PDF documents, like compressed images. pdf-parser can deal with malicious PDF documents that use obfuscation features of the PDF language.[1] The tool can also be used to extract data from damaged or corrupt PDF documents.[2]


  1. ^ PDF Babushka by Bojan Zdrnja, Internet Storm Center, January 14, 2010
  2. ^ Online PDF Translator