Natural Language Toolkit

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Natural Language Toolkit
Original author(s) Steven Bird, Edward Loper, Ewan Klein
Developer(s) Team NLTK
Initial release 2001 (2001)[1]
Stable release 3.1 / 15 October 2015; 3 months ago (2015-10-15)[2]
Preview release 3.0b2 / 21 August 2014; 17 months ago (2014-08-21)[3]
Written in Python
Type Natural language processing
License Apache 2.0[4]
Parse tree generated with NLTK

The Natural Language Toolkit, or more commonly NLTK, is a suite of libraries and programs for symbolic and statistical natural language processing (NLP) for the Python programming language. NLTK includes graphical demonstrations and sample data. It is accompanied by a book that explains the underlying concepts behind the language processing tasks supported by the toolkit,[5] plus a cookbook.[6]

NLTK is intended to support research and teaching in NLP or closely related areas, including empirical linguistics, cognitive science, artificial intelligence, information retrieval, and machine learning.[7] NLTK has been used successfully as a teaching tool, as an individual study tool, and as a platform for prototyping and building research systems.

Library highlights[edit]

See also[edit]


  1. ^ project site on SourceForge; registered:2001-07-09
  2. ^ "NLTK ChangeLog". Retrieved 2015-10-16. 
  3. ^ "NLTK News". Retrieved 2014-07-25. 
  4. ^ "NLTK License". NLTK Project. Retrieved 2015-02-14. 
  5. ^ Bird, Steven; Klein, Ewan; Loper, Edward (2009). Natural Language Processing with Python. O'Reilly Media Inc. ISBN 0-596-51649-5. 
  6. ^ Perkins, Jacob (2010). Python Text Processing with NLTK 2.0 Cookbook. Packt Publishing. ISBN 1849513600. 
  7. ^ Bird, Steven; Klein, Ewan; Loper, Edward; Baldridge, Jason (2008). "Multidisciplinary instruction with the Natural Language Toolkit" (PDF). Proceedings of the Third Workshop on Issues in Teaching Computational Linguistics, ACL. 

External links[edit]