Natural Language Toolkit
|Original author(s)||Steven Bird, Edward Loper, Ewan Klein|
3.2.3 / 16 May 2017
3.0b2 / 21 August 2014
|Type||Natural language processing|
The Natural Language Toolkit, or more commonly NLTK, is a suite of libraries and programs for symbolic and statistical natural language processing (NLP) for English written in the Python programming language. It was developed by Steven Bird and Edward Loper in the Department of Computer and Information Science at the University of Pennsylvania. NLTK includes graphical demonstrations and sample data. It is accompanied by a book that explains the underlying concepts behind the language processing tasks supported by the toolkit, plus a cookbook.
NLTK is intended to support research and teaching in NLP or closely related areas, including empirical linguistics, cognitive science, artificial intelligence, information retrieval, and machine learning. NLTK has been used successfully as a teaching tool, as an individual study tool, and as a platform for prototyping and building research systems. There are 32 universities in the US and 25 countries using NLTK in their courses. NLTK supports classification, tokenization, stemming, tagging, parsing, and semantic reasoning functionalities.
- Lexical analysis: Word and text tokenizer
- n-gram and collocations
- Part-of-speech tagger
- Tree model and Text chunker for capturing
- Named-entity recognition
- "Project site on SourceForge". 9 July 2001.
- "NLTK ChangeLog". nltk.org. Retrieved 2016-05-17.
- "NLTK News". nltk.org. Retrieved 2014-07-25.
- "NLTK License". NLTK Project. Retrieved 2015-02-14.
- "Preface". www.nltk.org. Retrieved 2016-06-15.
- Bird, Steven; Klein, Ewan; Loper, Edward (2009). Natural Language Processing with Python. O'Reilly Media Inc. ISBN 0-596-51649-5.
- Perkins, Jacob (2010). Python Text Processing with NLTK 2.0 Cookbook. Packt Publishing. ISBN 1849513600.
- Bird, Steven; Klein, Ewan; Loper, Edward; Baldridge, Jason (2008). "Multidisciplinary instruction with the Natural Language Toolkit" (PDF). Proceedings of the Third Workshop on Issues in Teaching Computational Linguistics, ACL.
- "NLTK Courses". Google Docs. Retrieved 2016-06-15.
- Official website
- NLTK book online
- NLTK development at GitHub
- API documentation
- on YouTube. Presented at the Google Campus on 12 June 2007.
|This programming-language-related article is a stub. You can help Wikipedia by expanding it.|