Natural Language Toolkit

Natural Language Toolkit
Original authorsSteven Bird, Edward Loper, Ewan Klein
DeveloperTeam NLTK
Initial release2001 (2001)
Stable release
3.9.1  / 19 August 2024 (19 August 2024)
Written inPython
TypeNatural language processing
LicenseApache 2.0
Websitewww.nltk.org
Repository

The Natural Language Toolkit, or more commonly NLTK, is a suite of libraries and programs for symbolic and statistical natural language processing (NLP) for English written in the Python programming language. It supports classification, tokenization, stemming, tagging, parsing, and semantic reasoning functionalities. It was developed by Steven Bird and Edward Loper in the Department of Computer and Information Science at the University of Pennsylvania. NLTK includes graphical demonstrations and sample data. It is accompanied by a book that explains the underlying concepts behind the language processing tasks supported by the toolkit, plus a cookbook.

NLTK is intended to support research and teaching in NLP or closely related areas, including empirical linguistics, cognitive science, artificial intelligence, information retrieval, and machine learning. NLTK has been used successfully as a teaching tool, as an individual study tool, and as a platform for prototyping and building research systems.