Apache Nutch

Apache Nutch
Original authorsDoug Cutting, Mike Cafarella
DeveloperApache Software Foundation
Stable release
1.x1.21 / 20 July 2025 (2025-07-20)
2.x2.4 / 11 October 2019 (2019-10-11)
Written inJava
Operating systemCross-platform
TypeWeb crawler
LicenseApache License 2.0
Websitenutch.apache.org
RepositoryNutch Github Repository

Apache Nutch is a highly extensible and scalable open source web crawler software project.