Apache Tika
| Tika | |
|---|---|
| Developer | Apache Software Foundation |
| Stable release | 3.2.3
/ 9 September 2025 |
| Written in | Java |
| Operating system | Cross-platform |
| Type | Search and index API |
| License | Apache License 2.0 |
| Website | tika |
| Repository | Tika Repository |
Apache Tika is a content detection and analysis framework, written in Java, stewarded at the Apache Software Foundation. It detects and extracts metadata and text from over a thousand different file types, and as well as providing a Java library, has server and command-line editions suitable for use from other programming languages.