Claimed by
Apache Software Foundation
Analyzed 3 months ago
The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries.
Tika is a project of the Apache Software Foundation, and was formerly a subproject of Apache Lucene.
399K
lines of code
19
current contributors
4 months
since last commit
23
users on Open Hub