Apache Tika

  765 downloads
1.10 Apache License
5.0/5 1
An open source toolkit for parsing, analyzing, and extracting metadata and content from files, with support for a broad range of file types

description

download

specifications

changelog

Apache Tika was developed as a low-level toolkit for searching content inside other files.

Tika doesn't do much on its own being a simple library, but it can be integrated in more powerful tools like search engines, digital asset management systems or CMSs to provide a fully-functional in-file search system.

The library can access just the file's header for quick overall file information, or it can go really deep and search even in the file's body for various types of data, in text or binary format.

A wide range of file types are supported and Tika can also be used with other programming languages thanks to a series of third-party bindings and wrappers.
READ MORE   
Last updated on August 13th, 2015

0 User reviews so far.

SUBMIT