Hello, Tika as far i understood is an "entire package" which means, was designed to work as a standalone application with features that allows a developer to integrate it into a "major process", there are some references that could help.
On my first "view" of the "world of tika" seems that the parser will run on zip files calling the main class "Tika", there is also some authentication issues regarding "password protected files". regards. On Sat, Dec 8, 2012 at 7:33 PM, Lewis John Mcgibbney < [email protected]> wrote: > Hi All, > > Currently we maintain a parser plugin over in Nutch [0], which is not > only outdated but also contains some problems. > > We use Tika 1.2 over in Nutch, I wonder what kind of support Tika has > for parsing .zip files and whether someone can comment on whether I > can work towards dropping the legacy parser for Nutch? > > I apologise as this is intentionally a rather vague question. I'm > unfamiliar with .zip parsing in Tika so any information or guidance > here is greatly appreciated. > > Thank you in advance > > Lewis > > [0] http://svn.apache.org/repos/asf/nutch/trunk/src/plugin/parse-zip/ > > -- > Lewis >
