On Sat, Dec 8, 2012 at 10:11 PM, Jose Carlos Canova <[email protected]> wrote: > Hello, > > Tika as far i understood is an "entire package" which means, was designed to > work as a standalone application with features that allows a developer to > integrate it into a "major process", there are some references that could > help.
No. Tika is a library with an API. > > On my first "view" of the "world of tika" seems that the parser will run on > zip files calling the main class "Tika", there is also some authentication > issues regarding "password protected files". You will need to be much more specific to get help with this. > > regards. > > On Sat, Dec 8, 2012 at 7:33 PM, Lewis John Mcgibbney > <[email protected]> wrote: >> >> Hi All, >> >> Currently we maintain a parser plugin over in Nutch [0], which is not >> only outdated but also contains some problems. >> >> We use Tika 1.2 over in Nutch, I wonder what kind of support Tika has >> for parsing .zip files and whether someone can comment on whether I >> can work towards dropping the legacy parser for Nutch? >> >> I apologise as this is intentionally a rather vague question. I'm >> unfamiliar with .zip parsing in Tika so any information or guidance >> here is greatly appreciated. >> >> Thank you in advance >> >> Lewis >> >> [0] http://svn.apache.org/repos/asf/nutch/trunk/src/plugin/parse-zip/ >> >> -- >> Lewis > >
