On Mon, 7 Jan 2013, Maciej Liżewski wrote:
Could you provide example how to use it to recursively index files in archive? Lets say I have archive.zip with 3 files: file.txt, file.doc, file.pdf. I would like to have output with text content of all those files.
Just set the AutoDetectParser as Parser.class in the ParseContext, and you should be sorted.
It's probably worth you getting the tika source code, including unit tests, and look for places where a Parser.class is set on the ParseContext, that'll give you several examples to compare
Nick
