On Tue, 10 Jun 2014, Carlos Scheidecker wrote:
I wonder if I have to have some logic to call the proper parser based on the metadata information. Such as, if it is a PDF then call the pdfparser, DOC docparser and so on.
As long as you have the Tika Parser jar on your classpath, along with the dependencies, then AutoDetectParser will do this selection for you
Otherwise, are you sure you have the tika parser jar on your classpath, along with all of the dependencies? Try asking DefaultParser what Parsers it knowns about, and ensure you've not lost any. The standalone tika-app can tell you what ones to expect
I can still only suggest you try this Nick
