On Tue, 19 May 2015, Andrea Asta wrote:
I was wondering if I could customize the AutoDetectParser without changing the Tika jar files.
Just add your own parser to the classpath along with a service file
I am following the Parser 5 min quick start but can't figure out where to add my new Parser.
Anywhere on your classpath. Can be in a new jar, or just a lone directory on the classpath, whatever works better
Is there any programmatic way to alter the AutoDetectParser (and the Tika facade) behaviour?
Currently, the default setup is that non-Tika parsers win over Tika ones when two parsers handle the same mime type. Otherwise, you can supply a tika config xml file that overrides things and forces different parsers, optionally while keeping everything else the same, see
http://wiki.apache.org/tika/CompositeParserDiscussion Nick
