On Tue, 19 May 2015, Andrea Asta wrote:
I was wondering if I could customize the AutoDetectParser without changing the Tika jar files.

Just add your own parser to the classpath along with a service file

I am following the Parser 5 min quick start but can't figure out where to
add my new Parser.

Anywhere on your classpath. Can be in a new jar, or just a lone directory on the classpath, whatever works better

Is there any programmatic way to alter the AutoDetectParser (and the Tika
facade) behaviour?

Currently, the default setup is that non-Tika parsers win over Tika ones when two parsers handle the same mime type. Otherwise, you can supply a tika config xml file that overrides things and forces different parsers, optionally while keeping everything else the same, see
http://wiki.apache.org/tika/CompositeParserDiscussion

Nick

Reply via email to