On Fri, 8 Oct 2010, Jan Høydahl / Cominvent wrote:
My question was for a very specific usecase which is easy to do by a small source code modification but perhaps harder to do with configuration only.

Looking at the AutoDetectParser source code, the last parser registered for a given mime type wins. So, if you have your custom word parser register after the built in one, then your custom one gets used.

You might find that if you're using the service file method of listing the parsers to load, then you just need to get your custom parser jar file to sort lexographically after the main tika parsers jar, but that's one to test. (The service registry we use is the javax.imageio one)

Nick

Reply via email to