On Fri, 3 Nov 2017, Markus Jelsma wrote:
This is how Nutch gets the parser:
Parser parser = tikaConfig.getParser(MediaType.parse(mimeType));
When no custom config is specified config is:
new TikaConfig(this.getClass().getClassLoader());
When i specify a custom config, it is:
tikaConfig = new TikaConfig(conf.getResource(customConfFile));
I think you need to give both the classloader and the config file for your
setup
Can you try this constructor:
https://tika.apache.org/1.16/api/org/apache/tika/config/TikaConfig.html#TikaConfig-java.net.URL-java.lang.ClassLoader-
With something like
new TikaConfig(conf.getResource(customConfFile),
this.getClass().getClassLoader());
Nick