[ 
https://issues.apache.org/jira/browse/TIKA-2491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16238177#comment-16238177
 ] 

Tim Allison commented on TIKA-2491:
-----------------------------------

[~gagravarr] solved this:
bq. I think you need to give both the classloader and the config file for your 
setup
bq. Can you try this 
constructor:https://tika.apache.org/1.16/api/org/apache/tika/config/TikaConfig.html#TikaConfig-java.net.URL-java.lang.ClassLoader-

bq. With something like new TikaConfig(conf.getResource(customConfFile), 
this.getClass().getClassLoader());

Nick, this seems strange that we allow for not including the classloader with 
regular TikaConfig(), but we require it if specifying a config file.  Should we 
do something like this:

{noformat}
        if (serviceLoaderElement != null) {
             ...some stuff...
+           if (loader == null) {
+                loader = ServiceLoader.getContextClassLoader();
+            }
            serviceLoader = new ServiceLoader(loader, loadErrorHandler, 
initializableProblemHandler, dynamic);
        } else if(loader != null) {
            serviceLoader = new ServiceLoader(loader);
        } else {
            serviceLoader = new ServiceLoader();
        }
{noformat}


> Cannot use TikaConfig
> ---------------------
>
>                 Key: TIKA-2491
>                 URL: https://issues.apache.org/jira/browse/TIKA-2491
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.16
>            Reporter: Markus Jelsma
>             Fix For: 1.17
>
>         Attachments: tika-config.xml
>
>
> I need to use a custom tika-config.xml in Nutch, which has support for it but 
> i can't get it to work. 
> This is how Nutch gets the parser:
> Parser parser = tikaConfig.getParser(MediaType.parse(mimeType));
> When no custom config is specified config is:
> new TikaConfig(this.getClass().getClassLoader());
> When i specify a custom config, it is:
> tikaConfig = new TikaConfig(conf.getResource(customConfFile));
> getParser always returns null with a custom config file. There are no errors 
> or exceptions. The config is fine, it fixed the encoding problem in a parser 
> outside of Nutch (thanks again Timothy) but i need to get it to work in Nutch 
> too.
> Our external project does:
> AutoDetectParser parser = new AutoDetectParser(tikaConfig); parser.parse(..);
> and it just works! If i do this in Nutch, however, nothing is passed through 
> the content handlers, the parser result is completely empty?



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to