[
https://issues.apache.org/jira/browse/TIKA-1867?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16005013#comment-16005013
]
Daniel Conn commented on TIKA-1867:
-----------------------------------
Hi [~gagravarr],
After trying the link earlier I have tried to exclude this parser but it still
seems to be calling it and in turn trying to call ffmpeg and exiftools due to
the tika-external-parsers.xml file. I too am looking for a solution to this,
perhaps one check if these things exist on startup and then cache this, instead
of many checks for the same program? Or a TikaConfig constructor which allows
you to explicitly remove constructors? These are just ideas though!
Just in case I've got the wrong end of the stick here is what I put in the
config file. Could you kindly confirm this was correct, or where I'm going
wrong:
<?xml version="1.0" encoding="UTF-8"?>
<properties>
<parsers>
<parser class="org.apache.tika.parser.DefaultParser">
<parser-exclude
class="org.apache.tika.parser.external.CompositeExternalParser"/>
</parser>
</parsers>
</properties>
Thanks and Kind Regards
Dan
> Tika external parsers cannot be turned off without patching the
> tika-app-XX.jar
> -------------------------------------------------------------------------------
>
> Key: TIKA-1867
> URL: https://issues.apache.org/jira/browse/TIKA-1867
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.11
> Reporter: Roman Kratochvil
>
> The CompositeExternalParser calls ExternalParsersFactory.create() which
> always uses configuration from
> org/apache/tika/parser/external/tika-external-parsers.xml. The issue is that
> this introduces performance regression as the parser initialization checks
> for presence of external commands (ffmpeg, exiftool) and that takes time.
> Unfortunately, there is no way how to turn off this functionality without
> patching the tika-app JAR -- one has to either change the
> tika-external-parsers.xml or remove the whole CompositeExternalParser from
> list of services in /META-INF/services/org.apache.tika.parser.Parser.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)