Karl Buchta created TIKA-2433:
---------------------------------
Summary: Tika 1.16 - Nullpointer Exception after update - Asking
for help
Key: TIKA-2433
URL: https://issues.apache.org/jira/browse/TIKA-2433
Project: Tika
Issue Type: Bug
Components: cli
Affects Versions: 1.16
Environment: Docker - Debian Stretch - Oracle Java
+Installation in Dockerfile+
{noformat}
ENV TIKA_VERSION 1.16
# also see
https://github.com/LogicalSpark/docker-tikaserver/blob/master/Dockerfile
RUN mkdir -p /opt/tika && cd /opt/tika && curl --fail
http://www-eu.apache.org/dist/tika/tika-app-${TIKA_VERSION}.jar -o tika.jar \
&& curl --fail
http://www-eu.apache.org/dist/tika/tika-server-${TIKA_VERSION}.jar -o
tika-server.jar \
&& apt-get install -y tesseract-ocr tesseract-ocr-eng tesseract-ocr-ita
tesseract-ocr-fra tesseract-ocr-spa tesseract-ocr-deu gdal-bin
{noformat}
+Tika.xml+
{noformat}
<?xml version="1.0" encoding="UTF-8"?>
<properties>
<parsers>
<parser class="org.apache.tika.parser.DefaultParser">
<parser-exclude
class="org.apache.tika.parser.ocr.TesseractOCRParser"/>
</parser>
</parsers>
</properties>
{noformat}
Reporter: Karl Buchta
Hi,
i would like to kindly ask for help. We had to update to the latest Tika 1.16.
I have no experience in Tika so far, i am just maintaining the configuration
and application from another developer.
Version 1.15 worked very fine for us. But right now i see following error
(office is the name of our docker container, hence this output):
https://github.com/apache/tika/blob/1.16/tika-app/src/main/java/org/apache/tika/cli/TikaCLI.java#L202
{noformat}
office | java.lang.NullPointerException
office | at
org.apache.tika.cli.TikaCLI$OutputType.process(TikaCLI.java:202)
office | at
org.apache.tika.cli.TikaCLI$TikaServer$1.run(TikaCLI.java:1153)
{noformat}
I have checked the source on github and have seen, that this code part was
changed with one of the latest commits before the 1.16 release (see link above).
I checked the Change.txt at https://tika.apache.org/1.16/index.html. As i
haven't used Tika so far, and i cannot see that the CLI requirements changed
from the release notes, i would like to ask, whether this is the case anyway.
Do you have some hints on where to start, is this maybe due to improper cli
usage? Or do you think there is a missing java package or dependency?
It's hard for me to say, as the cli commands are automated and distributed over
several layers and configuration files in the application stack, hence i am
asking for a hint.
Thx for any advice, best Karl.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)