Bharath created TIKA-4528:
-----------------------------
Summary: Older version PDF parsing is failing
Key: TIKA-4528
URL: https://issues.apache.org/jira/browse/TIKA-4528
Project: Tika
Issue Type: Bug
Affects Versions: 3.2.3
Environment: Java version:
{code:java}
openjdk version "24.0.1" 2025-04-15
OpenJDK Runtime Environment Temurin-24.0.1+9 (build 24.0.1+9)
OpenJDK 64-Bit Server VM Temurin-24.0.1+9 (build 24.0.1+9, mixed mode, sharing)
{code}
macOS 15.6.1
Reporter: Bharath
Hi,
it seems the Tika detector is returning {{video/quicktime}} for animated AVIF
images.
This is the output using Tika App:
{code:java}
Content-Length: 6848
Content-Type: video/quicktime
X-TIKA:EXCEPTION:warn: Box size too small.
X-TIKA:EXCEPTION:warn: Unable to skip. Requested 1751411818 bytes but only 6780
remained.
X-TIKA:Parsed-By: org.apache.tika.parser.DefaultParser
X-TIKA:Parsed-By: org.apache.tika.parser.mp4.MP4Parser
X-TIKA:Parsed-By-Full-Set: org.apache.tika.parser.DefaultParser
X-TIKA:Parsed-By-Full-Set: org.apache.tika.parser.mp4.MP4Parser
X-TIKA:Parsed-By-Full-Set: org.apache.tika.parser.EmptyParser
X-TIKA:digest:MD5: e2da1cfa3a1e34a70ea43ad70fa8b58f
X-TIKA:digest:SHA256:
b1e67d07a042376be37147051b3440ba1a545e433873a2a055dc894736411ec1
resourceName: animation2.avif {code}
I would expect it to return {{image/avif}} instead.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)