[ 
https://issues.apache.org/jira/browse/TIKA-4528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Bharath updated TIKA-4528:
--------------------------
    Description: 
Hi Team,

There seems to be an issue with latest version of apache tika *3.2.3* with 
parsing older versions (below 1.5) of pdf. But same file parsing is working in 
version {*}3.2.1{*}.

Thanks

  was:
Hi,

it seems the Tika detector is returning {{video/quicktime}} for animated AVIF 
images.

This is the output using Tika App:

 
{code:java}
Content-Length: 6848
Content-Type: video/quicktime
X-TIKA:EXCEPTION:warn: Box size too small.
X-TIKA:EXCEPTION:warn: Unable to skip. Requested 1751411818 bytes but only 6780 
remained.
X-TIKA:Parsed-By: org.apache.tika.parser.DefaultParser
X-TIKA:Parsed-By: org.apache.tika.parser.mp4.MP4Parser
X-TIKA:Parsed-By-Full-Set: org.apache.tika.parser.DefaultParser
X-TIKA:Parsed-By-Full-Set: org.apache.tika.parser.mp4.MP4Parser
X-TIKA:Parsed-By-Full-Set: org.apache.tika.parser.EmptyParser
X-TIKA:digest:MD5: e2da1cfa3a1e34a70ea43ad70fa8b58f
X-TIKA:digest:SHA256: 
b1e67d07a042376be37147051b3440ba1a545e433873a2a055dc894736411ec1
resourceName: animation2.avif {code}
 

I would expect it to return {{image/avif}} instead.


> Older version PDF parsing is failing
> ------------------------------------
>
>                 Key: TIKA-4528
>                 URL: https://issues.apache.org/jira/browse/TIKA-4528
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 3.2.3
>         Environment: Java version:
> {code:java}
> openjdk version "24.0.1" 2025-04-15
> OpenJDK Runtime Environment Temurin-24.0.1+9 (build 24.0.1+9)
> OpenJDK 64-Bit Server VM Temurin-24.0.1+9 (build 24.0.1+9, mixed mode, 
> sharing) {code}
> macOS 15.6.1
>            Reporter: Bharath
>            Priority: Major
>
> Hi Team,
> There seems to be an issue with latest version of apache tika *3.2.3* with 
> parsing older versions (below 1.5) of pdf. But same file parsing is working 
> in version {*}3.2.1{*}.
> Thanks



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to