[ 
https://issues.apache.org/jira/browse/TIKA-3255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Peter Kronenberg updated TIKA-3255:
-----------------------------------
    Description: 
I got the following exception with the attached mp3 file

 
{code:java}
Exception in thread "main" org.apache.tika.exception.TikaException: TIKA-198: 
Illegal IOException from org.apache.tika.parser.mp3.Mp3Parser@152aa092
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:286)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:159)
        at org.torchai.ReadFile.autoDetect(ReadFile.java:33)
        at org.torchai.ReadFile.main(ReadFile.java:40)
Caused by: java.io.IOException: Record size (2790678 bytes) is larger than the 
allowed record size: 1000000
        at org.apache.tika.parser.mp3.ID3v2Frame.readFully(ID3v2Frame.java:186)
        at org.apache.tika.parser.mp3.ID3v2Frame.<init>(ID3v2Frame.java:138)
        at 
org.apache.tika.parser.mp3.ID3v2Frame.createFrameIfPresent(ID3v2Frame.java:91)
        at 
org.apache.tika.parser.mp3.Mp3Parser.getAllTagHandlers(Mp3Parser.java:188)
        at org.apache.tika.parser.mp3.Mp3Parser.parse(Mp3Parser.java:70)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
        ... 5 more
{code}
This is a perfectly valid MP3 file.  It seems that the code has a hard-coded 
limit of 100000

 

Here is the code I'm running
{code:java}
Exception in thread "main" org.apache.tika.exception.TikaException: TIKA-198: 
Illegal IOException from org.apache.tika.parser.mp3.Mp3Parser@152aa092
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:286)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
        at 
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:159)
        at org.torchai.ReadFile.autoDetect(ReadFile.java:33)
        at org.torchai.ReadFile.main(ReadFile.java:40)
Caused by: java.io.IOException: Record size (2790678 bytes) is larger than the 
allowed record size: 1000000
        at org.apache.tika.parser.mp3.ID3v2Frame.readFully(ID3v2Frame.java:186)
        at org.apache.tika.parser.mp3.ID3v2Frame.<init>(ID3v2Frame.java:138)
        at 
org.apache.tika.parser.mp3.ID3v2Frame.createFrameIfPresent(ID3v2Frame.java:91)
        at 
org.apache.tika.parser.mp3.Mp3Parser.getAllTagHandlers(Mp3Parser.java:188)
        at org.apache.tika.parser.mp3.Mp3Parser.parse(Mp3Parser.java:70)
        at 
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
        ... 5 more
{code}
 

> Parsing MP3 file with record > 100000
> -------------------------------------
>
>                 Key: TIKA-3255
>                 URL: https://issues.apache.org/jira/browse/TIKA-3255
>             Project: Tika
>          Issue Type: Bug
>            Reporter: Peter Kronenberg
>            Priority: Major
>         Attachments: sample-a.mp3
>
>
> I got the following exception with the attached mp3 file
>  
> {code:java}
> Exception in thread "main" org.apache.tika.exception.TikaException: TIKA-198: 
> Illegal IOException from org.apache.tika.parser.mp3.Mp3Parser@152aa092
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:286)
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>       at 
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
>       at 
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:159)
>       at org.torchai.ReadFile.autoDetect(ReadFile.java:33)
>       at org.torchai.ReadFile.main(ReadFile.java:40)
> Caused by: java.io.IOException: Record size (2790678 bytes) is larger than 
> the allowed record size: 1000000
>       at org.apache.tika.parser.mp3.ID3v2Frame.readFully(ID3v2Frame.java:186)
>       at org.apache.tika.parser.mp3.ID3v2Frame.<init>(ID3v2Frame.java:138)
>       at 
> org.apache.tika.parser.mp3.ID3v2Frame.createFrameIfPresent(ID3v2Frame.java:91)
>       at 
> org.apache.tika.parser.mp3.Mp3Parser.getAllTagHandlers(Mp3Parser.java:188)
>       at org.apache.tika.parser.mp3.Mp3Parser.parse(Mp3Parser.java:70)
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>       ... 5 more
> {code}
> This is a perfectly valid MP3 file.  It seems that the code has a hard-coded 
> limit of 100000
>  
> Here is the code I'm running
> {code:java}
> Exception in thread "main" org.apache.tika.exception.TikaException: TIKA-198: 
> Illegal IOException from org.apache.tika.parser.mp3.Mp3Parser@152aa092
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:286)
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>       at 
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
>       at 
> org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:159)
>       at org.torchai.ReadFile.autoDetect(ReadFile.java:33)
>       at org.torchai.ReadFile.main(ReadFile.java:40)
> Caused by: java.io.IOException: Record size (2790678 bytes) is larger than 
> the allowed record size: 1000000
>       at org.apache.tika.parser.mp3.ID3v2Frame.readFully(ID3v2Frame.java:186)
>       at org.apache.tika.parser.mp3.ID3v2Frame.<init>(ID3v2Frame.java:138)
>       at 
> org.apache.tika.parser.mp3.ID3v2Frame.createFrameIfPresent(ID3v2Frame.java:91)
>       at 
> org.apache.tika.parser.mp3.Mp3Parser.getAllTagHandlers(Mp3Parser.java:188)
>       at org.apache.tika.parser.mp3.Mp3Parser.parse(Mp3Parser.java:70)
>       at 
> org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
>       ... 5 more
> {code}
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to