Let’s roll it….
From: Tim Allison <[email protected]> Reply-To: "[email protected]" <[email protected]> Date: Wednesday, September 19, 2018 at 12:14 PM To: "[email protected]" <[email protected]> Subject: 1.19.1? The mp3 regression is bad. In hindsight, the Tika-eval reports were fairly clear on this but I did some self-hand-waving to excuse away the numbers...I shouldn’t have. I want to add some new reports to tika-eval so that this never happens again. How long should we wait for 1.19.1 or 1.20? Best, Tim On Wed, Sep 19, 2018 at 2:29 PM Hudson (JIRA) <[email protected]> wrote: [ https://issues.apache.org/jira/browse/TIKA-2730?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16621008#comment-16621008 ] Hudson commented on TIKA-2730: ------------------------------ SUCCESS: Integrated in Jenkins build tika-branch-1x #94 (See [ https://builds.apache.org/job/tika-branch-1x/94/]) TIKA-2730 -- allow last frame to be truncated w/o throwing an EOF (tallison: [ https://github.com/apache/tika/commit/80cfd6d4a4270f8f3697c6dc083b3dedfc36c86a ]) * (edit) tika-parsers/src/main/java/org/apache/tika/parser/mp3/MpegStream.java * (edit) tika-parsers/src/test/java/org/apache/tika/parser/mp3/Mp3ParserTest.java * (add) tika-parsers/src/test/resources/test-documents/testMP3i18n_truncated.mp3 * (edit) tika-parsers/src/main/java/org/apache/tika/parser/mp3/Mp3Parser.java > parseToString fails for a simple mp3 > ------------------------------------ > > Key: TIKA-2730 > URL: https://issues.apache.org/jira/browse/TIKA-2730 > Project: Tika > Issue Type: Bug > Affects Versions: 1.19 > Reporter: Boris Petrov > Assignee: Tim Allison > Priority: Major > Fix For: 2.0.0, 1.20 > > Attachments: demo.mp3 > > > This is a regression from 1.18. I've attached the mp3 that fails. The exception I get is: > {noformat} > org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from org.apache.tika.parser.mp3.Mp3Parser@cefe6c6 > at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:286) > at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) > at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143) > at org.apache.tika.Tika.parseToString(Tika.java:527) > at com.company.TextExtractor.getText(TextExtractor.java:39) > Caused by: > java.io.EOFException: EOF: tried to skip 361 but could only skip 247 > at org.apache.tika.parser.mp3.MpegStream.skipFrame(MpegStream.java:166) > at org.apache.tika.parser.mp3.Mp3Parser.getAllTagHandlers(Mp3Parser.java:204) > at org.apache.tika.parser.mp3.Mp3Parser.parse(Mp3Parser.java:71) > at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) > ... 5 more{noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
