[
https://issues.apache.org/jira/browse/TIKA-2675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16534858#comment-16534858
]
Hudson commented on TIKA-2675:
------------------------------
SUCCESS: Integrated in Jenkins build Tika-trunk #1516 (See
[https://builds.apache.org/job/Tika-trunk/1516/])
TIKA-2675 -- OpenDocumentParser should fail on invalid zip via Sebastian
(tallison:
[https://github.com/apache/tika/commit/c9a81a400ee10e9342bbfe718d62f0b0d6c7944f])
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/odf/OpenDocumentParser.java
* (add) tika-parsers/src/test/resources/test-documents/testODTnotaZipFile.odt
* (edit)
tika-parsers/src/test/java/org/apache/tika/parser/odf/ODFParserTest.java
> OpenDocumentParser should fail on invalid zip files
> ---------------------------------------------------
>
> Key: TIKA-2675
> URL: https://issues.apache.org/jira/browse/TIKA-2675
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.18
> Reporter: Sebastian Nagel
> Assignee: Tim Allison
> Priority: Major
> Fix For: 1.19, 2.0.0
>
>
> The OpenDocumentParser assumes a zip file as container. However, if it is
> called on an invalid zip stream from a remote URL (see NUTCH-2603), the
> parser signals success and returns a document with no/empty content. The
> behavior is different when called on a local file: while the [constructor of
> ZipFile|https://docs.oracle.com/javase/8/docs/api/java/util/zip/ZipFile.html#ZipFile-java.io.File-]
> fails on invalid input, the [constructor of
> ZipInputStream|https://docs.oracle.com/javase/8/docs/api/java/util/zip/ZipInputStream.html#ZipInputStream-java.io.InputStream-]
> silently ignores the input.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)