[ 
https://issues.apache.org/jira/browse/TIKA-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14279600#comment-14279600
 ] 

Hudson commented on TIKA-1028:
------------------------------

SUCCESS: Integrated in tika-trunk-jdk1.7 #434 (See 
[https://builds.apache.org/job/tika-trunk-jdk1.7/434/])
Partial unit test for TIKA-1028 (nick: 
http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1652318)
* 
/tika/trunk/tika-parsers/src/test/java/org/apache/tika/parser/mail/RFC822ParserTest.java
TIKA-1028 If an encrypted attachment is found in a RFC822 email, silently skip 
it and carry on, so the rest of the email can be processed (may need more 
work!) (nick: http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1652317)
* 
/tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/mail/MailContentHandler.java
Test rfc822 file with an encrypted zip file attached from TIKA-1028 (nick: 
http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1652316)
* 
/tika/trunk/tika-parsers/src/test/resources/test-documents/testRFC822_encrypted_zip
TIKA-1028 Have PackageParser report encrypted zips via 
EncryptedDocumentException rather than commons compress 
UnsupportedZipFeatureException (nick: 
http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1652315)
* 
/tika/trunk/tika-parsers/src/main/java/org/apache/tika/parser/pkg/PackageParser.java


> Tika-server quits parsing of rfc-822 document prematurely when it encounters 
> encrypted zip file as attachment.
> --------------------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-1028
>                 URL: https://issues.apache.org/jira/browse/TIKA-1028
>             Project: Tika
>          Issue Type: Bug
>          Components: mime, parser, server
>    Affects Versions: 1.2, 1.3, 1.4, 1.5, 1.6, 1.7
>            Reporter: Juha Haaga
>         Attachments: encrypted-zip.msg
>
>
> The Zip parser in tika-server does not allow passing in the password for 
> decrypting the zip file and doesn't handle the unsupported feature 
> gracefully. Problem happens when zip file is attached part of email document 
> being parsed, and the parser gives up and throws an exception:
> WARNING: all: Unpacker failed
> org.apache.tika.exception.TikaException: TIKA-198: Illegal IOException from 
> org.apache.tika.parser.pkg.PackageParser@10fcc945
> Caused by: 
> org.apache.commons.compress.archivers.zip.UnsupportedZipFeatureException: 
> unsupported feature encryption used in entry
> Instead of returning the successfully parsed components, Tika-server returns 
> nothing. 
> It would be better to return rest of the parsed document contents along with 
> the untouched offending zip file in the archive that Tika-server returns as a 
> result. Until the feature of zip file decrypting is added this would always 
> return untouched zip file, and after it is implemented it should return the 
> untouched zip file in the cases where wrong password was provided.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to