[
https://issues.apache.org/jira/browse/TIKA-935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
John Mastarone updated TIKA-935:
--------------------------------
Attachment: ArParserTest.java
TIKA-935.patch
Patch uploaded which corrects the error in the *.ar file detection, along with
new unit test class that makes use of existing .ar files in the test-documents
folder. With this patch, parsing occurs successfully in a latest build. The
unit tests pass.
> TikaException thrown when trying to parse archive (*.ar) files
> --------------------------------------------------------------
>
> Key: TIKA-935
> URL: https://issues.apache.org/jira/browse/TIKA-935
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.2
> Environment: Windows 7
> Reporter: John Mastarone
> Attachments: ArParserTest.java, TIKA-935.patch
>
>
> A TikaException is thrown when trying to drop either of the two .ar files
> from the parsers' test-documents folder into Tika-GUI. From looking at this:
> http://stuff.mit.edu/afs/athena/software/cygwin/cygwin_v1.3.2/usr/share/magic.mime
> the archive detection is not done correctly for these types of files in
> the PackageExtractor class, and a TarArchiveInputStream is chosen by default,
> incorrectly.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira