[
https://issues.apache.org/jira/browse/TIKA-1966?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15270658#comment-15270658
]
Hudson commented on TIKA-1966:
------------------------------
SUCCESS: Integrated in tika-trunk-jdk1.7 #977 (See
[https://builds.apache.org/job/tika-trunk-jdk1.7/977/])
TIKA-1966 Converted versions of test iWorks files from latest iWorks for (nick:
rev c93ff3e1cc8d1af1e925a3911949405d0084a507)
* tika-parsers/src/test/resources/test-documents/testKeynoteNew.key
* tika-parsers/src/test/resources/test-documents/testNumbersNew.numbers
* tika-parsers/src/test/resources/test-documents/testPagesNew.pages
> Issue in parsing iWorksDocument with Apache Tika
> ------------------------------------------------
>
> Key: TIKA-1966
> URL: https://issues.apache.org/jira/browse/TIKA-1966
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.12
> Environment: Ubuntu 15
> Reporter: Sachin Shaju
> Attachments: budget.numbers, connors_20040127.key, pages.pages,
> sample code
>
>
> I was trying to parse iWorksDoc with Apache Tika. But am not getting parsed
> content as it is instead getting some other output from the content handler.
> Code snippet that I've used is attached with this.
> Output :-
> Contents of the file :
> Index/Document.iwa
> Index/ViewState.iwa
> Index/CalculationEngine.iwa
> Index/Tables/HeaderStorageBucket-2.iwa
> Index/Tables/Tile.iwa
> Index/Metadata.iwa
> Metadata/Properties.plist
> I'm able to detect the file type using Detector api correctly. But am not
> getting the useful content out of the document.
> I'm attaching the iWorks docs that I've tested with (made with latest version
> of iOS). I got it working when testing with older versions. Thanks
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)