[ 
https://issues.apache.org/jira/browse/TIKA-1358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17351067#comment-17351067
 ] 

Tim Allison commented on TIKA-1358:
-----------------------------------

If we find an iWorks library (even an external one, like, you'd have to run it 
on the commandline), and it is Apache 2.0 friendly, I'd be happy to integrate 
that.  Also, if you want iWorks files, I can try to find some for you in 
CommonCrawl.

> Add support for newer iWork file formats
> ----------------------------------------
>
>                 Key: TIKA-1358
>                 URL: https://issues.apache.org/jira/browse/TIKA-1358
>             Project: Tika
>          Issue Type: Wish
>          Components: parser
>    Affects Versions: 1.5
>            Reporter: Jelle Kastelein
>            Priority: Major
>              Labels: new-parser, newbie
>         Attachments: 666.pages, budget.txt, connors_20040127.txt, 
> iwork13-testdocs-zips.zip, iwork13-testfiles-2014-11.zip, pages.txt
>
>
> IWork 2013 uses a revised file format which replaces the xml files that hold 
> the content by .iwa files (a binary format). This file format is becoming 
> increasingly relevant as more and more people are using apple products. 
> However, it does not appear to work with the current IWorkPackageParser 
> (tested with several of the example .pages files one can get from the 
> iCloud). 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to