[
https://issues.apache.org/jira/browse/TIKA-1358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15345031#comment-15345031
]
Tim Allison commented on TIKA-1358:
-----------------------------------
Before I forget...I figured out that we can tell the diff btwn a numbers file
and a pages file by examining the message types in the Document.iwa. If
there's a 2 (TN::SheetArchive), it is a numbers file; if there's a 10000
(TP::DocumentArchive), that's a pages file. Yet again,
[obriensp|https://github.com/obriensp/iWorkFileFormat/blob/master/iWorkFileInspector/iWorkFileInspector/Persistence/IWMessageTypeRegistry.mm]
is extremely helpful.
I'll add code for this in our detector once we integrate iwana fully.
> Add support for newer iWork file formats
> ----------------------------------------
>
> Key: TIKA-1358
> URL: https://issues.apache.org/jira/browse/TIKA-1358
> Project: Tika
> Issue Type: Wish
> Components: parser
> Affects Versions: 1.5
> Reporter: Jelle Kastelein
> Labels: new-parser, newbie
> Attachments: budget.txt, connors_20040127.txt,
> iwork13-testdocs-zips.zip, iwork13-testfiles-2014-11.zip, pages.txt
>
>
> IWork 2013 uses a revised file format which replaces the xml files that hold
> the content by .iwa files (a binary format). This file format is becoming
> increasingly relevant as more and more people are using apple products.
> However, it does not appear to work with the current IWorkPackageParser
> (tested with several of the example .pages files one can get from the
> iCloud).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)