Tim Allison created TIKA-2310:
---------------------------------
Summary: Try to order chapters in epub correctly
Key: TIKA-2310
URL: https://issues.apache.org/jira/browse/TIKA-2310
Project: Tika
Issue Type: Bug
Reporter: Tim Allison
Priority: Minor
[~johanvanderknijff] recently pointed out on twitter that our Epub parser
doesn't handle chapters in the right order. We should try to fix our parser so
that the output is in the correct order.
Epub is new to me, but it looks like we can scrape the order out of
content.opf(?).
This would require dumping the stream to a ZipFile for direct access to zip
entries, but we require that of ooxml...
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)