Mike Rodent created TIKA-2265:
---------------------------------
Summary: Problem with footnotes/endnotes in Tika.parseToString
with MS Word (.docx) files
Key: TIKA-2265
URL: https://issues.apache.org/jira/browse/TIKA-2265
Project: Tika
Issue Type: Improvement
Components: parser
Affects Versions: 1.14
Environment: N/A
Reporter: Mike Rodent
Priority: Minor
It seems to be the case that a footnote numbered "1" in the real document will
be outputted by Tika.parseToString() as "2" in the footnote reference, and "2"
in the corresponding footnote body text.... real footnote "2" becomes "3", "3"
becomes "4", etc. Have not yet looked at source code ... I can't imagine it
would be difficult to correct this.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)