Mike Rodent created TIKA-2265:
---------------------------------

             Summary: Problem with footnotes/endnotes in Tika.parseToString 
with MS Word (.docx) files
                 Key: TIKA-2265
                 URL: https://issues.apache.org/jira/browse/TIKA-2265
             Project: Tika
          Issue Type: Improvement
          Components: parser
    Affects Versions: 1.14
         Environment: N/A
            Reporter: Mike Rodent
            Priority: Minor


It seems to be the case that a footnote numbered "1" in the real document will 
be outputted by Tika.parseToString() as "2" in the footnote reference, and "2" 
in the corresponding footnote body text.... real footnote "2" becomes "3", "3" 
becomes "4", etc.  Have not yet looked at source code ... I can't imagine it 
would be difficult to correct this.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to