[ 
https://issues.apache.org/jira/browse/TIKA-2264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15866182#comment-15866182
 ] 

Mike Rodent commented on TIKA-2264:
-----------------------------------

Thanks for the reply.  I'm going upload the unit test class for this and also 
the .ODT file I was testing with (although not used with the *unit* tests).

Please bear in mind that I'm also something of a newb when it comes to testing 
and mocking! This makes no claim to be good or comprehensive or meeting 
professional standards!  But maybe you can use my offerings to produce 
something "proper".

> Better handling of footnotes/endnotes for ODF files
> ---------------------------------------------------
>
>                 Key: TIKA-2264
>                 URL: https://issues.apache.org/jira/browse/TIKA-2264
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.14
>         Environment: N/A
>            Reporter: Mike Rodent
>            Priority: Minor
>              Labels: newbie
>         Attachments: ImprovedODFContentParser.java, 
> _ImprovedODFContentParserUTest.java, test.odt
>
>
> Springs from my question here 
> (http://stackoverflow.com/questions/42031237/modify-apache-tika-parsing-of-old-1997-2003-ms-word-docs)
>  ... I have improved the class OpenDocumentContentParser so that it puts 
> footnotes/endnotes at the end of the line to which they belong and doesn't 
> break up the line in question.  As with .docx parsing the notes can be linked 
> to the reference easily.  The respondee in Stack Overflow suggested I open an 
> issue here... 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to