[
https://issues.apache.org/jira/browse/TIKA-2264?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15862815#comment-15862815
]
Mike Rodent commented on TIKA-2264:
-----------------------------------
Just one more point:
my inner static class, ImprovedODEMCHandler, to replace final static private
inner class OpenDocumentElementMappingContentHandler, is neither final nor
private. At the moment it is package-private, by default really.
But in fact I can't see why people shouldn't be permitted to use this to
improve things further, so perhaps it should at least be protected. If so, it
would be necessary to change the method parseInternal in some way, so that the
DefaultHandler dh could somehow be set with a different ImprovedODEMCHandler
subclass...
> Better handling of footnotes/endnotes for ODF files
> ---------------------------------------------------
>
> Key: TIKA-2264
> URL: https://issues.apache.org/jira/browse/TIKA-2264
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.14
> Environment: N/A
> Reporter: Mike Rodent
> Priority: Minor
> Labels: newbie
> Attachments: ImprovedODFContentParser.java
>
>
> Springs from my question here
> (http://stackoverflow.com/questions/42031237/modify-apache-tika-parsing-of-old-1997-2003-ms-word-docs)
> ... I have improved the class OpenDocumentContentParser so that it puts
> footnotes/endnotes at the end of the line to which they belong and doesn't
> break up the line in question. As with .docx parsing the notes can be linked
> to the reference easily. The respondee in Stack Overflow suggested I open an
> issue here...
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)