[
https://issues.apache.org/jira/browse/TIKA-2242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17393200#comment-17393200
]
Tim Allison edited comment on TIKA-2242 at 8/4/21, 2:20 PM:
------------------------------------------------------------
Annotations are now fixed.
I added a test to run the xhtml output of the ODTParser through an XML parser.
This turned up two other bugs: we weren't closing style tags at the end of
headers or <a> elements.
This is all fixed now locally. When I get a clean build, I'll push to main.
was (Author: [email protected]):
K. That's fixed now.
I added a test to run the xhtml output of the ODTParser thrown an XML parser.
This turned up two other bugs: we weren't closing style tags at the end of
headers or <a> elements.
This is all fixed now locally. When I get a clean build, I'll push to main.
> opendocument parsing produces malformed xml
> -------------------------------------------
>
> Key: TIKA-2242
> URL: https://issues.apache.org/jira/browse/TIKA-2242
> Project: Tika
> Issue Type: Bug
> Components: handler, parser
> Affects Versions: 1.13, 1.14
> Reporter: Jan Van Raemdonck
> Assignee: Tim Allison
> Priority: Major
> Fix For: 1.15, 2.0.0
>
> Attachments: 2017-01-02-16B833-16B833VANCAUTEREN.odt,
> 2017-02-01-15B96Ghijsens-17B96GHIJSENS.odt, cor-S0BAC21-specimen2.odt,
> out.xhtml
>
>
> For some odt documents, a malformed xml is produced when parsing.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)