[ 
https://issues.apache.org/jira/browse/TIKA-2242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17393200#comment-17393200
 ] 

Tim Allison edited comment on TIKA-2242 at 8/4/21, 2:20 PM:
------------------------------------------------------------

Annotations are now fixed.

I added a test to run the xhtml output of the ODTParser through an XML parser.  
This turned up two other bugs: we weren't closing style tags at the end of 
headers or <a> elements.

This is all fixed now locally.  When I get a clean build, I'll push to main.


was (Author: [email protected]):
K. That's fixed now.

I added a test to run the xhtml output of the ODTParser thrown an XML parser.  
This turned up two other bugs: we weren't closing style tags at the end of 
headers or <a> elements.

This is all fixed now locally.  When I get a clean build, I'll push to main.

> opendocument parsing produces malformed xml
> -------------------------------------------
>
>                 Key: TIKA-2242
>                 URL: https://issues.apache.org/jira/browse/TIKA-2242
>             Project: Tika
>          Issue Type: Bug
>          Components: handler, parser
>    Affects Versions: 1.13, 1.14
>            Reporter: Jan Van Raemdonck
>            Assignee: Tim Allison
>            Priority: Major
>             Fix For: 1.15, 2.0.0
>
>         Attachments: 2017-01-02-16B833-16B833VANCAUTEREN.odt, 
> 2017-02-01-15B96Ghijsens-17B96GHIJSENS.odt, cor-S0BAC21-specimen2.odt, 
> out.xhtml
>
>
> For some odt documents, a malformed xml is produced when parsing. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to