Tim Allison created TIKA-4728:
---------------------------------

             Summary: Validate xhtml output, generally
                 Key: TIKA-4728
                 URL: https://issues.apache.org/jira/browse/TIKA-4728
             Project: Tika
          Issue Type: Improvement
            Reporter: Tim Allison


There's a bug in the xml output that we're writing for specific js attached in 
a specific way in PDFs. We should fix that, but we should add more general, 
more robust testing that we can actually parse our xhtml.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to