Walter Kasper created STANBOL-770:
-------------------------------------

             Summary: Wrong changes of  the structure of HTML5 docs by Tidy 
based HtmlParser
                 Key: STANBOL-770
                 URL: https://issues.apache.org/jira/browse/STANBOL-770
             Project: Stanbol
          Issue Type: Bug
          Components: Engine - HtmlExtractor
            Reporter: Walter Kasper
            Assignee: Walter Kasper


JTidy handles HTML5 docs wrong:
- it eliminates new, "unknown" HTML5 elements
- the changed behaviour of META tags is handled incorrectly by moving them to 
HEAD section, creating wrong scopes for Microdata elements

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to