Walter Kasper created STANBOL-770:
-------------------------------------
Summary: Wrong changes of the structure of HTML5 docs by Tidy
based HtmlParser
Key: STANBOL-770
URL: https://issues.apache.org/jira/browse/STANBOL-770
Project: Stanbol
Issue Type: Bug
Components: Engine - HtmlExtractor
Reporter: Walter Kasper
Assignee: Walter Kasper
JTidy handles HTML5 docs wrong:
- it eliminates new, "unknown" HTML5 elements
- the changed behaviour of META tags is handled incorrectly by moving them to
HEAD section, creating wrong scopes for Microdata elements
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira