[
https://issues.apache.org/jira/browse/STANBOL-770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Walter Kasper resolved STANBOL-770.
-----------------------------------
Resolution: Fixed
Resolved by replacing JTidy Html parser by JSoup parser
> Wrong changes of the structure of HTML5 docs by Tidy based HtmlParser
> ----------------------------------------------------------------------
>
> Key: STANBOL-770
> URL: https://issues.apache.org/jira/browse/STANBOL-770
> Project: Stanbol
> Issue Type: Bug
> Components: Engine - HtmlExtractor
> Reporter: Walter Kasper
> Assignee: Walter Kasper
>
> JTidy handles HTML5 docs wrong:
> - it eliminates new, "unknown" HTML5 elements
> - the changed behaviour of META tags is handled incorrectly by moving them to
> HEAD section, creating wrong scopes for Microdata elements
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira