[ 
https://issues.apache.org/jira/browse/STANBOL-770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Walter Kasper resolved STANBOL-770.
-----------------------------------

    Resolution: Fixed

Resolved by replacing JTidy Html parser by JSoup parser
                
> Wrong changes of  the structure of HTML5 docs by Tidy based HtmlParser
> ----------------------------------------------------------------------
>
>                 Key: STANBOL-770
>                 URL: https://issues.apache.org/jira/browse/STANBOL-770
>             Project: Stanbol
>          Issue Type: Bug
>          Components: Engine - HtmlExtractor
>            Reporter: Walter Kasper
>            Assignee: Walter Kasper
>
> JTidy handles HTML5 docs wrong:
> - it eliminates new, "unknown" HTML5 elements
> - the changed behaviour of META tags is handled incorrectly by moving them to 
> HEAD section, creating wrong scopes for Microdata elements

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to