[ https://issues.apache.org/jira/browse/SLING-6783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15976967#comment-15976967 ]
ASF GitHub Bot commented on SLING-6783: --------------------------------------- GitHub user JEBailey opened a pull request: https://github.com/apache/sling/pull/216 SLING-6783 updates for org.apache.commons.html - tagsoup jar update - updated annotations - added readme for available configuration options - updated tagsoup homepage - added ability to add configurations parameters/features to parser instance this is groundwork for html5 support You can merge this pull request into a Git repository by running: $ git pull https://github.com/JEBailey/sling local-dev Alternatively you can review and apply these changes as the patch at: https://github.com/apache/sling/pull/216.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #216 ---- commit b4defc1e67752d0b809c02000a815df688f12a48 Author: jabail <jab...@d10a352.na.sas.com> Date: 2017-04-20T15:54:15Z SLING-6783 updates for org.apache.commons.html ---- > update for org.apache.commons.html > ---------------------------------- > > Key: SLING-6783 > URL: https://issues.apache.org/jira/browse/SLING-6783 > Project: Sling > Issue Type: Improvement > Components: Commons > Reporter: Jason E Bailey > Priority: Minor > Attachments: sling.patch > > > Following updates: > Updated tagsoup lib to 1.2.1 which has the following modifications > * DOCTYPE is now recognized even in lower case. > * We make sure to buffer the reader, eliminating a long-standing bug that > would crash on certain inputs, such as & followed by CR+LF. > * The HTML scanner's table is precompiled at run time for efficiency, causing > a 4x speedup on large input documents. > * ]] within a CDATA section no longer causes input to be discarded. > * Remove bogus newline after printing children of the root element. > * Allow the noscript element anywhere, the same as the script element. > * Updated to the 2011 edition of the W3C character entity list. > Additionally: > Updated license with new home page for tagsoup > Updated annotations to OSGi annotations > Added the ability to specify additional features/properties for the parser > Documented available settings -- This message was sent by Atlassian JIRA (v6.3.15#6346)