[
https://issues.apache.org/jira/browse/KNOX-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16636003#comment-16636003
]
Kevin Risden commented on KNOX-1489:
------------------------------------
Could be backwards incompatible changes highlighted below in "CHANGES THAT
COULD AFFECT THE BEHABIOUR OF EXISTING PROGRAMS".
{noformat}
3.4 (2015-10-24)
- Bug Fixes:
- [62] Fixed GC performance problem in StreamedSource.
- [71] Renderer.setHRLineLength(0) doesn't completely disable
rendering of HR element.
- [72] Fixed performance problem in Attributes.
- [80] Fixed position discarded exception in StreamedSource.
- [81] Limited left margin in Renderer based on MaxLineLength.
- Little-endian BOM encoding detection broken.
- HTML5 elements with forbidden end tags weren't present in
HTMLElements.getEndTagForbiddenElementNames()
- CHANGES THAT COULD AFFECT THE BEHAVIOUR OF EXISTING PROGRAMS:
- Changed default character reference encoding behaviour.
(see Config.DEFAULT_CHARACTER_REFERENCE_ENCODING_BEHAVIOUR)
- Changed the the ordering of OutputSegments for more intuitive
behaviour, but still backward compatible with the old API contract.
- Added Apache License as an option for licensing.
- Added Config.CurrentCharacterReferenceEncodingBehaviour parameter.
- Performance improvements in name and attribute based searches after
full sequential parse.
- Performance improvement in CharacterReference.decode methods.
- Added LoggerProvider.getSourceLogger() for better performance of
highly concurrent applications.
- Performance improvement in StreamedSource by avoiding exception
at end of stream.
- Compiling to target Java 1.7 instead of Java 1.5.
(source code is however still compatible with Java 1.6)
- Removed all raw type references from source code.
- Improved documentation of TagType.isValidPosition to include mention
of potential problems with Microsoft downlevel-revealed conditional
comment tags.
- INPUT elements missing a name attribute no longer result in an
error message being logged.
- INPUT elements with type attribute values of date, datetime,
datetime-local, month, time, week, number, range, email, url, search,
tel, and color are now recognised as text controls without warnings
appearing in the log.
- HTMLSanitiser.stripInvalidMarkup sample now removes content from
<script> tags, not just the tags.
- Upgraded to the following logger APIs:
slf4j-api-1.7.12, log4j-2.4.1
3.3 (2012-10-31)
- Bug Fixes:
- [3581664] CharacterReference.decode() does not decode entities
containing digits - ½ ¼ ¾ ¹ ² ³
∴
- [3311286] SourceCompactor does not respect TEXTAREA
- [3519131] Renderer output incorrect when constructed with an
Element object.
- [3538829] Renderer output of font decoration on block boundaries
incorrect.
- Segment.getAllStartTags(name) and Segment.getFirstElement(name)
do not work if the argument contains upper case characters.
- The end delimiter of a common server tag inside an escaped server
tag is falsely recognised as the end delimiter of the escaped tag.
- CHANGES THAT COULD AFFECT THE BEHAVIOUR OF EXISTING PROGRAMS:
- [3427073] Segment.getStyleURISegments() now includes style element
content as well as style attribute values.
- [3427927] Segment.getURIAttributes() now includes the archive
attributes of object and applet elements.
- Comments no longer recognised inside script elements during full
sequential parse. Previously they were recognised for compatibility
with major browsers but modern browser behaviour has changed.
- Changed the log level of all parsing errors from INFO to ERROR, and
the log level of the Source.fullSequentialParse() advisory message
from WARN to INFO. The previous levels gave the advisory message a
higher severity than the parsing errors, preventing logging systems
from hiding the advisory message while showing parsing errors.
Character encoding warnings remain unchanged at WARN level.
- Changed the behaviour of the Renderer.renderHyperlinkURL(StartTag)
method so that relative URLs are not rendered.
- Changed the behaviour of the Renderer so that hyperlink element
content is not rendered if it is the same as the hyperlink URL,
ignoring any http:// prefix or / suffix.
- EndTag.tidy() now removes whitespace before the closing bracket.
- Added Source(File) constructor.
- Added OutputDocument.getSegment() method.
- Added OutputDocument.remove(int begin, int end) method.
- Added Renderer.setHRLineLength() method.
- Added RenderToText.jsp webapp sample.
- Added Segment.getRowColumnVector() method.
- Encoding detection now ignores common encodings specified in meta tags
that have a code unit size incompatible with the preliminary encoding.
- Upgraded to the following logger APIs:
slf4j-api-1.7.2, log4j-1.2.17{noformat}
>From http://jericho.htmlparser.net/release.txt
> Upgrade jericho-html to 3.4
> ---------------------------
>
> Key: KNOX-1489
> URL: https://issues.apache.org/jira/browse/KNOX-1489
> Project: Apache Knox
> Issue Type: Sub-task
> Reporter: Kevin Risden
> Assignee: Kevin Risden
> Priority: Major
> Fix For: 1.2.0
>
> Attachments: KNOX-1489.patch
>
>
> jericho-html 3.2 to 3.4
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)