Francesco Chicchiriccò commented on COCOON-2352:

I have reworked your patch to be also applied to the 
org.apache.cocoon:cocoon-serializers-charsets Maven artifact (used by Cocoon 
2.2 and Cocoon 3.0).

I don't know when we will be able to officially release your fix there; in the 
meanwhile, however, you could use the SNAPSHOT artifact by setting the 
following dependency:


and adding the following repository to your pom:

      <name>Apache Snapshot Repository</name>

Alternatively, you can download the updated SNAPSHOT artifact from


> XMLEncoder doesn't support Unicode surrogate pairs
> --------------------------------------------------
>                 Key: COCOON-2352
>                 URL: https://issues.apache.org/jira/browse/COCOON-2352
>             Project: Cocoon
>          Issue Type: Bug
>          Components: * Cocoon Core, Blocks: Serializers
>    Affects Versions: 2.1.12
>            Reporter: Ben Fortuna
>            Assignee: Francesco Chicchiriccò
>             Fix For: 2.1.13
> Whilst investigating an issue with the Sling project and support for emoji 
> characters, I've come to notice that the XMLEncoder used by HTMLSerializer 
> doesn't support Unicode surrogate pairs to represent higher order unicode 
> characters.
> A simple unit test that demonstrates this issue is here:
> https://github.com/micronode/whistlepost/blob/master/whistlepost-rewrite-lib/src/test/groovy/org/apache/cocoon/components/serializers/encoding/XMLEncoderTest.groovy
> More background info here also: SLING-5973
> This seems to have been identified/addressed in other Apache projects also:
> https://issues.apache.org/jira/browse/THRIFT-3403?jql=text%20~%20%22surrogate%20pairs%22

This message was sent by Atlassian JIRA

Reply via email to