Re: Getting UTF-16 encoding on dynamic content regardless of output content type

2022-03-31 Thread Christopher Schultz
Greg, On 3/31/22 12:17, Christopher Schultz wrote: Greg, On 3/29/22 13:41, gelo1234 wrote: Have you also tried HTMLT or XHTMLT Serializers? Default HTMLSerializer cannot handle some unicode characters: https://issues.apache.org/jira/browse/SLING-5973?attachmentOrder=asc Hmm. Are the HTMLT

Re: Getting UTF-16 encoding on dynamic content regardless of output content type

2022-03-31 Thread Cédric Damioli
Hi, To help isolate the issue, could you test with a simpler pipeline with only generator/single simple XSLT/xml serializer ? Cédric Le 31/03/2022 à 17:54, Christopher Schultz a écrit : Cédric, On 3/29/22 12:52, Cédric Damioli wrote: Do you use Xalan as XSLT Processor ? If so, I remember

Re: Getting UTF-16 encoding on dynamic content regardless of output content type

2022-03-31 Thread Christopher Schultz
Greg, On 3/31/22 12:13, Christopher Schultz wrote: On 3/29/22 13:37, gelo1234 wrote: Hello Chris, I think you will not get any icon-type character on output without using proper font rendering - like Emoji support? Emoji might not be supported by default in Cocoon. This isn't a

Re: Getting UTF-16 encoding on dynamic content regardless of output content type

2022-03-31 Thread Christopher Schultz
Greg, On 3/29/22 13:41, gelo1234 wrote: Have you also tried HTMLT or XHTMLT Serializers? Default HTMLSerializer cannot handle some unicode characters: https://issues.apache.org/jira/browse/SLING-5973?attachmentOrder=asc Hmm. Are the HTMLT / XHTMLT serializers built-in? I have disabled all

Re: Getting UTF-16 encoding on dynamic content regardless of output content type

2022-03-31 Thread Christopher Schultz
Greg, On 3/29/22 13:37, gelo1234 wrote: Hello Chris, I think you will not get any icon-type character on output without using proper font rendering - like Emoji support? Emoji might not be supported by default in Cocoon. This isn't a font-rendering issue; it's just ... wrong. Either the

Re: Getting UTF-16 encoding on dynamic content regardless of output content type

2022-03-31 Thread Christopher Schultz
Cédric, On 3/29/22 12:52, Cédric Damioli wrote: Do you use Xalan as XSLT Processor ? If so, I remember https://issues.apache.org/jira/browse/XALANJ-2617 which could be a cause of your issue. I resolved it on my side years ago by compiling my own patched version > of Xalan. I'm using

Re: Getting UTF-16 encoding on dynamic content regardless of output content type

2022-03-29 Thread gelo1234
Chris, Have you also tried HTMLT or XHTMLT Serializers? Default HTMLSerializer cannot handle some unicode characters: https://issues.apache.org/jira/browse/SLING-5973?attachmentOrder=asc Greetings, Greg wt., 29 mar 2022 o 19:37 gelo1234 napisał(a): > Hello Chris, > > I think you will not get

Re: Getting UTF-16 encoding on dynamic content regardless of output content type

2022-03-29 Thread gelo1234
Hello Chris, I think you will not get any icon-type character on output without using proper font rendering - like Emoji support? Emoji might not be supported by default in Cocoon. So this might be the reason why you get HTML entities instead of Emoji-icons. Also notice:

Re: Getting UTF-16 encoding on dynamic content regardless of output content type

2022-03-29 Thread Cédric Damioli
Do you use Xalan as XSLT Processor ? If so, I remember https://issues.apache.org/jira/browse/XALANJ-2617 which could be a cause of your issue. I resolved it on my side years ago by compiling my own patched version of Xalan. For "markers", you may use labels on your sitemap steps associated

Re: Getting UTF-16 encoding on dynamic content regardless of output content type

2022-03-29 Thread Christopher Schultz
Cédric, On 3/29/22 12:06, Cédric Damioli wrote: Could you provide more details ? How is your XML processed before outputting the wrong UTF-8 sequence ? It's somewhat straightforward: https://source/; />

Re: Getting UTF-16 encoding on dynamic content regardless of output content type

2022-03-29 Thread Cédric Damioli
Hi Christopher, Could you provide more details ? How is your XML processed before outputting the wrong UTF-8 sequence ? Regards, Cédric Le 29/03/2022 à 17:48, Christopher Schultz a écrit : All, I'm still struggling with this. I have upgraded to 2.1.13 which includes the fix for

Re: Getting UTF-16 encoding on dynamic content regardless of output content type

2022-03-29 Thread Christopher Schultz
All, I'm still struggling with this. I have upgraded to 2.1.13 which includes the fix for https://issues.apache.org/jira/browse/COCOON-2352 but I'm still getting that American flag converted into those 4 HTML entities: I would expect there to be a single (multibyte) character in the output

Re: Getting UTF-16 encoding on dynamic content regardless of output content type

2018-10-30 Thread Christopher Schultz
-BEGIN PGP SIGNED MESSAGE- Hash: SHA256 All, Some additional information at the end. On 10/30/18 11:58, Christopher Schultz wrote: > All, > > I'm attempting to do everything with UTF-8 in Cocoon 2.1.11. I have > a servlet generating XML in UTF-8 encoding and I have a pipeline > with a

Getting UTF-16 encoding on dynamic content regardless of output content type

2018-10-30 Thread Christopher Schultz
-BEGIN PGP SIGNED MESSAGE- Hash: SHA256 All, I'm attempting to do everything with UTF-8 in Cocoon 2.1.11. I have a servlet generating XML in UTF-8 encoding and I have a pipeline with a few transforms in it, ultimately serializing to XHTML. If I have a Unicode character in the XML which