[ 
https://issues.apache.org/jira/browse/COCOON-2297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nico Verwer updated COCOON-2297:
--------------------------------
    Affects Version/s: 2.1.13

> Character encoding does not follow JTidy properties
> ---------------------------------------------------
>
>                 Key: COCOON-2297
>                 URL: https://issues.apache.org/jira/browse/COCOON-2297
>             Project: Cocoon
>          Issue Type: Bug
>          Components: Blocks: HTML
>    Affects Versions: 2.1.11, 2.1.13
>            Reporter: Nico Verwer
>            Priority: Major
>         Attachments: HTMLTransformer.patch
>
>
> The text that HTMLTransformer sends to JTidy is always encoded according tot 
> the platform default encoding, by calling text.getBytes() without an encoding 
> parameter. JTidy does not follow the platform default encoding, but has its 
> own default. It is possible to change JTidy's input encoding in the 
> properties file.
> The patch uses the encoding specified by JTidy's configuration.
> The result is that HTMLTransformer handles UTF-8 or other encodings 
> correctly, so you don't get Chinese characters where you expected a 
> diacritical mark.
> While I was changing the code, I also changed the logging settings. They now 
> take the settings in the JTidy configuration into account.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to