[ 
https://issues.apache.org/jira/browse/COCOON-2297?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12902345#action_12902345
 ] 

Nico Verwer commented on COCOON-2297:
-------------------------------------

I am afraid the patch is not correct. The line that now reads
tidy.setShowWarnings(getLogger().isWarnEnabled() && ((wChar == 'f') || (wChar 
== 'n') || (wChar == '0')));
should have been
tidy.setShowWarnings(getLogger().isWarnEnabled() && ((wChar == 't') || (wChar 
== 'y') || (wChar == '1')));

Otherwise, the meaning of show-warnings is inverted. Sorry for the confusion.

Are there still committers for Cocoon 2.1 who are willing to pick this up?

> Character encoding does not follow JTidy properties
> ---------------------------------------------------
>
>                 Key: COCOON-2297
>                 URL: https://issues.apache.org/jira/browse/COCOON-2297
>             Project: Cocoon
>          Issue Type: Bug
>          Components: Blocks: HTML
>    Affects Versions: 2.1.11
>            Reporter: Nico Verwer
>         Attachments: HTMLTransformer.patch
>
>
> The text that HTMLTransformer sends to JTidy is always encoded according tot 
> the platform default encoding, by calling text.getBytes() without an encoding 
> parameter. JTidy does not follow the platform default encoding, but has its 
> own default. It is possible to change JTidy's input encoding in the 
> properties file.
> The patch uses the encoding specified by JTidy's configuration.
> The result is that HTMLTransformer handles UTF-8 or other encodings 
> correctly, so you don't get Chinese characters where you expected a 
> diacritical mark.
> While I was changing the code, I also changed the logging settings. They now 
> take the settings in the JTidy configuration into account.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to