Great! According to the links, I have an easy solution.
The only thing that needs to be done, to get HTML in UTF-8 encoding processed right is:
edit <forrest>/main/webapp/WEB-INF/jtidy.properties
replace
char-encoding=latin1
with
char-encoding=utf8

see http://tidy.sourceforge.net/docs/quickref.html#EncodingHeader for additional info

There is no need to play with java's file.encoding.
Maybe this solves FOR-668?

Mirek

Ross Gardler wrote:

Miroslav Mocek wrote:

Ok.
I added line <echo message="${file.encoding}"/> into forrest.build.xml.
I started
forrest run -Dfile.encoding=UTF-8
in console is written:
init:
UTF-8

So I assume, the java default encoding is now UTF-8. (it was other, previously)
Still getting the same wierd characters. (THE SAME)

my env: java1.5, forrest0.8-dev,windows.

Ok, I need solution. I've looked at sources, haven't found html reader you are reporting to. Where is it, please?


http://cocoon.apache.org/2.1/userdocs/generators/html-generator.html

http://cocoon.apache.org/2.1/apidocs/org/apache/cocoon/generation/HTMLGenerator.html

Ross