Hello,

[I tried to Google, but could not find any references to the above; accept my apologies if this has been discussed before, and send me a pointer to that discussion]

I have the opposite problem of what Upayavira described a few days ago: all my XML documents are completely in UTF-8, but Cocoon outputs entities for many non-ascii characters. Although this does not create any badly formatted pages, it does increase the size of the output html file (most such utf-8 characters will take 2 bytes, whereas the entities regularly take 7 or more bytes), and seems both unneccessary and some extra work in an all-utf-8 context, both for the server and the client. As my site contains a lot of these characters, I would like to turn it off. But it doesn't seem to be possible:

After some searching I hunted down the following paragraph in the description for XalanJ 2.6.0 (http://xml.apache.org/xalan-j/readme.html):

Other points of interest

<<inline: void.gif>>

...
• For HTML output, Xalan-Java 2 outputs character entity references (&copy; etc.) for the special characters designated in Appendix A. DTDs of the XHTML 1.0: The Extensible HyperText Markup Language. Xalan-Java 1.x, on the other hand, outputs literal characters for some of these special characters.

That is, it seems default behaviour, and I have found no Cocoon or other documentiation or tips to change it. Anyone can help me with this?


Sjur

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to