Christian Lohmaier wrote:
Hi John, *,

On Tue, Feb 05, 2008 at 07:34:50AM -0000, John McCreesh wrote:
On the Marketing Project planet, http://marketing.openoffice.org/planet
there are posts appearing with "non-displayable" characters. There's a
good example in Ben Horsts's post of Feb 4th.which has lots of these
"problem" characters.

Any suggestions please as to how I might correct them?

Define the charset in the generater to be UTF-8/set it to don't touch
and ask your peers to provide the feed in utf-8 only.

As there are various charsets, there is only hope when the charset is
specified somewhere, just guessing can break seriously. You cannot take
the page as it is and pass it through a converter. You would have to
pass the individual feeds through an appropriate filter/recoder.

As Ben's own site (and feed) is already UTF-8 and displays properly,
there is a misconfiguration in the aggregator "planetplanet" used (or a
bug), or the page is corrupted somewhere else in between.

Hmm, as you say, Ben's feed is clean, it passes through the Planet aggregator cleanly, it looks ok on my server, but once it gets into cvs on the site it's corrupt.

Is there something in cvs that could mangle characters? Should I be saving the files as binary in cvs? (maybe a question for the native-lang people...)?

Thanks - John


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to