Ok,

I think this encoding stuff with Ceki's name is a bug in the OSX JVM that
I'm using, so we may have to revert back to using "u" for a bit until the
OSX GM is out and I can test/use that. :-( The weird thing is that I'm not
convinced that it is the OSX JVM though...here is why:

Here is the input file:

<p>
<b>Ceki G&#252;lc&#252;</b> (ceki at apache.org)
<br/>
Ceki is the founder of the log4j project. Time permitting, he also does
custom development for clients. See <a
href="http://www.qos.ch">www.qos.ch</a> for more info.
</p>

Here is the little test program:

import java.io.*;
import java.lang.*;
import java.util.*;
import org.jdom.*;
import org.jdom.input.*;
import org.jdom.output.*;

public class Test
{
    public static void main (String[] args)
    {
        try
        {
            Document d = new SAXBuilder().build(args[0]);
            XMLOutputter outp = new XMLOutputter("", false);
            outp.setEncoding("ISO-8859-1");
            FileWriter fw = new FileWriter("test.html");
            outp.output(d, fw);
            fw.close();
        }
        catch (Exception e)
        {
        }
    }
}

java Test input.txt

produces:

<?xml version="1.0" encoding="ISO-8859-1"?>
<p>
<b>Ceki Gülcü</b> (ceki at apache.org)
<br />
Ceki is the founder of the
log4j project. Time permitting, he also does
custom development for clients.
See <a href="http://www.qos.ch">www.qos.ch</a> for more info.
</p>

As you can see, Ceki's name is correctly shown. The weird thing is that if I
take the file and load it in my browser, the weird characters show up for
the "u"'s, not the correct ones...

So, I'm at a loss here...can anyone help out?

thanks,

-jon


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to