My DOMWriter is transcoding from one of the native Macintosh code pages to UTF-8.  I'm 
exporting XML from a document layout program.  My specific issue is that high byte 
"smart quotes" characters are being serialized as three unrelated garbage characters 
(which I don't assume are wrong, since I'm looking at UTF-8), depending on the kind of 
quotes.  After transcoding by my SAX parser, these characters turn into three 
different garbage characters, which then promptly choke the target application (on 
Windows).  In the short-term, I'm probably going to just switch over the character and 
convert back to regular quotes.  In the long-term, I'd like to do away with this hack, 
as I expect it's just a matter of time before I hit more code page problems.

Sorry for being cranky about Mac code pages; I should have said unfamiliar instead of 
non-standard.  I'm no Windows zealot.

        Adam Heinz
        Development Consultant

        Exstream Software
        2424 Harrodsburg Road, Suite 200
        Lexington, KY 40503
        (317) 879-2831
        [EMAIL PROTECTED]

        connecting with the eGeneration 
        www.exstream.com

-----Original Message-----
From: James Berry [mailto:[EMAIL PROTECTED]
Sent: Wednesday, September 17, 2003 8:13 PM
To: Xerces C Dev
Cc: Adam Heinz
Subject: Re: native macintosh code page


Hi Adam,

Well, the Mac's native character encoding is no more non-standard than any
other, it's just different! (And it has the moral advantage of having been
defined before the Window's code page).

But to answer your question may require more information about what you're
really trying to do, and what's not working. You can set the output encoding
by using the setEncoding method on DOMWriter, for instance.

Does that help at all?

-jdb

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to