My DOMWriter is transcoding from one of the native Macintosh code pages to UTF-8. I'm
exporting XML from a document layout program. My specific issue is that high byte
"smart quotes" characters are being serialized as three unrelated garbage characters
(which I don't assume are wrong, since I'm looking at UTF-8), depending on the kind of
quotes. After transcoding by my SAX parser, these characters turn into three
different garbage characters, which then promptly choke the target application (on
Windows). In the short-term, I'm probably going to just switch over the character and
convert back to regular quotes. In the long-term, I'd like to do away with this hack,
as I expect it's just a matter of time before I hit more code page problems.
Sorry for being cranky about Mac code pages; I should have said unfamiliar instead of
non-standard. I'm no Windows zealot.
Adam Heinz
Development Consultant
Exstream Software
2424 Harrodsburg Road, Suite 200
Lexington, KY 40503
(317) 879-2831
[EMAIL PROTECTED]
connecting with the eGeneration
www.exstream.com
-----Original Message-----
From: James Berry [mailto:[EMAIL PROTECTED]
Sent: Wednesday, September 17, 2003 8:13 PM
To: Xerces C Dev
Cc: Adam Heinz
Subject: Re: native macintosh code page
Hi Adam,
Well, the Mac's native character encoding is no more non-standard than any
other, it's just different! (And it has the moral advantage of having been
defined before the Window's code page).
But to answer your question may require more information about what you're
really trying to do, and what's not working. You can set the output encoding
by using the setEncoding method on DOMWriter, for instance.
Does that help at all?
-jdb
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]