Re: default encoding

Michael Rafael Glavassevich 20 May 2003 18:36:49 -0000

Hi Sudha,

Internally, Java stores characters (or chars) in memory as 16-bit Unicode.
This has nothing to do with the input encoding of your document, so no
characters are not stored as UTF-8 in a DOM Document. If you don't specify your 
own character reader, Xerces will try to translate your
stream of bytes from whatever encoding it is to 16-bit Unicode chars, and
if you specify your own Reader, it's responsible for translation.


Hope that helps.

-----------------------------
Michael Glavassevich
[EMAIL PROTECTED]
4B Computer Engineering
University of Waterloo

On Tue, 20 May 2003, Sudha Sathiaseelan wrote:

> Hi,
>
> Is UTF-8 the default encoding for the Document returned by 
> org.apache.xerces.jaxp.DocumentBuilderImpl's parse?
>
> Thanks,
> Sudha
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
>
>

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: default encoding

Reply via email to