Hi Sudha, Internally, Java stores characters (or chars) in memory as 16-bit Unicode. This has nothing to do with the input encoding of your document, so no characters are not stored as UTF-8 in a DOM Document. If you don't specify your own character reader, Xerces will try to translate your stream of bytes from whatever encoding it is to 16-bit Unicode chars, and if you specify your own Reader, it's responsible for translation.
Hope that helps. ----------------------------- Michael Glavassevich [EMAIL PROTECTED] 4B Computer Engineering University of Waterloo On Tue, 20 May 2003, Sudha Sathiaseelan wrote: > Hi, > > Is UTF-8 the default encoding for the Document returned by > org.apache.xerces.jaxp.DocumentBuilderImpl's parse? > > Thanks, > Sudha > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
