Mohamed,
I think that Ravi was using FileInputStream from the following line when he sets his InpuSource
> InputSource ipSource = new InputSource();
> ipSource.setEncoding("UTF-8");
> ipSource.setByteStream( new FileInputStream( new File(inputFile) )
The problem I believe is on the assumption which in my opinion is incorrect, that the char[]
returned by the character method in handler returns UTF8 data.
By reading the UTF8 reader which is an internal class in Xerces I see that the parser reads UTF8
and converts data in a fly to UTF 16 in the read method and stuff data in to char[].
I would imagine that all processing is done in UTF16, and that is what is returned.
I maybe wrong since I worked in the parser many years ago and many classes have changed, and
my memory has gotten worse with the years.
Cheers,
Jeffrey Rodriguez
Silicon Valley_________________________________________________________________
Fast, faster, fastest: Upgrade to Cable or DSL today! https://broadband.msn.com
--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
