Sereena <serena_a <at> mindtree.com> writes:

> 
> I am trying to parse an xml with Korean characters in it, but when some of 
the 
> korean characters are encountered, the parsing stops. If I remove the 
> characters causing problem, the rest of the xml is also parsed. Could anyone 
> help me to get this solved so that I can parse the whole xml with any korean 
> character in it?
> Please note that I am not getting any exception here, but the parsing stops.
> 

The problem was with the encoding format. So in the method doc2bytes if we set 
the encoding format as "EUC-KR"(which is the encoding for Korean)to the 
Transformer properties as below we get the proper result:

Transformer transformer = factory.newTransformer();
Properties properties = transformer.getOutputProperties();
properties.setProperty(OutputKeys.ENCODING, "EUC-KR");
transformer.setOutputProperties(properties);




---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to