Sereena <serena_a <at> mindtree.com> writes: > > I am trying to parse an xml with Korean characters in it, but when some of the > korean characters are encountered, the parsing stops. If I remove the > characters causing problem, the rest of the xml is also parsed. Could anyone > help me to get this solved so that I can parse the whole xml with any korean > character in it? > Please note that I am not getting any exception here, but the parsing stops. >
The problem was with the encoding format. So in the method doc2bytes if we set the encoding format as "EUC-KR"(which is the encoding for Korean)to the Transformer properties as below we get the proper result: Transformer transformer = factory.newTransformer(); Properties properties = transformer.getOutputProperties(); properties.setProperty(OutputKeys.ENCODING, "EUC-KR"); transformer.setOutputProperties(properties); --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
