Is there any way xerces parser can identify unicode charecters in an xml document is not valid for that xml's encoding ( is there some feature that needs to be set )
Thanks in advance Rohith -----Original Message----- From: Joseph Kesselman [mailto:[EMAIL PROTECTED] Sent: Thursday, 7 August 2003 10:49 PM To: '[EMAIL PROTECTED]' Subject: need help in entity resolution On Thursday, 08/07/2003 at 06:36ZE10, "Raveendranath, Rohith (LNG - AUS)" > ″ ′ and say one more enity — These are numeric character references, not entity references. They don't have to be defined (and in fact can't be defined); they map directly into the Unicode characters with those numbers. During output, the serializer maps these Unicode characters to their corresponding characters in the selected output encoding, or outputs them as numeric references if the encoding does not have a corresponding character. > When the document is serialized, — is replaced with hyphen Check the details of your encoding; that may be correct. Check the file at the byte level; it's also possible that what you're seeing isn't actually a hyphen but a character which appears similar when displayed by your system/software. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] This e-mail is for the use of the intended recipient(s) only. If you have received this e-mail in error, please notify the sender immediately and then delete it. If you are not the intended recipient, you must not use, disclose or distribute this e-mail without the author's permission. We have taken precautions to minimise the risk of transmitting software viruses, but we advise you to carry out your own virus checks on any attachment to this e-mail. We cannot accept liability for any loss or damage caused by software viruses. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
