> I am writing xml files which contain physical units. > Some units contain the greek mucro.
Do you mean U+03BC which is "GREEK SMALL LETTER MU" or U+00B5, which is "MICRO SIGN"? There is no Greek "mucro" character. > Xerces writes it as two characters, I remember thats > because it cannot be displayed as UTF-8. Well, if you are using UTF-8 as the encoding, then yes, that character requires two bytes (not characters). I'm not sure what you mean by "thats because it cannot be displayed as UTF-8." > My customer claims that this is wrong. Perhaps your customer is expecting you to use an encoding that represents that character in one byte. If the character in question is U+00B5, you can use ISO-8859-1. If it's U+03BC, you can use ISO-8859-7. However, using UTF-8 is a much better option, and your customer can simply get an editor that can display text encoded in UTF-8. Dave --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]