you need to read the XML 1.0 specification.

http://www.w3.org/TR/REC-xml/#charsets, quoted below.

You can't fix this by any .NET transformation. Some Unicode code
points are simple forbidden.




2.2 Characters

[Definition: A parsed entity contains text, a sequence of characters,
which may represent markup or character data.] [Definition: A
character is an atomic unit of text as specified by ISO/IEC 10646:2000
[ISO/IEC 10646]. Legal characters are tab, carriage return, line feed,
and the legal characters of Unicode and ISO/IEC 10646. The versions of
these standards cited in A.1 Normative References were current at the
time this document was prepared. New characters may be added to these
standards by amendments or new editions. Consequently, XML processors
must accept any character in the range specified for Char. ]

Character Range

[2]     Char       ::=          #x9 | #xA | #xD | [#x20-#xD7FF] |
[#xE000-#xFFFD] | [#x10000-#x10FFFF]    /* any Unicode character,
excluding the surrogate blocks, FFFE, and FFFF. */

On Mon, Oct 24, 2011 at 3:15 PM, jp_listero <[email protected]> wrote:
> what

Reply via email to