https://bugzilla.novell.com/show_bug.cgi?id=362740


           Summary: Support supplementary chars in C# \U form
           Product: Mono: Compilers
           Version: 1.9.0
          Platform: Other
        OS/Version: Other
            Status: NEW
          Severity: Enhancement
          Priority: P5 - None
         Component: C#
        AssignedTo: [email protected]
        ReportedBy: [EMAIL PROTECTED]
         QAContact: [email protected]
          Found By: ---


Created an attachment (id=195451)
 --> (https://bugzilla.novell.com/attachment.cgi?id=195451)
Test cases

In C# the \Uxxxxxxxx escape sequence can be used to enter supplementary
codepoints (those in the range U+10000 to U+10FFFF).  In the UTF-16 encoding,
as used by .NET, codepoints in that range are represented in as a pair of chars
(codeunits) called a "surrogate pair".  The conversion is a simple arithmetic
conversion, see http://www.unicode.org/faq/utf_bom.html#UTF16  I thought I
would refer here to the code in the UTF32Encoding class, but it appears to no
support such codepoints either.  I will open a separate bug for that issue.

Mono appears not to support such usage, for instance in the first unit-test
attached the string contains a single char \x0041 rather that the surrogate
pair \xD800\xDC41.


-- 
Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
You are the assignee for the bug.
_______________________________________________
mono-bugs maillist  -  [email protected]
http://lists.ximian.com/mailman/listinfo/mono-bugs

Reply via email to