https://bugzilla.novell.com/show_bug.cgi?id=362740
Summary: Support supplementary chars in C# \U form
Product: Mono: Compilers
Version: 1.9.0
Platform: Other
OS/Version: Other
Status: NEW
Severity: Enhancement
Priority: P5 - None
Component: C#
AssignedTo: [email protected]
ReportedBy: [EMAIL PROTECTED]
QAContact: [email protected]
Found By: ---
Created an attachment (id=195451)
--> (https://bugzilla.novell.com/attachment.cgi?id=195451)
Test cases
In C# the \Uxxxxxxxx escape sequence can be used to enter supplementary
codepoints (those in the range U+10000 to U+10FFFF). In the UTF-16 encoding,
as used by .NET, codepoints in that range are represented in as a pair of chars
(codeunits) called a "surrogate pair". The conversion is a simple arithmetic
conversion, see http://www.unicode.org/faq/utf_bom.html#UTF16 I thought I
would refer here to the code in the UTF32Encoding class, but it appears to no
support such codepoints either. I will open a separate bug for that issue.
Mono appears not to support such usage, for instance in the first unit-test
attached the string contains a single char \x0041 rather that the surrogate
pair \xD800\xDC41.
--
Configure bugmail: https://bugzilla.novell.com/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
You are the assignee for the bug.
_______________________________________________
mono-bugs maillist - [email protected]
http://lists.ximian.com/mailman/listinfo/mono-bugs