Re: validity of lone surrogates (was Re: Unicode surroga tes: just say no!)

2001-06-27 Thread Gaute B Strokkenes
On Wed, 27 Jun 2001, [EMAIL PROTECTED] wrote: [earlier correspondents] Personally, I think that the codecs should report an error in the appropriate fashion when presented with a python unicode string which contains values that are not allowed, such as lone surrogates. Other people have

RE: validity of lone surrogates (was Re: Unicode surroga tes: just say no!)

2001-06-27 Thread Carl W. Brown
Mark, Your are correct in that the text is not nearly as clear as it should be, and is open to different interpretations. My view of the status in Unicode 3.1 is represented on http://www.macchiato.com/utc/utf_comparison.htm. Corresponding computations are on

Re: [I18n-sig] Re: Unicode surrogates: just say no!

2001-06-27 Thread Rick McGowan
Martin v. Loewis [EMAIL PROTECTED] wrote: It seems to be unclear to many, including myself, what exactly was clarified with Unicode 3.1. Where exactly does it say that processing a six-byte two-surrogates sequence as a single character is non-conforming? It's not non-conforming, it's

Re: [I18n-sig] Re: Unicode surrogates: just say no!

2001-06-27 Thread Peter_Constable
If you still find the definitions and discussion in the technical report to be unclear, then the Unicode editorial committee would undoubtedly like to hear about it. There is no question that there are still things that are unclear and things that are anachronistic in the definitions. I have

Unicode 3.1.1 Beta

2001-06-27 Thread Mark Davis
The next version of the Unicode Standard will be Version 3.1.1, due for release in August, 2001. The beta period for this version will be until July 31, 2001. During this beta period, updated Unicode Character Database files are available for public comment. We strongly encourage implementers to