At 08:12 06/08/16, Lisa Dusseault wrote:

>On Aug 15, 2006, at 3:29 PM, Spencer Dawkins wrote:
>>>
>>>OK, this may be inadvertantly funny - are "naive" and "Llwyd" supposed to 
>>>include a non-ascii character, or is that sentence saying something else? 
>>>(Welcome to the world of the RFC Editor)

>Without the ability to put \x81or \xFDinto Internet Drafts, we could 
>probably still make this more readable for people unfamiliar with i18n and 
>sorting issues.  Here's a stab:
>
>Use with natural language  is often inappropriate: even though the collation 
>apparently supports  languages such as Italian and English,

It apparently supports English, but Italian comes with accents
(a bit less than French, but e.g. both "is" and "and" are written
as "e", the former with an accent grave), so that's a bad
example. I think the examples here are usually languages like
Hawai'ian or Swahili.

Regards,    Martin.

>in real-world use it tends to mis-sort a number of types of string:
>    * words such as "naive" (if spelled with an accent, the accented character 
> could push the word to the wrong spot in a sorted list), 
>   * names such as "Llwyd" (which in Wales/Welsh or in Spanish should sort 
> after single-L names like Lyza), 
>   * people and  place names containing non-ASCII, 
>   * strings containing euro and pound sterling symbols, quotation marks, 
> dashes/hyphens, etc.
>
>Lisa


#-#-#  Martin J. Du"rst, Assoc. Professor, Aoyama Gakuin University
#-#-#  http://www.sw.it.aoyama.ac.jp       mailto:[EMAIL PROTECTED]     


_______________________________________________
Gen-art mailing list
[email protected]
https://www1.ietf.org/mailman/listinfo/gen-art

Reply via email to