On Aug 15, 2006, at 3:29 PM, Spencer Dawkins wrote:
Not sure there's a speeding bullet here. I happened to know what Arnt meant by this stuff -- my own favourite example is sorting "Canada" followed by "canal" then "cantor", *then* followed by "caña" and finally "cañada" (last two with tildes above the n's if you can't see them), if you're using Spanish. Without the ability to put ñ or ï into Internet Drafts, we could probably still make this more readable for people unfamiliar with i18n and sorting issues. Here's a stab: Use with natural language is often inappropriate: even though the collation apparently supports languages such as Italian and English, in real-world use it tends to mis-sort a number of types of string: * words such as "naive" (if spelled with an accent, the accented character could push the word to the wrong spot in a sorted list), * names such as "Llwyd" (which in Wales/Welsh or in Spanish should sort after single-L names like Lyza), * people and place names containing non-ASCII, * strings containing euro and pound sterling symbols, quotation marks, dashes/hyphens, etc. |
_______________________________________________ Gen-art mailing list [email protected] https://www1.ietf.org/mailman/listinfo/gen-art
