Re: [rust-dev] How to find Unicode string length in rustlang

Huon Wilson Wed, 28 May 2014 15:25:03 -0700

On 29/05/14 06:38, Kevin Ballard wrote:

On May 28, 2014, at 1:26 PM, Benjamin Striegel <ben.strie...@gmail.com<mailto:ben.strie...@gmail.com>> wrote:
> Unicode is not a simple concept. UTF-8 on the other hand is apretty simple concept.
I don't think we can fully divorce these two ideas. UnderstandingUTF-8 still implies understanding the difference between code points,code units, and grapheme clusters. If we have a single unadorned`len` function, that implies the existence of a "default" length to aUTF-8 string, which is a lie. It also *fails* to suggest theexistence of alternative measures of length of a UTF-8 string.Finally, the choice of byte length as the default length metricencourages the horrid status quo, which is the perpetuation of codethat is tested and works in ASCII environments but barfs as soon asanyone from a sufficiently-foreign culture tries to use it.Dedicating ourselves to Unicode support does us no good if theremainder of our API encourages the depressingly-typical ASCII-ismthat pervades nearly every other language.
Do you honestly believe that calling it .byte_len() will do anythingbesides confusing anyone who expects .len() to work, and resulting incode that looks any different than just using .byte_len() everywherepeople use .len() today?
Forcing more verbose, annoying, unconventional names on people won'tactually change how they process strings. It will just confuse andannoy them.
-Kevin


_______________________________________________
Rust-dev mailing list
Rust-dev@mozilla.org
https://mail.mozilla.org/listinfo/rust-dev

Changing the names of methods on strings seems very similar how Pathdoes not implement Show (except with even stronger motivation, becausestrings have at least 3 sensible interpretations of what the lengthcould be).



Huon

_______________________________________________
Rust-dev mailing list
Rust-dev@mozilla.org
https://mail.mozilla.org/listinfo/rust-dev

Re: [rust-dev] How to find Unicode string length in rustlang

Reply via email to