Re: Major performance problem with std.array.front()

w0rp Tue, 11 Mar 2014 05:47:27 -0700

On Sunday, 9 March 2014 at 21:38:06 UTC, Nick Sabalausky wrote:

On 3/9/2014 7:47 AM, w0rp wrote:
My knowledge of Unicode pretty much just comes from having
to deal with foreign language customers and discovering theproblemswith the code unit abstraction most languages seem to use.(Java andPython suffer from similar issues, but they don't really havealgorithms
in the way that we do.)
Python 2 or 3 (out of curiosity)? If you're including Python3,then that somewhat surprises me as I thought greatly improvedUnicode was one of the biggest reasons for the jump from 2 to3. (Although it isn't *completely* surprising since, as we allknow far too well here, fully correct Unicode is *not* easy.)

Late reply here. Python 3 is a lot better in terms of Unicodesupport than 2. The situation in Python 2 was this.


1. The default string type is 'str', an immutable array of bytes.
2. 'str' could be one of many encodings, including UTF-16, etc.

3. There is an extra 'unicode' type for when you want a Unicodestring.4. Python implicltly converts between the two, often in wrongways, often causing exceptions to appear where you didn't expectthem to.


In 3, this changed to...

1. The default string type is still named 'str', only now it'slike the 'unicode' of olde.2. 'bytes' is a new immutable array of bytes type like the Python2 'str'.

3. Conversion between 'str' and 'bytes' is always explicit.

However, Python 3 works on a code point level, probably some codeunit level in fact, and you don't see very many algorithms whichtake, say, combining characters into account. So Python suffersfrom similar issues.

Re: Major performance problem with std.array.front()

Reply via email to