Re: unicode() vs. s.decode()

Mark Lawrence Fri, 07 Aug 2009 00:06:54 -0700

Michael Ströder wrote:

Thorsten Kampe wrote:

* Michael Ströder (Thu, 06 Aug 2009 18:26:09 +0200)

timeit.Timer("unicode('äöüÄÖÜß','utf-8')").timeit(10000000)

17.23644495010376

timeit.Timer("'äöüÄÖÜß'.decode('utf8')").timeit(10000000)

72.087096929550171


That is significant! So the winner is:

unicode('äöüÄÖÜß','utf-8')

Unless you are planning to write a loop that decodes "äöüÄÖÜß" onemillion times, these benchmarks are meaningless.


Well, I can tell you I would not have posted this here and checked it if it
would be meaningless for me. You don't have to read and answer this thread if
it's meaningless to you.

Ciao, Michael.

I believe that the comment "these benchmarks are meaningless" refers tothe length of the strings being used in the tests. Surely somethinginvolving thousands or millions of characters is more meaningful? Or togo the other way, you are unlikely to write

for c in 'äöüÄÖÜß':
    u = unicode(c, 'utf-8')
    ...
Yes?

--
Kindest regards.

Mark Lawrence.

--
http://mail.python.org/mailman/listinfo/python-list

Re: unicode() vs. s.decode()

Reply via email to