Re: [Tutor] List comprehension question

Steven D'Aprano Wed, 10 Nov 2010 14:03:15 -0800

Richard D. Moores wrote:

On Tue, Nov 9, 2010 at 12:54, Steven D'Aprano <st...@pearwood.info> wrote:

Richard D. Moores wrote:

See <http://tutoree7.pastebin.com/R82876Eg> for a speed test with n =
100,000 and 100,000 loops

As a general rule, you shouldn't try to roll your own speed tests. There are
various subtleties that can throw your results right out. Timing small code
snippets on modern multi-tasking computers is fraught with difficulties.


Yes, but I thought that if I ran the tests the way I did, several
times and then reversing the order of the functions, and seeing that
the results were very similar, I could be pretty sure of my findings.
Could you point out where taking even that amount of care could lead
me astray?

Well, the biggest problem is that you're re-inventing the wheel. But intruth, what you did was not terrible, particularly if you're usingPython 3 where range() is quite lightweight rather than a heavyoperation that creates a big list.


There are various traps when timing code:

* There are commonly two functions you use for getting the time:time.time() and time.clock(). On Windows, the best one to use isclock(), but on other operating systems, you should probably use time().The timeit module automatically chooses the right one for you.

* Comparing apples and oranges. There's not a lot that timeit can dothere, since it requires you *understand* what the code is that you aretiming, but it helps by allowing you to split initialisation away fromthe code you are timing.


* Timing too much. One trap is to do this:

t = time.time()
for _ in range(1000000):
    do_something()
t = time.time() - t

This is especially bad in Python 2.x!

The problem is that you are including the time it takes to create a listof one million integers in your timing code. That's like starting thestop watch at a race while the athletes are still in the changing rooms,getting dressed, and yet you'd be amazed how often people do it! If thetime taken by each call to do_something() is very large, then the extratime will be negligible, but if do_something is a fast snippet, it mighttake you nearly as long to build the list as it does to run the codeyou're trying to time!

timeit ensures that as little as possible overhead is included in theexecution being timed.

* Forgetting that computers are multi-tasking machines these days. Atevery instant, the operating system is giving tiny time-slices to yourprogram and potentially dozens of others. If you're busy loading a dozenweb pages, playing a DVD, compiling a large program and trying to time acode snippet, it will be obvious that everything is going to be runningslow. But if you time something *once*, what's not obvious is that justby chance you might get a number that is much slower than normal becausethe OS *just happened* to call your anti-virus during the test. timeitfights against that by running your code in a loop, and then working outthe average time per loop, *and* then repeating the loop multiple times(three by default) and picking the lowest figure. Higher figures areslower due to external factors.

* Forgetting your CPU cache effects. Modern CPUs do all sorts of amazingwizardry to make code run fast. Sometimes slow, but hopefully more oftenfast. The first time you run a piece of code, chances are it will beslower than normal, because the hardware caches will have other stuff inthem. But the second time, the code and data will already be lined up inthe cache and pipelines ready to go. This is why if you time something adozen times, the *first* time could be two or three times slower thanthe rest.

I've probably forgotten some... but anyway, the important thing is thatthere's lots of subtleties to timing code, and Tim Peters has alreadythought of them so you don't have to :)




--
Steven
_______________________________________________
Tutor maillist  -  Tutor@python.org
To unsubscribe or change subscription options:
http://mail.python.org/mailman/listinfo/tutor

Re: [Tutor] List comprehension question

Reply via email to