Re: [Tutor] How to find optimisations for code

Cameron Simpson Fri, 19 Oct 2018 18:29:43 -0700

On 19Oct2018 23:07, Alan Gauld <alan.ga...@yahoo.co.uk> wrote:

On 19/10/18 17:12, Pat Martin wrote:

TLDR; How do you figure out if code is inefficient (if it isn'tnecessarily obvious) and how do you find a more efficient solution?


Others have addressed most of the issues but I'd just add the caveat
that you can spend forever trying to make your code "more efficient".
Usually, in the real world, you don't need to.

So how do you figure out if your code is inefficient?
Run it and if it doesn't work fast enough for your purpose then
try to speed it up. If it does then move onto the next project.

how do I go about finding a more efficient solution?


Google, a good reference book and experience.

Particularly, get some knowledge of the cost of various algorithms anddata structures. There are outright inefficient things to do, but manyothers have costs and benefits: things fast in time but costly in space,and vice versa, and things efficient for some kinds of data but known tobe bad for others.


Have a quick read of this:

 https://en.wikipedia.org/wiki/Big_O_notation

which talks about a common notation for costs. Skip the formal sectionwith the math and go to the Orders of Common Functions section.


 https://en.wikipedia.org/wiki/Big_O_notation#Orders_of_common_functions

You'll find people talk about big O notation a lot with algorithms andtheir efficiency. For example, a dict _lookup_ is O(1): constant timeper lookup. But that doesn't mean always use a dict, because there's acost to creating one in the first place. What you use it for matters.

But always, always, measure because there has been more
time wasted on unnecessary optimisations than on any other
feature of programming.

I just want to add a little nuance to all of this, because it to me itreads a little like (a) if it works and you get things done in a timelymanner it is efficient and (b) if it isn't fast enough you can alwaysmake it faster. Alan hasn't actually said that, but one could get thatimpression.


To my mind:

Alan's point about "fast enough" is perfectly valid: in the real world,if your problem has been solved then further work on efficiency mightcost more to implement than you gain after doing it.

His other point about measurement is also very important. It is possibleto guess incorrectly about where performance problems lie, particularlywith higher level languages because their internals has costs notapparent to the eye (because you can't see the internals). So from anengineering point of view, it is important to measure where aproblematic programme spends its time and devote effort to those partsconsuming the most time.


The standard example is the tight loop:

 for a in sequence1:
   for b in sequence2:
     do thing A
 do thing B

The cost of "do thing A" is more important than the cost of "do thing B"because likely A runs many many times more often than B. So even if Bhas some known inefficiency you can see, which will take some work toimprove, effort spent on A will probably be more useful (if that'sfeasible).

With a complex programme this isn't always obvious; in the example abovethe 2 pieces of code are side by side.

The point about measurement is that profiling tools are useful here:when you _don't_ have an obvious primary target to improve but do needto improve efficiency, a profiling tool can measure where a programmespends its time in aggregate. That can point the way to code morebeneficial to inspect.

It at least gets you objective information about where your programmespends its time. It is limited by the data you give your programme: toyexample input data are not as good as real world data.

Finally, some things are as efficient as they get. You _can't_ alwaysmake things even more efficient.


Cheers,
Cameron Simpson <c...@cskk.id.au>
_______________________________________________
Tutor maillist  -  Tutor@python.org
To unsubscribe or change subscription options:
https://mail.python.org/mailman/listinfo/tutor

Re: [Tutor] How to find optimisations for code

Reply via email to