[Drizzle-discuss] Regression, Fixes

Brian Aker Fri, 08 May 2009 17:13:22 -0700

Hi!

So how is the regression issue coming along?


Glad you asked!

Earlier in the week Jay found that the bitset patch gave ussignificant regression. Since then we have been able to find issueswith it, but no conclusion on how to solve it via a new design/container/etc.


So what are we doing?

Earlier this week I refactored the interface to the Table objects tocreate, well, an interface!

It is not perfect but it encapsulated a large chunk of the code. Montytoday put back into the original MY_BITMAP but did so behind thisinterface. Jay has run the numbers and declared that the regressioncan no longer be found.


What will happen from here?

We are going to work on the interface some more. Basically make it sothat we can change the back end to the bitmap without changing a lotof code.

Right now we have a couple of ideas on how to solve the problem (Ifavor a bool in Field object, Monty wants to look at vector<bool>,Mats has a bitvector). We will test each of these and find a solutionthat gives us in the end better code with no regression.

Solving this issue we had to look at a number of things. Our methods,the outcome of a tree rollback, and if performance in this casemattered. The problem wasn't simple, and all solutions had draw backs.The main thing we were not going to do was push some code which causedregression that we would then "find a solution for in the future".That was not acceptable. Rolling back the tree? We could do this, Ifavored it if we had no other solution, but we determined that wecould patch up the tree without causing this sort of disruption.


And?

Encapsulating the interface gives us room to find a new solution.

So what came out of all of this?

We are moving to a staging tree.

As of now we have an lp:drizzle/staging tree. This tree should not bepulled from. We will send code here before it is sent to trunk. Ifcode fails the performance regression testing then we will pull it back.

So what does this mean? No more pushes from main that bypass staging.We have pointed the automatic regression testing at this tree. I amgoing to be suggesting that we point Hudson and Buildbot at it aswell. If a tree can pass here then it will be moved to the main tree.


And what are the thoughts on regression for the future?

Jay asked me today "what do we mean by regression?". To me we cancalculate regression pretty easily. We look at the standard deviationof all previous runs and apply it to the current tree. If we find thatwe are within norms then the new code is fine (and I suspect we willrefine this formula in the future). This was my suggestion.

But what if regression happens and there is an argument for letting ithappen?

Then we talk about it on the mailing list. Right now most of us haveseen the numbers showing that 5.4 is faster then Drizzle at 16concurrent connections. We have been looking into this, but we mayfind that some of the decisions that let us scale out to moreconnections/processors contributed to this. That is ok. Our target isnot the 16 connections sites, it is the sites that need mass numbersof connections/threads/processors. If we find a change that hurts usat 1-N and N is a small number that may be ok.

What will we do when we are confronted by this? We talk abut it on IRCand we will send the information to the mailing list. More eyeballs isa good thing.


Thanks to everyone who has been working on this!

    -Brian


_______________________________________________
Mailing list: https://launchpad.net/~drizzle-discuss
Post to     : drizzle-discuss@lists.launchpad.net
Unsubscribe : https://launchpad.net/~drizzle-discuss
More help   : https://help.launchpad.net/ListHelp

[Drizzle-discuss] Regression, Fixes

Reply via email to