Re: [Drizzle-discuss] Performance Schema

Paul McCullagh Tue, 11 Aug 2009 06:53:56 -0700


On Aug 7, 2009, at 3:10 PM, Jay Pipes wrote:

To avoid locking, each thread needs a complete set of trackingvariables (counters) as part of its THD structure.
s/THD/Session


Oops, sorry, how very old school of me! ;)

Also, you must understand that there is no one-to-one thread-to-Session guarantee.
Because Sessions may be executed in a thread pool, there must be away of either:

Yes, true! I did not mention the need to "merge" statistics when asession is closed.

The current value of the statistics is derived from the global sum,plus the sum of all running sessions.

a) Merging Session-local stats into the global system variablesstructure upon Session destruction or rescheduling via a schedulingthread. Currently this operation does not acquire a lock around theglobal systems variables in the Session destructor:
Session::~Session()
{
...
 add_to_status(&global_status_var, &status_var);
...
}

void add_to_status(STATUS_VAR *to_var, STATUS_VAR *from_var)
{
 ulong *end= (ulong*) ((unsigned char*) to_var +
                       offsetof(STATUS_VAR, last_system_status_var) +
                        sizeof(ulong));
 ulong *to= (ulong*) to_var, *from= (ulong*) from_var;

 while (to != end)
   *(to++)+= *(from++);
}
I don't know if this critical section was deliberately leftunprotected by LOCK_status or not...still looking into this. Also,MontyT is completely redesigning the system variables system, so theabove "bookmarking" code will not likely look the same in a few weeks.

Either a lock or atomic op would be required here. In fact a spinlockwould probably be the best because the lock is only held for a shorttime.

Either way, you incur locking and instruction costs. These costshave been deemed too high by MySQL engineering for the hundreds(thousands?) of metrics that the MySQL performance schema monitors(or is able to monitor). This is likely because the frequency ofcertain events in the performance schema is quite high?

I agree that if you have 1000's of metrics that this method becomestoo expensive. But I think what is missing is a little thought aboutwhich metrics make sense, and which do not.

The profiling code pays the price for this. In order to get thecurrent state of all counters it goes through the list of THDs andaccumulates the THD related counters.But, this is OK, because this price is only paid when you areactually profiling.
Agreed in principle, yes.
This method not only works for things like "number of byteswritten", but can also be used to measure time. There is a littletrick involved here, but the result is that you can see, forexample, if the server is hanging in a fsync() call in realtime.Then we should create a kind of "drizzlestat" program which SELECTsthe current counter values, and displays the statistics in columns.
Before this is possible, an API into the performance data countersmust be written. I don't want programs willy-nilly accessinginternal kernel and storage engine data without going through aproper interface...we're trying to move away from that sort ofthing :)

I'm not sure what you mean year, but why not use an information schematable? It returns one row for each counter. The row has an ID (whichidentifies the counter) and a value.

So the performance counters are never written to a table, the currentvalue of each counter is just returned dynamically when a select isdone on the table.

This is much better then dumping loads of performance schema tableson a user and saying, the data is there if you need it.
Agreed.
I am also not a believer in gathering statistics on everything (forexample, every semaphore), and letting the user figure out what isimportant.
OK, sure, but what if you don't already know the cause of yourslowdown is a mutex or semaphore and want to find this out?


Yup, good question!

For me this is a matter of whether the tool is created for DBAs/Consultants or for the developers of the database.

I think such a tool should be useful to DBAs/Consultants and avaluable _support_ tool for the developers.

So in the case you mention, the main thing is that we notice that therelevant counter is missing from the statistics.

We would notice this when the transactions per second go down, butnone of the counters we have go up.

Then it is time to pull out other tools to look for the bottleneck(such as http://mituzas.lt/2009/02/15/poor-mans-contention-profiling).

The funny thing is: the goal will then be to remove this bottleneck,which means removing the semaphore (at least in the current form),which will mean removing the statistic.

So in a correctly optimized server you only have statistics onsemaphores that are _not_ bottlenecks. Which means you don't need thestatistics!

So my thinking is that, in the long run, we should only havestatistics that tend to come and go as problems depending on yourhardware setup etc.

A drizzlestat tool is then a help to developers in the sense that itquickly enables us to eliminate the mundane reasons for badperformance. And also to monitor the general performancecharacteristics at runtime.

As the developers we need to decide what are the performancecritical parameters, and just provide those statistics. Of course,statistics can be added later if we see we have missed something.But rather that then a whole bunch of irrelevant values that makefinding a problem like looking for a needle in a haystack.
Agreed, but see point above...
Marc Alff took an approach that causes almost no overhead if theperformance schema is not *compiled in*. There is an overhead ifthe performance schema is compiled in and the DBA is not careful tospecify only those things she is interested in.

One problem with "compiled in" statistics is that they are often notthere when you need them.

I'd love to find a perfect medium between Marc's approach (whichnicely NOOPs the performance schema code behind #define templateswhen it is not compiled in) and your discussion above of non-storageof all data pieces automatically.

Does Marc Alff's approach using the same method that Stewart proposed,i.e. "if(profiling_enabled)" when compiled in and NOPS where possibleto remove this overhead when not required?



--
Paul McCullagh
PrimeBase Technologies
www.primebase.org
www.blobstreaming.org
pbxt.blogspot.com




_______________________________________________
Mailing list: https://launchpad.net/~drizzle-discuss
Post to     : [email protected]
Unsubscribe : https://launchpad.net/~drizzle-discuss
More help   : https://help.launchpad.net/ListHelp

Re: [Drizzle-discuss] Performance Schema

Reply via email to