Re: [MTT devel] Reporter Slowness

Josh Hursey Wed, 30 Jan 2008 17:09:08 -0500

I've started the script running.

Below is a short version, and a trilogy of the gory details. I wantedto write up the details so if it ever happens again to us (or someoneelse) they can see what we did to fix it.



The Short Version:
------------------

The Slowness(tm) was caused by the recent shifting of data in thedatabase to resolve the partition table problems seen earlier thismonth.


The bad news is that it will take about 14 hours to finish.

The good news is that I confirmed that this will fix the performanceproblem that we are seeing. In the small run this technique reduce the'24 hour' query execution time from ~40 sec back down to ~8 sec.

This may slow down client submits this evening, but should not preventthem from being able to submit. The 'DELETE' operations do not requirean exclusive lock, so the 'INSERT' operations should proceed fineconcurrently. The 'INSERT' operations will need to be blocked whilethe 'VACUUM FULL' operation is progressing since it *does* require anexclusive lock. The 'INSERT' operations will proceed normally oncethis lock is released resulting in a temporary slowdown for clientsthat submit during these windows of time (about 20 min or so).




The Details: Part 1: What I did earlier this week:
(more than you wanted to know for prosperity purposes)
--------------------------------------------------

The original problem was that the master partition tables accidentlystarted storing data because I forgot to load the 2008 partitiontables into the database before the first of the year. :( So we loadedthe partition tables, but we still needed to move the misplaced data.

To move the misplaced data we have to duplicate the row (so it isstored properly this time), but we also need to take care in assigningrow IDs to the duplicate rows. We cannot give the dup'ed rows the sameID or we will be unable to differentiate the original and the dup'edrow. So I created a dummy table for mpi_install/test_build/test_run totranslate between the orig row ID and the dup'ed row ID. I used thenextval on the sequence to populate the values for the dup'ed rows inthe dummy table.

Now that I had translation I joined the dummy table with it'scorresponding master table (e.g. "mpi_install join mpi_install_dummyon mpi_install.mpi_install_id = mpi_install_dummy.orig_id"), andinstead of selecting the original ID from the dummy table I selectedthe new dup'ed ID. I inserted this selection back in to thempi_install table. (Cool little trick that PostgreSQL lets you getaway with sometimes).

Once I have duplicated all of the effected rows, then I updated allreferences to the original ID and set it to the duplicated ID in thetest_build/test_run tables. This removed all internal reference to theoriginal ID, and replaced it with the duplicated so we retainintegrity of the data.

Once I have verified that no tables references the original row Idelete those rows from the mpi_install/test_build/test_run tables.




The Details: Part 2: What I forgot to do:
-----------------------------------------

When rows are deleted from PostgreSQL the disk space used continues tobe reserved for this table, and is not reclaimed unless you 'VACUUMFULL' this table. PostgreSQL does this for many good reasons which aredescribed in their documentation. However in the case of the masterpartition tables we want them to release all of their disk space sincewe should never be storing data in this particular table.

I did a 'VACUUM FULL' on the mpi_install and test_build tablesoriginally, but did not do it on the test_run table since thisoperation requires an exclusive lock on the table and can take a longtime to finish. Further I only completed about 1% of the deletions fortest_run before I stopped this operation choosing to wait for theweekend since it will take a long time to complete.

By only deleting part of the test_run master table (which containedabout 1.2 Million rows) this caused the queries on this table to slowdown considerably. The Query Planner estimated the execution of the'24 hour' query at 322,924 and it completed in about 40 seconds. I ran'VACUUM FULL test_run' which only Vacuums the master table, and thenre-ran the query. This time the Query Planner estimated the executionat 151,430 and it completed in about 8 seconds.




The Details: Part 3: What I am doing now:
-----------------------------------------

Currently I am deleting the rest of the old rows from test_run. Thereare approx. 1.2 million rows, and this should complete in about 13hours.

After every 100 K deletions I'm running a 'VACUUM FULL' on test_run.My hope is that by doing it this way instead of just once at the endof all 1.2 M will cause each one to take less time. I hope this willlimit the interruptions seen by the MTT clients submitting resultsthis evening.

I'll send email once the script is complete, and things seem back tonormal.


Cheers,
Josh

On Jan 30, 2008, at 4:12 PM, Jeff Squyres wrote:

I'd go ahead and do it now.

On Jan 30, 2008, at 4:04 PM, Josh Hursey wrote:

It seems the reporter has gotten slower :( Now it is working in the
range of 40 - 50 seconds for the 24 hour query which is not
reasonable. This should be much lower.

Looking at the explain of the query I have some ideas on how to make
things better, but this will slow things down a for a while as I do
this work (maybe a day or two, can't say for sure).

The question is should I wait until Friday COB to start this orshould

I do it immediately?

Let me know,
Josh
_______________________________________________
mtt-devel mailing list
mtt-de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/mtt-devel



--
Jeff Squyres
Cisco Systems

_______________________________________________
mtt-devel mailing list
mtt-de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/mtt-devel

Re: [MTT devel] Reporter Slowness

Reply via email to