Re: [Pvfs2-developers] server crash on startup with millions of files

Sam Lang Thu, 01 Mar 2007 06:22:17 -0800


On Feb 28, 2007, at 6:54 AM, Phil Carns wrote:

I know that you guys still have some ongoing discussion about the long
range design for tracking handles, but I have another item about the
current implementation that might be of interest.

Most of the remaining startup performance problem (after Sam's
optimization patches) appears to be a result of how the db is ordered.
If I modify the attr db's comparison function so that it has a "<"
rather than ">", then all of the preads during startup go in order
through the db rather than backwards. This takes the startup timeon acold db down to just 34 seconds. Previously it was 2 minutes 22seconds.
It still could be faster, but that seems to be the biggest part of the
time. I imagine the rest of it is just the access size (4 KB at atime) that might be tunable through some berkeley db settings.
The downside of making that particular change to the comparisonmethod is that it breaks storage space compatibility.
I wonder if it might be possible to accomplish the same thing in the
current db format by modifying iterate_handles() to just run thecursor
backwards (using DB_PREV instead of DB_NEXT)?  That wouldn't hurt
storage space compability (if it works), but I don't know if itmakes any difference to callers of that function what order thehandles come out in.

It doesn't matter to the caller. You'll also need to set the cursorto the last position in the db with DB_LAST. Does DB_PREV work withDB_MULTIPLE though? Its not clear from the above, does theimprovement to 34 seconds occur with MULTIPLE or without?

I mentioned previously that the dspace db gets opened with the RECNUMflag. I don't think that's necessary, and removing it willinvariably improve performance, but we need a way to return theposition for iterate_handles. The easiest thing to do is turnPVFS_ds_position into a uint64_t (currently its only uint32_t). Thatbreaks interfaces and protocols though.


-sam

-Phil


Phil Carns wrote:
Phil Carns wrote:
Yeah that is odd. Setting the cursor for each call toiterate_handles may be the reason for it starting over. Do youknow how many times it starts over? The number of timesiterate_handles is called will be (# of files / 4096).
It only goes through the file twice if I am looking at the logcorrectly. Also, I just realized that on both passes (the onejumping backwards 40KB at a time and the one jumping backwards4KB at a time) it is only reading 4KB per pread. I don't knowwhat it is doing from a db point of view, but from an accesspoint of view it looks like it goes backwards with a stridedpattern and then goes backwards reading the entire thing. Thereare some other reads scattered here and there, but those twocycles represent the overwhelming majority of the total preads inthe strace file. By spot checking I don't really see anysignificant divergence from the patterns.
It also just occurred to me that maybe I should repeat the straceand try to capture it with timestamps; I'm not really sure ifboth of these pread cycles are actually during the scan or not.
I just double checked- both of those big pread cycles arehappening after this message is logged:[D 13:06:53.916769] dbpf collection 752900094 - Setting collectionhandle ranges to 4-536870914,4294967292-4831838202... but before the next message. So they do appear to both be aresult of the handle scanning on startup.
-Phil


_______________________________________________
Pvfs2-developers mailing list
[email protected]
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers

Re: [Pvfs2-developers] server crash on startup with millions of files

Reply via email to