Re: [Pvfs2-developers] duplicate entries in directory listing

Phil Carns Mon, 09 Oct 2006 13:18:55 -0700

Phil Carns wrote:

I started thinking about some more possible ideas, but I realizedafter looking closer at the code that I don't actually see whyduplicates would occur in the first place with the algorithm that isbeing used :) I apologize if this has been discussed a few timesalready, but could we walk through it one more time?
I know that the request protocol uses a token (integer based) to keeptrack of position. However, the pcache converts this into aparticular key based on where the last iteration left off. This keycontains the handle as well as the alphanumeric name of the entry.
Trove then does a c_get on that key with the DB_SET flag, which shouldput the cursor at the proper position. If the entry has been deleted(which is not happening in my case- I am only creating files), then itretries the c_get with the DB_SET_RANGE flag which should set thecursor at the next position. "next" in this case is defined by thecomparison function, PINT_trove_dbpf_keyval_compare().
The keyval_comare() function sorts the keys based on handle value,then key length, then stncmp of the key name.
This means that essentially we are indexing off of the name of theentry rather than a position in the database.
So how could inserting a new entry between readdir requests cause aduplicate? The old entry that is stored in the pcache should still bevalid. If the newly inserted entry comes after it (according to thekeyval_comare() sort order), then we should see it as we continueiterating. If the new entry comes before it, then it should not showup (we don't back up in the directory listing). It doesn't seem likethere should be any combination that causes it to show up twice.
Is c_get() not traversing the db in the order defined by thekeyval_comare() function?
The only other danger that I see is that if the pcache_lookup() fails,the code falls back to stepping linearly through the db to the tokenposition which I could imagine might have ordering implications.However, I am only talking to the server from a single client, so Idon't see why it would ever miss the pcache lookup.
I just want to confirm that there is actually an algorithm problemhere rather than just a bug in the code somewhere.
Oh, or is the problem in how the end of the directory is detected? Doesthe client do something like issuing a readdir until it gets a responsewith zero entries? I haven't looked at how this works yet, but Iimagine that could throw a wrench into things if the directory getsadditional entries between when the server first indicates that it hasreached the end and when the client gives up on asking for more.

I just tried repeating the test a few times, replacing the "ls" in mytest script with either "pvfs2-ls" or "pvfs2-ls -al". I cannot triggerthe problem when using pvfs2-ls.


If I switch back to "ls" or "/bin/ls" the problem shows up reliably.

Is there anything fundamentally different between how pvfs2-ls works andhow the vfs readdir path works, or is pvfs2-ls somehow getting luckierwith the timing?


-Phil
_______________________________________________
Pvfs2-developers mailing list
Pvfs2-developers@beowulf-underground.org
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers

Re: [Pvfs2-developers] duplicate entries in directory listing

Reply via email to