[Gluster-devel] bad file access (bit-rot + AFR)

Raghavendra Bhat Sat, 27 Jun 2015 02:03:40 -0700

Hi,

There is a patch that is submitted for review to deny access to objectswhich are marked as bad by scrubber (i.e. the data of the object mighthave been corrupted in the backend).


http://review.gluster.org/#/c/11126/10
http://review.gluster.org/#/c/11389/4

The above 2 patch sets solve the problem of denying access to the badobjects (they have passed regression and received a +1 from venky). Butin our testing we found that there is a race window (depending upon thescrubber frequency the race window can be larger) where there is apossibility of self-heal daemon healing the contents of the bad filebefore scrubber can mark it as bad.

I am not sure if the data truly gets corrupted in the backend, there isa chance of hitting this issue. But in our testing to simulate backendcorruption we modify the contents of the file directly in the backend.Now in this case, before the scrubber can mark the object as bad, theself-heal daemon kicks in and heals the contents of the bad file to thegood copy. Or before the scrubber marks the file as bad, if the clientaccesses it AFR finds that there is a mismatch in metadata (since wemodified the contents of the file in the backend) and does data andmetadata self-healing, thus copying the contents of the bad copy to goodcopy. And from now onwards the clients accessing that object always getsbad data.

Pranith?Do you have any solution for this? Venky and me are trying tocome up with a solution for this.

But does this issue block the above patches in anyway? (Those 2 patchesare still needed to deny access to objects once they are marked as badby scrubber).



Regards,
Raghavendra Bhat
_______________________________________________
Gluster-devel mailing list
Gluster-devel@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-devel

[Gluster-devel] bad file access (bit-rot + AFR)

Reply via email to