Re: [Gluster-devel] Rebalance data migration and corruption

Joe Julian Sun, 07 Feb 2016 22:51:14 -0800

Is this in current release versions?

On 02/07/2016 07:43 PM, Shyam wrote:

On 02/06/2016 06:36 PM, Raghavendra Gowdappa wrote:
----- Original Message -----
From: "Raghavendra Gowdappa" <[email protected]>
To: "Sakshi Bansal" <[email protected]>, "Susant Palai"<[email protected]>Cc: "Gluster Devel" <[email protected]>, "NithyaBalachandran" <[email protected]>, "Shyamsundar
Ranganathan" <[email protected]>
Sent: Friday, February 5, 2016 4:32:40 PM
Subject: Re: Rebalance data migration and corruption

+gluster-devel
Hi Sakshi/Susant,
- There is a data corruption issue in migration code. Rebalanceprocess,
   1. Reads data from src
   2. Writes (say w1) it to dst
However, 1 and 2 are not atomic, so another write (say w2) tosame regioncan happen between 1. But these two writes can reach dst in theorder
   (w2,
w1) resulting in a subtle corruption. This issue is not fixedyet and cancause subtle data corruptions. The fix is simple and involvesrebalance
   process acquiring a mandatory lock to make 1 and 2 atomic.
We can make use of compound fop framework to make sure we don'tsuffer asignificant performance hit. Following will be the sequence ofoperations
done by rebalance process:

1. issues a compound (mandatory lock, read) operation on src.
2. writes this data to dst.
3. issues unlock of lock acquired in 1.
Please co-ordinate with Anuradha for implementation of this compoundfop.
Following are the issues I see with this approach:
1. features/locks provides mandatory lock functionality only forposix-locks(flock and fcntl based locks). So, mandatory locks will beposix-locks whichwill conflict with locks held by application. So, if an applicationhas held
an fcntl/flock, migration cannot proceed.
We can implement a "special" domain for mandatory internal locks.These locks will behave similar to posix mandatory locks in thatconflicting fops (like write, read) are blocked/failed if they aredone while a lock is held.
2. data migration will be less efficient because of an extra unlock(withcompound lock + read) or extra lock and unlock (for non-compound fopbased
implementation) for every read it does from src.
Can we use delegations here? Rebalance process can acquire amandatory-write-delegation (an exclusive lock with a functionalitythat delegation is recalled when a write operation happens). In thatcase rebalance process, can do something like:
1. Acquire a read delegation for entire file.
2. Migrate the entire file.
3. Remove/unlock/give-back the delegation it has acquired.
If a recall is issued from brick (when a write happens from mount),it completes the current write to dst (or throws away the read fromsrc) to maintain atomicity. Before doing next set of (read, src) and(write, dst) tries to reacquire lock.
With delegations this simplifies the normal path, when a file isexclusively handled by rebalance. It also improves the case where aclient and rebalance are conflicting on a file, to degrade tomandatory locks by either parties.
I would prefer we take the delegation route for such needs in the future.
@Soumyak, can something like this be done with delegations?

@Pranith,
Afr does transactions for writing to its subvols. Can you suggest anyoptimizations here so that rebalance process can have a transactionfor (read, src) and (write, dst) with minimal performance overhead?
regards,
Raghavendra.
Comments?
regards,
Raghavendra.
_______________________________________________
Gluster-devel mailing list
[email protected]
http://www.gluster.org/mailman/listinfo/gluster-devel


_______________________________________________
Gluster-devel mailing list
[email protected]
http://www.gluster.org/mailman/listinfo/gluster-devel

Re: [Gluster-devel] Rebalance data migration and corruption

Reply via email to