Hi Mark,
Since Joyce is on holiday, I'll try to answer your doubt.

On 2015/9/1 5:31, Mark Fasheh wrote:
> On Wed, Aug 26, 2015 at 03:11:43PM -0700, Andrew Morton wrote:
>> From: Xue jiufei <xuejiu...@huawei.com>
>> Subject: ocfs2: extend transaction for ocfs2_remove_rightmost_path() and 
>> ocfs2_update_edge_lengths() before to avoid inconsistency between inode and 
>> et
>>
>> I found that jbd2_journal_restart() is called in some places without
>> keeping things consistently before.  However, jbd2_journal_restart() may
>> commit the handle's transaction and restart another one.  If the first
>> transaction is committed successfully while another not, it may cause
>> filesystem inconsistency or read only.  This is an effort to fix this kind
>> of problems.
>>
>>
>> This patch (of 3):
>>
>> The following functions will be called while truncating an extent:
>> ocfs2_remove_btree_range
>>   -> ocfs2_start_trans
>>   -> ocfs2_remove_extent
>>      -> ocfs2_truncate_rec
>>        -> ocfs2_extend_rotate_transaction
>>          -> jbd2_journal_restart if jbd2_journal_extend fail
>>        -> ocfs2_rotate_tree_left
>>          -> ocfs2_remove_rightmost_path
>>              -> ocfs2_extend_rotate_transaction
>>                -> ocfs2_unlink_subtree
>>                 -> ocfs2_update_edge_lengths
>>                   -> ocfs2_extend_trans
>>                     -> jbd2_journal_restart if jbd2_journal_extend fail
>>   -> ocfs2_et_update_clusters
>>   -> ocfs2_commit_trans
>>
>> jbd2_journal_restart() may be called and it may happened that the buffers
>> dirtied in ocfs2_truncate_rec() are committed while buffers dirtied in
>> ocfs2_et_update_clusters() are not, the total clusters on extent tree and
>> i_clusters in ocfs2_dinode is inconsistency.  So the clusters got from
>> ocfs2_dinode is incorrect, and it also cause read-only problem when call
>> ocfs2_commit_truncate() with the error message: "Inode %llu has empty
>> extent block at %llu".
>>
>> We should extend enough credits for function ocfs2_remove_rightmost_path
>> and ocfs2_update_edge_lengths to avoid this inconsistency.
>>
>> Signed-off-by: joyce.xue <xuejiu...@huawei.com>
>> Cc: Mark Fasheh <mfas...@suse.com>
>> Cc: Joel Becker <jl...@evilplan.org>
>> Signed-off-by: Andrew Morton <a...@linux-foundation.org>
>> ---
>>
>>  fs/ocfs2/alloc.c |   82 +++++++++++++++++++++++++++++----------------
>>  1 file changed, 54 insertions(+), 28 deletions(-)
>>
>> diff -puN 
>> fs/ocfs2/alloc.c~ocfs2-extend-transaction-for-ocfs2_remove_rightmost_path-and-ocfs2_update_edge_lengths-before-to-avoid-inconsistency-between-inode-and-et
>>  fs/ocfs2/alloc.c
>> --- 
>> a/fs/ocfs2/alloc.c~ocfs2-extend-transaction-for-ocfs2_remove_rightmost_path-and-ocfs2_update_edge_lengths-before-to-avoid-inconsistency-between-inode-and-et
>> +++ a/fs/ocfs2/alloc.c
>> @@ -2526,21 +2526,6 @@ static int ocfs2_update_edge_lengths(han
>>      struct ocfs2_extent_block *eb;
>>      u32 range;
>>  
>> -    /*
>> -     * In normal tree rotation process, we will never touch the
>> -     * tree branch above subtree_index and ocfs2_extend_rotate_transaction
>> -     * doesn't reserve the credits for them either.
>> -     *
>> -     * But we do have a special case here which will update the rightmost
>> -     * records for all the bh in the path.
>> -     * So we have to allocate extra credits and access them.
>> -     */
>> -    ret = ocfs2_extend_trans(handle, subtree_index);
>> -    if (ret) {
>> -            mlog_errno(ret);
>> -            goto out;
>> -    }
>> -
>>      ret = ocfs2_journal_access_path(et->et_ci, handle, path);
>>      if (ret) {
>>              mlog_errno(ret);
>> @@ -2967,7 +2952,7 @@ static int __ocfs2_rotate_tree_left(hand
>>                   right_path->p_node[subtree_root].bh->b_blocknr,
>>                   right_path->p_tree_depth);
>>  
>> -            ret = ocfs2_extend_rotate_transaction(handle, subtree_root,
>> +            ret = ocfs2_extend_rotate_transaction(handle, 0,
> 
> I don't understand why you changed the subtree depth parameter here to zero.
> 
> Also, I don't understand why it's zero in all the calls below either. Is
> there something wrong with the way the math in
> ocfs2_extend_rotate_transaction() is working out?

The credits in ocfs2_extend_rotate_transaction is calculated as
(path->p_tree_depth - subtree_depth) * 2 + 1 + op_credits.
So changing the subtree_depth parameter to 0 means we get extra credits
in ocfs2_truncate_rec ASAP. Then extending credits in
ocfs2_update_edge_lengths is no longer needed.
In other words, Joyce wants to resolve the issue by extending enough
credits at the very beginning.

Thanks,
Joseph

> 
> 
>>                                                    orig_credits, left_path);
>>              if (ret) {
>>                      mlog_errno(ret);
>> @@ -3040,21 +3025,9 @@ static int ocfs2_remove_rightmost_path(h
>>      struct ocfs2_extent_block *eb;
>>      struct ocfs2_extent_list *el;
>>  
>> -
>>      ret = ocfs2_et_sanity_check(et);
>>      if (ret)
>>              goto out;
>> -    /*
>> -     * There's two ways we handle this depending on
>> -     * whether path is the only existing one.
>> -     */
>> -    ret = ocfs2_extend_rotate_transaction(handle, 0,
>> -                                          handle->h_buffer_credits,
>> -                                          path);
>> -    if (ret) {
>> -            mlog_errno(ret);
>> -            goto out;
>> -    }
>>  
>>      ret = ocfs2_journal_access_path(et->et_ci, handle, path);
>>      if (ret) {
>> @@ -3628,6 +3601,14 @@ static int ocfs2_merge_rec_left(struct o
>>               */
>>              if (le16_to_cpu(right_rec->e_leaf_clusters) == 0 &&
>>                  le16_to_cpu(el->l_next_free_rec) == 1) {
>> +                    /* extend credit for ocfs2_remove_rightmost_path */
>> +                    ret = ocfs2_extend_rotate_transaction(handle, 0,
>> +                                    handle->h_buffer_credits,
>> +                                    right_path);
>> +                    if (ret) {
>> +                            mlog_errno(ret);
>> +                            goto out;
>> +                    }
>>  
>>                      ret = ocfs2_remove_rightmost_path(handle, et,
>>                                                        right_path,
>> @@ -3666,6 +3647,14 @@ static int ocfs2_try_to_merge_extent(han
>>      BUG_ON(ctxt->c_contig_type == CONTIG_NONE);
>>  
>>      if (ctxt->c_split_covers_rec && ctxt->c_has_empty_extent) {
>> +            /* extend credit for ocfs2_remove_rightmost_path */
>> +            ret = ocfs2_extend_rotate_transaction(handle, 0,
>> +                            handle->h_buffer_credits,
>> +                            path);
>> +            if (ret) {
>> +                    mlog_errno(ret);
>> +                    goto out;
>> +            }
>>              /*
>>               * The merge code will need to create an empty
>>               * extent to take the place of the newly
>> @@ -3714,6 +3703,15 @@ static int ocfs2_try_to_merge_extent(han
>>               */
>>              BUG_ON(!ocfs2_is_empty_extent(&el->l_recs[0]));
>>  
>> +            /* extend credit for ocfs2_remove_rightmost_path */
>> +            ret = ocfs2_extend_rotate_transaction(handle, 0,
>> +                                    handle->h_buffer_credits,
>> +                                    path);
>> +            if (ret) {
>> +                    mlog_errno(ret);
>> +                    goto out;
>> +            }
>> +
>>              /* The merge left us with an empty extent, remove it. */
>>              ret = ocfs2_rotate_tree_left(handle, et, path, dealloc);
>>              if (ret) {
> 
> A few of these were added, where we do the transaction extend before calling
> ocfs2_rotate_tree_left(), can we move the call into ocfs2_rotate_tree_left()
> then?
> 
> Thanks,
>       --Mark
> 
> --
> Mark Fasheh
> 
> _______________________________________________
> Ocfs2-devel mailing list
> Ocfs2-devel@oss.oracle.com
> https://oss.oracle.com/mailman/listinfo/ocfs2-devel
> 
> .
> 



_______________________________________________
Ocfs2-devel mailing list
Ocfs2-devel@oss.oracle.com
https://oss.oracle.com/mailman/listinfo/ocfs2-devel

Reply via email to