> On 2025/3/28 11:40, yohan.joung wrote:
> >> From: Chao Yu <c...@kernel.org>
> >> Sent: Thursday, March 27, 2025 10:48 PM
> >> To: 정요한(JOUNG YOHAN) Mobile AE <yohan.jo...@sk.com>; Yohan Joung
> >> <jyh...@gmail.com>; jaeg...@kernel.org; daeh...@gmail.com
> >> Cc: c...@kernel.org; linux-f2fs-devel@lists.sourceforge.net; linux-
> >> ker...@vger.kernel.org; 김필현(KIM PILHYUN) Mobile AE
> >> <pilhyun....@sk.com>
> >> Subject: [External Mail] Re: [External Mail] Re: [External Mail] Re:
> >> [PATCH] f2fs: prevent the current section from being selected as a
> >> victim during garbage collection
> >>
> >> On 2025/3/27 16:00, yohan.jo...@sk.com wrote:
> >>>> From: Chao Yu <c...@kernel.org>
> >>>> Sent: Thursday, March 27, 2025 4:30 PM
> >>>> To: 정요한(JOUNG YOHAN) Mobile AE <yohan.jo...@sk.com>; Yohan Joung
> >>>> <jyh...@gmail.com>; jaeg...@kernel.org; daeh...@gmail.com
> >>>> Cc: c...@kernel.org; linux-f2fs-devel@lists.sourceforge.net; linux-
> >>>> ker...@vger.kernel.org; 김필현(KIM PILHYUN) Mobile AE
> >>>> <pilhyun....@sk.com>
> >>>> Subject: [External Mail] Re: [External Mail] Re: [PATCH] f2fs:
> >>>> prevent the current section from being selected as a victim during
> >>>> garbage collection
> >>>>
> >>>> On 3/27/25 14:43, yohan.jo...@sk.com wrote:
> >>>>>> From: Chao Yu <c...@kernel.org>
> >>>>>> Sent: Thursday, March 27, 2025 3:02 PM
> >>>>>> To: Yohan Joung <jyh...@gmail.com>; jaeg...@kernel.org;
> >>>>>> daeh...@gmail.com
> >>>>>> Cc: c...@kernel.org; linux-f2fs-devel@lists.sourceforge.net;
> >>>>>> linux- ker...@vger.kernel.org; 정요한(JOUNG YOHAN) Mobile AE
> >>>>>> <yohan.jo...@sk.com>
> >>>>>> Subject: [External Mail] Re: [PATCH] f2fs: prevent the current
> >>>>>> section from being selected as a victim during garbage collection
> >>>>>>
> >>>>>> On 3/26/25 22:14, Yohan Joung wrote:
> >>>>>>> When selecting a victim using next_victim_seg in a large
> >>>>>>> section, the selected section might already have been cleared
> >>>>>>> and designated as the new current section, making it actively in
> use.
> >>>>>>> This behavior causes inconsistency between the SIT and SSA.
> >>>>>>
> >>>>>> Hi, does this fix your issue?
> >>>>>
> >>>>> This is an issue that arises when dividing a large section into
> >>>>> segments for garbage collection.
> >>>>> caused by the background GC (garbage collection) thread in large
> >>>>> section
> >>>>> f2fs_gc(victim_section) ->
> >>>>> f2fs_clear_prefree_segments(victim_section)->
> >>>>> cursec(victim_section) -> f2fs_gc(victim_section by
> >>>>> next_victim_seg)
> >>>>
> >>>> I didn't get it, why f2fs_get_victim() will return section which is
> >>>> used by curseg? It should be avoided by checking w/ sec_usage_check().
> >>>>
> >>>> Or we missed to check gcing section which next_victim_seg points to
> >>>> during get_new_segment()?
> >>>>
> >>>> Can this happen?
> >>>>
> >>>> e.g.
> >>>> - bggc selects sec #0
> >>>> - next_victim_seg: seg #0
> >>>> - migrate seg #0 and stop
> >>>> - next_victim_seg: seg #1
> >>>> - checkpoint, set sec #0 free if sec #0 has no valid blocks
> >>>> - allocate seg #0 in sec #0 for curseg
> >>>> - curseg moves to seg #1 after allocation
> >>>> - bggc tries to migrate seg #1
> >>>>
> >>>> Thanks,
> >>> That's correct
> >>> In f2fs_get_victim, we use next_victim_seg to directly jump to
> >>> got_result, thereby bypassing sec_usage_check What do you think
> >>> about this change?
> >>>
> >>> @@ -850,15 +850,20 @@ int f2fs_get_victim(struct f2fs_sb_info *sbi,
> >> unsigned int *result,
> >>>                           p.min_segno = sbi->next_victim_seg[BG_GC];
> >>>                           *result = p.min_segno;
> >>>                           sbi->next_victim_seg[BG_GC] = NULL_SEGNO;
> >>> -                       goto got_result;
> >>>                   }
> >>>                   if (gc_type == FG_GC &&
> >>>                                   sbi->next_victim_seg[FG_GC] != 
> >>> NULL_SEGNO) {
> >>>                           p.min_segno = sbi->next_victim_seg[FG_GC];
> >>>                           *result = p.min_segno;
> >>>                           sbi->next_victim_seg[FG_GC] = NULL_SEGNO;
> >>> -                       goto got_result;
> >>>                   }
> >>> +
> >>> +               secno = GET_SEC_FROM_SEG(sbi, segno);
> >>> +
> >>> +               if (sec_usage_check(sbi, secno))
> >>> +                       goto next;
> >>> +
> >>> +               goto got_result;
> >>>           }
> >>
> >> But still allocator can assign this segment after sec_usage_check()
> >> in race condition, right?
> > Since the BG GC using next_victim  takes place after the SIT update in
> > do_checkpoint, it seems unlikely that a race condition with
> sec_usage_check will occur.
> 
> I mean this:
> 
> - gc_thread
>   - f2fs_gc
>    - f2fs_get_victim
>     - sec_usage_check --- segno #1 is not used in any cursegs
>                                       - f2fs_allocate_data_block
>                                        - new_curseg
>                                         - get_new_segment find segno #1
> 
>    - do_garbage_collect
> 
> Thanks,

                                                  do_checkpoint sec0 free 
                                                  If sec0 is not freed, then 
segno1 within sec0 cannot be allocated
- gc_thread
  - f2fs_gc
   - f2fs_get_victim
    - sec_usage_check  --- segno #1 is not used in any cursegs (but sec0 is 
already used)
                                                        - 
f2fs_allocate_data_block
                                                        - new_curseg
                                                        - get_new_segment find 
segno #1
                                        
   - do_garbage_collect

I appreciate your patch, it is under testing.
but I'm wondering if there's a risk of a race condition in this situation


> 
> >>
> >> IMO, we can clear next_victim_seg[] once section is free in
> >> __set_test_and_free()? something like this:
> > I will test it according to your suggestion.
> > If there are no issues, can I submit it again with the patch?
> > Thanks
> >>
> >> ---
> >>    fs/f2fs/segment.h | 13 ++++++++++---
> >>    1 file changed, 10 insertions(+), 3 deletions(-)
> >>
> >> diff --git a/fs/f2fs/segment.h b/fs/f2fs/segment.h index
> >> 0465dc00b349..826e37999085 100644
> >> --- a/fs/f2fs/segment.h
> >> +++ b/fs/f2fs/segment.h
> >> @@ -473,9 +473,16 @@ static inline void __set_test_and_free(struct
> >> f2fs_sb_info *sbi,
> >>                            goto skip_free;
> >>                    next = find_next_bit(free_i->free_segmap,
> >>                                    start_segno + SEGS_PER_SEC(sbi), 
> >> start_segno);
> >> -          if (next >= start_segno + usable_segs) {
> >> -                  if (test_and_clear_bit(secno, free_i->free_secmap))
> >> -                          free_i->free_sections++;
> >> +          if ((next >= start_segno + usable_segs) &&
> >> +                  test_and_clear_bit(secno, free_i->free_secmap)) {
> >> +                  free_i->free_sections++;
> >> +
> >> +                  if (GET_SEC_FROM_SEG(sbi->next_victim_seg[BG_GC]) ==
> >> +                                                                  secno)
> >> +                          sbi->next_victim_seg[BG_GC] = NULL_SEGNO;
> >> +                  if (GET_SEC_FROM_SEG(sbi->next_victim_seg[FG_GC]) ==
> >> +                                                                  secno)
> >> +                          sbi->next_victim_seg[FG_GC] = NULL_SEGNO;
> >>                    }
> >>            }
> >>    skip_free:
> >> --
> >> 2.40.1
> >>
> >>>>
> >>>>>
> >>>>> Because the call stack is different, I think that in order to
> >>>>> handle everything at once, we need to address it within
> >>>>> do_garbage_collect, or otherwise include it on both sides.
> >>>>> What do you think?
> >>>>>
> >>>>> [30146.337471][ T1300] F2FS-fs (dm-54): Inconsistent segment
> >>>>> (70961) type [0, 1] in SSA and SIT [30146.346151][ T1300] Call trace:
> >>>>> [30146.346152][ T1300]  dump_backtrace+0xe8/0x10c [30146.346157][
> >>>>> T1300]  show_stack+0x18/0x28 [30146.346158][ T1300]
> >>>>> dump_stack_lvl+0x50/0x6c [30146.346161][ T1300]
> >>>>> dump_stack+0x18/0x28 [30146.346162][ T1300]
> >>>>> f2fs_stop_checkpoint+0x1c/0x3c [30146.346165][ T1300]
> >>>>> do_garbage_collect+0x41c/0x271c [30146.346167][ T1300]
> >>>>> f2fs_gc+0x27c/0x828 [30146.346168][ T1300]
> >>>>> gc_thread_func+0x290/0x88c [30146.346169][ T1300]
> >>>>> kthread+0x11c/0x164 [30146.346172][ T1300]
> >>>>> ret_from_fork+0x10/0x20
> >>>>>
> >>>>> struct curseg_info : 0xffffff803f95e800 {
> >>>>>         segno        : 0x11531 : 70961
> >>>>> }
> >>>>>
> >>>>> struct f2fs_sb_info : 0xffffff8811d12000 {
> >>>>>         next_victim_seg[0] : 0x11531 : 70961 }
> >>>>>
> >>>>>>
> >>>>>> https://lore.kernel.org/linux-f2fs-devel/20250325080646.3291947-2
> >>>>>> -
> >>>>>> c...@kernel.org
> >>>>>>
> >>>>>> Thanks,
> >>>>>>
> >>>>>>>
> >>>>>>> Signed-off-by: Yohan Joung <yohan.jo...@sk.com>
> >>>>>>> ---
> >>>>>>>    fs/f2fs/gc.c | 4 ++++
> >>>>>>>    1 file changed, 4 insertions(+)
> >>>>>>>
> >>>>>>> diff --git a/fs/f2fs/gc.c b/fs/f2fs/gc.c index
> >>>>>>> 2b8f9239bede..4b5d18e395eb 100644
> >>>>>>> --- a/fs/f2fs/gc.c
> >>>>>>> +++ b/fs/f2fs/gc.c
> >>>>>>> @@ -1926,6 +1926,10 @@ int f2fs_gc(struct f2fs_sb_info *sbi,
> >>>>>>> struct
> >>>>>> f2fs_gc_control *gc_control)
> >>>>>>>               goto stop;
> >>>>>>>       }
> >>>>>>>
> >>>>>>> +     if (__is_large_section(sbi) &&
> >>>>>>> +                     IS_CURSEC(sbi, GET_SEC_FROM_SEG(sbi, segno)))
> >>>>>>> +             goto stop;
> >>>>>>> +
> >>>>>>>       seg_freed = do_garbage_collect(sbi, segno, &gc_list, gc_type,
> >>>>>>>                               gc_control->should_migrate_blocks,
> >>>>>>>                               gc_control->one_time);
> >>>>>
> >>>
> >



_______________________________________________
Linux-f2fs-devel mailing list
Linux-f2fs-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/linux-f2fs-devel

Reply via email to