On 2019/8/21 12:48, Shin'ichiro Kawasaki wrote:
> When sudden f2fs shutdown happens on zoned block devices, write
> pointers can be inconsistent with valid blocks counts in meta data.
> The failure scenario is as follows:
> 
> - Just before a sudden shutdown, a new segment in a new zone is selected
>   for a current segment. Write commands were executed to the segment.
>   and the zone has a write pointer not at zone start.
> - Before the write commands complete, shutdown happens. Meta data is
>   not updated and still keeps zero valid blocks count for the zone.
> - After next mount of the file system, the zone is selected for the next
>   write target because it has zero valid blocks count. However, it has
>   the write pointer not at zone start. Then "Unaligned write command"
>   error happens.
> 
> To avoid this potential error path, reset write pointers if the zone
> does not have a current segment, the write pointer is not at the zone
> start and the zone has no valid blocks.
> 
> Signed-off-by: Shin'ichiro Kawasaki <shinichiro.kawas...@wdc.com>
> ---
>  fsck/fsck.c | 30 +++++++++++++++++++++++++++++-
>  1 file changed, 29 insertions(+), 1 deletion(-)
> 
> diff --git a/fsck/fsck.c b/fsck/fsck.c
> index 21a06ac..cc9bbc0 100644
> --- a/fsck/fsck.c
> +++ b/fsck/fsck.c
> @@ -2595,6 +2595,7 @@ static int fsck_chk_write_pointer(int i, struct 
> blk_zone *blkz, void *opaque)
>       int log_sectors_per_block = sbi->log_blocksize - SECTOR_SHIFT;
>       unsigned int segs_per_zone = sbi->segs_per_sec * sbi->secs_per_zone;
>       void *zero_blk;
> +     block_t zone_valid_blocks = 0;
>  
>       if (blk_zone_conv(blkz))
>               return 0;
> @@ -2615,8 +2616,35 @@ static int fsck_chk_write_pointer(int i, struct 
> blk_zone *blkz, void *opaque)
>                       break;
>       }
>  
> -     if (cs_index >= NR_CURSEG_TYPE)
> +     if (cs_index >= NR_CURSEG_TYPE) {
> +             for (b = zone_block; b < zone_block + c.zone_blocks &&
> +                          IS_VALID_BLK_ADDR(sbi, b); b += c.blks_per_seg) {
> +                     se = get_seg_entry(sbi, GET_SEGNO(sbi, b));
> +                     zone_valid_blocks += se->valid_blocks;
> +             }
> +             if (wp_block == zone_block || zone_valid_blocks)
> +                     return 0;
> +
> +             /*
> +              * The write pointer is not at zone start but there is no valid
> +              * block in the zone. Segments in the zone can be selected for
> +              * next write. Need to reset the write pointer to avoid
> +              * unaligned write command error.

In SPOR (sudden power-off recovery) of kernel side, we may revalidate blocks
belong to fsynced file in such zone within range of [0, write pointer], if we
just reset zone, will we lose those data for ever?

BTW, how you think enabling f2fs kernel module to recover incorrect write
pointer of zone? Once f2fs-tools doesn't upgrade, however kernel does...

Thanks,

> +              */
> +             if (c.fix_on) {
> +                     FIX_MSG("Reset write pointer at segment 0x%x",
> +                             zone_segno);
> +                     ret = f2fs_reset_zone(dev, blkz);
> +                     if (ret)
> +                             return ret;
> +                     fsck->chk.wp_fixed_zones++;
> +             } else {
> +                     MSG(0, "Inconsistent write pointer at segment 0x%x\n",
> +                         zone_segno);
> +                     fsck->chk.wp_inconsistent_zones++;
> +             }
>               return 0;
> +     }
>  
>       /* check write pointer consistency with the curseg in the zone */
>       cs_block = START_BLOCK(sbi, cs->segno) + cs->next_blkoff;
> 


_______________________________________________
Linux-f2fs-devel mailing list
Linux-f2fs-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/linux-f2fs-devel

Reply via email to