RE: [PATCH v6 1/1] f2fs: ioctl for removing a range from F2FS

From: sunqiuyang
Date: Tue Jun 04 2019 - 06:30:00 EST


I have tried to add power failure after committing the new SB (resized) and before writing the new CP. This will cause FS inconsistency:

[FSCK] free segment_count matched with CP [Fail]

which can be fixed by the fsck tool, resulting in a resized FS.

________________________________________
From: Yuchao (T)
Sent: Tuesday, June 04, 2019 9:43
To: Jaegeuk Kim
Cc: sunqiuyang; linux-kernel@xxxxxxxxxxxxxxx; linux-fsdevel@xxxxxxxxxxxxxxx; linux-f2fs-devel@xxxxxxxxxxxxxxxxxxxxx
Subject: Re: [PATCH v6 1/1] f2fs: ioctl for removing a range from F2FS

On 2019/6/4 0:23, Jaegeuk Kim wrote:
> On 05/31, Chao Yu wrote:
>> On 2019/5/31 0:06, Jaegeuk Kim wrote:
>>> On 05/24, sunqiuyang wrote:
>>>> From: Qiuyang Sun <sunqiuyang@xxxxxxxxxx>
>>>>
>>>> This ioctl shrinks a given length (aligned to sections) from end of the
>>>> main area. Any cursegs and valid blocks will be moved out before
>>>> invalidating the range.
>>>>
>>>> This feature can be used for adjusting partition sizes online.
>>>> --
>>>> Changlog v1 ==> v2:
>>>>
>>>> Sahitya Tummala:
>>>> - Add this ioctl for f2fs_compat_ioctl() as well.
>>>> - Fix debugfs status to reflect the online resize changes.
>>>> - Fix potential race between online resize path and allocate new data
>>>> block path or gc path.
>>>>
>>>> Others:
>>>> - Rename some identifiers.
>>>> - Add some error handling branches.
>>>> - Clear sbi->next_victim_seg[BG_GC/FG_GC] in shrinking range.
>>>> --
>>>> Changelog v2 ==> v3:
>>>> Implement this interface as ext4's, and change the parameter from shrunk
>>>> bytes to new block count of F2FS.
>>>> --
>>>> Changelog v3 ==> v4:
>>>> - During resizing, force to empty sit_journal and forbid adding new
>>>> entries to it, in order to avoid invalid segno in journal after resize.
>>>> - Reduce sbi->user_block_count before resize starts.
>>>> - Commit the updated superblock first, and then update in-memory metadata
>>>> only when the former succeeds.
>>>> - Target block count must align to sections.
>>>> --
>>>> Changelog v4 ==> v5:
>>>> Write checkpoint before and after committing the new superblock, w/o
>>>> CP_FSCK_FLAG respectively, so that the FS can be fixed by fsck even if
>>>> resize fails after the new superblock is committed.
>>>> --
>>>> Changelog v5 ==> v6:
>>>> - In free_segment_range(), reduce granularity of gc_mutex.
>>>> - Add protection on curseg migration.
>>>>
>>>> Signed-off-by: Qiuyang Sun <sunqiuyang@xxxxxxxxxx>
>>>> Signed-off-by: Chao Yu <yuchao0@xxxxxxxxxx>
>>>> Signed-off-by: Sahitya Tummala <stummala@xxxxxxxxxxxxxx>
>>>> ---
>>>> fs/f2fs/checkpoint.c | 5 +-
>>>> fs/f2fs/debug.c | 7 +++
>>>> fs/f2fs/f2fs.h | 7 +++
>>>> fs/f2fs/file.c | 28 +++++++++++
>>>> fs/f2fs/gc.c | 134 ++++++++++++++++++++++++++++++++++++++++++++++++++-
>>>> fs/f2fs/segment.c | 54 +++++++++++++++++----
>>>> fs/f2fs/segment.h | 1 +
>>>> fs/f2fs/super.c | 4 ++
>>>> 8 files changed, 228 insertions(+), 12 deletions(-)
>>>>
>>>> diff --git a/fs/f2fs/checkpoint.c b/fs/f2fs/checkpoint.c
>>>> index ed70b68..4706d0a 100644
>>>> --- a/fs/f2fs/checkpoint.c
>>>> +++ b/fs/f2fs/checkpoint.c
>>>> @@ -1313,8 +1313,11 @@ static void update_ckpt_flags(struct f2fs_sb_info *sbi, struct cp_control *cpc)
>>>> else
>>>> __clear_ckpt_flags(ckpt, CP_ORPHAN_PRESENT_FLAG);
>>>>
>>>> - if (is_sbi_flag_set(sbi, SBI_NEED_FSCK))
>>>> + if (is_sbi_flag_set(sbi, SBI_NEED_FSCK) ||
>>>> + is_sbi_flag_set(sbi, SBI_IS_RESIZEFS))
>>>> __set_ckpt_flags(ckpt, CP_FSCK_FLAG);
>>>> + else
>>>> + __clear_ckpt_flags(ckpt, CP_FSCK_FLAG);
>>>
>>> We don't need to clear this flag.
>>
>> During resizefs, we may face inconsistent status of filesystem's on-disk data,
>> so I propose to use below flow, so once some thing breaks resizefs, fsck can
>> detect the corruption by the CP_FSCK_FLAG directly.
>>
>> - resizefs()
>> - set SBI_IS_RESIZEFS
>> - do_checkpoint()
>> - if (is_resizing)
>> - set CP_FSCK_FLAG
>>
>> - clear SBI_IS_RESIZEFS
>> - do_checkpoint()
>> - if (!is_resizing && not_need_fsck)
>> - clear CP_FSCK_FLAG
>>
>> It's safe to clear CP_FSCK_FLAG if there is no resizing and corruption, as once
>> the inconsistency was detected we will keep SBI_NEED_FSCK in memory anyway, then
>> checkpoint can set CP_FSCK_FLAG again.
>
> This tries to resize the image and I mostly worried whether fsck is able to fix

So, Qiuyang, could you try break resizefs at some key points with power-cut, to
check whether fsck can repair all corruption cases? and what is the result
(resized fs or origianl fs)?

> the corrupted metadata area. Moreover, I'm in doubt we really need to do this in
> parallel with FS operations.

What do you mean? We have wrapped main resizefs operaion with
{freeze,thaw}_bdev, so there should be no parallel FS operations.

Thanks,