Re: [f2fs-dev] [PATCH v2 2/2] f2fs: relocate chksum_offset for large_nat_bitmap feature

From: Chao Yu
Date: Sun May 19 2019 - 22:20:49 EST


On 2019/5/20 1:31, Ju Hyung Park wrote:
> Hi Jaegeuk and Chao,
>
> I was semi-forced today to use the new kernel and test f2fs.
>
> My Ubuntu initramfs got a bit wonky and I had to boot into live CD and
> fix some stuffs. The live CD was using 4.15 kernel, and just mounting
> the f2fs partition there corrupted f2fs and my 4.19(with 5.1-rc1-4.19
> f2fs-stable merged) refused to mount with "SIT is corrupted node"
> message.
>
> I used the latest f2fs-tools sent by Chao including "fsck.f2fs: fix to
> repair cp_loads blocks at correct position"
>
> It spit out 140M worth of output, but at least I didn't have to run it
> twice. Everything returned "Ok" in the 2nd run.
> The new log is at
> http://arter97.com/f2fs/final
>
> After fixing the image, I used my 4.19 kernel with 5.2-rc1-4.19
> f2fs-stable merged and it mounted.
>
> But, I got this:
> [ 1.047791] F2FS-fs (nvme0n1p3): layout of large_nat_bitmap is
> deprecated, run fsck to repair, chksum_offset: 4092
> [ 1.081307] F2FS-fs (nvme0n1p3): Found nat_bits in checkpoint
> [ 1.161520] F2FS-fs (nvme0n1p3): recover fsync data on readonly fs
> [ 1.162418] F2FS-fs (nvme0n1p3): Mounted with checkpoint version = 761c7e00
>
> But after doing a reboot, the message is gone:
> [ 1.098423] F2FS-fs (nvme0n1p3): Found nat_bits in checkpoint
> [ 1.177771] F2FS-fs (nvme0n1p3): recover fsync data on readonly fs
> [ 1.178365] F2FS-fs (nvme0n1p3): Mounted with checkpoint version = 761c7eda
>
> I'm not exactly sure why the kernel detected that I'm still using the
> old layout on the first boot. Maybe fsck didn't fix it properly, or
> the check from the kernel is improper.

The fsck should have fixed layout problem, that's just unnecessary message, we
can avoid it by relocating sanity check of layout.

I've sent one patch for that, and tested with it, it acts as expected. :)

>
> I also noticed that Jaegeuk sent v1 of this patch to upstream. (Maybe
> that's why the kernel detected old layout?) Please send v2 to upstream
> soon, as running older fsck will cause much more headaches.

Since v1 was upstreamed in 5.2-rc1, I guess we need another patch for that,
maybe in below patch?

[PATCH] f2fs: fix to check layout on last valid checkpoint park

Thanks,

>
> Thanks.
>
>
> On Fri, Apr 26, 2019 at 11:26 AM Chao Yu <yuchao0@xxxxxxxxxx> wrote:
>>
>> For large_nat_bitmap feature, there is a design flaw:
>>
>> Previous:
>>
>> struct f2fs_checkpoint layout:
>> +--------------------------+ 0x0000
>> | checkpoint_ver |
>> | ...... |
>> | checksum_offset |------+
>> | ...... | |
>> | sit_nat_version_bitmap[] |<-----|-------+
>> | ...... | | |
>> | checksum_value |<-----+ |
>> +--------------------------+ 0x1000 |
>> | | nat_bitmap + sit_bitmap
>> | payload blocks | |
>> | | |
>> +--------------------------|<-------------+
>>
>> Obviously, if nat_bitmap size + sit_bitmap size is larger than
>> MAX_BITMAP_SIZE_IN_CKPT, nat_bitmap or sit_bitmap may overlap
>> checkpoint checksum's position, once checkpoint() is triggered
>> from kernel, nat or sit bitmap will be damaged by checksum field.
>>
>> In order to fix this, let's relocate checksum_value's position
>> to the head of sit_nat_version_bitmap as below, then nat/sit
>> bitmap and chksum value update will become safe.
>>
>> After:
>>
>> struct f2fs_checkpoint layout:
>> +--------------------------+ 0x0000
>> | checkpoint_ver |
>> | ...... |
>> | checksum_offset |------+
>> | ...... | |
>> | sit_nat_version_bitmap[] |<-----+
>> | ...... |<-------------+
>> | | |
>> +--------------------------+ 0x1000 |
>> | | nat_bitmap + sit_bitmap
>> | payload blocks | |
>> | | |
>> +--------------------------|<-------------+
>>
>> Related report and discussion:
>>
>> https://sourceforge.net/p/linux-f2fs/mailman/message/36642346/
>>
>> Reported-by: Park Ju Hyung <qkrwngud825@xxxxxxxxx>
>> Signed-off-by: Chao Yu <yuchao0@xxxxxxxxxx>
>> ---
>> v2:
>> - improve hint message suggested by Ju Hyung.
>> - move verification to f2fs_sanity_check_ckpt().
>> fs/f2fs/f2fs.h | 4 +++-
>> fs/f2fs/super.c | 13 +++++++++++++
>> 2 files changed, 16 insertions(+), 1 deletion(-)
>>
>> diff --git a/fs/f2fs/f2fs.h b/fs/f2fs/f2fs.h
>> index 119bc5a9783e..aa71c1aa9eaa 100644
>> --- a/fs/f2fs/f2fs.h
>> +++ b/fs/f2fs/f2fs.h
>> @@ -1909,9 +1909,11 @@ static inline void *__bitmap_ptr(struct f2fs_sb_info *sbi, int flag)
>> int offset;
>>
>> if (is_set_ckpt_flags(sbi, CP_LARGE_NAT_BITMAP_FLAG)) {
>> + unsigned int chksum_size = sizeof(__le32);
>> +
>> offset = (flag == SIT_BITMAP) ?
>> le32_to_cpu(ckpt->nat_ver_bitmap_bytesize) : 0;
>> - return &ckpt->sit_nat_version_bitmap + offset;
>> + return &ckpt->sit_nat_version_bitmap + offset + chksum_size;
>> }
>>
>> if (__cp_payload(sbi) > 0) {
>> diff --git a/fs/f2fs/super.c b/fs/f2fs/super.c
>> index fefc8cc6e756..22241bb866df 100644
>> --- a/fs/f2fs/super.c
>> +++ b/fs/f2fs/super.c
>> @@ -2714,6 +2714,19 @@ int f2fs_sanity_check_ckpt(struct f2fs_sb_info *sbi)
>> return 1;
>> }
>>
>> + if (__is_set_ckpt_flags(ckpt, CP_LARGE_NAT_BITMAP_FLAG)) {
>> + unsigned int chksum_offset;
>> +
>> + chksum_offset = le32_to_cpu(ckpt->checksum_offset);
>> + if (chksum_offset != CP_MIN_CHKSUM_OFFSET) {
>> + f2fs_msg(sbi->sb, KERN_WARNING,
>> + "using deprecated layout of large_nat_bitmap, "
>> + "please run fsck v1.13.0 or higher to repair, "
>> + "chksum_offset: %u", chksum_offset);
>> + return 1;
>> + }
>> + }
>> +
>> if (unlikely(f2fs_cp_error(sbi))) {
>> f2fs_msg(sbi->sb, KERN_ERR, "A bug case: need to run fsck");
>> return 1;
>> --
>> 2.18.0.rc1
>>
>>
>>
>> _______________________________________________
>> Linux-f2fs-devel mailing list
>> Linux-f2fs-devel@xxxxxxxxxxxxxxxxxxxxx
>> https://lists.sourceforge.net/lists/listinfo/linux-f2fs-devel
> .
>