Re: [syzbot] KMSAN: uninit-value in longest_match

From: Eric Biggers
Date: Tue Dec 13 2022 - 01:40:35 EST


On Fri, Dec 09, 2022 at 01:19:41AM -0800, syzbot wrote:
> Hello,
>
> syzbot found the following issue on:
>
> HEAD commit: 30d2727189c5 kmsan: fix memcpy tests
> git tree: https://github.com/google/kmsan.git master
> console output: https://syzkaller.appspot.com/x/log.txt?x=117d38f5880000
> kernel config: https://syzkaller.appspot.com/x/.config?x=a2144983ada8b4f3
> dashboard link: https://syzkaller.appspot.com/bug?extid=14d9e7602ebdf7ec0a60
> compiler: clang version 15.0.0 (https://github.com/llvm/llvm-project.git 610139d2d9ce6746b3c617fb3e2f7886272d26ff), GNU ld (GNU Binutils for Debian) 2.35.2
> userspace arch: i386
>
> Unfortunately, I don't have any reproducer for this issue yet.
>
> Downloadable assets:
> disk image: https://storage.googleapis.com/syzbot-assets/1e8c2d419c2e/disk-30d27271.raw.xz
> vmlinux: https://storage.googleapis.com/syzbot-assets/9e8a728a72a9/vmlinux-30d27271.xz
> kernel image: https://storage.googleapis.com/syzbot-assets/89f71c80c707/bzImage-30d27271.xz
>
> IMPORTANT: if you fix the issue, please add the following tag to the commit:
> Reported-by: syzbot+14d9e7602ebdf7ec0a60@xxxxxxxxxxxxxxxxxxxxxxxxx
>
> =====================================================
> BUG: KMSAN: uninit-value in longest_match+0xc88/0x1220 lib/zlib_deflate/deflate.c:668
> longest_match+0xc88/0x1220 lib/zlib_deflate/deflate.c:668
> deflate_fast+0x1838/0x2280 lib/zlib_deflate/deflate.c:954
> zlib_deflate+0x1783/0x22b0 lib/zlib_deflate/deflate.c:410
> zlib_compress_pages+0xd34/0x1f90 fs/btrfs/zlib.c:178
> compression_compress_pages fs/btrfs/compression.c:77 [inline]
> btrfs_compress_pages+0x325/0x440 fs/btrfs/compression.c:1208
> compress_file_range+0x11ac/0x3510 fs/btrfs/inode.c:730
> async_cow_start+0x33/0xd0 fs/btrfs/inode.c:1458
> btrfs_work_helper+0x55a/0x990 fs/btrfs/async-thread.c:280
> process_one_work+0xb27/0x13e0 kernel/workqueue.c:2289
> worker_thread+0x1076/0x1d60 kernel/workqueue.c:2436
> kthread+0x31b/0x430 kernel/kthread.c:376
> ret_from_fork+0x1f/0x30 arch/x86/entry/entry_64.S:306

zlib has long been known to use initialized values in longest_match(). This
issue is mentioned in the zlib FAQ. I personally consider this to be a bug, as
the code could be written in a way such that it doesn't use uninitialized
memory. However, zlib considers it to be "safe" and "working as intended".

Note that the copy of zlib in Linux is not really being maintained, and it is
based on a 25-year old version of zlib. However, upstream zlib does not change
much anyway (it's very hard to get changes accepted into it), and as far as I
can tell even the latest version of upstream zlib has this same issue.

So I suppose the way to resolve this syzbot report is to just add
__no_kmsan_checks to longest_match(). The real issue, though, is that zlib
hasn't kept up with the times (nor has Linux kept up with zlib).

- Eric