Re: KCSAN: data-race in __alloc_file / __alloc_file

From: Eric Dumazet
Date: Fri Nov 08 2019 - 08:28:07 EST




On 11/8/19 5:16 AM, syzbot wrote:
> Hello,
>
> syzbot found the following crash on:
>
> HEAD commit:ÂÂÂ 05f22368 x86, kcsan: Enable KCSAN for x86
> git tree:ÂÂÂÂÂÂ https://github.com/google/ktsan.git kcsan
> console output: https://syzkaller.appspot.com/x/log.txt?x=10d7fd88e00000
> kernel config:Â https://syzkaller.appspot.com/x/.config?x=87d111955f40591f
> dashboard link: https://syzkaller.appspot.com/bug?extid=3ef049d50587836c0606
> compiler:ÂÂÂÂÂÂ gcc (GCC) 9.0.0 20181231 (experimental)
>
> Unfortunately, I don't have any reproducer for this crash yet.
>
> IMPORTANT: if you fix the bug, please add the following tag to the commit:
> Reported-by: syzbot+3ef049d50587836c0606@xxxxxxxxxxxxxxxxxxxxxxxxx
>
> ==================================================================
> BUG: KCSAN: data-race in __alloc_file / __alloc_file
>
> write to 0xffff8880bb157398 of 4 bytes by task 10993 on cpu 0:
> Âget_cred include/linux/cred.h:253 [inline]
> Â__alloc_file+0x74/0x210 fs/file_table.c:105
> Âalloc_empty_file+0x8f/0x180 fs/file_table.c:151
> Âalloc_file+0x4e/0x2b0 fs/file_table.c:193
> Âalloc_file_pseudo+0x11c/0x1b0 fs/file_table.c:232
> Âanon_inode_getfile fs/anon_inodes.c:91 [inline]
> Âanon_inode_getfile+0x103/0x1d0 fs/anon_inodes.c:74
> Â__do_sys_perf_event_open+0xd32/0x1ac0 kernel/events/core.c:11100
> Â__se_sys_perf_event_open kernel/events/core.c:10867 [inline]
> Â__x64_sys_perf_event_open+0x70/0x90 kernel/events/core.c:10867
> Âdo_syscall_64+0xcc/0x370 arch/x86/entry/common.c:290
> Âentry_SYSCALL_64_after_hwframe+0x44/0xa9
>
> write to 0xffff8880bb157398 of 4 bytes by task 11004 on cpu 1:
> Âget_cred include/linux/cred.h:253 [inline]
> Â__alloc_file+0x74/0x210 fs/file_table.c:105
> Âalloc_empty_file+0x8f/0x180 fs/file_table.c:151
> Âpath_openat+0x74/0x36e0 fs/namei.c:3514
> Âdo_filp_open+0x11e/0x1b0 fs/namei.c:3555
> Âdo_sys_open+0x3b3/0x4f0 fs/open.c:1097
> Â__do_sys_open fs/open.c:1115 [inline]
> Â__se_sys_open fs/open.c:1110 [inline]
> Â__x64_sys_open+0x55/0x70 fs/open.c:1110
> Âdo_syscall_64+0xcc/0x370 arch/x86/entry/common.c:290
> Âentry_SYSCALL_64_after_hwframe+0x44/0xa9
>
> Reported by Kernel Concurrency Sanitizer on:
> CPU: 1 PID: 11004 Comm: syz-executor.5 Not tainted 5.4.0-rc3+ #0
> Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011
> ==================================================================
>
>
> ---
> This bug is generated by a bot. It may contain errors.
> See https://goo.gl/tpsmEJ for more information about syzbot.
> syzbot engineers can be reached at syzkaller@xxxxxxxxxxxxxxxxx
>
> syzbot will keep track of this bug report. See:
> https://goo.gl/tpsmEJ#status for how to communicate with syzbot.

Linus, what do you think of the following fix ?

I also took the opportunity avoiding dirtying a cache line if this was possible.

diff --git a/include/linux/cred.h b/include/linux/cred.h
index 18639c069263fbe79dfd5a36163c656dca5da220..01b5b7d4e054ddca0df676dc1ceb068e5d71a3f8 100644
--- a/include/linux/cred.h
+++ b/include/linux/cred.h
@@ -250,7 +250,14 @@ static inline const struct cred *get_cred(const struct cred *cred)
if (!cred)
return cred;
validate_creds(cred);
- nonconst_cred->non_rcu = 0;
+
+ /*
+ * Avoid dirtying one cache line. The WRITE_ONCE() also pairs
+ * with itself, since we run without protection of a lock.
+ */
+ if (READ_ONCE(nonconst_cred->non_rcu))
+ WRITE_ONCE(nonconst_cred->non_rcu, 0);
+
return get_new_cred(nonconst_cred);
}

@@ -262,7 +269,14 @@ static inline const struct cred *get_cred_rcu(const struct cred *cred)
if (!atomic_inc_not_zero(&nonconst_cred->usage))
return NULL;
validate_creds(cred);
- nonconst_cred->non_rcu = 0;
+
+ /*
+ * Avoid dirtying one cache line. The WRITE_ONCE() also pairs
+ * with itself, since we run without protection of a lock.
+ */
+ if (READ_ONCE(nonconst_cred->non_rcu))
+ WRITE_ONCE(nonconst_cred->non_rcu, 0);
+
return cred;
}