Re: [syzbot] KASAN: use-after-free Read in rfkill_blocked
From: Dmitry Vyukov
Date: Wed Nov 30 2022 - 04:33:52 EST
On Mon, 28 Nov 2022 at 11:15, Krzysztof Kozlowski
<krzysztof.kozlowski@xxxxxxxxxx> wrote:
>
> On 28/11/2022 11:07, Dmitry Vyukov wrote:
> > On Sun, 27 Nov 2022 at 20:59, Krzysztof Kozlowski
> > <krzysztof.kozlowski@xxxxxxxxxx> wrote:
> >>
> >> On 25/11/2022 10:09, Johannes Berg wrote:
> >>> Looks like an NFC issue to me, Krzysztof?
> >>>
> >>> I mean, rfkill got allocated by nfc_register_device(), freed by
> >>> nfc_unregister_device(), and then used by nfc_dev_up(). Seems like the
> >>> last bit shouldn't be possible after nfc_unregister_device()?
> >>>
> >>> johannes
> >>>
> >>> On Wed, 2022-11-23 at 22:24 -0800, syzbot wrote:
> >>>> Hello,
> >>>>
> >>>> syzbot found the following issue on:
> >>>>
> >>>> HEAD commit: 0966d385830d riscv: Fix auipc+jalr relocation range checks
> >>>> git tree: git://git.kernel.org/pub/scm/linux/kernel/git/riscv/linux.git fixes
> >>>> console output: https://syzkaller.appspot.com/x/log.txt?x=11196d0d880000
> >>>> kernel config: https://syzkaller.appspot.com/x/.config?x=6295d67591064921
> >>>> dashboard link: https://syzkaller.appspot.com/bug?extid=0299462c067009827b2a
> >>>> compiler: riscv64-linux-gnu-gcc (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU Binutils for Debian) 2.35.2
> >>>> userspace arch: riscv64
> >>>>
> >>>> Unfortunately, I don't have any reproducer for this issue yet.
> >>>>
> >>>> IMPORTANT: if you fix the issue, please add the following tag to the commit:
> >>>> Reported-by: syzbot+0299462c067009827b2a@xxxxxxxxxxxxxxxxxxxxxxxxx
> >>>>
> >>>> ==================================================================
> >>>> BUG: KASAN: use-after-free in __lock_acquire+0x8ee/0x333e kernel/locking/lockdep.c:4897
> >>>> Read of size 8 at addr ffffaf8024249018 by task syz-executor.0/7946
> >>>>
> >>>> CPU: 0 PID: 7946 Comm: syz-executor.0 Not tainted 5.17.0-rc1-syzkaller-00002-g0966d385830d #0
> >>>> Hardware name: riscv-virtio,qemu (DT)
> >>>> Call Trace:
> >>>> [<ffffffff8000a228>] dump_backtrace+0x2e/0x3c arch/riscv/kernel/stacktrace.c:113
> >>>> [<ffffffff831668cc>] show_stack+0x34/0x40 arch/riscv/kernel/stacktrace.c:119
> >>>> [<ffffffff831756ba>] __dump_stack lib/dump_stack.c:88 [inline]
> >>>> [<ffffffff831756ba>] dump_stack_lvl+0xe4/0x150 lib/dump_stack.c:106
> >>>> [<ffffffff8047479e>] print_address_description.constprop.0+0x2a/0x330 mm/kasan/report.c:255
> >>>> [<ffffffff80474d4c>] __kasan_report mm/kasan/report.c:442 [inline]
> >>>> [<ffffffff80474d4c>] kasan_report+0x184/0x1e0 mm/kasan/report.c:459
> >>>> [<ffffffff80475b20>] check_region_inline mm/kasan/generic.c:183 [inline]
> >>>> [<ffffffff80475b20>] __asan_load8+0x6e/0x96 mm/kasan/generic.c:256
> >>>> [<ffffffff80112b70>] __lock_acquire+0x8ee/0x333e kernel/locking/lockdep.c:4897
> >>>> [<ffffffff80116582>] lock_acquire.part.0+0x1d0/0x424 kernel/locking/lockdep.c:5639
> >>>> [<ffffffff8011682a>] lock_acquire+0x54/0x6a kernel/locking/lockdep.c:5612
> >>>> [<ffffffff831afa2c>] __raw_spin_lock_irqsave include/linux/spinlock_api_smp.h:110 [inline]
> >>>> [<ffffffff831afa2c>] _raw_spin_lock_irqsave+0x3e/0x62 kernel/locking/spinlock.c:162
> >>>> [<ffffffff83034f0a>] rfkill_blocked+0x22/0x62 net/rfkill/core.c:941
> >>>> [<ffffffff830b8862>] nfc_dev_up+0x8e/0x26c net/nfc/core.c:102
> >>>> [<ffffffff830bb742>] nfc_genl_dev_up+0x5e/0x8a net/nfc/netlink.c:770
> >>>> [<ffffffff8296f9ae>] genl_family_rcv_msg_doit+0x19a/0x23c net/netlink/genetlink.c:731
> >>>> [<ffffffff82970420>] genl_family_rcv_msg net/netlink/genetlink.c:775 [inline]
> >>>> [<ffffffff82970420>] genl_rcv_msg+0x236/0x3ba net/netlink/genetlink.c:792
> >>>> [<ffffffff8296ded2>] netlink_rcv_skb+0xf8/0x2be net/netlink/af_netlink.c:2494
> >>>> [<ffffffff8296ecb2>] genl_rcv+0x36/0x4c net/netlink/genetlink.c:803
> >>>> [<ffffffff8296cbcc>] netlink_unicast_kernel net/netlink/af_netlink.c:1317 [inline]
> >>>> [<ffffffff8296cbcc>] netlink_unicast+0x40e/0x5fe net/netlink/af_netlink.c:1343
> >>>> [<ffffffff8296d29c>] netlink_sendmsg+0x4e0/0x994 net/netlink/af_netlink.c:1919
> >>>> [<ffffffff826d264e>] sock_sendmsg_nosec net/socket.c:705 [inline]
> >>>> [<ffffffff826d264e>] sock_sendmsg+0xa0/0xc4 net/socket.c:725
> >>>> [<ffffffff826d4dd4>] ____sys_sendmsg+0x46e/0x484 net/socket.c:2413
> >>>> [<ffffffff826d8bca>] ___sys_sendmsg+0x16c/0x1f6 net/socket.c:2467
> >>>> [<ffffffff826d8e78>] __sys_sendmsg+0xba/0x150 net/socket.c:2496
> >>>> [<ffffffff826d8f3a>] __do_sys_sendmsg net/socket.c:2505 [inline]
> >>>> [<ffffffff826d8f3a>] sys_sendmsg+0x2c/0x3a net/socket.c:2503
> >>>> [<ffffffff80005716>] ret_from_syscall+0x0/0x2
> >>>>
> >>>> Allocated by task 7946:
> >>>> stack_trace_save+0xa6/0xd8 kernel/stacktrace.c:122
> >>>> kasan_save_stack+0x2c/0x58 mm/kasan/common.c:38
> >>>> kasan_set_track mm/kasan/common.c:45 [inline]
> >>>> set_alloc_info mm/kasan/common.c:436 [inline]
> >>>> ____kasan_kmalloc mm/kasan/common.c:515 [inline]
> >>>> ____kasan_kmalloc mm/kasan/common.c:474 [inline]
> >>>> __kasan_kmalloc+0x80/0xb2 mm/kasan/common.c:524
> >>>> kasan_kmalloc include/linux/kasan.h:270 [inline]
> >>>> __kmalloc+0x190/0x318 mm/slub.c:4424
> >>>> kmalloc include/linux/slab.h:586 [inline]
> >>>> kzalloc include/linux/slab.h:715 [inline]
> >>>> rfkill_alloc+0x96/0x1aa net/rfkill/core.c:983
> >>>> nfc_register_device+0xe4/0x29e net/nfc/core.c:1129
> >>>> nci_register_device+0x538/0x612 net/nfc/nci/core.c:1252
> >>>> virtual_ncidev_open+0x82/0x12c drivers/nfc/virtual_ncidev.c:143
> >>>> misc_open+0x272/0x2c8 drivers/char/misc.c:141
> >>>> chrdev_open+0x1d4/0x478 fs/char_dev.c:414
> >>>> do_dentry_open+0x2a4/0x7d4 fs/open.c:824
> >>>> vfs_open+0x52/0x5e fs/open.c:959
> >>>> do_open fs/namei.c:3476 [inline]
> >>>> path_openat+0x12b6/0x189e fs/namei.c:3609
> >>>> do_filp_open+0x10e/0x22a fs/namei.c:3636
> >>>> do_sys_openat2+0x174/0x31e fs/open.c:1214
> >>>> do_sys_open fs/open.c:1230 [inline]
> >>>> __do_sys_openat fs/open.c:1246 [inline]
> >>>> sys_openat+0xdc/0x164 fs/open.c:1241
> >>>> ret_from_syscall+0x0/0x2
> >>>>
> >>>> Freed by task 7944:
> >>>> stack_trace_save+0xa6/0xd8 kernel/stacktrace.c:122
> >>>> kasan_save_stack+0x2c/0x58 mm/kasan/common.c:38
> >>>> kasan_set_track+0x1a/0x26 mm/kasan/common.c:45
> >>>> kasan_set_free_info+0x1e/0x3a mm/kasan/generic.c:370
> >>>> ____kasan_slab_free mm/kasan/common.c:366 [inline]
> >>>> ____kasan_slab_free+0x15e/0x180 mm/kasan/common.c:328
> >>>> __kasan_slab_free+0x10/0x18 mm/kasan/common.c:374
> >>>> kasan_slab_free include/linux/kasan.h:236 [inline]
> >>>> slab_free_hook mm/slub.c:1728 [inline]
> >>>> slab_free_freelist_hook+0x8e/0x1cc mm/slub.c:1754
> >>>> slab_free mm/slub.c:3509 [inline]
> >>>> kfree+0xe0/0x3e4 mm/slub.c:4562
> >>>> rfkill_release+0x20/0x2a net/rfkill/core.c:831
> >>>> device_release+0x66/0x148 drivers/base/core.c:2229
> >>>> kobject_cleanup lib/kobject.c:705 [inline]
> >>>> kobject_release lib/kobject.c:736 [inline]
> >>>> kref_put include/linux/kref.h:65 [inline]
> >>>> kobject_put+0x1bc/0x38e lib/kobject.c:753
> >>>> put_device+0x28/0x3a drivers/base/core.c:3512
> >>>> rfkill_destroy+0x2a/0x3c net/rfkill/core.c:1142
> >>>> nfc_unregister_device+0xac/0x232 net/nfc/core.c:1167
> >>>> nci_unregister_device+0x168/0x182 net/nfc/nci/core.c:1298
> >>>> virtual_ncidev_close+0x9c/0xbc drivers/nfc/virtual_ncidev.c:163
> >>
> >> There were several issues found recently in virtual NCI driver, so this
> >> might be one of them. There is no reproducer, though...
> >
> >
> > Hi Krzysztof,
> >
> > Do you think it's related specifically to the virtual driver?
>
> Both, although maybe not this particular issue. There were like five
> separate reports last few days...
>
> >
> > I would assume it's a bug in the NCI core itself related to dynamic
> > device destructions. This should affect e.g. USB devices as well.
>
> Earlier this year there was a bigger fix for unregister path in NFC -
> see commits:
> da5c0f119203 (nfc_unregister_device+nfc_fw_download
> ef27324e2c (nci_unregister_device+nci_cmd_work)
> 1b0e81416 (rfkill related)
> and these pointed out inherent issues in locking/synchronization of NFC
> core modules. I don't think we fixed all of the core issues, rather only
> what was reported, so some specific scenarios.
>
> > It's an issue only in the virtual driver. It means that the virtual
> > driver uses the NCI core incorrectly, not the way all real drivers use
> > it. If so the question is: what is the difference? We need to fix it.
> > It's not useful to have unrealistic test drivers -- we both get false
> > positives and don't get true positives.
> >
> > I think the issue may be localized from the KASAN report itself w/o a
> > reproducer.
> > Is there proper synchronization between
> > nfc_unregister_device/rfkill_destroy and nfc_dev_up/rfkill_blocked?
> > Something that prevents rfkill_blocked to be called after
> > rfkill_destroy? If not, then that's the issue.
>
> Mentioned 1b0e81416a tried to do this and that time I had impression fix
> is correct. However it seems it is not... (or not enough)
>
> Best regards,
> Krzysztof
#syz dup: WARNING in nci_send_cmd