Re: WARNING in get_unlocked_entry
From: Dmitry Vyukov
Date: Wed Oct 31 2018 - 05:45:37 EST
On Wed, Oct 31, 2018 at 4:18 AM, Matthew Wilcox <willy@xxxxxxxxxxxxx> wrote:
> On Tue, Oct 30, 2018 at 08:00:03AM -0700, syzbot wrote:
>> syzbot found the following crash on:
>>
>> HEAD commit: 4b42745211af Merge tag 'armsoc-soc' of git://git.kernel.or..
>> git tree: upstream
>> console output: https://syzkaller.appspot.com/x/log.txt?x=1187d06d400000
>> kernel config: https://syzkaller.appspot.com/x/.config?x=93932074d01b4a5
>> dashboard link: https://syzkaller.appspot.com/bug?extid=4fd0c066d82852499145
>> compiler: gcc (GCC) 8.0.1 20180413 (experimental)
>>
>> Unfortunately, I don't have any reproducer for this crash yet.
>
> Hmmpf. Would have been nice if syzbot had caught this during its tests
> of the -next tree ...
>
>> IMPORTANT: if you fix the bug, please add the following tag to the commit:
>> Reported-by: syzbot+4fd0c066d82852499145@xxxxxxxxxxxxxxxxxxxxxxxxx
>>
>> EXT4-fs (sda1): Cannot specify journal on remount
>> EXT4-fs (sda1): DAX enabled. Warning: EXPERIMENTAL, use at your own risk
>> EXT4-fs (sda1): warning: refusing change of dax flag with busy inodes while
>> remounting
>> WARNING: CPU: 0 PID: 12870 at fs/dax.c:227 get_unlocked_entry+0x3ac/0x4d0
>> fs/dax.c:227
>> Kernel panic - not syncing: panic_on_warn set ...
>>
>> CPU: 0 PID: 12870 Comm: syz-executor3 Not tainted 4.19.0+ #87
>> Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS
>> Google 01/01/2011
>> Call Trace:
>> __dump_stack lib/dump_stack.c:77 [inline]
>> dump_stack+0x244/0x39d lib/dump_stack.c:113
>> panic+0x238/0x4e7 kernel/panic.c:184
>> __warn.cold.8+0x20/0x4a kernel/panic.c:536
>> report_bug+0x254/0x2d0 lib/bug.c:186
>> fixup_bug arch/x86/kernel/traps.c:178 [inline]
>> do_error_trap+0x11b/0x200 arch/x86/kernel/traps.c:271
>> do_invalid_op+0x36/0x40 arch/x86/kernel/traps.c:290
>> invalid_op+0x14/0x20 arch/x86/entry/entry_64.S:966
>> RIP: 0010:get_unlocked_entry+0x3ac/0x4d0 fs/dax.c:227
>> Code: e8 31 ff 83 e0 01 48 89 c6 48 89 85 10 ff ff ff e8 09 18 96 ff 48 8b
>> 85 10 ff ff ff 48 85 c0 0f 85 34 fe ff ff e8 c4 16 96 ff <0f> 0b e8 bd 16 96
>> ff 48 b8 00 00 00 00 00 fc ff df 48 03 85 f8 fe
>> RSP: 0018:ffff88018025e7b8 EFLAGS: 00010012
>> RAX: 0000000000040000 RBX: ffff88018025ead8 RCX: ffffc90008983000
>> RDX: 0000000000000341 RSI: ffffffff81e94d3c RDI: 0000000000000007
>> RBP: ffff88018025e8c8 R08: ffff88017f862280 R09: ffffed003004bd08
>> R10: ffffed003004bd08 R11: 0000000000000003 R12: dffffc0000000000
>> R13: ffffea0005e01040 R14: ffff88018025e800 R15: ffff88018025e8a0
>> grab_mapping_entry fs/dax.c:447 [inline]
>
> OK, so I'd like to know what value actually got loaded into 'entry'.
> Maybe my x86-fu is weak, but I don't understand how this register dump
> correlates with the instructions I'm seeing.
>
> Here's the C code:
>
> entry = xas_load(xas);
> if (!entry || xa_is_internal(entry) ||
> WARN_ON_ONCE(!xa_is_value(entry)) ||
> !dax_is_locked(entry))
> return entry;
>
>
> I downloaded the .config that syzbot was using and built fs/dax.o.
> Disassembling it, I get:
>
> WARN_ON_ONCE(!xa_is_value(entry)) ||
> 5e53: 31 ff xor %edi,%edi
> 5e55: 83 e0 01 and $0x1,%eax
> 5e58: 48 89 c6 mov %rax,%rsi
> 5e5b: 48 89 85 10 ff ff ff mov %rax,-0xf0(%rbp)
> 5e62: e8 00 00 00 00 callq 5e67 <get_unlocked_entry+0x397>
> 5e63: R_X86_64_PLT32 __sanitizer_cov_trace_const_cmp8
> -0x4
> 5e67: 48 8b 85 10 ff ff ff mov -0xf0(%rbp),%rax
> 5e6e: 48 85 c0 test %rax,%rax
> 5e71: 0f 85 34 fe ff ff jne 5cab <get_unlocked_entry+0x1db>
> 5e77: e8 00 00 00 00 callq 5e7c <get_unlocked_entry+0x3ac>
> 5e78: R_X86_64_PLT32 __sanitizer_cov_trace_pc-0x4
> 5e7c: 0f 0b ud2
>
> That ud2 is the WARN_ON trigger. It's at offset 0x3ac from the start of
> get_unlocked_entry() so it matches the dump.
>
> The key is 'test %rax,%rax' at 5e6e. If that passes, we jump to 5cab
> where we do xa_to_value() in order to implement dax_is_locked(). So is
> 'entry' actually 0x0000000000040000 like the dump says?
>
> That's where I get confused. The previous insn (5e67) loads rax from what
> I presume is the stack, having previously stored it there at insn 535b.
> But insn 5e51 should have reduced %rax to just 0 or 1 in the low bit of
> the register. So clearly I have my understanding of x86 insns confused.
> If someone could help me out here, I'd be grateful.
There is a call to __sanitizer_cov_trace_pc before ud2 and rax is not
a caller-preserved register. So I think we can just assume that
xa_is_value returned 0.