Re: [Nouveau] Machine freeze on latest Linus kernel, seems related to nouveau
From: Ilia Mirkin
Date: Sat May 17 2014 - 14:33:29 EST
On Sat, May 17, 2014 at 2:13 PM, Damien Wyart <damien.wyart@xxxxxxxxx> wrote:
> Hi;
>
> After further tests, I can reproduce the problem on 3.14.4 also,
> mainly by visiting the following URL with Firefox (29.0.1) :
> http://lavieestmaloptimisee.blogspot.fr/
Amazing. I get the same thing in chrome on my setup (G96).
[235255.701101] BUG: unable to handle kernel paging request at ffffc90013d00000
[235255.701119] IP: [<ffffffff8131f3de>] iowrite32+0xe/0x31
[235255.701130] PGD 1a880e067 PUD 1a880f067 PMD 1a7533067 PTE 0
[235255.701221] CPU: 0 PID: 22304 Comm: chrome Not tainted 3.15.0-rc5+ #83
and a slightly different backtrace, although both eventually try to
create a gpuobj:
[235255.701381] [<ffffffffa02455d0>] ? nouveau_barobj_wr32+0x14/0x16 [nouveau]
[235255.701400] [<ffffffffa0242d28>] _nouveau_gpuobj_wr32+0x2a/0x2c [nouveau]
[235255.701418] [<ffffffffa0242fe8>]
nouveau_gpuobj_create_+0x1f7/0x247 [nouveau]
[235255.701437] [<ffffffffa0243075>] _nouveau_gpuobj_ctor+0x3d/0x4b [nouveau]
[235255.701457] [<ffffffffa02442c6>] nouveau_object_ctor+0x32/0xaf [nouveau]
[235255.701475] [<ffffffffa02430d1>] nouveau_gpuobj_new+0x4e/0x50 [nouveau]
[235255.701504] [<ffffffffa026d823>] nouveau_vm_get+0x151/0x27a [nouveau]
[235255.701545] [<ffffffffa02a3f47>] ?
nouveau_gem_object_open+0x7a/0xbe [nouveau]
[235255.701585] [<ffffffffa02a377d>] nouveau_bo_vma_add+0x36/0x9f [nouveau]
[235255.701624] [<ffffffffa02a3f61>]
nouveau_gem_object_open+0x94/0xbe [nouveau]
[235255.701640] [<ffffffffa007fb47>]
drm_gem_handle_create_tail+0xe0/0x106 [drm]
[235255.701654] [<ffffffffa007fba6>] drm_gem_handle_create+0x39/0x40 [drm]
[235255.701693] [<ffffffffa02a4279>] nouveau_gem_ioctl_new+0xc9/0x117 [nouveau]
[235255.701707] [<ffffffffa007e216>] drm_ioctl+0x2ae/0x416 [drm]
[235255.701745] [<ffffffffa02a41b0>] ? nouveau_gem_new+0xe9/0xe9 [nouveau]
[235255.701755] [<ffffffff813d0eca>] ? __pm_runtime_resume+0x6b/0x7a
[235255.701793] [<ffffffffa029e158>] nouveau_drm_ioctl+0x5d/0x92 [nouveau]
[235255.701803] [<ffffffff8115fa37>] do_vfs_ioctl+0x3f7/0x441
[235255.701811] [<ffffffff81152a10>] ? fput+0x17/0x8d
[235255.701820] [<ffffffff81135971>] ? SyS_mmap_pgoff+0x1a2/0x1d4
[235255.701827] [<ffffffff8115fabf>] SyS_ioctl+0x3e/0x67
[235255.701837] [<ffffffff81724e12>] system_call_fastpath+0x16/0x1b
This is on top of a 3.15-rc5+ kernel.
Normally these things are very hard to debug because they're
~impossible to reproduce. However this website seems to do the trick
(I'm guessing with the help of the youtube embeds).
Ben, any advice on debugging? (Or even better, an idea as to what's
wrong... I seem to recall this happening a lot a while back and then
it magically fixed itself for a while...)
>
> Firefox becomes unresponsive and I get some messages in the kernel log :
>
> May 17 20:01:36 brouette kernel: BUG: unable to handle kernel paging
> request at ffffc90015300000
> May 17 20:01:36 brouette kernel: IP: [<ffffffff8127ff38>] iowrite32+0x38/0x40
> May 17 20:01:36 brouette kernel: PGD 1b880f067 PUD 1b8850067 PMD 1b6cfd067 PTE 0
> May 17 20:01:36 brouette kernel: Oops: 0002 [#1] PREEMPT SMP
> May 17 20:01:36 brouette kernel: Modules linked in: usb_storage
> nf_conntrack_ipv6 nf_defrag_ipv6 nf_conntrack_ipv4 nf_defrag_ipv4
> xt_dscp xt_mark cls_flow cls_fw sch_sfq sch_htb xt_helper xt_length
> nf_conntrack_ftp nf_conntrack ip6table_mangle ip6_tables
> iptable_mangle ip_tables x_tables cpufreq_powersave cpufreq_userspace
> cpufreq_conservative deadline_iosched binfmt_misc hid_roccat_konepure
> hid_roccat hid_roccat_common hid_generic snd_usb_audio snd_usbmidi_lib
> usbhid snd_hwdep nouveau snd_hda_codec_realtek snd_hda_codec_generic
> wmi snd_ca0106 video i2c_algo_bit snd_ac97_codec ac97_bus
> snd_seq_dummy snd_seq_midi snd_seq_oss ttm snd_seq_midi_event
> drm_kms_helper snd_seq drm i2c_core backlight snd_rawmidi
> snd_hda_intel snd_hda_codec snd_seq_device sr_mod snd_pcm_oss pcspkr
> cdrom nvidiafb snd_mixer_oss vgastate snd_pcm snd_timer ehci_pci
> uhci_hcd ehci_hcd usbcore evdev usb_common acpi_cpufreq loop fuse
> autofs4
> May 17 20:01:36 brouette kernel: CPU: 1 PID: 12671 Comm:
> plugin-containe Tainted: P O 3.14.4 #1
> May 17 20:01:36 brouette kernel: Hardware name: System manufacturer
> System Product Name/P6T SE, BIOS 0808 03/08/2010
> May 17 20:01:36 brouette kernel: task: ffff8800ba42a160 ti:
> ffff8801ac584000 task.ti: ffff8801ac584000
> May 17 20:01:36 brouette kernel: RIP: 0010:[<ffffffff8127ff38>]
> [<ffffffff8127ff38>] iowrite32+0x38/0x40
> May 17 20:01:36 brouette kernel: RSP: 0018:ffff8801ac585b90 EFLAGS: 00010292
> May 17 20:01:36 brouette kernel: RAX: ffff8800813e65a0 RBX:
> ffff8801b5aed600 RCX: 0000000000000000
> May 17 20:01:36 brouette kernel: RDX: ffffc90015300000 RSI:
> ffffc90015300000 RDI: 0000000000000000
> May 17 20:01:36 brouette kernel: RBP: ffff8801ac585b98 R08:
> ffffffffa0423100 R09: 0000000000000000
> May 17 20:01:36 brouette kernel: R10: 0000000000000000 R11:
> 000000000000000f R12: 0000000000060004
> May 17 20:01:36 brouette kernel: R13: 0000000000000000 R14:
> ffff8801ac585bd8 R15: 0000000000000000
> May 17 20:01:36 brouette kernel: FS: 00007f1767d15a40(0000)
> GS:ffff8801bfc20000(0000) knlGS:0000000000000000
> May 17 20:01:36 brouette kernel: CS: 0010 DS: 0000 ES: 0000 CR0:
> 0000000080050033
> May 17 20:01:36 brouette kernel: CR2: ffffc90015300000 CR3:
> 000000019d752000 CR4: 00000000000007e0
> May 17 20:01:36 brouette kernel: Stack:
> May 17 20:01:36 brouette kernel: ffffffffa036be6f ffff8801ac585ba8
> ffffffffa0368f25 ffff8801ac585c08
> May 17 20:01:36 brouette kernel: ffffffffa03691c6 0100000077359400
> ffffffffa0443ff0 000000001de8d000
> May 17 20:01:36 brouette kernel: ffff8800813e65a0 0000000000000000
> ffff8801b5aed700 ffff880165e83740
> May 17 20:01:36 brouette kernel: Call Trace:
> May 17 20:01:36 brouette kernel: [<ffffffffa036be6f>] ?
> nouveau_barobj_wr32+0xf/0x20 [nouveau]
> May 17 20:01:36 brouette kernel: [<ffffffffa0368f25>]
> _nouveau_gpuobj_wr32+0x25/0x30 [nouveau]
> May 17 20:01:36 brouette kernel: [<ffffffffa03691c6>]
> nouveau_gpuobj_create_+0x1c6/0x2c0 [nouveau]
> May 17 20:01:36 brouette kernel: [<ffffffffa036848f>]
> nouveau_engctx_create_+0x24f/0x2a0 [nouveau]
> May 17 20:01:36 brouette kernel: [<ffffffffa03daddc>]
> nv50_graph_context_ctor+0x3c/0x80 [nouveau]
> May 17 20:01:36 brouette kernel: [<ffffffffa036a8ad>]
> nouveau_object_ctor+0x2d/0xc0 [nouveau]
> May 17 20:01:36 brouette kernel: [<ffffffffa036b09b>]
> nouveau_object_new+0xeb/0x200 [nouveau]
> May 17 20:01:36 brouette kernel: [<ffffffffa03edc1b>]
> nouveau_abi16_ioctl_grobj_alloc+0x6b/0xe0 [nouveau]
> May 17 20:01:36 brouette kernel: [<ffffffffa0253a62>]
> drm_ioctl+0x4c2/0x5f0 [drm]
> May 17 20:01:36 brouette kernel: [<ffffffff810dbcb2>] ?
> do_mmap_pgoff+0x2c2/0x380
> May 17 20:01:36 brouette kernel: [<ffffffffa03e3799>]
> nouveau_drm_ioctl+0x9/0x10 [nouveau]
> May 17 20:01:36 brouette kernel: [<ffffffff81109850>] do_vfs_ioctl+0x2e0/0x4c0
> May 17 20:01:36 brouette kernel: [<ffffffff81113239>] ? __fget+0x69/0xb0
> May 17 20:01:36 brouette kernel: [<ffffffff81109ab1>] SyS_ioctl+0x81/0xa0
> May 17 20:01:36 brouette kernel: [<ffffffff81482f62>]
> system_call_fastpath+0x16/0x1b
> May 17 20:01:36 brouette kernel: Code: 81 fe 00 00 01 00 76 0b 0f b7
> d6 89 f8 ef c3 0f 1f 40 00 55 48 c7 c6 e1 9e 58 81 48 89 d7 48 89 e5
> e8 1d fe ff ff 5d c3 0f 1f 00 <89> 3e c3 0f 1f 44 00 00 48 81 ff ff ff
> 03 00 77 37 48 81 ff 00
> May 17 20:01:36 brouette kernel: RIP [<ffffffff8127ff38>] iowrite32+0x38/0x40
> May 17 20:01:36 brouette kernel: RSP <ffff8801ac585b90>
> May 17 20:01:36 brouette kernel: CR2: ffffc90015300000
> May 17 20:01:36 brouette kernel: ---[ end trace e4c3bdfb0b08f505 ]---
>
> Thanks in advance for any help.
>
> Best
>
> Damien Wyart
>
> On Fri, May 16, 2014 at 10:05 AM, Damien Wyart <damien.wyart@xxxxxxxxx> wrote:
>> Hi,
>>
>> I am running the latest kernel from Linus. Once yesterday and once
>> today, I got a freeze of my machine. The first time, I could reboot
>> with sysrq, but not the second one (completely unresponsive). The
>> first time,, there was nothng in the logs, but today, I got some
>> traces which I attach to this email.
>>
>> I can provide more details if needed ; the card is GeForce 9600 GT and
>> the OS Debian Sid.
>>
>> Thanks in advance for any feedback.
>>
>> Damien Wyart
> _______________________________________________
> Nouveau mailing list
> Nouveau@xxxxxxxxxxxxxxxxxxxxx
> http://lists.freedesktop.org/mailman/listinfo/nouveau
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/