Re: general protection fault: 0000 [#1] PREEMPT SMP

From: Yong Zhang
Date: Tue Aug 24 2010 - 05:08:50 EST


On Fri, Aug 13, 2010 at 9:47 PM, Sergey Senozhatsky
<sergey.senozhatsky@xxxxxxxxx> wrote:
> Hello,
>
> Got this traces today:
>
> [ Â 29.940248] CPU 1 is now offline
> [ Â 29.941025] general protection fault: 0000 [#1] PREEMPT SMP
> [ Â 29.941103] last sysfs file: /sys/devices/system/cpu/cpu1/online
> [ Â 29.941157] CPU 0
> [ Â 29.941178] Modules linked in: snd_hwdep snd_hda_codec_atihdmi snd_seq_dummy snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device battery ac wmi snd_pcm_oss snd_mixer_oss button snd_hda_codec_realtek radeon broadcom snd_hda_intel
> snd_hda_codec snd_pcm snd_timer snd soundcore snd_page_alloc usbhid hid tg3 libphy psmouse serio_raw evdev ttm drm_kms_helper ehci_hcd sr_mod usbcore cdrom sd_mod ahci libahci
> [ Â 29.941679]
> [ Â 29.941699] Pid: 5208, comm: bash Not tainted 2.6.36-rc0-git12-07921-g60bf26a-dirty #124 Aspire 5741G Â Â/Aspire 5741G
> [ Â 29.941792] RIP: 0010:[<ffffffff810640a1>] Â[<ffffffff810640a1>] __lock_acquire+0x4e9/0x17fd
> [ Â 29.941878] RSP: 0018:ffff88015751dbc8 ÂEFLAGS: 00010082
> [ Â 29.941926] RAX: 0000000000000001 RBX: ffff880152284920 RCX: 0000000000000000
> [ Â 29.941988] RDX: dead4ead00000202 RSI: 0000000000000000 RDI: ffff880152284920
> [ Â 29.942049] RBP: ffff88015751dca8 R08: 0000000000000002 R09: 0000000000000001
> [ Â 29.942111] R10: 0000000000000000 R11: 0000000000000005 R12: ffff8801504c3f60
> [ Â 29.942171] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000002
> [ Â 29.942233] FS: Â00007f53580ca700(0000) GS:ffff880002200000(0000) knlGS:0000000000000000
> [ Â 29.942303] CS: Â0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [ Â 29.942353] CR2: 00007fb030079000 CR3: 0000000151b5a000 CR4: 00000000000006f0
> [ Â 29.942414] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> [ Â 29.942475] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
> [ Â 29.942536] Process bash (pid: 5208, threadinfo ffff88015751c000, task ffff8801504c3f60)
> [ Â 29.942604] Stack:
> [ Â 29.942625] Âffff8801569a4888 ffff88015751dc48 ffff88015751dcb8 ffffffff81132b2a
> [ Â 29.942702] <0> ffff8801504c3f60 0000000000000001 ffff880100000000 ffffffff8186c0a0
> [ Â 29.942789] <0> ffffffff00000000 0000000000000004 ffff8801504c3f60 ffff8801504c3f60
> [ Â 29.942880] Call Trace:
> [ Â 29.942910] Â[<ffffffff81132b2a>] ? sysfs_deactivate+0x3e/0xec
> [ Â 29.942966] Â[<ffffffff81062ddd>] ? mark_held_locks+0x50/0x72
> [ Â 29.943019] Â[<ffffffff81065893>] lock_acquire+0x97/0xb6
> [ Â 29.943072] Â[<ffffffff8137145b>] ? percpu_counter_hotcpu_callback+0x3e/0x93
> [ Â 29.943136] Â[<ffffffff81374321>] ? mutex_lock_nested+0x2f3/0x31b
> [ Â 29.943192] Â[<ffffffff81371446>] ? percpu_counter_hotcpu_callback+0x29/0x93
> [ Â 29.943257] Â[<ffffffff8137568d>] _raw_spin_lock_irqsave+0x4e/0x60
> [ Â 29.943313] Â[<ffffffff8137145b>] ? percpu_counter_hotcpu_callback+0x3e/0x93
> [ Â 29.943376] Â[<ffffffff8137145b>] percpu_counter_hotcpu_callback+0x3e/0x93
> [ Â 29.943441] Â[<ffffffff81057344>] notifier_call_chain+0x32/0x5e
> [ Â 29.943494] Â[<ffffffff8105738f>] __raw_notifier_call_chain+0x9/0xb
> [ Â 29.943552] Â[<ffffffff8103c6e3>] __cpu_notify+0x1b/0x2d
> [ Â 29.943602] Â[<ffffffff8103c703>] cpu_notify+0xe/0x10
> [ Â 29.943649] Â[<ffffffff8103c70e>] cpu_notify_nofail+0x9/0x11
> [ Â 29.943703] Â[<ffffffff81362d82>] _cpu_down+0x151/0x206
> [ Â 29.943751] Â[<ffffffff81362ea8>] cpu_down+0x28/0x35
> [ Â 29.943798] Â[<ffffffff8136430d>] store_online+0x27/0x6e
> [ Â 29.943850] Â[<ffffffff812923ab>] sysdev_store+0x1b/0x1d
> [ Â 29.943899] Â[<ffffffff811321b2>] sysfs_write_file+0x103/0x13f
> [ Â 29.943955] Â[<ffffffff810daf92>] vfs_write+0xb0/0x14f
> [ Â 29.944003] Â[<ffffffff810db22e>] sys_write+0x45/0x6c
> [ Â 29.944054] Â[<ffffffff81002002>] system_call_fastpath+0x16/0x1b
> [ Â 29.946132] Code: 85 c0 0f 84 a4 12 00 00 be 0b 03 00 00 83 3d 5e c4 f5 00 00 0f 85 92 12 00 00 e9 a4 11 00 00 45 31 f6 48 85 d2 0f 84 81 12 00 00 <f0> ff 82 98 01 00 00 45 8b 84 24 20 07 00 00 83 3d 79 ce 6a 00
> [ Â 29.951109] RIP Â[<ffffffff810640a1>] __lock_acquire+0x4e9/0x17fd
> [ Â 29.953421] ÂRSP <ffff88015751dbc8>
> [ Â 29.965605] ---[ end trace 34832156140843b2 ]---
> [ Â 29.967758] note: bash[5208] exited with preempt_count 1
> [ Â 29.969997] BUG: scheduling while atomic: bash/5208/0x10000002
> [ Â 29.972098] INFO: lockdep is turned off.
> [ Â 29.974167] Modules linked in: snd_hwdep snd_hda_codec_atihdmi snd_seq_dummy snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device battery ac wmi snd_pcm_oss snd_mixer_oss button snd_hda_codec_realtek radeon broadcom snd_hda_intel
> snd_hda_codec snd_pcm snd_timer snd soundcore snd_page_alloc usbhid hid tg3 libphy psmouse serio_raw evdev ttm drm_kms_helper ehci_hcd sr_mod usbcore cdrom sd_mod ahci libahci
> [ Â 29.981076] Pid: 5208, comm: bash Tainted: G Â Â ÂD Â Â 2.6.36-rc0-git12-07921-g60bf26a-dirty #124
> [ Â 29.983375] Call Trace:
> [ Â 29.985672] Â[<ffffffff8102dd6a>] __schedule_bug+0x72/0x77
> [ Â 29.987996] Â[<ffffffff81372790>] schedule+0xdc/0x8f2
> [ Â 29.990342] Â[<ffffffff810360f9>] __cond_resched+0x13/0x1f
> [ Â 29.992607] Â[<ffffffff813730c5>] _cond_resched+0x29/0x30
> [ Â 29.994912] Â[<ffffffff810bbe97>] unmap_vmas+0x747/0x92b
> [ Â 29.997198] Â[<ffffffff810c15cd>] exit_mmap+0xda/0x184
> [ Â 29.999410] Â[<ffffffff8103888e>] mmput+0x28/0xcf
> [ Â 30.001623] Â[<ffffffff8103cd21>] exit_mm+0x109/0x116
> [ Â 30.003791] Â[<ffffffff81375db0>] ? _raw_spin_unlock_irq+0x55/0x59
> [ Â 30.005931] Â[<ffffffff8103e387>] do_exit+0x1fe/0x6c0
> [ Â 30.008070] Â[<ffffffff8103c6a5>] ? kmsg_dump+0x14f/0x16a
> [ Â 30.010205] Â[<ffffffff810060c4>] oops_end+0x8f/0x94
> [ Â 30.012285] Â[<ffffffff81006206>] die+0x55/0x5e
> [ Â 30.014369] Â[<ffffffff8100371c>] do_general_protection+0x135/0x13d
> [ Â 30.016453] Â[<ffffffff81376304>] ? irq_return+0x0/0xc
> [ Â 30.018546] Â[<ffffffff813764e5>] general_protection+0x25/0x30
> [ Â 30.020643] Â[<ffffffff810640a1>] ? __lock_acquire+0x4e9/0x17fd
> [ Â 30.022719] Â[<ffffffff81132b2a>] ? sysfs_deactivate+0x3e/0xec
> [ Â 30.024804] Â[<ffffffff81062ddd>] ? mark_held_locks+0x50/0x72
> [ Â 30.026883] Â[<ffffffff81065893>] lock_acquire+0x97/0xb6
> [ Â 30.028932] Â[<ffffffff8137145b>] ? percpu_counter_hotcpu_callback+0x3e/0x93
> [ Â 30.031017] Â[<ffffffff81374321>] ? mutex_lock_nested+0x2f3/0x31b
> [ Â 30.033076] Â[<ffffffff81371446>] ? percpu_counter_hotcpu_callback+0x29/0x93
> [ Â 30.035171] Â[<ffffffff8137568d>] _raw_spin_lock_irqsave+0x4e/0x60
> [ Â 30.037272] Â[<ffffffff8137145b>] ? percpu_counter_hotcpu_callback+0x3e/0x93
> [ Â 30.039372] Â[<ffffffff8137145b>] percpu_counter_hotcpu_callback+0x3e/0x93
> [ Â 30.041515] Â[<ffffffff81057344>] notifier_call_chain+0x32/0x5e
> [ Â 30.043653] Â[<ffffffff8105738f>] __raw_notifier_call_chain+0x9/0xb
> [ Â 30.045777] Â[<ffffffff8103c6e3>] __cpu_notify+0x1b/0x2d
> [ Â 30.047927] Â[<ffffffff8103c703>] cpu_notify+0xe/0x10
> [ Â 30.050049] Â[<ffffffff8103c70e>] cpu_notify_nofail+0x9/0x11
> [ Â 30.052172] Â[<ffffffff81362d82>] _cpu_down+0x151/0x206
> [ Â 30.054308] Â[<ffffffff81362ea8>] cpu_down+0x28/0x35
> [ Â 30.056413] Â[<ffffffff8136430d>] store_online+0x27/0x6e
> [ Â 30.058546] Â[<ffffffff812923ab>] sysdev_store+0x1b/0x1d
> [ Â 30.060537] Â[<ffffffff811321b2>] sysfs_write_file+0x103/0x13f
> [ Â 30.062357] Â[<ffffffff810daf92>] vfs_write+0xb0/0x14f
> [ Â 30.064167] Â[<ffffffff810db22e>] sys_write+0x45/0x6c
> [ Â 30.065943] Â[<ffffffff81002002>] system_call_fastpath+0x16/0x1b

Seems this is resolved by commit 602586a8, can you try it?

Thanks,
Yong
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/