Re: [PATCH] perf/x86/intel/uncore: fix IMC missing box initialization

From: Vince Weaver
Date: Fri Apr 24 2015 - 09:33:21 EST


On Thu, 23 Apr 2015, Stephane Eranian wrote:

> This patch fixes a bug introduced by:
>
> commit c05199e5a57a579fea1e8fa65e2b511ceb524ffc
> Author: Kan Liang <kan.liang@xxxxxxxxx>
> Date: Tue Jan 20 04:54:25 2015 +0000

I was hoping this would fix the uncore/imc bug that the perf_fuzzer
triggers on my Haswell box, but even with this patch I can still quickly
hard lock the system

[ 79.468201] ------------[ cut here ]------------
[ 79.473121] kernel BUG at mm/vmalloc.c:1335!
[ 79.477705] invalid opcode: 0000 [#1] SMP
[ 79.482141] Modules linked in: fuse x86_pkg_temp_thermal intel_powerclamp intel_rapl iosf_mbi coretemp kvm snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi crct10dif_pclmul snd_hda_intel snd_hda_controller snd_hda_codec crc32_pclmul snd_hda_core snd_hwdep ghash_clmulni_intel i915 aesni_intel aes_x86_64 lrw gf128mul iTCO_wdt glue_helper ppdev snd_pcm iTCO_vendor_support evdev ablk_helper drm_kms_helper psmouse snd_timer cryptd drm tpm_tis serio_raw pcspkr xhci_pci snd parport_pc soundcore tpm xhci_hcd lpc_ich parport mei_me wmi mfd_core processor video battery i2c_i801 i2c_algo_bit button mei sg sr_mod cdrom sd_mod ehci_pci ehci_hcd ahci e1000e libahci ptp crc32c_intel usbcore libata scsi_mod usb_common pps_core thermal fan thermal_sys
[ 79.554768] CPU: 0 PID: 0 Comm: swapper/0 Not tainted 4.0.0+ #135
[ 79.561315] Hardware name: LENOVO 10AM000AUS/SHARKBAY, BIOS FBKT72AUS 01/26/2014
[ 79.569250] task: ffffffff81c14580 ti: ffffffff81c00000 task.ti: ffffffff81c00000
[ 79.577270] RIP: 0010:[<ffffffff811b0cd0>] [<ffffffff811b0cd0>] __get_vm_area_node+0x170/0x180
[ 79.586597] RSP: 0000:ffff88011ea03cb8 EFLAGS: 00010006
[ 79.592301] RAX: 0000000080010000 RBX: 0000000000006000 RCX: ffffc90000000000
[ 79.599950] RDX: 0000000000000001 RSI: 0000000000000001 RDI: 0000000000006000
[ 79.607581] RBP: ffff88011ea03cf8 R08: ffffe8ffffffffff R09: 00000000ffffffff
[ 79.615220] R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000000002
[ 79.622850] R13: 00000000fed16000 R14: 00000000fed10000 R15: 00000000fed10000
[ 79.630480] FS: 0000000000000000(0000) GS:ffff88011ea00000(0000) knlGS:0000000000000000
[ 79.639167] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 79.645302] CR2: 000000000061c218 CR3: 0000000001c0d000 CR4: 00000000001407f0
[ 79.652953] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 79.660571] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 79.668188] Stack:
[ 79.670344] 00000000fed10000 ffffc90000000000 ffff88011ea03d18 0000000000006000
[ 79.678352] 0000000000000002 00000000fed16000 00000000fed10000 00000000fed10000
[ 79.686346] ffff88011ea03d18 ffffffff811b2130 00000000000000d0 ffffffff81039f4d
[ 79.694391] Call Trace:
[ 79.696997] <IRQ>
[ 79.699090] [<ffffffff811b2130>] get_vm_area_caller+0x40/0x50
[ 79.705505] [<ffffffff81039f4d>] ? snb_uncore_imc_init_box+0x6d/0x90
[ 79.712414] [<ffffffff810635e5>] __ioremap_caller+0x195/0x350
[ 79.718610] [<ffffffff81039f4d>] ? snb_uncore_imc_init_box+0x6d/0x90
[ 79.725462] [<ffffffff81427f6b>] ? debug_object_activate+0x14b/0x1e0
[ 79.732346] [<ffffffff810637b7>] ioremap_nocache+0x17/0x20
[ 79.738283] [<ffffffff81039f4d>] snb_uncore_imc_init_box+0x6d/0x90
[ 79.744945] [<ffffffff81039cf7>] snb_uncore_imc_event_start+0xb7/0x110
[ 79.752020] [<ffffffff81039d97>] snb_uncore_imc_event_add+0x47/0x60
[ 79.758832] [<ffffffff81162cbb>] event_sched_in.isra.85+0xfb/0x330
[ 79.765519] [<ffffffff81162f5f>] group_sched_in+0x6f/0x1e0
[ 79.771481] [<ffffffff8101df1a>] ? native_sched_clock+0x2a/0x90
[ 79.777858] [<ffffffff811637bc>] __perf_event_enable+0x25c/0x2a0
[ 79.784418] [<ffffffff810f3e69>] ? tick_nohz_irq_exit+0x29/0x30
[ 79.790820] [<ffffffff8115ef30>] ? cpu_clock_event_start+0x40/0x40
[ 79.797546] [<ffffffff8115ef80>] remote_function+0x50/0x60
[ 79.803535] [<ffffffff810f8cd1>] flush_smp_call_function_queue+0x81/0x180
[ 79.810840] [<ffffffff810f9763>] generic_smp_call_function_single_interrupt+0x13/0x60
[ 79.819328] [<ffffffff8104b5e8>] smp_trace_call_function_single_interrupt+0x38/0xc0
[ 79.827614] [<ffffffff816de9be>] trace_call_function_single_interrupt+0x6e/0x80
[ 79.835465] <EOI>
[ 79.837543] [<ffffffff8156e8b5>] ? cpuidle_enter_state+0x65/0x160
[ 79.844377] [<ffffffff8156e8a1>] ? cpuidle_enter_state+0x51/0x160
[ 79.851015] [<ffffffff8156e9e7>] cpuidle_enter+0x17/0x20
[ 79.856791] [<ffffffff810b6e39>] cpu_startup_entry+0x399/0x440
[ 79.863165] [<ffffffff816c9ddb>] rest_init+0xbb/0xd0
[ 79.868555] [<ffffffff81d46f74>] start_kernel+0x44e/0x45b
[ 79.874433] [<ffffffff81d46120>] ? early_idt_handlers+0x120/0x120
[ 79.881083] [<ffffffff81d464d7>] x86_64_start_reservations+0x2a/0x2c
[ 79.888625] [<ffffffff81d46614>] x86_64_start_kernel+0x13b/0x14a
[ 79.895775] Code: fe ff ff 0f 1f 84 00 00 00 00 00 4c 89 ef e8 d8 b9 01 00 48 83 c4 18 31 c0 5b 41 5c 41 5d 41 5e 41 5f 5d c3 0f 1f 80 00 00 00 00 <0f> 0b 66 66 66 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00
[ 79.918001] RIP [<ffffffff811b0cd0>] __get_vm_area_node+0x170/0x180
[ 79.925511] RSP <ffff88011ea03cb8>
[ 79.929921] ---[ end trace 838bf78817e86b4e ]---
[ 79.935575] Kernel panic - not syncing: Fatal exception in interrupt
[ 79.943131] Kernel Offset: disabled
[ 79.947644] drm_kms_helper: panic occurred, switching back to text console
[ 79.955802] ---[ end Kernel panic - not syncing: Fatal exception in interrupt
[ 79.964192] ------------[ cut here ]------------


--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/