Re: [GIT PULL][PATCH] perf/ftrace: Fix paranoid level for enablingfunction tracer

From: Steven Rostedt
Date: Wed Nov 06 2013 - 15:04:42 EST


On Wed, 6 Nov 2013 15:04:17 -0500 (EST)
Vince Weaver <vincent.weaver@xxxxxxxxx> wrote:

> On Wed, 6 Nov 2013, Vince Weaver wrote:
>
> > I've run this patch on top of 3.12 and it seems pretty solid.
> > I've run my perf_fuzzer for hours (which wasn't possible w/o
> > quickly crashing on a stock kernel).
>
> It turns out there are still tracepoint issues though.

This is a different bug.

>
> I managed to lock my machine hard, even with the patch, but it took a few
> hours to trigger:
>
> [11559.476002] WARNING: CPU: 0 PID: 2994 at kernel/watchdog.c:245 watchdog_overflow_callback+0x9b/0xa6()
> [11559.476002] Watchdog detected hard LOCKUP on cpu 0
> [11559.476002] Modules linked in: cpufreq_userspace cpufreq_stats cpufreq_powersave cpufreq_conservative f71882fg evdev coretemp acpi_cpufreq psmouse serio_raw video wmi pcspkr processor thermal_sys button mcs7830 i2c_nforce2 ohci_pci usbnet ohci_hcd sg ehci_pci ehci_hcd sd_mod usbcore usb_common
> [11559.476002] CPU: 0 PID: 2994 Comm: perf_fuzzer Tainted: G W 3.12.0 #2
> [11559.476002] Hardware name: AOpen DE7000/nMCP7ALPx-DE R1.06 Oct.19.2012, BIOS 080015 10/19/2012
> [11559.476002] 00000000000000f5 ffff88011fc07c48 ffffffff8151aad6 ffff88011fc07c58
> [11559.476002] ffffffff8103cd68 ffff88011fc07c78 ffffffff810a101c ffff88011b313400
> [11559.476002] ffff88011fc07ef8 ffff88011fc07d58 0000000000000000 ffff88011fc07ca8
> [11559.476002] Call Trace:
> [11559.476002] <NMI> [<ffffffff8151aad6>] ? dump_stack+0x41/0x53
> [11559.476002] [<ffffffff8103cd68>] ? warn_slowpath_common+0x81/0x9c
> [11559.476002] [<ffffffff810a101c>] ? watchdog_overflow_callback+0x9b/0xa6
> [11559.476002] [<ffffffff8103ce25>] ? warn_slowpath_fmt+0x46/0x48
> [11559.476002] [<ffffffff810c9e5b>] ? perf_prepare_sample+0x1d0/0x2f4
> [11559.476002] [<ffffffff810a101c>] ? watchdog_overflow_callback+0x9b/0xa6
> [11559.476002] [<ffffffff810ca4a2>] ? __perf_event_overflow+0x12e/0x1ab
> [11559.476002] [<ffffffff810167d3>] ? intel_pmu_lbr_read+0x253/0x349
> [11559.476002] [<ffffffff81018c95>] ? intel_pmu_handle_irq+0x2b8/0x34d
> [11559.476002] [<ffffffff8101866a>] ? intel_pmu_enable_all+0x4e/0xa7
> [11559.476002] [<ffffffff8151edfc>] ? nmi_handle+0xa8/0x13a
> [11559.476002] [<ffffffff8151f473>] ? perf_event_nmi_handler+0x24/0x3d
> [11559.476002] [<ffffffff8151edb2>] ? nmi_handle+0x5e/0x13a
> [11559.476002] [<ffffffff8151ef35>] ? do_nmi+0xa7/0x2bd
> [11559.476002] [<ffffffff8151e537>] ? end_repeat_nmi+0x1e/0x2e
> [11559.476002] [<ffffffff81049982>] ? __sigqueue_alloc+0x9e/0xc4
> [11559.476002] [<ffffffff81049982>] ? __sigqueue_alloc+0x9e/0xc4
> [11559.476002] [<ffffffff81049982>] ? __sigqueue_alloc+0x9e/0xc4
> [11559.476002] <<EOE>> <IRQ> [<ffffffff8104acaa>] ? __send_signal+0xd8/0x26f

Can you find the exact location of __send_signal()?

-- Steve

> [11559.476002] [<ffffffff8104b074>] ? do_send_sig_info+0x4b/0x75
> [11559.476002] [<ffffffff810ca8a5>] ? perf_tp_event+0xe9/0x1f6
> [11559.476002] [<ffffffff8111f65e>] ? send_sigio_to_task+0xc6/0xf5
> [11559.476002] [<ffffffff8111f737>] ? send_sigio+0xaa/0xdd
> [11559.476002] [<ffffffff8111f7d3>] ? kill_fasync+0x69/0x86
> [11559.476002] [<ffffffff810cabf9>] ? perf_event_wakeup+0x8c/0xa1
> [11559.476002] [<ffffffff810c494d>] ? __irq_work_run+0x5f/0x7b
> [11559.476002] [<ffffffff81005f82>] ? smp_trace_irq_work_interrupt+0x4b/0x7e
> [11559.476002] [<ffffffff815261ca>] ? trace_irq_work_interrupt+0x6a/0x70
> [11559.476002] [<ffffffff810461bf>] ? perf_trace_timer_class+0x39/0x98
> [11559.476002] [<ffffffff8151da07>] ? _raw_spin_unlock_irqrestore+0xf/0x11
> [11559.476002] [<ffffffff810a3a89>] ? rcu_process_callbacks+0x4a4/0x4b8
> [11559.476002] [<ffffffff8105b8f3>] ? enqueue_hrtimer+0x51/0x7b
> [11559.476002] [<ffffffff81041027>] ? __do_softirq+0x102/0x208
> [11559.476002] [<ffffffff81027608>] ? lapic_next_event+0x1d/0x21
> [11559.476002] [<ffffffff8105c187>] ? hrtimer_interrupt+0x119/0x1c6
> [11559.476002] [<ffffffff815263dc>] ? call_softirq+0x1c/0x30
> [11559.476002] [<ffffffff810041b9>] ? do_softirq+0x38/0x6d
> [11559.476002] [<ffffffff81040d1c>] ? irq_exit+0x45/0xa1
> [11559.476002] [<ffffffff81526a14>] ? smp_trace_apic_timer_interrupt+0x83/0x91
> [11559.476002] [<ffffffff8152597a>] ? trace_apic_timer_interrupt+0x6a/0x70
> [11559.476002] <EOI>
> [11559.476002] ---[ end trace 1274fa8d9fb13c66 ]---
> [11559.476002] perf samples too long (7791497 > 10000), lowering kernel.perf_event_max_sample_rate to 12500
> [11559.476002] INFO: NMI handler (perf_event_nmi_handler) took too long to run: 1025.396 msecs
> [11570.756001] perf samples too long (7730632 > 20000), lowering kernel.perf_event_max_sample_rate to 6250
> [11571.580001] perf samples too long (7670242 > 40000), lowering kernel.perf_event_max_sample_rate to 3250
> [11572.428001] perf samples too long (7610324 > 76923), lowering kernel.perf_event_max_sample_rate to 1750
> [11573.276001] perf samples too long (7550874 > 142857), lowering kernel.perf_event_max_sample_rate to 1000
> [11574.124001] perf samples too long (7491888 > 250000), lowering kernel.perf_event_max_sample_rate to 500
> [11574.968001] perf samples too long (7433363 > 500000), lowering kernel.perf_event_max_sample_rate to 250

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/