Re: [PATCH][GIT PULL] tracing: Fix compile issue fortrace_sched_wakeup.c
From: Ingo Molnar
Date:  Mon Oct 25 2010 - 08:11:47 EST
* Ingo Molnar <mingo@xxxxxxx> wrote:
> and here's a new crash with a new config:
> 
> [   11.810471] Testing event timer_expire_exit: OK
> [   11.850475] Testing event timer_cancel: OK
> [   11.890508] Testing event hrtimer_init: OK
> [   11.930469] Testing event hrtimer_start: OK
> [   11.970475] Testing event hrtimer_expire_entry: 
> [   11.980002] BUG: unable to handle kernel NULL pointer dereference at (null)
> [   11.980010] IP: [<(null)>] (null)
> [   11.980010] *pde = 00000000 
> [   11.980010] Oops: 0000 [#1] SMP 
> [   11.980010] last sysfs file: 
> [   11.980010] Modules linked in:
> [   11.980010] 
> [   11.980010] Pid: 0, comm: swapper Not tainted 2.6.36-tip-05833-g9db2fad-dirty #52316 A8N-E/System Product Name
> [   11.980010] EIP: 0060:[<00000000>] EFLAGS: 00010046 CPU: 0
> [   11.980010] EIP is at 0x0
> [   11.980010] EAX: f6806a94 EBX: f6806a94 ECX: 00010000 EDX: 00000096
> [   11.980010] ESI: f65bdf50 EDI: f6806a00 EBP: f6806a30 ESP: c13dff04
> [   11.980010]  DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
> [   11.980010] Process swapper (pid: 0, ti=c13de000 task=c13e2f20 task.ti=c13de000)
> [   11.980010] Stack:
> [   11.980010]  c103d297 00000000 c10460c1 c13dff4c ca105369 00000002 ffffffff 7fffffff
> [   11.980010]  c103d52b ca105369 00000002 ca105369 0000002c f6806a00 00000000 f6806a04
> [   11.980010]  ca105369 00000002 ca105369 00000002 00000000 f6805dac 00000000 c1420788
> [   11.980010] Call Trace:
> [   11.980010]  [<c103d297>] ? __run_hrtimer+0x91/0x105
> [   11.980010]  [<c10460c1>] ? tick_sched_timer+0x0/0x1a1
> [   11.980010]  [<c103d52b>] ? hrtimer_interrupt+0x108/0x20a
> [   11.980010]  [<c1012294>] ? smp_apic_timer_interrupt+0x66/0x75
> [   11.980010]  [<c12c202a>] ? apic_timer_interrupt+0x36/0x3c
> [   11.980010]  [<c10163f0>] ? native_safe_halt+0x2/0x3
> [   11.980010]  [<c10072c6>] ? default_idle+0x66/0x91
> [   11.980010]  [<c10020f6>] ? cpu_idle+0x98/0xda
> [   11.980010]  [<c142280a>] ? start_kernel+0x2f7/0x2fc
> [   11.980010] Code:  Bad EIP value.
> [   11.980010] EIP: [<00000000>] 0x0 SS:ESP 0068:c13dff04
> [   11.980010] CR2: 0000000000000000
> [   11.980010] ---[ end trace 74b10a949febd52e ]---
> [   11.980011] BUG: unable to handle kernel NULL pointer dereference at (null)
> [   11.980011] IP: [<(null)>] (null)
> [   11.980011] *pde = 00000000 
> [   11.980011] Oops: 0000 [#2] SMP 
> [   11.980011] last sysfs file: 
> [   11.980011] Modules linked in:
> [   11.980011] 
> [   11.980011] Pid: 0, comm: kworker/0:0 Tainted: G      D     2.6.36-tip-05833-g9db2fad-dirty #52316 A8N-E/System Product Name
> [   11.980011] EIP: 0060:[<00000000>] EFLAGS: 00010046 CPU: 1
> [   11.980011] EIP is at 0x0
> [   11.980011] EAX: f6a06a94 EBX: f6a06a94 ECX: 00010000 EDX: 00000092
> [   11.980011] ESI: f65bdf50 EDI: f6a06a00 EBP: f6a06a30 ESP: f6479ec0
> [   11.980011]  DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068
> [   11.980011] Process kworker/0:0 (pid: 0, ti=f6478000 task=f64364e0 task.ti=f6478000)
> [   11.980011] Stack:
> [   11.980011]  c103d297 00000000 c10460c1 f6479f08 ca1058de 00000002 ffffffff 7fffffff
> [   11.980011]  c103d52b ca1058de 00000002 ca1058de 0000002c f6a06a00 00000000 f6a06a04
> [   11.980011]  ca1058de 00000002 ca1058de 00000002 00000000 f6a05dac 00000001 c1420788
> [   11.980011] Call Trace:
> [   11.980011]  [<c103d297>] ? __run_hrtimer+0x91/0x105
> [   11.980011]  [<c10460c1>] ? tick_sched_timer+0x0/0x1a1
> [   11.980011]  [<c103d52b>] ? hrtimer_interrupt+0x108/0x20a
> [   11.980011]  [<c1012294>] ? smp_apic_timer_interrupt+0x66/0x75
> [   11.980011]  [<c12c202a>] ? apic_timer_interrupt+0x36/0x3c
> [   11.980011]  [<c10163f0>] ? native_safe_halt+0x2/0x3
> [   11.980011]  [<c10072c6>] ? default_idle+0x66/0x91
> [   11.980011]  [<c10020f6>] ? cpu_idle+0x98/0xda
> [   11.980011]  [<c14501ce>] ? start_secondary+0x200/0x205
> [   11.980011] Code:  Bad EIP value.
> [   11.980011] EIP: [<00000000>] 0x0 SS:ESP 0068:f6479ec0
> [   11.980011] CR2: 0000000000000000
> [   11.980011] ---[ end trace 74b10a949febd52f ]---
> [   11.980011] Kernel panic - not syncing: Fatal exception in interrupt
> [   11.980011] Pid: 0, comm: kworker/0:0 Tainted: G      D     2.6.36-tip-05833-g9db2fad-dirty #52316
> [   11.980011] Call Trace:
> [   11.980011]  [<c12bfaa3>] ? panic+0x4d/0x13a
> [   11.980011]  [<c12c2b31>] ? oops_end+0x72/0x7c
> [   11.980011]  [<c1017475>] ? no_context+0x10d/0x117
> [   11.980011]  [<c12c42d7>] ? do_page_fault+0x0/0x37c
> [   11.980011]  [<c1017591>] ? bad_area_nosemaphore+0xa/0xc
> [   11.980011]  [<c12c44a4>] ? do_page_fault+0x1cd/0x37c
> [   11.980011]  [<c105a808>] ? rb_reserve_next_event+0x289/0x336
> [   11.980011]  [<c113ea0c>] ? trace_hardirqs_off_thunk+0xc/0x10
> [   11.980011]  [<c12c42d7>] ? do_page_fault+0x0/0x37c
> [   11.980011]  [<c12c42d7>] ? do_page_fault+0x0/0x37c
> [   11.980011]  [<c12c21ec>] ? error_code+0x6c/0x74
> [   11.980011]  [<c103d297>] ? __run_hrtimer+0x91/0x105
> [   11.980011]  [<c10460c1>] ? tick_sched_timer+0x0/0x1a1
> [   11.980011]  [<c103d52b>] ? hrtimer_interrupt+0x108/0x20a
> [   11.980011]  [<c1012294>] ? smp_apic_timer_interrupt+0x66/0x75
> [   11.980011]  [<c12c202a>] ? apic_timer_interrupt+0x36/0x3c
> [   11.980011]  [<c10163f0>] ? native_safe_halt+0x2/0x3
> [   11.980011]  [<c10072c6>] ? default_idle+0x66/0x91
> [   11.980011]  [<c10020f6>] ? cpu_idle+0x98/0xda
> [   11.980011]  [<c14501ce>] ? start_secondary+0x200/0x205
> [   11.980010] Kernel panic - not syncing: Fatal exception in interrupt
> [   11.980010] Pid: 0, comm: swapper Tainted: G      D     2.6.36-tip-05833-g9db2fad-dirty #52316
> [   11.980010] Call Trace:
> [   11.980010]  [<c12bfaa3>] ? panic+0x4d/0x13a
> [   11.980010]  [<c12c2b31>] ? oops_end+0x72/0x7c
> [   11.980010]  [<c1017475>] ? no_context+0x10d/0x117
> [   11.980010]  [<c12c42d7>] ? do_page_fault+0x0/0x37c
> [   11.980010]  [<c1017591>] ? bad_area_nosemaphore+0xa/0xc
> [   11.980010]  [<c12c44a4>] ? do_page_fault+0x1cd/0x37c
> [   11.980010]  [<c105a808>] ? rb_reserve_next_event+0x289/0x336
> [   11.980010]  [<c113ea0c>] ? trace_hardirqs_off_thunk+0xc/0x10
> [   11.980010]  [<c12c42d7>] ? do_page_fault+0x0/0x37c
> [   11.980010]  [<c12c42d7>] ? do_page_fault+0x0/0x37c
> [   11.980010]  [<c12c21ec>] ? error_code+0x6c/0x74
> [   11.980010]  [<c103d297>] ? __run_hrtimer+0x91/0x105
> [   11.980010]  [<c10460c1>] ? tick_sched_timer+0x0/0x1a1
> [   11.980010]  [<c103d52b>] ? hrtimer_interrupt+0x108/0x20a
> [   11.980010]  [<c1012294>] ? smp_apic_timer_interrupt+0x66/0x75
> [   11.980010]  [<c12c202a>] ? apic_timer_interrupt+0x36/0x3c
> [   11.980010]  [<c10163f0>] ? native_safe_halt+0x2/0x3
> [   11.980010]  [<c10072c6>] ? default_idle+0x66/0x91
> [   11.980010]  [<c10020f6>] ? cpu_idle+0x98/0xda
> [   11.980010]  [<c142280a>] ? start_kernel+0x2f7/0x2fc
Here's the disassembly of the crash site:
c103d282:       89 da                   mov    %ebx,%edx
c103d284:       8b 4c 24 04             mov    0x4(%esp),%ecx
c103d288:       ff 16                   call   *(%esi)
c103d28a:       83 c6 08                add    $0x8,%esi
c103d28d:       83 3e 00                cmpl   $0x0,(%esi)
c103d290:       eb eb                   jmp    c103d27d <__run_hrtimer+0x77>
c103d292:       89 d8                   mov    %ebx,%eax
c103d294:       ff 14 24                call   *(%esp)
c103d297:       89 04 24                mov    %eax,(%esp)
c103d29a:       e9 00 00 00 00          jmp    c103d29f <__run_hrtimer+0x99>
c103d29f:       eb 19                   jmp    c103d2ba <__run_hrtimer+0xb4>
c103d2a1:       8b 35 50 f1 40 c1       mov    0xc140f150,%esi
c103d2a7:       85 f6                   test   %esi,%esi
c103d2a9:       74 0f                   je     c103d2ba <__run_hrtimer+0xb4>
c103d2ab:       8b 46 04                mov    0x4(%esi),%eax
c103d2ae:       89 da                   mov    %ebx,%edx
c103d2b0:       ff 16                   call   *(%esi)
c103d2b2:       83 c6 08                add    $0x8,%esi
c103d2b5:       83 3e 00                cmpl   $0x0,(%esi)
c103d2b8:       eb ef                   jmp    c103d2a9 <__run_hrtimer+0xa3>
c103d2ba:       89 f8                   mov    %edi,%eax
c103d2bc:       e8 ea 43 28 00          call   c12c16ab <_raw_spin_lock>
c103d2c1:       83 3c 24 00             cmpl   $0x0,(%esp)
(gdb) list *0xc103d297
0xc103d297 is in __run_hrtimer (kernel/hrtimer.c:1227).
1222		 * they get migrated to another cpu, therefore its safe to unlock
1223		 * the timer base.
1224		 */
1225		raw_spin_unlock(&cpu_base->lock);
1226		trace_hrtimer_expire_entry(timer, now);
1227		restart = fn(timer);
1228		trace_hrtimer_expire_exit(timer);
1229		raw_spin_lock(&cpu_base->lock);
1230	
1231		/*
(gdb) list *0xc103d294
0xc103d294 is in __run_hrtimer (kernel/hrtimer.c:1227).
1222		 * they get migrated to another cpu, therefore its safe to unlock
1223		 * the timer base.
1224		 */
1225		raw_spin_unlock(&cpu_base->lock);
1226		trace_hrtimer_expire_entry(timer, now);
1227		restart = fn(timer);
1228		trace_hrtimer_expire_exit(timer);
1229		raw_spin_lock(&cpu_base->lock);
1230	
1231		/*
i.e. the 'fn(timer)' call crashed.
Thanks,
	Ingo
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/