Kernel panic on HT machine - full logs with debug from a few boots(some successful)

From: wzab
Date: Sun Jun 05 2011 - 16:22:36 EST

I tried to find the source of the problem I experience, supposing, that it may affect also other users of machine with Hyper-Threaded CPU.

I've recompiled the kernel with debugging support.
If I remember correctly, machine was booted 5 times. Two of restarts
were successful and even GDM started.

Attached (in gzipped form to spare bandwidth) are logs from serial console (with overwritten MAC addresses of NICs), and configuration of the kernel.

Crash seems to happen in random places:
1. [ 9.589453] BUG: unable to handle kernel paging request at
[ 9.590022] IP: [<c02e3fb2>] acpi_evaluate_object+0xf1/0x1f2
[ 9.590022] *pde = 00000000
[ 9.590022] Oops: 0000 [#1] PREEMPT SMP DEBUG_PAGEALLOC
[ 9.590022] last sysfs file:
[ 9.590022] Modules linked in: snd_page_alloc processor(+) unix

2. [ 15.995196] pci_hotplug: PCI Hot Plug PCI Core version: 0.5
[ 16.064443] agpgart-intel 0000:00:00.0: Intel 865 Chipset
[ 16.133686] BUG: sleeping function called from invalid context at kernel/mutex.c:278
[ 16.133693] BUG: unable to handle kernel NULL pointer dereference at (null)
[ 16.133702] IP: [<001d9726>] 0x1d9725
[ 16.133711] *pde = 00000000
[ 16.133716] Oops: 0000 [#1] PREEMPT SMP DEBUG_PAGEALLOC
[ 16.133724] last sysfs file: /sys/devices/LNXSYSTM:00/device:00/PNP0A03:00/device:12/PNP0303:00/uevent

As crash occurs only with HT on and doesn't happen on another machine with 2 cores, it seems that maybe the problem is associated with incorrect allocation of resources or locking for HT enabled CPU...


Attachment: panic5.log.gz
Description: GNU Zip compressed data

Attachment: .config.gz
Description: GNU Zip compressed data