linux troubleshooting help
From: Dimitri Chausson
Date: Sun Aug 13 2006 - 07:12:04 EST
since about 1 week, I am experiencing system crashes. I tried kernel versions 2.6.15 and 2.6.16 but it occurs in both. I first thougth it was X related (so not a kernel problem), but the machine is not reachable via network. All logs I list below were obtained with a 2.6.15-1-k7 kernel (debian):
1- When booting, contains:
Unable to handle kernel NULL pointer dereference at virtual address 00000006
*pde = 00000000
Recursive die() failure, output supressed
<0> Kernel panic - not syncing: Fatal exception in interrupt
2- Running on a terminal (no X running):
CPU 0: Machine Check Exception: 0000000000000004
Bank 1: ......... at ...........
Kernel panic - not syncing: CPU context corrupt
And there were other similar crashes. It crashes really often (several times a day, while the computer is not running the whole day).
Since I did not add any hardware, I thought some hardware may be dying... but is there a way to know what ? Until now I ran a memtest86, and I checked the hard disk with smartmontools. Both went well...
I do not know how to proceed now, and I would appreciate any hint or help on how to further isolate the problem,
thanks for your time,
PS: of course, I can provide you with more details if necessary
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/