2.2.5-15SMP dying without OOPS

Thierry Danis (danis@sagem.fr)
Sat, 10 Jul 1999 16:59:49 +0200


Hello,

Yesterday, one of my collegue reported a complete lock up of our SMP
box using 2.2.5-15SMP from RH 6.0.

The box is a Dual PII 400 acting as a NFS server with a very light
load at the current time.

The box did not answer, and from memory, the collegue reported
the message shown on the console before it rebooted.

The message comes from function show() in
/usr/src/linux-2.2.5/arch/i386/kernel/irq.c.

Only CPU %d, irq : %d [%d %d], bh : %d [%d %d] were displayed,
but not the following 40 addresses.

The call path was :
__global_cli
get_irqlock
wait_on_irq
show("wait_on_irq")

Sorry that he did not remember what the %d above were.

After the reboot, the date was reset to January 1st 1990, 01:23:00
(the last date before reboot was Jul 9 13:22:13).

In /var/log/messages :
Jul 9 17:18:30 strichnine kernel: number of MP IRQ sources: 15.
Jul 9 17:18:30 strichnine kernel: number of IO-APIC registers: 24.
Jul 9 17:18:30 strichnine kernel: testing the IO APIC.......................
Jul 9 17:18:30 strichnine kernel: .... register #00: 02000000
Jul 9 17:18:30 strichnine kernel: ....... : physical APIC id: 02
Jul 9 17:18:30 strichnine kernel: .... register #01: 00170011
Jul 9 17:18:30 strichnine kernel: ....... : max redirection entries: 0017
Jul 9 17:18:30 strichnine kernel: ....... : IO APIC version: 0011
Jul 9 17:18:30 strichnine kernel: .... register #02: 00000000
Jul 9 17:18:30 strichnine kernel: ....... : arbitration: 00
Jul 9 17:18:30 strichnine kernel: .... IRQ redirection table:

# lspci
00:00.0 Host bridge: Intel Corporation 440BX/ZX - 82443BX/ZX Host bridge (rev
+03)
00:01.0 PCI bridge: Intel Corporation 440BX/ZX - 82443BX/ZX AGP bridge (rev 03)
00:04.0 ISA bridge: Intel Corporation 82371AB PIIX4 ISA (rev 02)
00:04.1 IDE interface: Intel Corporation 82371AB PIIX4 IDE (rev 01)
00:04.2 USB Controller: Intel Corporation 82371AB PIIX4 USB (rev 01)
00:04.3 Bridge: Intel Corporation 82371AB PIIX4 ACPI (rev 02)
00:0a.0 Ethernet controller: Standard Microsystems 9432 TX (rev 08)
00:0c.0 SCSI storage controller: Symbios Logic Inc. (formerly NCR) 53c875 (rev
+26)
01:00.0 VGA compatible controller: ATI Technologies Inc 215GB [Mach64 GB] (rev
+5c)

No X, just a very light NFS load if any. Uptime, 10 days, but a lot of

set_rtc_mmss: can't update from 49 to 0

messages. And some :

Jul 9 12:43:24 strichnine kernel: EXT2-fs warning (device sd(8,17)): ext2_free_
blocks: bit already cleared for block 11985685
Jul 9 12:43:24 strichnine kernel: EXT2-fs warning (device sd(8,17)): ext2_free_
inode: bit already cleared for inode 2990140

A+,

-- 
	Thierry Danis
	Poste : 53 53	danis@spmo.sagem.fr

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.rutgers.edu Please read the FAQ at http://www.tux.org/lkml/