Re: Dell 2650 Dual Xeon freezing up frequently

From: Mikael Pettersson (mikpe@csd.uu.se)
Date: Wed Jul 30 2003 - 17:50:51 EST


On Wed, 30 Jul 2003 15:52:58 -0600, Ian S. Nelson wrote:
>I'm running a RedHat 2.4.20 kernel on some 2650's all dual xeon
>(pentium 4 jacksonized so it looks like 4 procsessors) 2 have 1GB of
>RAM and 1 has 2GB of RAM. THey all wedge, some times after a few
>minutes, sometimes after hours.
>
>I hooked up a serial consol to capture a kernel panic or something else
>that would be fun to debug, no such luck.. It just locks up. No nothing.

Welcome to the club. Our PE2650 used to hang hard anywhere from a few
days to two weeks after each reboot. This started after our RH7.3 to
RH8.0 upgrade and may be related to problems with the tg3 NIC driver.
(RH7.3 was stable, but then I think we used a bcm<something> driver.)

After having these problems from January to March or April, with a
a series of RH upgrade kernels, I accidentally found that enabling
the I/O-APIC nmi_watchdog solved all problems. (I can only speculate
that somehow the regular I/O-APIC NMIs prevent some hang somewhere.)
It's now running stable as a rock since April.

/Mikael
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Thu Jul 31 2003 - 22:00:47 EST