Re: 2.6.16.1 lost cpu, was: 2.6.16-rc5 'lost' cpu

From: Zwane Mwaikambo
Date: Sat Apr 08 2006 - 07:56:07 EST


On Fri, 7 Apr 2006, Ashok Raj wrote:

> On Fri, Apr 07, 2006 at 06:45:36PM +0200, jensmh@xxxxxx wrote:
>
> Oh well, seems like that CPU has trouble booting, per message below
> we seemed to start it, but processor didnt run startup code... Suspect its a
> failing part probably..
>
> > CPU1: Intel P4/Xeon Extended MCE MSRs (12) available
> > CPU1: Thermal monitoring enabled
> > CPU1: Intel(R) Xeon(TM) CPU 2.80GHz stepping 09
> > Booting processor 2/6 eip 2000
> > CPU 2 irqstacks, hard=c04b8000 soft=c04b0000
> > Not responding.
> > Inquiring remote APIC #6...
> > ... APIC #6 ID: failed
> > ... APIC #6 VERSION: failed
> > ... APIC #6 SPIV: failed
> > CPU #6 not responding - cannot use it.

Ok i've seen that about 2years ago on a similar Xeon system, it was hard
to reproduce as it only happened on the occassional boot i was thinking of
making the processor startup delays longer but could never get it to
reliably fail. The system is still running today.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/