Re: MCE triggered with v3.1 and v3.2 on Xeon E5

From: Tony Luck
Date: Fri Mar 30 2012 - 13:07:57 EST


On Fri, Mar 30, 2012 at 9:15 AM, Arnaud Lacombe <lacombar@xxxxxxxxx> wrote:
> Currently, I would suspect an hardware issue as the machine is brand
> new. I'll see if v3.3 trigger the same MCE and eventually run a
> memtest.

Probably a bad DIMM (you have 4 corrected errors from addresses close
to each other - then an uncorrected error which causes the panic.

The DIMM is in socket 0, channel 3 ... but if you have more than one DIMM
in channel 3 you'll have to try each in turn to see which is causing the problem
(or compile and load drivers/edac/sb_edac.c to see if it gives you a
more precise
location).

-Tony
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/