Re: Machine Check Exception on Opteron 265

From: Alan Cox
Date: Tue Apr 17 2007 - 09:53:44 EST


On Sat, 14 Apr 2007 16:58:43 +0200
Espen FjellvÃr Olsen <espen@xxxxxxxxxxxxxxxxx> wrote:

> Hi!
> Today our Opteron 265, 2x2, paniced after many months uptime, giving
> only this error message:
>
> HARDWARE ERROR
> CPU 2: Machine Check Exception: 4 Bank 4: b60a200100000813
> TSC 6bb9fd0142921a ADDR a891e9b8
> This is not a software problem!
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^]

This is there for a good reason.

> We updated glibc yesterday, but that shouldnt really cause such a problem.
> So now we wonder if this might be an MCE bug, or really a HW problem,
> and if it is one of the CPUs, or the RAM thats faulty.

Consult your hardware vendor but if its a single event in a year it might
be anything - even cosmic rays.

Alan
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/