Re: mcelog ?

From: Bernd Pfrommer
Date: Mon May 22 2006 - 06:59:41 EST


Stephan von Krawczynski <skraw <at> ithnet.com> writes:

>
> Hello,
>
> can some kind soul please shortly explain what this message tells me:
>
> HARDWARE ERROR
> CPU 1: Machine Check Exception: 4 Bank 4: b60a200170080813
> TSC 89cfb4725b17 ADDR 1025cb3f0
> This is not a software problem!
> Run through mcelog --ascii to decode and contact your hardware vendor
> Kernel panic - not syncing: Machine check
>
> Of course I ran mcelog but I don't quite understand how the additional info
> helps me finding the problem.
> Is this a problem with RAM? And if, which one?
>
> The box is a dual opteron with two banks of mem (4 sockets each), each socket
> holding a 1 GB mem module.
>
> Thanks for any hints.


I got a very similar error on a supermicro H8QC8+ (4way dual-core opteron)
during heavy disk writes. It only happened once so far. The error message also
mentioned
4 Bank 4: b608a00100000813 (strange that the last 4 digits agree).

Bernd


-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/