Re: Machine Check Exception on Opteron 265

From: Matthew Garrett
Date: Tue Apr 17 2007 - 09:58:32 EST


On Tue, Apr 17, 2007 at 02:56:18PM +0100, Alan Cox wrote:
> On Sat, 14 Apr 2007 16:58:43 +0200
> Espen Fjellvær Olsen <espen@xxxxxxxxxxxxxxxxx> wrote:
> > Hi!
> > Today our Opteron 265, 2x2, paniced after many months uptime, giving
> > only this error message:
> >
> > HARDWARE ERROR
> > CPU 2: Machine Check Exception: 4 Bank 4: b60a200100000813
> > TSC 6bb9fd0142921a ADDR a891e9b8
> > This is not a software problem!
> ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^]
>
> This is there for a good reason.

Though we saw MCEs being generated by running the HPA code on sata_nv
with awdma, so it's not always true. I agree that in this case, it
probably is.

--
Matthew Garrett | mjg59@xxxxxxxxxxxxx
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/