Re: Machine Check Exception: 0...04

From: Jesper Juhl
Date: Sun Jun 17 2007 - 18:43:02 EST


On 18/06/07, Mr. James W. Laferriere <babydr@xxxxxxxxxxxxxxxx> wrote:
Hello All , As a continuation .

On Sat, 16 Jun 2007, Mr. James W. Laferriere wrote:
> Hello All , Does anoyone know howto identify a cause for these(*) ?
> Or of any tools to help in the identification of the cause ?
> So far the Machine checks only happen when I am running bonnie++
> against
> my software raid6 array .
>
> I have done everything I know to do to attempt to ascertain what is
> causing the machine checks .
> ie:
> 1) memtest86+ for days , no errors .
> 2) cpuburnP6 , The tests run were 'cpuburnP6 E' & 'cpuburnP6 H' for ~
> 60 minutes each . All CPU's & HT were at 96+<->100% for 60+ Minutes
> ,
> no excessive heating or lockups . In single user mode of course .
> I know cpuburn is old but it can excersize the comms between l1 & cpu
> & l1 & l2 -> cpu if done right .
>
> (*)
> CPU 5: Machine Check Exception: 0000000000000004
> CPU 4: Machine Check Exception: 0000000000000004
> Kernel panic - not syncing: Unable to continue
> <system reboots>
>

An MCE is an error reported by the hardware. It is most likely not a
software problem, not much kernel people can do about it.

Google for "parsemce.c" to find a program that'll decode most MCE's for you.

You may also want to contact your hardware vendor to get an exact
explanation for the error.

--
Jesper Juhl <jesper.juhl@xxxxxxxxx>
Don't top-post http://www.catb.org/~esr/jargon/html/T/top-post.html
Plain text mails only, please http://www.expita.com/nomime.html
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/