RE: [PATCH v3] x86/mce: Try printing all machine check banks known before panic

From: Luck, Tony
Date: Fri Nov 21 2014 - 16:32:04 EST


>
> /*
> * No machine check event found. Must be some external
> * source or one CPU is hung. Panic.
> */
> if (global_worst <= MCE_KEEP_SEVERITY && mca_cfg.tolerant < 3)
> mce_panic("Machine check from unknown source", NULL, NULL);
>
> Provided this comment is correct, it doesn't sound like any MCE record
> will ever tell us what causes the error as an external source or a hung
> CPU doesn't generate an MCE record in any bank, does it?

That means there were no VALID=1, EN=1, S=1 errors anywhere. But there
might be some other things logged that would help us understand.

We are into cpu errata territory here though ... we aren't supposed to get
machine checks that don't have a logged cause. We panic for spurious
machine checks because we know something has gone horribly wrong,
even if we don't know what that something was.

-Tony