RE: [PATCH 3/3] ie31200_edac: Add driver

From: Luck, Tony
Date: Wed Apr 09 2014 - 17:33:25 EST


> Unfortunately, the box reporting the ue errors just went into transit (so
> that I can better examine this issue), so I will probably not be able to
> run this experiment on that specific box until next week.

Do you have any other logs from this machine. Is there something
logged in one (or more) of the machine check banks when your EDAC
driver says that there are uncorrected errors?

When the box is back online again - I'd be interested to know if mcelog(8)
daemon reports any errors. Grab the latest from mcelog.org, compile
and run as "mcelog --daemon". Logs show up in /var/log/mcelog

> # ./rdmsr 0x179
> c09

So this processor does support CMCI - next question is whether each
bank support it (and got enabled by Linux) [can run on any system ... don't
need to wait for the one to finish transit)]

# for I in `seq 0 8`
do
./rdmsr 0x28$i
done

will print the MCi_CTL2 registers from each bank. Bit 30 (0x40000000)
shows CMCI enabled.

On the name of the driver - can you throw in an underscore: ie3_12xx.c ?

Do you have systems from Sandy Bridge, Ivy Bridge and Haswell generations
(no suffix for Sandy Bridge, then v2 and v3) ... and does this driver work across all of
them? If it is just for Haswell ... then "ie3_12xx_v3.c" might be a better name.

-Tony
N‹§²æ¸›yú²X¬¶ÇvØ–)Þ{.nlj·¥Š{±‘êX§¶›¡Ü}©ž²ÆzÚj:+v‰¨¾«‘êZ+€Êzf£¢·hšˆ§~†­†Ûÿû®w¥¢¸?™¨è&¢)ßf”ùy§m…á«a¶Úÿ 0¶ìå