Q: EDAC/kprintf/Xen issue (long logs inline)
From: Ulrich Windl
Date: Tue Oct 07 2014 - 08:05:31 EST
Hi!
I have a somewhat strange isse on a Xen host running SLES11 SP3 on a HP DL380 G7 server (two 5-core Xeon 5650 CPUs): At some time the system had RAM problems, and in one case the messages seemed to overwrite each other as seen in syslog. I wonder whether the locking of kprintf() is broken. See yourself:
Mar 14 10:06:40 h05 kernel: [679593.489003] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:40 h05 kernel: [679593.489010] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:40 h05 kernel: [679593.489014] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:40 h05 kernel: [679593.489019] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:40 h05 kernel: [679593.489023] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:40 h05 kernel: [679593.489027] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:40 h05 kernel: [679593.489031] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
[...and so on...]
Mar 14 10:06:41 h05 kernel: [679593.501561] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.501568] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.501575] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.501583] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.501590] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.501597] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.501604] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.501611] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.501618] EDAC MC1: CE rohannel 0, label "": Corrected error (Socket=1hannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1hanne
l 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.501647] EDAhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=hannel 0, label "": Corrected error (Socket=1 chhannel 0, label
"": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected
error (Socket=1 hannel 0, label "": Corrected error (Socket=1hannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1
hannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 hannel 0, label
"": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1hannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected e
rror (Socket=1
Mar 14 10:06:41 h05 kernel: hannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected err
or (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.501830] EDAC MC1: CE row 6, channehannel 0, label "": Corrected error (Socket=1 chanhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error
(Socket=1 hannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 h
annel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 hannel 0, label
"": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=hannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected er
ror (Socket=1 channel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1
hannel 0, labe
Mar 14 10:06:41 h05 kernel: l "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=han
nel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 hannel 0, label ""
: Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected
error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Sock
et=1 chahannel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.502074] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Sockehannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 cha
hannel 0, label "": Corrected error (Sohannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chahannel 0, label "":
Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected er
ror (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.502135] EDAC hannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 chhannel 0, l
abel "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.502159] EDAChannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chhannel 0,
label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Co
rrected error (Socket=1 channel 0, label "": Corrected error (Sochannel 0, label "": Corrected error (Socket=1 channelhannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error
(Socket=1 chhannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1
chhannel 0, label "":hannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Sockehannel 0, label "": Corrected error (Socket
=1 chahannel 0
Mar 14 10:06:41 h05 kernel: , label "": Corrected error (Socket=1hannel 0, label "": Correctedhannel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.502258] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.502262] EDAC MC1: CE row 6, channel 0, label "": Corrected errhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel=2
dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.502275] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.502281] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Shannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhann
el 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 chhannel 0, label
"": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.502314] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.502318] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.502322] EDAC MC1:hannel 0, label "": Corrected error (Socket=1 chanhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhann
el 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.502342] EDAChannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, la
bel "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 channel 0, label "": Co
rrected error (Sockethannel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.502379] EDAhannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, la
bel "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corre
cted error (Socket=1 channel 0, label "": Corrected error (Socket=1 chanhannel 0, label "": Corrected errorhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 ch
hannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 channel=2 di
mm=0)
Mar 14 10:06:41 h05 kernel: [679593.502448] EDAhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.502456] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.50hannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "":
Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chanhannel 0, label "": Corrected
error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Sochannel 0, label "": Corrected error (Socket=1
chahannel 0, label "": Corrected error (Sockethannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channehannel 0,
label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channehannel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.502544] EDAChannel 0, label "": Corrected error (Sockethannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label
"": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Sockhannel 0, label "": Corrected err
or (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Sockehannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 cha
nnel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.502600] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [67959hannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corre
cted error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected erro
r (Socket=1 chhannel 0, label "": Corrected error (Sockhannel 0, label "": Corrected error (Sockethannel 0, label "": Corrected error (Socket=1 channhannel 0, label "": Corrected error (Socket=1 chanhan
nel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 chahannel 0, label
"": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Correc
ted error (Soc
Mar 14 10:06:41 h05 kernel: ket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Cor
rected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (S
ocket=1 hannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 cha
nnel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 chahannel 0, lab
el "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Correc
ted error (Socket
Mar 14 10:06:41 h05 kernel: =1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Correcte
d error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Sock
et=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel
0, label "": Corrected error (Socket=hannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Sochannel 0, label "": Corrected error (Sockehannel 0, label "": Corrected error
(Socket=1 chhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Sochan
nel 0, label "":
Mar 14 10:06:41 h05 kernel: Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chanhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chh
annel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, la
bel "": Corrected error (Sochannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Sohannel 0, label "": Corrected error (So
cket=1 chhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Sockhannel 0, label "": Corrected error (Socket=1 channel 0,
label "": Corrected error (Sockethannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Correc
ted error (Socket
Mar 14 10:06:41 h05 kernel: =1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 channel 0, label "": Correct
ed error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Soc
ket=1 chhannel 0, label "": Corrected error (Socket=1hannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0
, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Sockhannel 0, label "": Correc
ted error (Sockehannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Sockethannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 c
hannel 0, label "
Mar 14 10:06:41 h05 kernel: ": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chanhannel 0, label "": Corrected error (Socket=1
channhannel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503068] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [67959hannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503082] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503086] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 chanhannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (S
ocket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.5031hannel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503104] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Sochannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 cha
nnel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503116] EDAChannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Sockehannel 0, label "": Corrected error (Socket=1 chahannel 0, label
"": Corrected error (Socket=1hannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected e
rror (Socket=hannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 c
hannel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503170] EDAC Mhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 channel 0,
label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": C
orrected error (Socket=1 chanhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503213] EDAhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, lab
el "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Sockehannel 0, label "": Corrected er
ror (Socket=1 chhannel 0, label "": Corrected error (Sockethannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 c
hhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 chhannel 0,
label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 chhannel 0, label "":
Corrected erro
Mar 14 10:06:41 h05 kernel: r (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 chhannel 0, label
"": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Correcte
d error (Socket=1 hannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (S
ocket=1 hannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chan
nel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Socket=1 chanhannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 chahannel 0,
label "": Correct
Mar 14 10:06:41 h05 kernel: ed error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503386] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
[...]
Mar 14 10:06:41 h05 kernel: [679593.503646] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679hannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1hannel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503664] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503668] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.50hannel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503676] hannel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503681] EDhannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503689] EDAhannel 0, label "": Corrected error (Sockehannel 0, label "": Channel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Socket=
1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503715] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503719] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503723] EDAChannel 0, label "": Corrected error (Socket=1hannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503734] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: hannel 0, label "": Corrected error (Socket=1hannel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503747] EDAC MC1: CE row 6, channel 0, label "": Corrected ehannel 0, label "": Corrected error (Socket=1 channehannel 0, label "": Corrected error (Sockhannel 0, lab
el "": Corrected error (Socket=1 chanhannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected error (Sochannel 0, label "": Corrected error (Socket=1 chhannel 0, label "": Corrected
error (Socket=hannel 0, label "": Corrected error (Socket=1 hannel 0, label "": Corrected error (Sockehannel 0, label "": Corrected error (Socket=1 chahannel 0, label "": Corrected error (Sockhannel 0,
label "": Corrected error (Sochannel 0, label "": Corrected error (Socket=1 chanhannel 0, label "": Corrected error (Sockethannel 0, label "": Corrected error (Socket=1hannel 0, label "": Corrected err
or (Socket=1 chhannel 0, label "": Corrected error (Sockehannel 0, label "": Corrected error (Socket=1hannel 0, label "": Corrected error (Socket=1 channel 0, label "": Corrected error (Socket=1 channel
=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503841] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503845] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503849] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503853] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503857] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503861] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503865] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
Mar 14 10:06:41 h05 kernel: [679593.503869] EDAC MC1: CE row 6, channel 0, label "": Corrected error (Socket=1 channel=2 dimm=0)
[...]
On a non-Xen host (same hardware) I don't see this kind of message corruption:
Jan 17 01:05:11 h04 kernel: [2724087.160257] EDAC MC0: CE row 2, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=2)
[...]
Aug 13 05:01:40 h04 kernel: [2797680.835057] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 05:01:40 h04 kernel: [2797680.835064] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 05:01:40 h04 kernel: [2797680.835068] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 05:01:40 h04 kernel: [2797680.835073] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
[...]
Aug 13 05:58:28 h04 kernel: [2801088.028505] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 05:58:28 h04 kernel: [2801088.028511] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 06:00:01 h04 kernel: [2801180.743866] CMCI storm detected: switching to poll mode
Aug 13 06:00:01 h04 kernel: [2801181.003188] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 06:00:01 h04 kernel: [2801181.003194] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 06:00:01 h04 kernel: [2801181.003198] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 06:00:01 h04 kernel: [2801181.003202] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
[...]
Aug 13 06:00:02 h04 kernel: [2801182.003227] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 06:00:02 h04 kernel: [2801182.003230] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 06:00:02 h04 kernel: [2801182.003232] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 06:00:02 h04 kernel: [2801182.003234] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 06:00:07 h04 kernel: [2801187.001381] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 06:00:16 h04 kernel: [2801195.998847] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 06:00:24 h04 kernel: [2801203.900612] CMCI storm subsided: switching to interrupt mode
Aug 13 06:00:24 h04 kernel: [2801203.900618] CPU 2 MCA banks CMCI:2 CMCI:3 CMCI:5 CMCI:6 CMCI:8
Aug 13 06:00:24 h04 kernel: [2801204.000640] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 06:00:53 h04 kernel: [2801232.916638] CPU 0 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 06:00:54 h04 kernel: [2801233.652425] CPU 22 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 06:00:54 h04 kernel: [2801233.676407] CPU 20 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 06:00:54 h04 kernel: [2801233.701573] CPU 18 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 06:00:54 h04 kernel: [2801233.724421] CPU 16 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 06:01:46 h04 kernel: [2801285.986361] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 06:01:46 h04 kernel: [2801285.986368] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 06:01:46 h04 kernel: [2801285.986372] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 06:01:46 h04 kernel: [2801285.986376] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 06:01:47 h04 kernel: [2801286.985912] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
[...]
Aug 13 07:54:17 h04 kernel: [2808034.584062] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 07:54:17 h04 kernel: [2808034.584064] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 07:54:17 h04 kernel: [2808034.584067] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 07:54:17 h04 kernel: [2808034.584069] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 07:54:17 h04 kernel: [2808034.584071] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 07:54:17 h04 kernel: [2808034.931978] CMCI storm detected: switching to poll mode
Aug 13 07:54:18 h04 kernel: [2808035.583593] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 07:54:18 h04 kernel: [2808035.583599] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 07:54:18 h04 kernel: [2808035.583603] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 07:54:18 h04 kernel: [2808035.583607] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 07:54:18 h04 kernel: [2808035.583612] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
[...]
Aug 13 07:54:18 h04 kernel: [2808035.583653] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 07:54:18 h04 kernel: [2808035.583657] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 07:54:23 h04 kernel: [2808040.582274] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 07:54:47 h04 kernel: [2808064.923646] CMCI storm subsided: switching to interrupt mode
Aug 13 07:54:47 h04 kernel: [2808064.923656] CPU 22 MCA banks CMCI:2 CMCI:3 CMCI:5 CMCI:6 CMCI:8
Aug 13 07:55:17 h04 kernel: [2808094.915444] CPU 0 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 07:55:17 h04 kernel: [2808094.915455] CPU 20 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 07:55:17 h04 kernel: [2808094.915473] CPU 2 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 07:55:17 h04 kernel: [2808094.915688] CPU 4 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 07:55:17 h04 kernel: [2808094.915842] CPU 6 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 07:55:54 h04 kernel: [2808131.557694] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 07:55:54 h04 kernel: [2808131.557700] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 07:55:54 h04 kernel: [2808131.557704] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
[...]
Aug 13 08:42:09 h04 kernel: [2810906.123879] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 08:42:09 h04 kernel: [2810906.123881] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 08:42:09 h04 kernel: [2810906.123883] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 08:42:09 h04 kernel: [2810906.123886] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 08:42:09 h04 kernel: [2810906.368343] CMCI storm detected: switching to poll mode
Aug 13 08:42:10 h04 kernel: [2810907.123636] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 08:42:10 h04 kernel: [2810907.123643] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 08:42:10 h04 kernel: [2810907.123648] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 08:42:10 h04 kernel: [2810907.123652] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
[...]
Aug 13 08:42:13 h04 kernel: [2810910.122787] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 08:42:13 h04 kernel: [2810910.122791] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 08:42:13 h04 kernel: [2810910.122795] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 08:42:13 h04 kernel: [2810910.122800] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 08:42:39 h04 kernel: [2810936.360923] CMCI storm subsided: switching to interrupt mode
Aug 13 08:42:39 h04 kernel: [2810936.360932] CPU 20 MCA banks CMCI:2 CMCI:3 CMCI:5 CMCI:6 CMCI:8
Aug 13 08:42:47 h04 kernel: [2810944.009597] CPU 22 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 08:43:00 h04 kernel: [2810957.118128] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 08:43:00 h04 kernel: [2810957.118132] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 08:43:09 h04 kernel: [2810966.351528] CPU 16 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 08:43:09 h04 kernel: [2810966.351563] CPU 0 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 08:43:09 h04 kernel: [2810966.351580] CPU 18 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 08:43:09 h04 kernel: [2810966.351692] CPU 2 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 08:44:14 h04 kernel: [2811031.102138] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 08:44:14 h04 kernel: [2811031.102142] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 08:44:14 h04 kernel: [2811031.102145] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
[...]
Aug 13 10:22:33 h04 kernel: [2816928.092940] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:26:10 h04 kernel: [2817145.046007] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:26:17 h04 kernel: [2817152.044197] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:26:52 h04 kernel: [2817187.050696] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:30:18 h04 kernel: [2817393.365271] CMCI storm detected: switching to poll mode
Aug 13 10:30:19 h04 kernel: [2817394.042582] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:30:19 h04 kernel: [2817394.042588] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:30:19 h04 kernel: [2817394.042592] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
[...]
Aug 13 10:30:48 h04 kernel: [2817423.042713] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:30:48 h04 kernel: [2817423.042715] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:30:48 h04 kernel: [2817423.042717] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:30:48 h04 kernel: [2817423.042720] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:30:48 h04 kernel: [2817423.042722] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:30:48 h04 kernel: [2817423.042724] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:30:48 h04 kernel: [2817423.354451] CMCI storm subsided: switching to interrupt mode
Aug 13 10:30:48 h04 kernel: [2817423.354456] CPU 20 MCA banks CMCI:2 CMCI:3 CMCI:5 CMCI:6 CMCI:8
Aug 13 10:30:48 h04 kernel: [2817423.553247] CMCI storm detected: switching to poll mode
Aug 13 10:30:49 h04 kernel: [2817424.046318] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:30:49 h04 kernel: [2817424.046321] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:30:49 h04 kernel: [2817424.046324] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:30:49 h04 kernel: [2817424.046326] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:30:49 h04 kernel: [2817424.046328] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
[...]
Aug 13 10:31:18 h04 kernel: [2817453.046749] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:31:18 h04 kernel: [2817453.046751] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:31:18 h04 kernel: [2817453.046753] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:31:18 h04 kernel: [2817453.046755] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:31:18 h04 kernel: [2817453.543102] CMCI storm subsided: switching to interrupt mode
Aug 13 10:31:18 h04 kernel: [2817453.543108] CPU 14 MCA banks CMCI:2 CMCI:3 CMCI:5 CMCI:6 CMCI:8
Aug 13 10:31:19 h04 kernel: [2817453.729057] CMCI storm detected: switching to poll mode
Aug 13 10:31:19 h04 kernel: [2817454.047029] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:31:19 h04 kernel: [2817454.047033] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:31:19 h04 kernel: [2817454.047036] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:31:19 h04 kernel: [2817454.047039] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:31:19 h04 kernel: [2817454.047042] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
[...]
Aug 13 10:31:45 h04 kernel: [2817480.043281] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:31:45 h04 kernel: [2817480.043286] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:31:45 h04 kernel: [2817480.043290] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:31:45 h04 kernel: [2817480.043295] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:31:49 h04 kernel: [2817483.718417] CMCI storm subsided: switching to interrupt mode
Aug 13 10:31:49 h04 kernel: [2817483.718426] CPU 6 MCA banks CMCI:2 CMCI:3 CMCI:5 CMCI:6 CMCI:8
Aug 13 10:32:03 h04 kernel: [2817498.038262] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:32:03 h04 kernel: [2817498.038265] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:32:03 h04 kernel: [2817498.038271] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:32:04 h04 kernel: [2817499.037942] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:32:04 h04 kernel: [2817499.037948] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:32:09 h04 kernel: [2817504.404466] CPU 8 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 10:32:18 h04 kernel: [2817513.526004] CPU 0 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 10:32:18 h04 kernel: [2817513.526029] CPU 2 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 10:32:19 h04 kernel: [2817513.709916] CPU 16 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 10:32:19 h04 kernel: [2817513.709945] CPU 22 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 10:32:42 h04 kernel: [2817537.027747] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:16 h04 kernel: [2817570.937450] CMCI storm detected: switching to poll mode
Aug 13 10:33:16 h04 kernel: [2817571.026519] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:16 h04 kernel: [2817571.026525] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:16 h04 kernel: [2817571.026529] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
[...]
Aug 13 10:33:45 h04 kernel: [2817600.034964] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:45 h04 kernel: [2817600.034968] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:45 h04 kernel: [2817600.034972] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:45 h04 kernel: [2817600.034976] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:46 h04 kernel: [2817600.926265] CMCI storm subsided: switching to interrupt mode
Aug 13 10:33:46 h04 kernel: [2817600.926274] CPU 18 MCA banks CMCI:2 CMCI:3 CMCI:5 CMCI:6 CMCI:8
Aug 13 10:33:46 h04 kernel: [2817601.034923] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:46 h04 kernel: [2817601.034927] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:46 h04 kernel: [2817601.034930] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:46 h04 kernel: [2817601.034932] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
[...]
Aug 13 10:33:46 h04 kernel: [2817601.035080] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:46 h04 kernel: [2817601.035082] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:46 h04 kernel: [2817601.035084] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:46 h04 kernel: [2817601.035086] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:46 h04 kernel: [2817601.142363] CMCI storm detected: switching to poll mode
Aug 13 10:33:47 h04 kernel: [2817602.034580] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:47 h04 kernel: [2817602.034587] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:47 h04 kernel: [2817602.034591] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:47 h04 kernel: [2817602.034595] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:33:47 h04 kernel: [2817602.034599] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
[...]
Aug 13 10:34:16 h04 kernel: [2817631.026523] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:34:16 h04 kernel: [2817631.026528] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:34:16 h04 kernel: [2817631.026532] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:34:16 h04 kernel: [2817631.026536] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:34:16 h04 kernel: [2817631.134412] CMCI storm subsided: switching to interrupt mode
Aug 13 10:34:16 h04 kernel: [2817631.134420] CPU 8 MCA banks CMCI:2 CMCI:3 CMCI:5 CMCI:6 CMCI:8
Aug 13 10:34:16 h04 kernel: [2817631.334460] CMCI storm detected: switching to poll mode
Aug 13 10:34:17 h04 kernel: [2817632.026573] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:34:17 h04 kernel: [2817632.026577] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:34:17 h04 kernel: [2817632.026579] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
[...]
Aug 13 10:34:42 h04 kernel: [2817657.019136] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:34:42 h04 kernel: [2817657.019139] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:34:42 h04 kernel: [2817657.019141] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:34:42 h04 kernel: [2817657.019143] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:34:42 h04 kernel: [2817657.019145] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:34:42 h04 kernel: [2817657.019147] EDAC MC0: CE row 0, channel 0, label "": Corrected error (Socket=0 channel=0 dimm=0)
Aug 13 10:34:46 h04 kernel: [2817661.327218] CMCI storm subsided: switching to interrupt mode
Aug 13 10:34:46 h04 kernel: [2817661.327224] CPU 22 MCA banks CMCI:2 CMCI:3 CMCI:5 CMCI:6 CMCI:8
Aug 13 10:34:48 h04 kernel: [2817663.289320] CPU 20 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 10:34:49 h04 kernel: [2817663.669468] CPU 4 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 10:34:49 h04 kernel: [2817663.669612] CPU 6 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 10:35:16 h04 kernel: [2817691.317774] CPU 0 MCA banks CMCI:2 CMCI:3 CMCI:5
Aug 13 10:35:16 h04 kernel: [2817691.317837] CPU 2 MCA banks CMCI:2 CMCI:3 CMCI:5
[...no more EDAC messages since then...]
Kernel on this machine is "kernel-default-3.0.101-0.31.1". CPU details:
processor : 23
vendor_id : GenuineIntel
cpu family : 6
model : 44
model name : Intel(R) Xeon(R) CPU X5650 @ 2.67GHz
stepping : 2
microcode : 26
I feel the EDAC log messages are not very informative, and I feel these messages should be throttled and summarized somehow.
Regards,
Ulrich
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/