[QUESTION] Why there is additional info print when run "mce-inject uncorrect_error"

From: Yu, Mingli
Date: Wed Dec 04 2024 - 03:12:07 EST


Hello,

Have noticed some additonal info which seems unrelated to mce when run "mce-inject uncorrect_error" though add "quiet" option to the boot paramter. Confusing about why there is additional info unrelated to mce printed. And the additional info gone after add "selinux=0 enforcing=0" to the boot paramter. Don't understand why "selinux=0 enforcing=0" affect the additional info printed. Could you help to explain more? Many thanks!

# cat uncorrect_error
# this should just kill the process
CPU 0 BANK 1
STATUS UNCORRECTED
ADDR 0x1234

# mce-inject uncorrect_error
intel_rapl_common: Found RAPL domain package
intel_rapl_common: Found RAPL domain core
intel_rapl_common: Found RAPL domain dram
[00] BAD 00 ff ff ff ff ff ff 00 04 92 19 02 01 01 01 01
[00] BAD 15 12 01 03 6d 00 00 96 ef ee 91 a3 54 4c 99 26
[00] BAD ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff
[00] BAD ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff
[00] BAD ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff
[00] BAD ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff
[00] BAD ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff
[00] BAD ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff
fbcon: astdrmfb (fb0) is primary device
Console: switching to colour frame buffer device 128x48
ast 0000:06:00.0: [drm] fb0: astdrmfb frame buffer device
cfg80211: Loading compiled-in X.509 certificates for regulatory database
cfg80211: Loaded X.509 cert 'sforshee: 00b28ddf47aef9cea7'
cfg80211: Loaded X.509 cert 'wens: 61c038651aabdcf94bd0ac7ff06c7248db18c600'
8021q: 802.1Q VLAN Support v1.8
8021q: adding VLAN 0 to HW filter on device eth0
8021q: adding VLAN 0 to HW filter on device eth1
8021q: adding VLAN 0 to HW filter on device eth2
8021q: adding VLAN 0 to HW filter on device eth3
pps pps0: new PPS source ptp4
ixgbe 0000:07:00.0: registered PHC device on eth4
8021q: adding VLAN 0 to HW filter on device eth4
pps pps1: new PPS source ptp5
ixgbe 0000:07:00.1: registered PHC device on eth5
8021q: adding VLAN 0 to HW filter on device eth5
pps pps2: new PPS source ptp6
ixgbe 0000:08:00.0: registered PHC device on eth6
8021q: adding VLAN 0 to HW filter on device eth6
pps pps3: new PPS source ptp7
ixgbe 0000:08:00.1: registered PHC device on eth7
8021q: adding VLAN 0 to HW filter on device eth7
bridge: filtering via arp/ip/ip6tables is no longer available by default. Update your scripts to load br_netfilter if you need this.
igb 0000:04:00.1 eth1: igb: eth1 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX/TX
IPv6: ADDRCONF(NETDEV_CHANGE): eth1: link becomes ready
igb 0000:04:00.2 eth2: igb: eth2 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX/TX
IPv6: ADDRCONF(NETDEV_CHANGE): eth2: link becomes ready
igb 0000:04:00.3 eth3: igb: eth3 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX/TX
IPv6: ADDRCONF(NETDEV_CHANGE): eth3: link becomes ready
mce: CMCI storm subsided: switching to interrupt mode
mce: Triggering MCE exception on CPU 0
CPU: 0 PID: 1727 Comm: mce-inject Not tainted 5.15.169-rt76-yocto-preempt-rt #1
Hardware name: Supermicro SYS-E300-9A-16CN8TP/A2SDi-16C-TP8F, BIOS 1.4 01/29/2021
Call Trace:
<TASK>
dump_stack_lvl+0x38/0x4d
dump_stack+0x10/0x16
panic+0xaa/0x2ed
mce_panic+0x27d/0x29d
mce_timed_out+0x80/0xa2
do_machine_check+0x29b/0x684
raise_exception.constprop.0+0x61/0xa0
raise_local.isra.0+0x67/0xfc
mce_inject_raise.cold+0x26c/0x331
blocking_notifier_call_chain+0x68/0x90
mce_chrdev_write+0xc2/0x110
vfs_write+0xc7/0x300
ksys_write+0x6d/0xf0
__x64_sys_write+0x19/0x20
x64_sys_call+0x47c/0x1fa0
do_syscall_64+0x3d/0x90
entry_SYSCALL_64_after_hwframe+0x6c/0xd6
RIP: 0033:0x7fb4274c728f
Code: 89 54 24 18 48 89 74 24 10 89 7c 24 08 e8 69 ad f8 ff 48 8b 54 24 18 48 8b 74 24 10 41 89 c0 8b 7c 24 08 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 31 44 89 c7 48 89 44 24 08 e8 bc ad f8 ff 48
RSP: 002b:00007fb4273c6d70 EFLAGS: 00000293 ORIG_RAX: 0000000000000001
RAX: ffffffffffffffda RBX: 0000558b0c4ffd40 RCX: 00007fb4274c728f
RDX: 0000000000000058 RSI: 0000558b0c4f84a0 RDI: 0000000000000004
RBP: 0000000000000000 R08: 0000000000000000 R09: 00007ffd77a1bf4f
R10: 0000000000000008 R11: 0000000000000293 R12: 00007fb4273c7640
R13: 0000000000000000 R14: 00007fb427455820 R15: 0000000000000000
</TASK>
Kernel Offset: 0x2c400000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
Rebooting in 30 seconds..


When add "selinux=0 enforcing=0" to the boot paramter and then run "mce-inject uncorrect_error" and there is no additional info printed as below.

# mce-inject uncorrect_error
mce: CMCI storm subsided: switching to interrupt mode
mce: Triggering MCE exception on CPU 0
CPU: 0 PID: 1727 Comm: mce-inject Not tainted 5.15.169-rt76-yocto-preempt-rt #1
Hardware name: Supermicro SYS-E300-9A-16CN8TP/A2SDi-16C-TP8F, BIOS 1.4 01/29/2021
Call Trace:
<TASK>
dump_stack_lvl+0x38/0x4d
dump_stack+0x10/0x16
panic+0xaa/0x2ed
mce_panic+0x27d/0x29d
mce_timed_out+0x80/0xa2
do_machine_check+0x29b/0x684
raise_exception.constprop.0+0x61/0xa0
raise_local.isra.0+0x67/0xfc
mce_inject_raise.cold+0x26c/0x331
blocking_notifier_call_chain+0x68/0x90
mce_chrdev_write+0xc2/0x110
vfs_write+0xc7/0x300
ksys_write+0x6d/0xf0
__x64_sys_write+0x19/0x20
x64_sys_call+0x47c/0x1fa0
do_syscall_64+0x3d/0x90
entry_SYSCALL_64_after_hwframe+0x6c/0xd6
RIP: 0033:0x7fb4274c728f
Code: 89 54 24 18 48 89 74 24 10 89 7c 24 08 e8 69 ad f8 ff 48 8b 54 24 18 48 8b 74 24 10 41 89 c0 8b 7c 24 08 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 31 44 89 c7 48 89 44 24 08 e8 bc ad f8 ff 48
RSP: 002b:00007fb4273c6d70 EFLAGS: 00000293 ORIG_RAX: 0000000000000001
RAX: ffffffffffffffda RBX: 0000558b0c4ffd40 RCX: 00007fb4274c728f
RDX: 0000000000000058 RSI: 0000558b0c4f84a0 RDI: 0000000000000004
RBP: 0000000000000000 R08: 0000000000000000 R09: 00007ffd77a1bf4f
R10: 0000000000000008 R11: 0000000000000293 R12: 00007fb4273c7640
R13: 0000000000000000 R14: 00007fb427455820 R15: 0000000000000000
</TASK>
Kernel Offset: 0x2c400000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
Rebooting in 30 seconds..

Thanks,
Mingli