How to debug random bus errors?

From: Orion Poplawski
Date: Fri Sep 22 2006 - 13:12:50 EST


We're seeing programs die with "bus error" (SIGBUS) randomly on a dual processor Opteron machine. I've run memtest86+ and cpuburn stress tests with no failure. gdb on a core file seems uninteresting. Is there some way to trace the kernel to try to get more insight?

Thanks!

--
Orion Poplawski
System Administrator 303-415-9701 x222
NWRA/CoRA Division FAX: 303-415-9702
3380 Mitchell Lane orion@xxxxxxxxxxxxx
Boulder, CO 80301 http://www.cora.nwra.com

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/