I posted this on alt.comp.periphs.mainboard.abit where someone suggested
I forward it to this list.
I don't actually read the list so if any further information is required
please email me.
Cheers, Jas
-- Jason D. Jordan Perth, Western Australia www.e3.com.au - e3 technology - the way of the future ...less than 13 months til the start of the 3rd Millennium! --------------B065428D91474C9A7258CBC9 Content-Type: message/rfc822 Content-Transfer-Encoding: 7bit Content-Disposition: inlineX-Mozilla-Status2: 00000000 Message-ID: <3858B30B.5EA7E398@swami.pcguru.com.au> Date: Thu, 16 Dec 1999 17:38:19 +0800 From: Jason Jordan <guru@swami.pcguru.com.au> Reply-To: guru@swami.pcguru.com.au X-No-Archive: Yes X-Mailer: Mozilla 4.7 [en] (X11; I; Linux 2.2.13 i686) X-Accept-Language: en MIME-Version: 1.0 Newsgroups: alt.comp.periphs.mainboard.abit Subject: Re: Dual Celery on BP6 References: <837p5o$t3c$1@the-fly.zip.com.au> <RQtYONVwbCD+16Aqq2+kb4VA6M80@4ax.com> <38581877.45E63209@inreach.com> Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit
"Christopher R. Carlen" wrote:
> It seems they run well with "normal" (effectively light) loading, but > when Linux users and some BSD people tax the hell out of the system, > they lock. > > The verdict seems to still be out though, if this is a kernel or other > SMP bug in the OS, or a BX chip overheat problem, a Cel bug, or the > board.
I'm getting the lockup problems and have tried everything I've seen in here.
Yesterday, hoping that the problem might be related to 2.2 kernels, I compiled 2.3.33 and booted.
Apparently 2.3.33 starts logging APIC errors for the first time:
I saw the following over & pver until I locked up after about 40 minutes.
Dec 15 23:24:10 swami kernel: APIC error interrupt on CPU#0, should never happen. Dec 15 23:24:10 swami kernel: ... APIC ESR0: 00000000 Dec 15 23:24:10 swami kernel: ... APIC ESR1: 00000008 Dec 15 23:24:10 swami kernel: ... bit 3: APIC Receive Accept Error. Dec 15 23:24:55 swami kernel: APIC error interrupt on CPU#1, should never happen. Dec 15 23:24:55 swami kernel: ... APIC ESR0: 00000000 Dec 15 23:24:55 swami kernel: ... APIC ESR1: 00000008 Dec 15 23:24:55 swami kernel: ... bit 3: APIC Receive Accept Error. Dec 15 23:25:15 swami kernel: APIC error interrupt on CPU#0, should never happen. Dec 15 23:25:15 swami kernel: ... APIC ESR0: 00000008 Dec 15 23:25:15 swami kernel: ... APIC ESR1: 0000000a Dec 15 23:25:15 swami kernel: ... bit 1: APIC Receive CS Error (hw problem). Dec 15 23:25:15 swami kernel: ... bit 3: APIC Receive Accept Error. Dec 15 23:26:12 swami kernel: APIC error interrupt on CPU#1, should never happen. Dec 15 23:26:12 swami kernel: ... APIC ESR0: 00000008 Dec 15 23:26:12 swami kernel: ... APIC ESR1: 0000000a Dec 15 23:26:12 swami kernel: ... bit 1: APIC Receive CS Error (hw problem). Dec 15 23:26:12 swami kernel: ... bit 3: APIC Receive Accept Error.
This system is a Dual Celeron 400 - not overclocked - at 2.0v. I also tried 2.1v.
It is not overheating as I have the i2c stuff installed and can monitor temperatures.
It seems to me that this is demonstrating a *real* hardware fault.
Least we have somewhere to start.
Cheers, Jas
-- Jason D. Jordan Perth, Western Australia www.e3.com.au - e3 technology - the way of the future ...less than 13 months til the start of the 3rd Millennium!--------------B065428D91474C9A7258CBC9--
- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.rutgers.edu Please read the FAQ at http://www.tux.org/lkml/