Re: Lockups with Intel EtherExpress Pro 10/100

Fred Reimer (Fred.Reimer@bellsouth.net)
Fri, 16 Apr 1999 10:18:41 -0400


Although it's not a "solution," did you try compiling the driver as a
module? If you do that, can you rmmod the driver when this happens and
insmod it again instead of rebooting the whole box? Is there any debugging
options (compile flags) that you can turn on in the driver to give more
information? I don't have the sources available now, but check out the
source for the driver and see if there are any such optional debug flags.

Hopefully, someone with more knowledge than me about the drivers (which is
about nil) will be able to help...

fwr

----- Original Message -----
From: Leslie F. Donaldson <donaldlf@cs.rose-hulman.edu>
To: <linux-kernel@vger.rutgers.edu>
Sent: Thursday, April 15, 1999 1:50 PM
Subject: Lockups with Intel EtherExpress Pro 10/100

> Hello,
> I thought I would report this to the kernel group to see if anyone
> has a suggestion or solution:
> Hardware:
> Intel Nightshade motherboard Dual Pentium II 450
> 512 Meg memory
> 3 Intel EtherExpress Pro 10/100
> on board ncr53c8xx
>
> Software:
> kernel 2.2.1 or 2.2.5
> NO X
> (These are massive servers)
>
> Situation:
> The machine is crusing along with a load average of 0.01 and all of a
> sudden one of the ethernet boards will lock up.
>
> Message:
> Apr 14 10:36:56 dsidev kernel: eth0: Trying to restart the transmitter...
> Apr 14 10:37:00 dsidev kernel: nfs: server nfs1 not responding, timed out
> Apr 14 10:37:01 dsidev kernel: eth0: Transmit timed out: status 7048 0000
at 31
> 048594/31048609 command 000ca000.
> Apr 14 10:37:01 dsidev kernel: eth0: Trying to restart the transmitter...
> Apr 14 10:37:06 dsidev kernel: eth0: Transmit timed out: status 7048 0000
at 31
> 048594/31048609 command 000ca000.
> Apr 14 10:37:06 dsidev kernel: eth0: Trying to restart the transmitter...
> Apr 14 10:37:10 dsidev kernel: nfs: server nfs1 not responding, timed out
> Apr 14 10:37:11 dsidev kernel: eth0: Transmit timed out: status 7048 0000
at 31
> 048594/31048609 command 000ca000.
> Apr 14 10:37:11 dsidev kernel: eth0: Trying to restart the transmitter...
> Apr 14 10:37:16 dsidev kernel: eth0: Transmit timed out: status 7048 0000
at 31
> 048594/31048609 command 000ca000.
>
>
> The only way to reset the card is to reboot the machine.
> This has happend in different PCI slots different kernel versions
> different PCI cards and at both 10 an 100 speeds.
>
> Here are two proc infos to show machine setups
> Linux 2.2.5 (root@rhdsdev) (gcc 2.7.2.3) #2 [rhdsdev.]
>
> Memory: Total Used Free Shared Buffers
Cached
> Mem: 517376 440208 77168 63012 251096
61392
> Swap: 130748 0 130748
>
> Bootup: Wed Apr 14 11:57:34 1999 Load average: 0.00 0.01 0.00 1/72 6503
>
> user : 0:14:09.33 0.9% page in : 849163 disk 1: 23491r
16195w
> nice : 0:00:00.00 0.0% page out: 703315 disk 2: 15648r
16064w
> system: 0:07:06.13 0.5% swap in : 1 disk 3: 15440r
15604w
> idle : 2d 1:20:01.52 198.6% swap out: 0 disk 4: 15490r
15704w
> uptime: 1d 0:50:38.47 context : 4787905
>
> irq 0: 8943849 -edge timer irq 13: 1 fpu
> irq 1: 1060 -edge keyboard irq 14: 6 -edge ide0
> irq 2: 0 cascade [4] irq 18: 3930608 -level Intel
EtherE
> irq 6: 3 irq 19: 1960
> irq 7: 5 irq 21: 163899 -level
ncr53c8xx
> irq 12: 4 irq 22: 35 -level
ncr53c8xx
>
>
> Linux 2.2.5 (root@ds6) (gcc 2.7.2.3) #5 SMP Tue Mar 30 08:06:24 CST 1999
[ds6.]
>
> Memory: Total Used Free Shared Buffers
Cached
> Mem: 517376 493868 23508 56852 202028
250020
> Swap: 130748 4 130744
>
> Bootup: Thu Apr 1 19:58:59 1999 Load average: 0.00 0.00 0.00 2/63 2380
>
> user : 3:48:00.12 1.2% page in : 3291034 disk 1: 123228r
1051476w
> nice : 1:57:55.60 0.6% page out: 9199211 disk 2: 125769r
724545w
> system: 4:43:50.57 1.4% swap in : 1 disk 3: 136692r
406799w
> idle : 26d 21:09:09.51 196.8% swap out: 1 disk 4: 117176r
503785w
> uptime: 13d 15:49:27.87 context :669141590
>
> irq 0: 118016790 -edge timer irq 14: 16486 -edge ide0
> irq 1: 1708 -edge keyboard irq 18: 137458199 -level Intel
EtherE
> irq 2: 0 cascade [4] irq 19: 181790630 -level Intel
EtherE
> irq 6: 3 irq 20: 784312 -level Intel
EtherE
> irq 7: 5 irq 21: 3568079 -level
ncr53c8xx
> irq 12: 4 irq 22: 32 -level
ncr53c8xx
> irq 13: 1 fpu
>
>
>
> Both of these machines have had at least one lockup. I have 8 machines
> with near identical setups. The lookups occur at about a rate of
> 1 per week.
>
> Please cross mail any info to:
>
> donaldlf@cs.rose-hulman.edu
>
> Thank You
> Leslie Donaldson
> /----------------------------\ Current Contractor: Onsite Engineering
> | Leslie F. Donaldson | Current Customer : Entergy - Nuclear GGNS
> | Computer Contractor | Skills:
Unix/OS9/VMS/Linux/SUN-OS/C/C++/assembly
> | Have Computer will travel. | Curr :
http://www.cs.rose-hulman.edu/~donaldlf
> \----------------------------/ Current Email :
donaldlf@cs.rose-hulman.edu
> Goth Code V1.1: GoCS$$ TYg(T6,T9) B11Bk!^1 C6b-- P0(1,7) M+ a24 n--- b++:+
> H6'11" g m---- w+ r+++ D--~!% h+ s10 k+++ R-- Ssw LusCA++
>
>
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@vger.rutgers.edu
> Please read the FAQ at http://www.tux.org/lkml/
>

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.tux.org/lkml/