System locks on network activity under 2.0.0 and 1.99.7

Chuck Mattern (cmattern@mindspring.com)
4 Jul 1996 19:30:38 GMT


I had posted about this earlier but have sine found new facts. I am
running a Triton Chipset based P133 with AHA2940 scsi card, Linksys
LNEPCI ethernet card, STB 2meg pci video card, Soundblaster 16, 32
megs of ram. With the 2.0.0 kernel any network activity that hits the
ethernet card causes a hard lock with no output to syslog. This
happens with the Linksys and also with an NE2000 isa clone. This
system is rock solid under the 1.2.13 kernel and I am running it that
way now. All of the 2.0.0 upgrades to Netkit, ppp, procps, etc have
been applied and are stable with the 1.2.13 kernel.

I began to try to back trace through the kernel revisions by building
1.99.7 and installing that. The system seemed to come up fine but I
was unable to ping anything on the local lan. The system did not
crash when packets hit the card but nothing got out or in. Other
machines on the net could not get any responses to pings. When I
terminated the outbound ping on the machine with the problem I got an
error message indicated that a socket had been destroyed:

Socket destroyed (r=0 w=2992)

I then tried a netstat -r to check the routing and the system locked
hard agin apparently while trying to resolve an ip address to a name.
The behavior under 2.0.0 is similar, excpet that the system locks at
the ping, and never come back.

I've included boot time messages from the last 1.99.7 boot below for
anyone who is interested in helping. Note that the message at the end
about eth0 have no media is incorrect, I get it under all three of the
kernels I am working with, and occaisionally under 1.2.13 I will see a
message like this:

Jul 4 12:27:14 ronin kernel: eth0: Media is AUI.

This is truly bogus since the card supports 10baseT and 2baseT, not
even an AUI on it. Nevertheless everything runs fine under 1.2.13. I
would REALLY like to move up to 2.0.x but am not sure what to do next
in trying to get this resolved. I'd appreciate any advice.

Thanks in advance,
Chuck

[boot time messages follow]

p.. ok - 53.25 BogoMIPS
Memory: 30848k/32768k available (776k kernel code, 384k reserved, 760k data)
Swansea University Computer Society NET3.035 for Linux 2.0
NET3: Unix domain sockets 0.12 for Linux NET3.035.
Swansea University Computer Society TCP/IP for NET3.034
IP Protocols: ICMP, UDP, TCP
Checking 386/387 coupling... Ok, fpu using exception 16 error reporting.
Checking 'hlt' instruction... Ok.
Linux version 1.99.9 (root@ronin.mindspring.com) (gcc version 2.7.2) #2-pre-2.0 Thu Jul 4 00:19:26 EDT 1996
Serial driver version 4.12 with no serial options enabled
tty00 at 0x03f8 (irq = 4) is a 16550A
tty01 at 0x02f8 (irq = 3) is a 16550A
lp0 at 0x03bc, (polling)
Sound initialization started
<SoundBlaster 16 4.11> at 0x220 irq 5 dma 1,5
<SoundBlaster 16 4.11> at 0x220 irq 5 dma 1,5
Sound initialization complete
Ramdisk driver initialized : 16 ramdisks of 4096K size
Floppy drive(s): fd0 is 2.88M AMI BIOS
Started kswapd v 1.4.2.2
FDC 0 is an 8272A
aic7xxx: BurstLen = 8 DWDs, Latency Timer = 32 PCLKS
aic7xxx: AHA-2940 Rev B.
aic7xxx: devconfig = 0x500.
aic7xxx: Reading SEEPROM...done.
aic7xxx: Extended translation enabled.
aic7xxx: Using 16 SCB's after checking for SCB memory.
aic7xxx: Enabling wide channel of AHA-2940-Wide.
AHA-2940-WIDE (PCI-bus):
irq 9
bus release time 40 bclks
data fifo threshold 100%
SCSI CHANNEL A:
scsi id 7
scsi selection timeout 256 ms
scsi bus reset at power-on enabled
scsi bus parity enabled
scsi bus termination (low byte) enabled
scsi bus termination (high byte) enabled
aic7xxx: Downloading sequencer code...done.
aic7xxx: Resetting the SCSI bus...done.
scsi0 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 3.2/3.1/3.0
scsi : 1 host.
aic7xxx: Scanning channel A for devices.
aic7xxx: Received MSG_WDTR, Target 0, channel A needwdtr(0xffff).
aic7xxx: Target 0, channel A, now synchronous at 10.0MHz, offset(0xf).
Vendor: CONNER Model: CFP1060S SUN1.05 Rev: 245C
Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sda at scsi0, channel 0, id 0, lun 0
aic7xxx: Received MSG_WDTR, Target 1, channel A needwdtr(0xfffe).
aic7xxx: Target 1, channel A, now synchronous at 10.0MHz, offset(0xf).
Vendor: CONNER Model: CFP1060S SUN1.05 Rev: 245C
Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sdb at scsi0, channel 0, id 1, lun 0
aic7xxx: Target 2, channel A, refusing WIDE negotiation. Using 8 bit transfers.
aic7xxx: Target 2, channel A, now synchronous at 10.0MHz, offset(0xf).
Vendor: CONNER Model: CFP1080S SUN1.05 Rev: 3131
Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sdc at scsi0, channel 0, id 2, lun 0
scsi : detected 3 SCSI disks total.
SCSI device sda: hdwr sector= 512 bytes. Sectors= 2070400 [1010 MB] [1.0 GB]
SCSI device sdb: hdwr sector= 512 bytes. Sectors= 2070400 [1010 MB] [1.0 GB]
SCSI device sdc: hdwr sector= 512 bytes. Sectors= 2110812 [1030 MB] [1.0 GB]
PPP: version 2.2.0 (dynamic channel allocation)
TCP compression code copyright 1989 Regents of the University of California
PPP Dynamic channel allocation code copyright 1995 Caldera, Inc.
PPP line discipline registered.
SLIP: version 0.8.4-NET3.019-NEWTTY (dynamic channels, max=256).
CSLIP: code copyright 1989 Regents of the University of California.
eth0: DC21041 at 6100 (PCI bus 0, device 15), h/w address 00:40:05:23:2d:38,
and requires IRQ10 (provided by PCI BIOS).
de4x5.c:v0.42 96/4/26 davies@wanton.lkg.dec.com
Partition check:
sda: sda1 sda2 sda3 sda4 < sda5 sda6 sda7 sda8 >
sdb: sdb1 sdb2 sdb3
sdc: sdc1 sdc2 sdc3 sdc4 < sdc5 sdc6 sdc7 sdc8 >
VFS: Mounted root (ext2 filesystem) readonly.
Adding Swap: 32764k swap-space
Adding Swap: 32764k swap-space
Adding Swap: 32124k swap-space
eth0: media is unconnected!

-- 
-------------------------------------------------------------------------------
|Chuck Mattern            | "That which does not kill us, makes us stronger.."|
|cmattern@mindspring.com  | -Friedrich Nietzsche-                             |
-------------------------------------------------------------------------------