AIC and scsi: aborting command due to timeout

From: Vladimir Ivaschenko (vladimir@jfs-cy.com)
Date: Mon Jan 17 2000 - 03:38:24 EST


Hello,

Recently I got two Pentium III-600 servers (RedHat 6.1) and have been
experiencing problems with SCSI. Both of them have AHA Ultra2
SCSI adapters installed on motherboard (kernel reports them as
AIC-7896). The problem is sometimes (once a week or two) the kernel
falls into continuos loop of messages like

scsi: aborting command due to timeout: pid 3912514, scsi 0, channel 0,
id 0, lun 0 Red (10) [hex numbers here]

and servers hangs (it seems it can't do any IO operations anymore).
Both servers have two ST39175LW 9G HDDs installed and a SEAGATE DDS2 DAT
drive.

Anyway I attached dmesg and /proc/pci to this message.

The problems occured on 2.2.13 kernels, I noticed that 2.2.14 did some
updates to the AIC driver and yesterday I have upgraded to it - will it
help?

Please CC the answers to my email, as I'm not subscribed to the list.

Thanks,
Vladimir Ivaschenko

PCI devices found:
  Bus 0, device 0, function 0:
    Host bridge: Intel Unknown device (rev 0).
      Vendor id=8086. Device id=71a0.
      Medium devsel. Master Capable. Latency=64.
      Prefetchable 32 bit memory at 0xf8000000 [0xf8000008].
  Bus 0, device 1, function 0:
    PCI bridge: Intel Unknown device (rev 0).
      Vendor id=8086. Device id=71a1.
      Medium devsel. Master Capable. Latency=64. Min Gnt=132.
  Bus 0, device 12, function 0:
    SCSI storage controller: Adaptec AIC-7896/7 (rev 0).
      Medium devsel. Fast back-to-back capable. BIST capable. IRQ 11. Master Capable. Latency=64. Min Gnt=39.Max Lat=25.
      I/O at 0x1400 [0x1401].
      Non-prefetchable 64 bit memory at 0xf4100000 [0xf4100004].
  Bus 0, device 12, function 1:
    SCSI storage controller: Adaptec AIC-7896/7 (rev 0).
      Medium devsel. Fast back-to-back capable. BIST capable. IRQ 11. Master Capable. Latency=64. Min Gnt=39.Max Lat=25.
      I/O at 0x1800 [0x1801].
      Non-prefetchable 64 bit memory at 0xf4101000 [0xf4101004].
  Bus 0, device 14, function 0:
    Ethernet controller: Intel 82557 (rev 8).
      Medium devsel. Fast back-to-back capable. IRQ 10. Master Capable. Latency=64. Min Gnt=8.Max Lat=56.
      Non-prefetchable 32 bit memory at 0xf4102000 [0xf4102000].
      I/O at 0x1080 [0x1081].
      Non-prefetchable 32 bit memory at 0xf4000000 [0xf4000000].
  Bus 0, device 18, function 0:
    ISA bridge: Intel 82371AB PIIX4 ISA (rev 2).
      Medium devsel. Fast back-to-back capable. Master Capable. No bursts.
  Bus 0, device 18, function 1:
    IDE interface: Intel 82371AB PIIX4 IDE (rev 1).
      Medium devsel. Fast back-to-back capable. Master Capable. Latency=64.
      I/O at 0x1050 [0x1051].
  Bus 0, device 18, function 2:
    USB Controller: Intel 82371AB PIIX4 USB (rev 1).
      Medium devsel. Fast back-to-back capable. IRQ 10. Master Capable. Latency=64.
      I/O at 0x1060 [0x1061].
  Bus 0, device 18, function 3:
    Bridge: Intel 82371AB PIIX4 ACPI (rev 2).
      Medium devsel. Fast back-to-back capable.
  Bus 0, device 20, function 0:
    VGA compatible controller: Cirrus Logic GD 5480 (rev 35).
      Medium devsel. Master Capable. Latency=64. Min Gnt=2.Max Lat=10.
      Prefetchable 32 bit memory at 0xf6000000 [0xf6000008].
      Non-prefetchable 32 bit memory at 0xf4103000 [0xf4103000].
  Bus 1, device 15, function 0:
    PCI bridge: DEC Unknown device (rev 6).
      Vendor id=1011. Device id=23.
      Medium devsel. Fast back-to-back capable. Master Capable. Latency=240. Min Gnt=6.

Linux version 2.2.14 (root@zyon.jfs-cy.com) (gcc version egcs-2.91.66 19990314/Linux (egcs-1.1.2 release)) #1 Mon Jan 17 10:14:14 EET 2000
Detected 596924418 Hz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 594.74 BogoMIPS
Memory: 127360k/131008k available (1520k kernel code, 416k reserved, 1652k data, 60k init)
Dentry hash table entries: 16384 (order 5, 128k)
Buffer cache hash table entries: 131072 (order 7, 512k)
Page cache hash table entries: 32768 (order 5, 128k)
Pentium-III serial number disabled.
CPU: Intel Pentium III (Katmai) stepping 03
Checking 386/387 coupling... OK, FPU using exception 16 error reporting.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
mtrr: v1.35a (19990819) Richard Gooch (rgooch@atnf.csiro.au)
PCI: PCI BIOS revision 2.10 entry at 0xfdab0
PCI: Using configuration type 1
PCI: Probing PCI hardware
Linux NET4.0 for Linux 2.2
Based upon Swansea University Computer Society NET3.039
NET4: Unix domain sockets 1.0 for Linux NET4.0.
NET4: Linux TCP/IP 1.0 for NET4.0
IP Protocols: ICMP, UDP, TCP, IGMP
TCP: Hash tables configured (ehash 131072 bhash 65536)
Linux IP multicast router 0.06 plus PIM-SM
Initializing RT netlink socket
Starting kswapd v 1.5
Detected PS/2 Mouse Port.
Serial driver version 4.27 with MANY_PORTS SHARE_IRQ enabled
ttyS00 at 0x03f8 (irq = 4) is a 16550A
ttyS01 at 0x02f8 (irq = 3) is a 16550A
pty: 256 Unix98 ptys configured
Real Time Clock Driver v1.09
PIIX4: IDE controller on PCI bus 00 dev 91
PIIX4: not 100% native mode: will probe irqs later
    ide0: BM-DMA at 0x1050-0x1057, BIOS settings: hda:DMA, hdb:pio
    ide1: BM-DMA at 0x1058-0x105f, BIOS settings: hdc:pio, hdd:pio
hda: ATAPI 48X CDROM, ATAPI CDROM drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
hda: ATAPI 48X CD-ROM drive, 128kB Cache
Uniform CDROM driver Revision: 2.56
Floppy drive(s): fd0 is 1.44M
FDC 0 is a National Semiconductor PC87306
md driver 0.90.0 MAX_MD_DEVS=256, MAX_REAL=12
translucent personality registered
linear personality registered
raid0 personality registered
raid1 personality registered
raid5 personality registered
raid5: measuring checksumming speed
raid5: MMX detected, trying high-speed MMX checksum routines
   pII_mmx : 1328.928 MB/sec
   p5_mmx : 1395.603 MB/sec
   8regs : 1024.890 MB/sec
   32regs : 574.929 MB/sec
using fastest function: p5_mmx (1395.603 MB/sec)
(scsi0) <Adaptec AIC-7896/7 Ultra2 SCSI host adapter> found at PCI 12/0
(scsi0) Wide Channel A, SCSI ID=7, 32/255 SCBs
(scsi0) Downloading sequencer code... 385 instructions downloaded
(scsi1) <Adaptec AIC-7896/7 Ultra2 SCSI host adapter> found at PCI 12/1
(scsi1) Wide Channel B, SCSI ID=7, 32/255 SCBs
(scsi1) Downloading sequencer code... 385 instructions downloaded
scsi0 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 5.1.22/3.2.4
       <Adaptec AIC-7896/7 Ultra2 SCSI host adapter>
scsi1 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 5.1.22/3.2.4
       <Adaptec AIC-7896/7 Ultra2 SCSI host adapter>
scsi : 2 hosts.
(scsi0:0:0:0) Synchronous at 40.0 Mbyte/sec, offset 15.
  Vendor: SEAGATE Model: ST39175LW Rev: 0001
  Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sda at scsi0, channel 0, id 0, lun 0
(scsi0:0:1:0) Synchronous at 40.0 Mbyte/sec, offset 15.
  Vendor: SEAGATE Model: ST39175LW Rev: 0001
  Type: Direct-Access ANSI SCSI revision: 02
Detected scsi disk sdb at scsi0, channel 0, id 1, lun 0
(scsi0:0:2:0) Synchronous at 6.67 Mbyte/sec, offset 15.
  Vendor: ARCHIVE Model: Python 04687-XXX Rev: 6610
  Type: Sequential-Access ANSI SCSI revision: 02
Detected scsi tape st0 at scsi0, channel 0, id 2, lun 0
scsi : detected 1 SCSI tape 2 SCSI disks total.
SCSI device sda: hdwr sector= 512 bytes. Sectors= 17783240 [8683 MB] [8.7 GB]
SCSI device sdb: hdwr sector= 512 bytes. Sectors= 17783240 [8683 MB] [8.7 GB]
PPP: version 2.3.7 (demand dialling)
TCP compression code copyright 1989 Regents of the University of California
PPP line discipline registered.
SLIP: version 0.8.4-NET3.019-NEWTTY (dynamic channels, max=256).
eth0: Intel EtherExpress Pro 10/100 at 0x1080, 00:90:27:8D:46:D2, IRQ 10.
  Board assembly 000000-000, Physical connectors present: RJ45
  Primary interface chip i82555 PHY #1.
  General self-test: passed.
  Serial sub-system self-test: passed.
  Internal registers self-test: passed.
  ROM checksum self-test: passed (0x04f4518b).
  Receiver lock-up workaround activated.
IP-Config: Incomplete network configuration information.
Partition check:
 sda: sda1 sda2 < sda5 sda6 sda7 >
 sdb: sdb1 sdb2 < sdb5 sdb6 sdb7 sdb8 sdb9 >
md.c: sizeof(mdp_super_t) = 4096
autodetecting RAID arrays
(read) sda6's sb offset: 4200896 [events: 00000069]
(read) sdb1's sb offset: 4200896 [events: 00000069]
autorun ...
considering sdb1 ...
  adding sdb1 ...
  adding sda6 ...
created md0
bind<sda6,1>
bind<sdb1,2>
running: <sdb1><sda6>
now!
sdb1's event counter: 00000069
sda6's event counter: 00000069
md0: max total readahead window set to 128k
md0: 1 data-disks, max readahead per data-disk: 128k
raid1: device sdb1 operational as mirror 1
raid1: device sda6 operational as mirror 0
(checking disk 0)
(really checking disk 0)
(checking disk 1)
(really checking disk 1)
(checking disk 2)
(checking disk 3)
(checking disk 4)
(checking disk 5)
(checking disk 6)
(checking disk 7)
(checking disk 8)
(checking disk 9)
(checking disk 10)
(checking disk 11)
raid1: raid set md0 active with 2 out of 2 mirrors
md: updating md0 RAID superblock on device
sdb1 [events: 0000006a](write) sdb1's sb offset: 4200896
sda6 [events: 0000006a](write) sda6's sb offset: 4200896
.
... autorun DONE.
Coda Kernel/Venus communications, v4.6.0, braam@cs.cmu.edu
NTFS version 990411
VFS: Mounted root (ext2 filesystem) readonly.
Freeing unused kernel memory: 60k freed
Adding Swap: 530104k swap-space (priority -1)
VFS: Disk change detected on device fd(2,0)

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu
Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Sun Jan 23 2000 - 21:00:15 EST