aic79xx trouble
From: Bernd Schubert
Date: Thu May 13 2004 - 15:30:09 EST
Hello,
we are just in the process of setting up a new server, which will serve the
data of an IDE/SCSI raid system (transtec 5008). Some partions of this raid
device are also mirrored via drbd to a failover system. During a full resync
of all (3) failover partitions *from* the failover server, the main-server
first logs many scsi errors and later the access to the raid-partitions
completely locks up.
Below is some relevant dmesg output, I already enabled the verbose option for
the aic79xx driver. Should I also enable debugging, if so, which mode?
Any help is highly appreciated.
Thanks in advance,
Bernd
SCSI subsystem driver Revision: 1.00
ahd_pci:2:6:1: Reading VPD from SEEPROM...ahd_pci:2:6:1: VPD parsing
successful
ahd_pci:2:6:1: Reading SEEPROM...done.
ahd_pci:2:6:1: STPWLEVEL is on
ahd_pci:2:6:1: Manual Primary Termination
ahd_pci:2:6:1: Manual Secondary Termination
ahd_pci:2:6:1: Primary High byte termination Enabled
ahd_pci:2:6:1: Primary Low byte termination Enabled
ahd_pci:2:6:1: Secondary High byte termination Disabled
ahd_pci:2:6:1: Secondary Low byte termination Disabled
ahd_pci:2:6:1: Downloading Sequencer Program... 656 instructions downloaded
ahd_pci:2:6:1: Features 0x1c101, Bugs 0x700002, Flags 0x43f0
ahd_pci:2:6:0: Reading VPD from SEEPROM...ahd_pci:2:6:0: VPD parsing
successful
ahd_pci:2:6:0: Reading SEEPROM...done.
ahd_pci:2:6:0: STPWLEVEL is on
ahd_pci:2:6:0: Manual Primary Termination
ahd_pci:2:6:0: Manual Secondary Termination
ahd_pci:2:6:0: Primary High byte termination Enabled
ahd_pci:2:6:0: Primary Low byte termination Enabled
ahd_pci:2:6:0: Secondary High byte termination Disabled
ahd_pci:2:6:0: Secondary Low byte termination Disabled
ahd_pci:2:6:0: Downloading Sequencer Program... 656 instructions downloaded
ahd_pci:2:6:0: Features 0x1c101, Bugs 0x700002, Flags 0x43f1
scsi0 : Adaptec AIC79XX PCI-X SCSI HBA DRIVER, Rev 1.3.10
<Adaptec AIC7902 Ultra320 SCSI adapter>
aic7902: Ultra320 Wide Channel A, SCSI Id=7, PCI-X 67-100Mhz, 512 SCBs
scsi1 : Adaptec AIC79XX PCI-X SCSI HBA DRIVER, Rev 1.3.10
<Adaptec AIC7902 Ultra320 SCSI adapter>
aic7902: Ultra320 Wide Channel B, SCSI Id=7, PCI-X 67-100Mhz, 512 SCBs
blk: queue f7961e18, I/O limit 4095Mb (mask 0xffffffff)
scsi0:A:0:0: DV failed to configure device. Please file a bug report against
this driver.
(scsi0:A:0:0): Sending PPR bus_width 1, period 9, offset 7f, ppr_options 2
(scsi0:A:0:0): Received PPR width 1, period 9, offset 1f,options 2
Filtered to width 1, period 9, offset 1f, options 2
(scsi0:A:0): 6.600MB/s transfers (16bit)
scsi0: target 0 using 16bit transfers
(scsi0:A:0): 160.000MB/s transfers (80.000MHz DT, 16bit)
scsi0: target 0 synchronous with period = 0x9, offset = 0x1f(DT)
Vendor: transtec Model: Rev: 0001
Type: Direct-Access ANSI SCSI revision: 03
blk: queue f7961c18, I/O limit 4095Mb (mask 0xffffffff)
(scsi0:A:0): 160.000MB/s transfers (80.000MHz DT, 16bit)
scsi0:A:0:0: Tagged Queuing enabled. Depth 32
Attached scsi disk sda at scsi0, channel 0, id 0, lun 0
SCSI device sda: 4101521408 512-byte hdwr sectors (2099979 MB)
sda: sda1 sda2 sda3 < sda5 sda6 sda7 sda8 >
drbd: initialised. Version: 0.6.12 (api:64/proto:62)
drbd0: Connection established. size=24410736 KB / blksize=4096 B
drbd1: Connection established. size=19535008 KB / blksize=4096 B
drbd1: Synchronisation started blks=15
drbd2: Connection established. size=195760026 KB / blksize=4096 B
drbd2: Synchronisation started blks=15
scsi0:0:0:0: Attempting to abort cmd f78fce00: 0x28 0x0 0x5 0xd1 0x2a 0x89 0x0
0x0 0x78 0x0
scsi0: At time of recovery, card was not paused
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
scsi0: Dumping Card State at program address 0x25 Mode 0x11
Card was paused
HS_MAILBOX[0x0] INTCTL[0xc0]:(SWTMINTEN|SWTMINTMASK)
SEQINTSTAT[0x0] SAVED_MODE[0x11] DFFSTAT[0x33]:(CURRFIFO_NONE|FIFO0FREE|
FIFO1FREE)
SCSISIGI[0x0]:(P_DATAOUT) SCSIPHASE[0x0] SCSIBUS[0x0]
LASTPHASE[0x1]:(P_DATAOUT|P_BUSFREE) SCSISEQ0[0x0]
SCSISEQ1[0x12]:(ENAUTOATNP|ENRSELI) SEQCTL0[0x0] SEQINTCTL[0x0]
SEQ_FLAGS[0xc0]:(NO_CDB_SENT|NOT_IDENTIFIED) SEQ_FLAGS2[0x0]
SSTAT0[0x0] SSTAT1[0x8]:(BUSFREE) SSTAT2[0x0] SSTAT3[0x0]
PERRDIAG[0x0] SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO)
LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0]
LQOSTAT1[0x0] LQOSTAT2[0x0]
SCB Count = 32 CMDS_PENDING = 2 LASTSCB 0xffff CURRSCB 0x8 NEXTSCB 0x0
qinstart = 53192 qinfifonext = 53192
QINFIFO:
WAITING_TID_QUEUES:
Pending list:
5 FIFO_USE[0x0] SCB_CONTROL[0x64]:(DISCONNECTED|TAG_ENB|DISCENB)
SCB_SCSIID[0x7]
11 FIFO_USE[0x0] SCB_CONTROL[0x64]:(DISCONNECTED|TAG_ENB|DISCENB)
SCB_SCSIID[0x7]
Total 2
Kernel Free SCB list: 8 25 23 24 6 13 18 26 20 30 22 9 19 29 17 12 14 28 21 3
15 27 0 10 2 31 1 16 7 4
Sequencer Complete DMA-inprog list:
Sequencer Complete list:
Sequencer DMA-Up and Complete list:
scsi0: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|
ENSAVEPTRS)
SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL)
scsi0: FIFO1 Free, LONGJMP == 0x81d7, SCB 0x8
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|
ENSAVEPTRS)
SEQINTSRC[0x0] DFCNTRL[0x4]:(DIRECTION) DFSTATUS[0x89]:(FIFOEMP|HDONE|
PRELOAD_AVAIL)
SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL)
LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
0x0 0x0
scsi0: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x52
scsi0: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0
SIMODE0[0xc]:(ENOVERRUN|ENIOERR)
CCSCBCTL[0x0]
scsi0: REG0 == 0x10, SINDEX = 0x1e0, DINDEX = 0xe1
scsi0: SCBPTR == 0x8, SCB_NEXT == 0xff00, SCB_NEXT2 == 0xff97
CDB 2a 0 1 80 1 7a
STACK: 0x13 0x0 0x0 0x0 0x0 0x0 0x0 0x0
<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
DevQ(0:0:0): 0 waiting
(scsi0:A:0:0): Device is disconnected, re-queuing SCB
Recovery code sleeping
(scsi0:A:0:0): Abort Tag Message Sent
(scsi0:A:0:0): SCB 11 - Abort Completed.
Recovery SCB completes
found == 0x1
Recovery code awake
scsi0:0:0:0: Attempting to abort cmd f78fd000: 0x28 0x0 0x3 0x7c 0x71 0x99 0x0
0x0 0x78 0x0
scsi0:0:0:0: Command not found
scsi0:A:0: no active SCB for reconnecting target - issuing BUS DEVICE RESET
SAVED_SCSIID == 0x7, SAVED_LUN == 0x0, REG0 == 0xff00 ACCUM = 0x0
SEQ_FLAGS == 0xc0, SCBPTR == 0xb, BTT == 0xff00, SINDEX == 0x104
SELID == 0x0, SCB_SCSIID == 0x7, SCB_LUN == 0x0, SCB_CONTROL == 0x68
SCSIBUS[0] == 0xb, SCSISIGI == 0xe6
SXFRCTL0 == 0x88
SEQCTL0 == 0x0
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
scsi0: Dumping Card State at program address 0x14a Mode 0x33
Card was paused
HS_MAILBOX[0x0] INTCTL[0xc0]:(SWTMINTEN|SWTMINTMASK)
SEQINTSTAT[0x0] SAVED_MODE[0x11] DFFSTAT[0x33]:(CURRFIFO_NONE|FIFO0FREE|
FIFO1FREE)
SCSISIGI[0xe6]:(P_MESGIN|REQI|BSYI) SCSIPHASE[0x8]:(MSG_IN_PHASE)
SCSIBUS[0xb] LASTPHASE[0xe0]:(P_MESGIN) SCSISEQ0[0x0]
SCSISEQ1[0x12]:(ENAUTOATNP|ENRSELI) SEQCTL0[0x0] SEQINTCTL[0x0]
SEQ_FLAGS[0xc0]:(NO_CDB_SENT|NOT_IDENTIFIED) SEQ_FLAGS2[0x0]
SSTAT0[0x2]:(SPIORDY) SSTAT1[0x9]:(REQINIT|BUSFREE)
SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0x0] SIMODE1[0xac]:(ENSCSIPERR|ENBUSFREE|
ENSCSIRST|ENSELTIMO)
LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0]
LQOSTAT1[0x0] LQOSTAT2[0x0]
SCB Count = 32 CMDS_PENDING = 0 LASTSCB 0xffff CURRSCB 0xb NEXTSCB 0x0
qinstart = 53195 qinfifonext = 53195
QINFIFO:
WAITING_TID_QUEUES:
Pending list:
Total 0
Kernel Free SCB list: 11 5 8 25 23 24 6 13 18 26 20 30 22 9 19 29 17 12 14 28
21 3 15 27 0 10 2 31 1 16 7 4
Sequencer Complete DMA-inprog list:
Sequencer Complete list:
Sequencer DMA-Up and Complete list:
scsi0: FIFO0 Free, LONGJMP == 0x80ff, SCB 0x0
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|
ENSAVEPTRS)
SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL)
scsi0: FIFO1 Free, LONGJMP == 0x81d7, SCB 0xb
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|
ENSAVEPTRS)
SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL)
LQIN: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
0x0 0x0
scsi0: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x52
scsi0: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x0
SIMODE0[0xc]:(ENOVERRUN|ENIOERR)
CCSCBCTL[0x0]
scsi0: REG0 == 0xff00, SINDEX = 0x104, DINDEX = 0xa9
scsi0: SCBPTR == 0xb, SCB_NEXT == 0xff00, SCB_NEXT2 == 0xff97
CDB 28 0 5 80 a9 53
STACK: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
DevQ(0:0:0): 0 waiting
scsi0:A:0:0: Target did not send an IDENTIFY message. LASTPHASE = 0x60.
(scsi0:A:0): 80.000MB/s transfers (80.000MHz DT)
scsi0: target 0 using 8bit transfers
(scsi0:A:0): 3.300MB/s transfers
scsi0: target 0 using asynchronous transfers
scsi0: target 1 using 8bit transfers
scsi0: target 1 using asynchronous transfers
scsi0: target 2 using 8bit transfers
scsi0: target 2 using asynchronous transfers
scsi0: target 3 using 8bit transfers
scsi0: target 3 using asynchronous transfers
scsi0: target 4 using 8bit transfers
scsi0: target 4 using asynchronous transfers
scsi0: target 5 using 8bit transfers
scsi0: target 5 using asynchronous transfers
scsi0: target 6 using 8bit transfers
scsi0: target 6 using asynchronous transfers
scsi0: target 8 using 8bit transfers
scsi0: target 8 using asynchronous transfers
scsi0: target 9 using 8bit transfers
scsi0: target 9 using asynchronous transfers
scsi0: target 10 using 8bit transfers
scsi0: target 10 using asynchronous transfers
scsi0: target 11 using 8bit transfers
scsi0: target 11 using asynchronous transfers
scsi0: target 12 using 8bit transfers
scsi0: target 12 using asynchronous transfers
scsi0: target 13 using 8bit transfers
scsi0: target 13 using asynchronous transfers
scsi0: target 14 using 8bit transfers
scsi0: target 14 using asynchronous transfers
scsi0: target 15 using 8bit transfers
scsi0: target 15 using asynchronous transfers
scsi0: Issued Channel A Bus Reset. 0 SCBs aborted
scsi0: target 0 using 8bit transfers
scsi0: target 0 using asynchronous transfers
scsi0: SCSI bus reset delivered. 0 SCBs aborted.
scsi0:A:0:0: DV failed to configure device. Please file a bug report against
this driver.
(scsi0:A:0:0): Sending PPR bus_width 1, period 9, offset 7f, ppr_options 2
(scsi0:A:0:0): Received PPR width 1, period 9, offset 1f,options 2
Filtered to width 1, period 9, offset 1f, options 2
(scsi0:A:0): 6.600MB/s transfers (16bit)
scsi0: target 0 using 16bit transfers
(scsi0:A:0): 160.000MB/s transfers (80.000MHz DT, 16bit)
scsi0: target 0 synchronous with period = 0x9, offset = 0x1f(DT)
--
Bernd Schubert
Physikalisch Chemisches Institut / Theoretische Chemie
Universität Heidelberg
INF 229
69120 Heidelberg
e-mail: bernd.schubert@xxxxxxxxxxxxxxxxxxxxx
Attachment:
pgp00000.pgp
Description: signature