Re: Hang in wait_on_inode with SMP 2.1.87

Steve Hsieh (steveh@eecs.umich.edu)
Sun, 22 Feb 1998 14:38:16 -0500 (EST)


On Sat, 21 Feb 1998, Carsten Gross wrote:

> On Sat, 21 Feb 1998, Bill Hawes wrote:
> > [...] get trashed somehow, so it would be very helpful if you could
> > determine which filesystem's inode is getting stuck. Did this problem just
> > start recently?
>
> The problem occurs only with newer 2.1.x kernels and is (unfortunality) not
> reproducable. 2.0.3x runs solid on my system. Typically heavy disc activity
> seems to provoke the hang (Large copies with lots of files, activity from
> different processes and so on)

I think I have a similar problem, I believe starting around 2.1.8x.
If there's heavy disk activity, whatever process is involved gets
stuck, and I can't kill it. Unlike Carsten, though, it is repeatable
-- if I do a 'cp -a /usr /mnt' where a different drive partition is
mounted in /mnt, cp will hang.

ps -axlw shows cp as such:

100 0 190 181 0 0 2632 1844 wait_on_pa D p0 0:26 cp -a /usr /mnt

My /var/log/messages file also has the following messages:

Feb 21 12:13:49 ord kernel: (scsi0:0:0) No active SCB for reconnecting target - Issuing BUS DEVICE RESET.
Feb 21 12:13:49 ord kernel: SAVED_TCL=0x0, ARG_1=0x0, SEQADDR=0x114
Feb 21 12:13:49 ord kernel: (scsi0:0:0) Reset device, active_scb 0
Feb 21 12:13:49 ord kernel: scsi0: (targ 0/chan A) matching scb to (targ 0/chan A)
Feb 21 12:13:49 ord last message repeated 2 times
Feb 21 12:13:49 ord kernel: scsi0: (targ 0/chan A) matching scb to (targ 1/chan A)
Feb 21 12:13:49 ord kernel: scsi0: (targ 0/chan A) matching scb to (targ 0/chan A)
Feb 21 12:13:49 ord kernel: scsi0: (targ 0/chan A) matching scb to (targ 1/chan A)
Feb 21 12:13:49 ord kernel: (scsi0:0:0) Bus Device Reset delivered, 1 SCBs aborted.
Feb 21 12:13:49 ord kernel: (scsi0:0:0) Aborting scb 0
Feb 21 12:13:49 ord kernel: (scsi0:0:0) Reset device, active_scb 0
Feb 21 12:13:49 ord kernel: scsi0: (targ 0/chan A) matching scb to (targ 0/chan A)
Feb 21 12:13:49 ord kernel: scsi0: (targ 0/chan A) matching scb to (targ 1/chan A)
Feb 21 12:13:49 ord kernel: scsi0: (targ 0/chan A) matching scb to (targ 1/chan A)
Feb 21 12:13:49 ord kernel: (scsi0:0:0) Bus Device Reset delivered, 1 SCBs aborted.
Feb 21 12:13:49 ord kernel: (scsi0:0:0) Aborting scb 0
Feb 22 14:20:08 ord kernel: scsi0: CMDCMPLT without command for SCB 2, QOUTCNT 0, QINCNT 0, SCB flags 0x0, cmd 0xc001d400

My system configuration is

Dell Quad PPro running SMP 2.1.87/88, 256MB

aic7xxx: <Adaptec AIC-7880 Ultra SCSI host adapter> at PCI 11
aic7xxx: BIOS enabled, IO Port 0xec00, IO Mem 0xfc4ff000, IRQ 11, Revision B
aic7xxx: Wide Channel, SCSI ID 7, 16/16 SCBs, QFull 16, QMask 0x1f
aic7xxx: <Adaptec AIC-7880 Ultra SCSI host adapter> at PCI 12
aic7xxx: BIOS enabled, IO Port 0xe800, IO Mem 0xfc4fe000, IRQ 10, Revision B
aic7xxx: Wide Channel, SCSI ID 7, 16/16 SCBs, QFull 16, QMask 0x1f
scsi0 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 4.1/3.2
scsi1 : Adaptec AHA274x/284x/294x (EISA/VLB/PCI-Fast SCSI) 4.1/3.2
scsi : 2 hosts.

eepro100.c:v0.36 10/20/97 Donald Becker linux-eepro100@cesdis.gsfc.nasa.gov
eth0: Intel EtherExpress Pro 10/100 at 0xff80, 00:A0:C9:98:67:A8, IRQ 10.

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.rutgers.edu