Re: ATA Write Error and Time-out Notification in User Space

From: John Treubig
Date: Tue Jan 03 2006 - 13:28:48 EST


Alan,

Happy new year and many thanks for your help. I took the information from the prior submissions and wanted to see if the Promise sub-system was my failure point. I put a drive on the Secondary IDE bus hanging off the motherboard Nvidia NForce 2 controller, began an access and pulled the plug. Sure enough the failures occured and were passed back to user level, but the system did not hang. I've repeated this a number of times. I moved the same drive to the Promise Controller and the hang occurs. Thus it seems we have proved the Promise sub-system is my problem.

You suggested the something like IORDY hanging was not likely the problem. Given the data, what would you suspect? I've included the message logs for the NForce for comparison.


From: Alan Cox <alan@xxxxxxxxxxxxxxxxxxx>
To: Drew Winstel <raw@xxxxxxxx>
CC: linux-ide@xxxxxxxxxxxxxxx, John Treubig <jtreubig@xxxxxxxxxxx>,linux-kernel@xxxxxxxxxxxxxxx, linux-scsi@xxxxxxxxxxxxxxx
Subject: Re: ATA Write Error and Time-out Notification in User Space
Date: Thu, 22 Dec 2005 01:09:16 +0000

On Maw, 2005-12-20 at 18:31 -0600, Drew Winstel wrote:
> With the application that John is using (namely, it delivers reads and writes
> directly to the drive via various SG ioctls), the file system is not an
> issue, hence wanting the errors to be returned to userspace.
>
> I presume this means that John would have to look at the block level error
> handling as opposed to the SCSI level?

If you are using the sg ioctls then the commands are dispatched and the
results come through the request queues but not the block layer above.

In that case you really shouldn't be seeing a hang.

Alan

-
To unsubscribe from this list: send the line "unsubscribe linux-ide" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html

Jan 3 10:35:43 localhost kernel: [ 1326.485012] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 00 00 00 60 00 44 ce 00
Jan 3 10:35:43 localhost kernel: [ 1326.485018] ata_scsiop_inq_std: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.485248] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 01 00 00 fc 00 44 ce 58
Jan 3 10:35:43 localhost kernel: [ 1326.485358] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 01 80 00 fc 00 44 ce 58
Jan 3 10:35:43 localhost kernel: [ 1326.485403] ata_scsi_dump_cdb: CDB (1:0,0,0) 25 00 00 00 00 00 00 00 00
Jan 3 10:35:43 localhost kernel: [ 1326.485407] ata_scsiop_read_cap: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.485477] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 00 00 00 60 00 a3 cf 00
Jan 3 10:35:43 localhost kernel: [ 1326.485481] ata_scsiop_inq_std: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.485519] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 01 00 00 fc 00 a3 cf 58
Jan 3 10:35:43 localhost kernel: [ 1326.485589] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 01 80 00 fc 00 a3 cf 58
Jan 3 10:35:43 localhost kernel: [ 1326.485632] ata_scsi_dump_cdb: CDB (6:0,0,0) 25 00 00 00 00 00 00 00 00
Jan 3 10:35:43 localhost kernel: [ 1326.485635] ata_scsiop_read_cap: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.504714] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 00 00 00 60 00 27 cd 00
Jan 3 10:35:43 localhost kernel: [ 1326.504721] ata_scsiop_inq_std: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.504939] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 01 00 00 fc 00 27 cd 58
Jan 3 10:35:43 localhost kernel: [ 1326.505048] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 01 80 00 fc 00 27 cd 58
Jan 3 10:35:43 localhost kernel: [ 1326.505093] ata_scsi_dump_cdb: CDB (1:0,0,0) 25 00 00 00 00 00 00 00 00
Jan 3 10:35:43 localhost kernel: [ 1326.505097] ata_scsiop_read_cap: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.505168] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 00 00 00 60 00 92 cf 00
Jan 3 10:35:43 localhost kernel: [ 1326.505172] ata_scsiop_inq_std: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.505226] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 01 00 00 fc 00 92 cf 58
Jan 3 10:35:43 localhost kernel: [ 1326.505297] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 01 80 00 fc 00 92 cf 58
Jan 3 10:35:43 localhost kernel: [ 1326.505340] ata_scsi_dump_cdb: CDB (6:0,0,0) 25 00 00 00 00 00 00 00 00
Jan 3 10:35:43 localhost kernel: [ 1326.505343] ata_scsiop_read_cap: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.524300] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 00 00 00 60 00 e5 cd 00
Jan 3 10:35:43 localhost kernel: [ 1326.524307] ata_scsiop_inq_std: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.524523] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 01 00 00 fc 00 e5 cd 58
Jan 3 10:35:43 localhost kernel: [ 1326.524633] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 01 80 00 fc 00 e5 cd 58
Jan 3 10:35:43 localhost kernel: [ 1326.524678] ata_scsi_dump_cdb: CDB (1:0,0,0) 25 00 00 00 00 00 00 00 00
Jan 3 10:35:43 localhost kernel: [ 1326.524682] ata_scsiop_read_cap: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.524750] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 00 00 00 60 00 44 ce 00
Jan 3 10:35:43 localhost kernel: [ 1326.524754] ata_scsiop_inq_std: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.524793] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 01 00 00 fc 00 44 ce 58
Jan 3 10:35:43 localhost kernel: [ 1326.524864] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 01 80 00 fc 00 44 ce 58
Jan 3 10:35:43 localhost kernel: [ 1326.524907] ata_scsi_dump_cdb: CDB (6:0,0,0) 25 00 00 00 00 00 00 00 00
Jan 3 10:35:43 localhost kernel: [ 1326.524910] ata_scsiop_read_cap: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.543782] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 00 00 00 60 00 f9 cd 00
Jan 3 10:35:43 localhost kernel: [ 1326.543789] ata_scsiop_inq_std: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.544003] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 01 00 00 fc 00 f9 cd 58
Jan 3 10:35:43 localhost kernel: [ 1326.544113] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 01 80 00 fc 00 f9 cd 58
Jan 3 10:35:43 localhost kernel: [ 1326.544176] ata_scsi_dump_cdb: CDB (1:0,0,0) 25 00 00 00 00 00 00 00 00
Jan 3 10:35:43 localhost kernel: [ 1326.544180] ata_scsiop_read_cap: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.544251] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 00 00 00 60 00 2a cf 00
Jan 3 10:35:43 localhost kernel: [ 1326.544254] ata_scsiop_inq_std: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.544292] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 01 00 00 fc 00 2a cf 58
Jan 3 10:35:43 localhost kernel: [ 1326.544361] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 01 80 00 fc 00 2a cf 58
Jan 3 10:35:43 localhost kernel: [ 1326.544404] ata_scsi_dump_cdb: CDB (6:0,0,0) 25 00 00 00 00 00 00 00 00
Jan 3 10:35:43 localhost kernel: [ 1326.544407] ata_scsiop_read_cap: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.563356] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 00 00 00 60 00 9c cf 00
Jan 3 10:35:43 localhost kernel: [ 1326.563363] ata_scsiop_inq_std: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.563581] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 01 00 00 fc 00 9c cf 58
Jan 3 10:35:43 localhost kernel: [ 1326.563690] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 01 80 00 fc 00 9c cf 58
Jan 3 10:35:43 localhost kernel: [ 1326.563736] ata_scsi_dump_cdb: CDB (1:0,0,0) 25 00 00 00 00 00 00 00 00
Jan 3 10:35:43 localhost kernel: [ 1326.563739] ata_scsiop_read_cap: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.563809] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 00 00 00 60 00 45 ce 00
Jan 3 10:35:43 localhost kernel: [ 1326.563813] ata_scsiop_inq_std: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.563850] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 01 00 00 fc 00 45 ce 58
Jan 3 10:35:43 localhost kernel: [ 1326.563922] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 01 80 00 fc 00 45 ce 58
Jan 3 10:35:43 localhost kernel: [ 1326.563965] ata_scsi_dump_cdb: CDB (6:0,0,0) 25 00 00 00 00 00 00 00 00
Jan 3 10:35:43 localhost kernel: [ 1326.563968] ata_scsiop_read_cap: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.582915] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 00 00 00 60 00 9e c9 00
Jan 3 10:35:43 localhost kernel: [ 1326.582922] ata_scsiop_inq_std: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.583152] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 01 00 00 fc 00 9e c9 58
Jan 3 10:35:43 localhost kernel: [ 1326.583264] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 01 80 00 fc 00 9e c9 58
Jan 3 10:35:43 localhost kernel: [ 1326.583309] ata_scsi_dump_cdb: CDB (1:0,0,0) 25 00 00 00 00 00 00 00 00
Jan 3 10:35:43 localhost kernel: [ 1326.583313] ata_scsiop_read_cap: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.583383] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 00 00 00 60 00 67 cd 00
Jan 3 10:35:43 localhost kernel: [ 1326.583387] ata_scsiop_inq_std: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.583424] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 01 00 00 fc 00 67 cd 58
Jan 3 10:35:43 localhost kernel: [ 1326.583494] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 01 80 00 fc 00 67 cd 58
Jan 3 10:35:43 localhost kernel: [ 1326.583537] ata_scsi_dump_cdb: CDB (6:0,0,0) 25 00 00 00 00 00 00 00 00
Jan 3 10:35:43 localhost kernel: [ 1326.583540] ata_scsiop_read_cap: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.602537] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 00 00 00 60 00 b1 c4 00
Jan 3 10:35:43 localhost kernel: [ 1326.602544] ata_scsiop_inq_std: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.602760] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 01 00 00 fc 00 b1 c4 58
Jan 3 10:35:43 localhost kernel: [ 1326.602869] ata_scsi_dump_cdb: CDB (1:0,0,0) 12 01 80 00 fc 00 b1 c4 58
Jan 3 10:35:43 localhost kernel: [ 1326.602914] ata_scsi_dump_cdb: CDB (1:0,0,0) 25 00 00 00 00 00 00 00 00
Jan 3 10:35:43 localhost kernel: [ 1326.602918] ata_scsiop_read_cap: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.602987] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 00 00 00 60 00 48 ce 00
Jan 3 10:35:43 localhost kernel: [ 1326.602991] ata_scsiop_inq_std: ENTER
Jan 3 10:35:43 localhost kernel: [ 1326.603028] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 01 00 00 fc 00 48 ce 58
Jan 3 10:35:43 localhost kernel: [ 1326.603114] ata_scsi_dump_cdb: CDB (6:0,0,0) 12 01 80 00 fc 00 48 ce 58
Jan 3 10:35:43 localhost kernel: [ 1326.603157] ata_scsi_dump_cdb: CDB (6:0,0,0) 25 00 00 00 00 00 00 00 00
Jan 3 10:35:43 localhost kernel: [ 1326.603161] ata_scsiop_read_cap: ENTER
Jan 3 10:35:44 localhost kernel: [ 1327.700597] raddio: raddio_open() - file->f_mode=15
Jan 3 10:36:21 localhost kernel: [ 1364.170793] hdc: dma_timer_expiry: dma status == 0x61
Jan 3 10:36:31 localhost kernel: [ 1374.156104] hdc: DMA timeout error
Jan 3 10:36:31 localhost kernel: [ 1374.156114] hdc: dma timeout error: status=0x7f { DriveReady DeviceFault SeekComplete DataRequest CorrectedError Index Error }
Jan 3 10:36:31 localhost kernel: [ 1374.156121] hdc: dma timeout error: error=0x7f { DriveStatusError UncorrectableError SectorIdNotFound TrackZeroNotFound AddrMarkNotFound }, LBAsect=260013951, sector=58557208
Jan 3 10:36:31 localhost kernel: [ 1374.156132] ide: failed opcode was: unknown
Jan 3 10:36:31 localhost kernel: [ 1374.158087] hdc: DMA disabled
Jan 3 10:36:31 localhost kernel: [ 1374.206030] ide1: reset: master: error (0x0a?)
Jan 3 10:37:01 localhost kernel: [ 1404.161969] hdc: lost interrupt
Jan 3 10:37:01 localhost kernel: [ 1404.161976] hdc: set_geometry_intr: status=0x7f { DriveReady DeviceFault SeekComplete DataRequest CorrectedError Index Error }
Jan 3 10:37:01 localhost kernel: [ 1404.161984] hdc: set_geometry_intr: error=0x7f { DriveStatusError UncorrectableError SectorIdNotFound TrackZeroNotFound AddrMarkNotFound }, LBAsect=260013951, sector=58557208
Jan 3 10:37:01 localhost kernel: [ 1404.161994] ide: failed opcode was: unknown
Jan 3 10:37:01 localhost kernel: [ 1404.211892] ide1: reset: master: error (0x0e?)
Jan 3 10:37:01 localhost kernel: [ 1404.211905] end_request: I/O error, dev hdc, sector 58557208
Jan 3 10:37:01 localhost kernel: [ 1404.295241] end_request: I/O error, dev hdc, sector 11436648
Jan 3 10:37:01 localhost kernel: [ 1404.295598] end_request: I/O error, dev hdc, sector 35034528
Jan 3 10:37:01 localhost kernel: [ 1404.295851] end_request: I/O error, dev hdc, sector 71396488
Jan 3 10:37:01 localhost kernel: [ 1404.296074] end_request: I/O error, dev hdc, sector 35035064
Jan 3 10:37:01 localhost kernel: [ 1404.296288] end_request: I/O error, dev hdc, sector 29730312
Jan 3 10:37:01 localhost kernel: [ 1404.296507] end_request: I/O error, dev hdc, sector 18232768
Jan 3 10:37:01 localhost kernel: [ 1404.297303] end_request: I/O error, dev hdc, sector 50093576
Jan 3 10:37:01 localhost kernel: [ 1404.297496] end_request: I/O error, dev hdc, sector 13031360
Jan 3 10:37:01 localhost kernel: [ 1404.297718] end_request: I/O error, dev hdc, sector 4525024
Jan 3 10:37:01 localhost kernel: [ 1404.298029] end_request: I/O error, dev hdc, sector 68605856
Jan 3 10:37:01 localhost kernel: [ 1404.298248] end_request: I/O error, dev hdc, sector 20995192
Jan 3 10:37:01 localhost kernel: [ 1404.298445] end_request: I/O error, dev hdc, sector 41058880
Jan 3 10:37:01 localhost kernel: [ 1404.298632] end_request: I/O error, dev hdc, sector 73172512
Jan 3 10:37:01 localhost kernel: [ 1404.298837] end_request: I/O error, dev hdc, sector 48503288
Jan 3 10:37:01 localhost kernel: [ 1404.299022] end_request: I/O error, dev hdc, sector 46497312
Jan 3 10:37:01 localhost kernel: [ 1404.299167] end_request: I/O error, dev hdc, sector 60576872
Jan 3 10:37:01 localhost kernel: [ 1404.299308] end_request: I/O error, dev hdc, sector 9632736
Jan 3 10:37:01 localhost kernel: [ 1404.299449] end_request: I/O error, dev hdc, sector 40871200
Jan 3 10:37:01 localhost kernel: [ 1404.299627] end_request: I/O error, dev hdc, sector 68270824
Jan 3 10:37:01 localhost kernel: [ 1404.299800] end_request: I/O error, dev hdc, sector 22557072
Jan 3 10:37:01 localhost kernel: [ 1404.299805] Buffer I/O error on device hdc, logical block 2819634
Jan 3 10:37:01 localhost kernel: [ 1404.299807] lost page write due to I/O error on hdc
Jan 3 10:37:01 localhost kernel: [ 1404.299952] end_request: I/O error, dev hdc, sector 58118752
Jan 3 10:37:01 localhost kernel: [ 1404.300100] end_request: I/O error, dev hdc, sector 74922088
Jan 3 10:37:01 localhost kernel: [ 1404.300240] end_request: I/O error, dev hdc, sector 38151592
Jan 3 10:37:01 localhost kernel: [ 1404.300380] end_request: I/O error, dev hdc, sector 76381056
Jan 3 10:37:01 localhost kernel: [ 1404.300520] end_request: I/O error, dev hdc, sector 62933696
Jan 3 10:37:01 localhost kernel: [ 1404.300660] end_request: I/O error, dev hdc, sector 42973464
Jan 3 10:37:01 localhost kernel: [ 1404.300810] end_request: I/O error, dev hdc, sector 37614272
Jan 3 10:37:01 localhost kernel: [ 1404.300950] end_request: I/O error, dev hdc, sector 14795552
Jan 3 10:37:01 localhost kernel: [ 1404.301137] end_request: I/O error, dev hdc, sector 73067064
Jan 3 10:37:01 localhost kernel: [ 1404.301141] Buffer I/O error on device hdc, logical block 9133383
Jan 3 10:37:01 localhost kernel: [ 1404.301143] lost page write due to I/O error on hdc
Jan 3 10:37:01 localhost kernel: [ 1404.301292] end_request: I/O error, dev hdc, sector 68217912
Jan 3 10:37:01 localhost kernel: [ 1404.301442] end_request: I/O error, dev hdc, sector 73352768
Jan 3 10:37:01 localhost kernel: [ 1404.301446] Buffer I/O error on device hdc, logical block 9169096
Jan 3 10:37:01 localhost kernel: [ 1404.301448] lost page write due to I/O error on hdc
Jan 3 10:37:01 localhost kernel: [ 1404.301590] end_request: I/O error, dev hdc, sector 46747304
Jan 3 10:37:01 localhost kernel: [ 1404.301730] end_request: I/O error, dev hdc, sector 65496040
Jan 3 10:37:01 localhost kernel: [ 1404.301881] end_request: I/O error, dev hdc, sector 66611208
Jan 3 10:37:01 localhost kernel: [ 1404.302021] end_request: I/O error, dev hdc, sector 3644328
Jan 3 10:37:01 localhost kernel: [ 1404.302167] end_request: I/O error, dev hdc, sector 17088312
Jan 3 10:37:01 localhost kernel: [ 1404.302316] end_request: I/O error, dev hdc, sector 47087584
Jan 3 10:37:01 localhost kernel: [ 1404.302320] Buffer I/O error on device hdc, logical block 5885948
Jan 3 10:37:01 localhost kernel: [ 1404.302322] lost page write due to I/O error on hdc
Jan 3 10:37:01 localhost kernel: [ 1404.302474] end_request: I/O error, dev hdc, sector 15981504
Jan 3 10:37:01 localhost kernel: [ 1404.302478] Buffer I/O error on device hdc, logical block 1997688
Jan 3 10:37:01 localhost kernel: [ 1404.302480] lost page write due to I/O error on hdc
Jan 3 10:37:01 localhost kernel: [ 1404.302653] end_request: I/O error, dev hdc, sector 70501312
Jan 3 10:37:01 localhost kernel: [ 1404.303509] end_request: I/O error, dev hdc, sector 0
Jan 3 10:37:01 localhost kernel: [ 1404.303512] Buffer I/O error on device hdc, logical block 0
Jan 3 10:56:42 localhost /sbin/hotplug: no runnable /etc/hotplug/vc.agent is installed