Re: [PATCH] scsi_lib.c: continue after MEDIUM_ERROR

From: Ric Wheeler
Date: Wed Jan 31 2007 - 11:35:58 EST




Alan wrote:
When libata reports a MEDIUM_ERROR to us, we *know* it's non-recoverable,
as the drive itself has already done internal retries (libata uses the
"with retry" ATA opcodes for this).

This depends on the firmware. Some of the "raid firmware" drives don't
appear to do retries in firmware.

I think that even for these devices, the actual drives behind the controller will do retries. In any case, it would be reasonable to be able to set this retry/no-retry via /sys to handle exceptional cases...


But meanwhile, we still have the original issue too, where a single stray
bad sector can blow a system out of the water, because the mid-layer
currently aborts everything after it from a large merged request.

Thus the original patch from this thread. :)

Agreed


-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/