Re: sata_promise disk error 2.6.22-rc5 with hrt1 patch

From: Mikael Pettersson
Date: Sun Jun 24 2007 - 17:31:01 EST


(cc: linux-ide added)

On Sun, 24 Jun 2007 20:49:05 +0200, otto Meier wrote:
> I got the following in my logs:
>
> Jun 24 19:06:40 gate2 kernel: ata4.00: exception Emask 0x0 SAct 0x0 SErr
> 0x0 action 0x0
> Jun 24 19:06:40 gate2 kernel: ata4.00: (port_status 0x00001000)
> Jun 24 19:06:40 gate2 kernel: ata4.00: cmd
> 35/00:00:c1:17:92/00:04:17:00:00/e0 tag 0 cdb 0x0 data 524288 out
> Jun 24 19:06:40 gate2 kernel: res
> 50/00:00:c0:1b:92/00:00:17:00:00/e0 Emask 0x20 (host bus error)
> Jun 24 19:06:40 gate2 kernel: ata4.00: ata_hpa_resize 1: sectors =
> 488397168, hpa_sectors = 488397168
> Jun 24 19:06:40 gate2 kernel: ata4.00: ata_hpa_resize 1: sectors =
> 488397168, hpa_sectors = 488397168
> Jun 24 19:06:40 gate2 kernel: ata4.00: configured for UDMA/133
> Jun 24 19:06:40 gate2 kernel: ata4: EH complete
> Jun 24 19:06:40 gate2 kernel: sd 3:0:0:0: [sdd] 488397168 512-byte
> hardware sectors (250059 MB)
> Jun 24 19:06:40 gate2 kernel: ata4.00: exception Emask 0x0 SAct 0x0 SErr
> 0x0 action 0x0
> Jun 24 19:06:40 gate2 kernel: ata4.00: (port_status 0x00001000)
> Jun 24 19:06:40 gate2 kernel: ata4.00: cmd
> 25/00:00:c1:15:92/00:02:17:00:00/e0 tag 0 cdb 0x0 data 262144 in
> Jun 24 19:06:40 gate2 kernel: res
> 50/00:00:c0:17:92/00:00:17:00:00/e0 Emask 0x20 (host bus error)
> Jun 24 19:06:40 gate2 kernel: ata4.00: ata_hpa_resize 1: sectors =
> 488397168, hpa_sectors = 488397168
> Jun 24 19:06:40 gate2 kernel: ata4.00: ata_hpa_resize 1: sectors =
> 488397168, hpa_sectors = 488397168
> Jun 24 19:06:40 gate2 kernel: ata4.00: configured for UDMA/133
> Jun 24 19:06:40 gate2 kernel: ata4: EH complete
> Jun 24 19:06:40 gate2 kernel: sd 3:0:0:0: [sdd] Write Protect is off
> Jun 24 19:06:40 gate2 kernel: sd 3:0:0:0: [sdd] Mode Sense: 00 3a 00 00
> Jun 24 19:06:40 gate2 kernel: sd 3:0:0:0: [sdd] Write cache: enabled,
> read cache: enabled, doesn't support DPO or FUA
> Jun 24 19:06:40 gate2 kernel: sd 3:0:0:0: [sdd] 488397168 512-byte
> hardware sectors (250059 MB)
> Jun 24 19:06:40 gate2 kernel: sd 3:0:0:0: [sdd] Write Protect is off
> Jun 24 19:06:40 gate2 kernel: sd 3:0:0:0: [sdd] Mode Sense: 00 3a 00 00
> Jun 24 19:06:40 gate2 kernel: sd 3:0:0:0: [sdd] Write cache: enabled,
> read cache: enabled, doesn't support DPO or FUA
>
> The system continued without seeing other effects.
>
> It is runnuning a 2.6.22-rc5 kernel with hrt1 patch.
> The drive ist part of a raid5 with 4 identical drives
>
> lspci -vvv gives:
>
> 02:06.0 Mass storage controller: Promise Technology, Inc. PDC40718 (SATA
> 300 TX4) (rev 02)

The 300 TX4 model is causing transient errors for several people,
and we don't yet know why. In your case, port_status 0x00001000
means that "host bus is busy more than 256 clock cycles for every
ATA I/O transfer" (quoting the docs). Basically it's a timeout.

As long as it's transient, don't worry.

You may want to try without hrt1 to see if hrt1 is causing the issue.

/Mikael
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/