Re: [PATCH 2/4 v2] libata: Implement disk shock protection support

From: Elias Oltmanns
Date: Fri Sep 19 2008 - 10:07:30 EST


Tejun Heo <htejun@xxxxxxxxx> wrote:
> Hello, Elias.
>
> Elias Oltmanns wrote:
[...]
>> so I'm just resending the whole patch. Please let me know what
>> you think.
>>
>> @@ -2830,6 +2878,51 @@ int ata_eh_recover(struct ata_port *ap, ata_prereset_fn_t prereset,
>> }
>> }
>>
>> + do {
>> + unsigned long now;
>> +
>> + ata_eh_pull_park_action(ap);
>> + deadline = jiffies;
>> + ata_port_for_each_link(link, ap) {
>> + ata_link_for_each_dev(dev, link) {
>> + struct ata_eh_context *ehc = &link->eh_context;
>> + unsigned long tmp;
>> +
>> + if (dev->class != ATA_DEV_ATA)
>> + continue;
>> + if (!(ehc->i.dev_action[dev->devno] &
>> + ATA_EH_PARK))
>> + continue;
>> + tmp = dev->unpark_deadline;
>> + if (time_before(deadline, tmp))
>> + deadline = tmp;
>> + else if (time_before_eq(tmp, jiffies))
>> + continue;
>> + if (ehc->unloaded_mask & (1 << dev->devno))
>> + continue;
>> +
>> + ata_eh_park_issue_cmd(dev, 1);
>> + }
>> + }
>> +
>> + now = jiffies;
>> + if (time_before_eq(deadline, now))
>> + break;
>> +
>> + deadline = wait_for_completion_timeout(&ap->park_req_pending,
>> + deadline - now);
>> + } while (deadline);
>
> This should basically work but completion isn't really designed for
> this type of continuous events where single consumption should clear
> all pending events. INIT_COMPLETION comes close but it doesn't lock,
> so can't guarantee anything. What's necessary is the counterpart for
> complete_all() for the wait.

You are right that it isn't designed for this use case and my approach
is somewhat hackish. Still, it really does exactly what we want. Please
note that ap->park_req_pending is protected by the host lock; the call
to complete() is atomic wrt the setting of ATA_EH_PARK for one of the
devices on the port and so is the call to INIT_COMPLETION() wrt clearing
ATA_EH_PARK requests for *all* devices on the port.

>
> Well, anyways, I think the issue is slightly out of scope for this
> patch and the only side effect is possibly looping over the do {}
> while () block several times unnecessarily on certain cases, so I
> think just noting about it should be enough for now.
>
> Can you please add explanation above wait_for_complete_timeout() that
> all done counts should be cleared here but aren't and as a result the
> loop might repeat determinate number of times unnecessarily and resend
> as proper patch?

Well, we don't really care about the done count after
wait_for_completion_timeout() has returned. All that matters is that the
done counter is cleared when all ATA_EH_PARK actions have been pulled in
which happens at the start of each cycle over the loop.

Perhaps I should add comments to this effect before
wait_for_completion_timeout() as well as INIT_COMPLETION()?

Regards,

Elias
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/