Yeah, the draining is done before the cpu is offline. But the drain isIf the erroneous IO is still in-flight from blk-mq perspective, then how canI mentioned before that if no hctx->cpumask is online then we don't needNo, it is really not OK, if all cpus in hctx->cpumask are offline, you
to allocate a request. That is because if no hctx->cpumask is online,
this means that original erroneous IO must be completed due to nature of
how blk-mq cpu hotplug handler works, i.e. drained, and then we don't
actually need to abort it any longer, so ok to not get a request.
can't allocate
request on the specified hw queue, then the erroneous IO can't be handled,
then cpu hotplug handler may hang for ever.
hctx->cpumask still be offline? I thought that we guarantee that
hctx->cpumask cannot go offline until drained.
simply waiting for the inflight IO to be completed. If the IO is failed
during the waiting, you can't allocate such reserved request for error
handling, then hang ever in blk_mq_hctx_notify_offline().
If you just make it one driver private command, there can't be such
issue.
Block layer is supposed for handling common case(normal io and pt io),
I'd suggest to not put such special cases into block layer.