Re: [PATCH RFC 3/3] nvme: delay failover by command quiesce timeout

From: Randy Jennings
Date: Tue Apr 15 2025 - 19:57:54 EST

Next message: yohan.joung: "Re: [f2fs-dev] [PATCH v1] f2fs: Improve large section GC by locating valid block segments"
Previous message: Vishal Moola (Oracle): "Re: [PATCH 4/5] mm/vmalloc: optimize function vm_unmap_aliases()"
In reply to: Sagi Grimberg: "Re: [PATCH RFC 3/3] nvme: delay failover by command quiesce timeout"
Next in thread: Daniel Wagner: "Re: [PATCH RFC 3/3] nvme: delay failover by command quiesce timeout"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Tue, Apr 15, 2025 at 4:35 PM Sagi Grimberg <sagi@xxxxxxxxxxx> wrote:
>
>
> >> What I meant was that the user can no longer set kato to be arbitrarily
> >> long when we
> >> now introduce failover dependency on it.
> >>
> >> We need to set a sane maximum value that will failover in a reasonable
> >> timeframe.
> >> In other words, kato cannot be allowed to be set by the user to 60
> >> minutes. While we didn't
> >> care about it before, now it means that failover may take 60+ minutes.
> >>
> >> Hence, my request to set kato to a max absolute value of seconds. My
> >> vote was 10 (2x of the default),
> >> but we can also go with 30.
> > Adding a maximum value for KATO makes a lot of sense to me. This will
> > help keep us away from a hung task timeout when the full delay is
> > taken into account. 30 makes sense to me from the perspective that
> > the maximum should be long enough to handle non-ideal situations
> > functionally, but not a value that you expect people to use regularly.
> >
> > I think CQT should have a maximum allowed value for similar reasons.
> > If we do clamp down on the CQT, we could be opening ourselves to the
> > target not completely cleaning up, but it keeps us from a hung task
> > timeout, and _any_ delay will help most of the time.
>
> CQT comes from the controller, and if it is high, it effectively means
> that the
> controller cannot handle faster failover reliably. So I think we should
> leave it
> as is. It is the vendor problem.
Okay, that is one way to approach it. However, because of the hung
task issue, we would be allowing the vendor to panic the initiator
with a hung task. Until CCR, and without implementing other checks
(for events which might not happen), this hung task would happen on
every messy disconnect with that vendor/array.

Sincerely,
Randy Jennings

Next message: yohan.joung: "Re: [f2fs-dev] [PATCH v1] f2fs: Improve large section GC by locating valid block segments"
Previous message: Vishal Moola (Oracle): "Re: [PATCH 4/5] mm/vmalloc: optimize function vm_unmap_aliases()"
In reply to: Sagi Grimberg: "Re: [PATCH RFC 3/3] nvme: delay failover by command quiesce timeout"
Next in thread: Daniel Wagner: "Re: [PATCH RFC 3/3] nvme: delay failover by command quiesce timeout"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]