Re: [PATCH 2/9] IB: add a proper completion queue abstraction

From: Bart Van Assche
Date: Mon Nov 23 2015 - 16:04:36 EST

Next message: Toshi Kani: "Re: [PATCH] dax: Split pmd map when fallback on COW"
Previous message: Russell King - ARM Linux: "Re: [RFC/PATCH 2/3] recordmcount: Record locations of __aeabi_{u}idiv() calls on ARM"
In reply to: Jason Gunthorpe: "Re: [PATCH 2/9] IB: add a proper completion queue abstraction"
Next in thread: Jason Gunthorpe: "Re: [PATCH 2/9] IB: add a proper completion queue abstraction"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On 11/23/2015 12:37 PM, Jason Gunthorpe wrote:

On Sat, Nov 14, 2015 at 08:13:44AM +0100, Christoph Hellwig wrote:

On Fri, Nov 13, 2015 at 03:06:36PM -0700, Jason Gunthorpe wrote:

Looking at that thread and then at the patch a bit more..

+void ib_process_cq_direct(struct ib_cq *cq)
[..]
+ __ib_process_cq(cq, INT_MAX);

INT_MAX is not enough, it needs to loop.
This is missing a ib_req_notify also.

No. Direct cases _never_ calls ib_req_notify. Its for the case where
the SRP case polls the send CQ only from the same context it sends for
without any interrupt notification at al.

Hurm, okay, that is not at all what I was thinking this was for..

So the only use of this function is to drain a send cq, in a state
where it is guarenteed no new entries can be added, and only if the cq
is not already event driven. I'd stick those notes in the comment..

Hum. I wonder if this is even a reasonable way to run a ULP. It is
important that rx completions are not used to drive reaping of
resources that are still committed to the send queue. ie do not
trigger send buffer reuse based on a rx completion.

So, if a ULP uses this API, how does it handle the sendq becoming
full? As above, a ULP cannot use recvs to infer available sendq
space. It must directly reap the sendq. So a correct ULP would have to
spin calling ib_process_direct_cq until it makes enough progress to
add more things to the sendq. I don't obviously see that in SRP - so
I'm guessing it has buggered up sendq flow control?

NFS had similar problems lately too, I wrote a long explanation to
Chuck on this subject.

That said, the demand poll almost seems like a reasonable way for a
ULP to run the sendq, do the polls on send occasionally or when more
space is needed to better amortize the reaping overhead at the cost of
send latency. But API wise it needs to be able to switch over to a
sleep if enough progress hasn't been made.

So.. maybe also add to the comment that ib_process_cq_direct is
deprecated and should not be used in new code until SRP gets sorted?

Hello Jason,

Considerable time ago the send queue in the SRP initiator driver was modified from signaled to non-signaled to reduce the number of interrupts triggered by the SRP initiator driver. The SRP initiator driver polls the send queue every time before a SCSI command is sent to the target. I think this is a pattern that is also useful for other ULP's so I'm not convinced that ib_process_cq_direct() should be deprecated :-)

Bart.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Toshi Kani: "Re: [PATCH] dax: Split pmd map when fallback on COW"
Previous message: Russell King - ARM Linux: "Re: [RFC/PATCH 2/3] recordmcount: Record locations of __aeabi_{u}idiv() calls on ARM"
In reply to: Jason Gunthorpe: "Re: [PATCH 2/9] IB: add a proper completion queue abstraction"
Next in thread: Jason Gunthorpe: "Re: [PATCH 2/9] IB: add a proper completion queue abstraction"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]