Re: [PATCH v4 21/28] cxl/extent: Process DCD events and realize region extents

From: Fan Ni
Date: Fri Oct 11 2024 - 15:15:04 EST


On Thu, Oct 10, 2024 at 03:50:14PM +0100, Jonathan Cameron wrote:
> On Wed, 9 Oct 2024 14:49:09 -0500
> Ira Weiny <ira.weiny@xxxxxxxxx> wrote:
>
> > Li, Ming4 wrote:
> > > On 10/8/2024 7:16 AM, ira.weiny@xxxxxxxxx wrote:
> > > > From: Navneet Singh <navneet.singh@xxxxxxxxx>
> > > >
> >
> > [snip]
> >
> > > >
> > > > Signed-off-by: Navneet Singh <navneet.singh@xxxxxxxxx>
> > > > Co-developed-by: Ira Weiny <ira.weiny@xxxxxxxxx>
> > > > Signed-off-by: Ira Weiny <ira.weiny@xxxxxxxxx>
> > > >
> > > Hi Ira,
> > >
> > > I guess you missed my comments for V3, I comment it again for this patch.
> >
> > Apologies. Yes I totally missed your reply. :-(
> >
> > >
> > > > +static bool extents_contain(struct cxl_dax_region *cxlr_dax,
> > > > + struct cxl_endpoint_decoder *cxled,
> > > > + struct range *new_range)
> > > > +{
> > > > + struct device *extent_device;
> > > > + struct match_data md = {
> > > > + .cxled = cxled,
> > > > + .new_range = new_range,
> > > > + };
> > > > +
> > > > + extent_device = device_find_child(&cxlr_dax->dev, &md, match_contains);
> > > > + if (!extent_device)
> > > > + return false;
> > > > +
> > > > + put_device(extent_device);
> > > could use __free(put_device) to drop this 'put_device(extent_device)'
> >
> > Yep.
> >
> > > > + return true;
> > > > +}
> > > [...]
> > > > +static bool extents_overlap(struct cxl_dax_region *cxlr_dax,
> > > > + struct cxl_endpoint_decoder *cxled,
> > > > + struct range *new_range)
> > > > +{
> > > > + struct device *extent_device;
> > > > + struct match_data md = {
> > > > + .cxled = cxled,
> > > > + .new_range = new_range,
> > > > + };
> > > > +
> > > > + extent_device = device_find_child(&cxlr_dax->dev, &md, match_overlaps);
> > > > + if (!extent_device)
> > > > + return false;
> > > > +
> > > > + put_device(extent_device);
> > > Same as above.
> >
> > Done.
> >
> > > > + return true;
> > > > +}
> > > > +
> > > [...]
> > > > +static int cxl_send_dc_response(struct cxl_memdev_state *mds, int opcode,
> > > > + struct xarray *extent_array, int cnt)
> > > > +{
> > > > + struct cxl_mailbox *cxl_mbox = &mds->cxlds.cxl_mbox;
> > > > + struct cxl_mbox_dc_response *p;
> > > > + struct cxl_mbox_cmd mbox_cmd;
> > > > + struct cxl_extent *extent;
> > > > + unsigned long index;
> > > > + u32 pl_index;
> > > > + int rc;
> > > > +
> > > > + size_t pl_size = struct_size(p, extent_list, cnt);
> > > > + u32 max_extents = cnt;
> > > > +
> > > > + /* May have to use more bit on response. */
> > > > + if (pl_size > cxl_mbox->payload_size) {
> > > > + max_extents = (cxl_mbox->payload_size - sizeof(*p)) /
> > > > + sizeof(struct updated_extent_list);
> > > > + pl_size = struct_size(p, extent_list, max_extents);
> > > > + }
> > > > +
> > > > + struct cxl_mbox_dc_response *response __free(kfree) =
> > > > + kzalloc(pl_size, GFP_KERNEL);
> > > > + if (!response)
> > > > + return -ENOMEM;
> > > > +
> > > > + pl_index = 0;
> > > > + xa_for_each(extent_array, index, extent) {
> > > > +
> > > > + response->extent_list[pl_index].dpa_start = extent->start_dpa;
> > > > + response->extent_list[pl_index].length = extent->length;
> > > > + pl_index++;
> > > > + response->extent_list_size = cpu_to_le32(pl_index);
> > > > +
> > > > + if (pl_index == max_extents) {
> > > > + mbox_cmd = (struct cxl_mbox_cmd) {
> > > > + .opcode = opcode,
> > > > + .size_in = struct_size(response, extent_list,
> > > > + pl_index),
> > > > + .payload_in = response,
> > > > + };
> > > > +
> > > > + response->flags = 0;
> > > > + if (pl_index < cnt)
> > > > + response->flags &= CXL_DCD_EVENT_MORE;
> > >
> > > It should be 'response->flags |= CXL_DCD_EVENT_MORE' here.
> >
> > Ah yea. Good catch.
> >
> > >
> > > Another issue is if 'cnt' is N times bigger than 'max_extents'(e,g. cnt=20, max_extents=10). all responses will be sent in this xa_for_each(), and CXL_DCD_EVENT_MORE will be set in the last response but it should not be set in these cases.
> > >
> >
> > Ah yes. cnt must be decremented. As I looked at the patch just now the
> >
> > if (cnt == 0 || pl_index)
> >
> > ... seemed very wrong to me. That change masked this bug.
> >
> > This should fix it:
> >
> > diff --git a/drivers/cxl/core/mbox.c b/drivers/cxl/core/mbox.c
> > index d66beec687a0..99200274dea8 100644
> > --- a/drivers/cxl/core/mbox.c
> > +++ b/drivers/cxl/core/mbox.c
> > @@ -1119,10 +1119,11 @@ static int cxl_send_dc_response(struct cxl_memdev_state *mds, int opcode,
> > if (rc)
> > return rc;
> > pl_index = 0;
> > + cnt -= pl_index;
> > }
> > }
> >
> > - if (cnt == 0 || pl_index) {
>
> I thought this cnt == 0 check was to deal with the no valid
> extents case where an empty reply is needed.

Agreed. Based on current code logic, there are two cases that cnt == 0:
1. no extent is accepted so cnt is passed as 0;
2. cnt was decreased to 0 and response has already been sent.

For case 1, we still need to send a response with zero extents;
For case 2, we do not need to handle.

Fan

>
>
> > + if (pl_index) {
> > mbox_cmd = (struct cxl_mbox_cmd) {
> > .opcode = opcode,
> > .size_in = struct_size(response, extent_list,
> >
> >
> > Thank you, and sorry again for missing your feedback.
> >
> > Ira
> >
> > [snip]
> >
>

--
Fan Ni