Re: [PATCH] ixgbe: let the xdpdrv work with more than 64 cpus

From: Jason Xing
Date: Wed Aug 25 2021 - 07:59:51 EST


On Tue, Aug 24, 2021 at 11:48 PM Maciej Fijalkowski
<maciej.fijalkowski@xxxxxxxxx> wrote:
>
> On Tue, Aug 24, 2021 at 11:23:29PM +0800, Jason Xing wrote:
> > On Tue, Aug 24, 2021 at 9:32 PM Jesper Dangaard Brouer
> > <jbrouer@xxxxxxxxxx> wrote:
> > >
> > >
> > >
> > > On 24/08/2021 12.49, kerneljasonxing@xxxxxxxxx wrote:
> > > > From: Jason Xing <xingwanli@xxxxxxxxxxxx>
> > > >
> > > > Originally, ixgbe driver doesn't allow the mounting of xdpdrv if the
> > > > server is equipped with more than 64 cpus online. So it turns out that
> > > > the loading of xdpdrv causes the "NOMEM" failure.
> > > >
> > > > Actually, we can adjust the algorithm and then make it work, which has
> > > > no harm at all, only if we set the maxmium number of xdp queues.
> > >
> > > This is not true, it can cause harm, because XDP transmission queues are
> > > used without locking. See drivers ndo_xdp_xmit function ixgbe_xdp_xmit().
> > > As driver assumption is that each CPU have its own XDP TX-queue.
>
> Thanks Jesper for chiming in.
>
> > >
> >
> > Point taken. I indeed miss that part which would cause bad behavior if it
> > happens.
> >
> > At this point, I think I should find all the allocation and use of XDP
> > related, say,
> > queues and rings, then adjust them all?
> >
> > Let's say if the server is shipped with 128 cpus, we could map 128 cpus to 64
> > rings in the function ixgbe_xdp_xmit(). However, it sounds a little bit odd.
> >
> > Do you think that it makes any sense?
>
> We need a fallback path for ixgbe. I did the following for ice:
> https://x-lore.kernel.org/bpf/20210819120004.34392-9-maciej.fijalkowski@xxxxxxxxx/T/#u
>

Thanks. I'm ready to send the v2 patch. Please help me review the next
submission.

Jason

> >
> > Thanks,
> > Jason
> >
> > > This patch is not a proper fix.
> > >
> > > I do think we need a proper fix for this issue on ixgbe.
> > >
> > >
> > > > Fixes: 33fdc82f08 ("ixgbe: add support for XDP_TX action")
> > > > Co-developed-by: Shujin Li <lishujin@xxxxxxxxxxxx>
> > > > Signed-off-by: Shujin Li <lishujin@xxxxxxxxxxxx>
> > > > Signed-off-by: Jason Xing <xingwanli@xxxxxxxxxxxx>
> > > > ---
> > > > drivers/net/ethernet/intel/ixgbe/ixgbe_lib.c | 2 +-
> > > > drivers/net/ethernet/intel/ixgbe/ixgbe_main.c | 3 ---
> > > > 2 files changed, 1 insertion(+), 4 deletions(-)
> > > >
> > > > diff --git a/drivers/net/ethernet/intel/ixgbe/ixgbe_lib.c b/drivers/net/ethernet/intel/ixgbe/ixgbe_lib.c
> > > > index 0218f6c..5953996 100644
> > > > --- a/drivers/net/ethernet/intel/ixgbe/ixgbe_lib.c
> > > > +++ b/drivers/net/ethernet/intel/ixgbe/ixgbe_lib.c
> > > > @@ -299,7 +299,7 @@ static void ixgbe_cache_ring_register(struct ixgbe_adapter *adapter)
> > > >
> > > > static int ixgbe_xdp_queues(struct ixgbe_adapter *adapter)
> > > > {
> > > > - return adapter->xdp_prog ? nr_cpu_ids : 0;
> > > > + return adapter->xdp_prog ? min_t(int, MAX_XDP_QUEUES, nr_cpu_ids) : 0;
> > > > }
> > > >
> > > > #define IXGBE_RSS_64Q_MASK 0x3F
> > > > diff --git a/drivers/net/ethernet/intel/ixgbe/ixgbe_main.c b/drivers/net/ethernet/intel/ixgbe/ixgbe_main.c
> > > > index 14aea40..b36d16b 100644
> > > > --- a/drivers/net/ethernet/intel/ixgbe/ixgbe_main.c
> > > > +++ b/drivers/net/ethernet/intel/ixgbe/ixgbe_main.c
> > > > @@ -10130,9 +10130,6 @@ static int ixgbe_xdp_setup(struct net_device *dev, struct bpf_prog *prog)
> > > > return -EINVAL;
> > > > }
> > > >
> > > > - if (nr_cpu_ids > MAX_XDP_QUEUES)
> > > > - return -ENOMEM;
> > > > -
> > > > old_prog = xchg(&adapter->xdp_prog, prog);
> > > > need_reset = (!!prog != !!old_prog);
> > > >
> > > >
> > >