Re: [PATCH net-next v4 1/8] net: gro: decouple GRO from the NAPI layer

From: Alexander Lobakin
Date: Fri Feb 07 2025 - 07:00:30 EST


From: Eric Dumazet <edumazet@xxxxxxxxxx>
Date: Thu, 6 Feb 2025 19:35:50 +0100

> On Thu, Feb 6, 2025 at 1:15 PM Alexander Lobakin
> <aleksander.lobakin@xxxxxxxxx> wrote:
>>
>> From: Eric Dumazet <edumazet@xxxxxxxxxx>
>> Date: Wed, 5 Feb 2025 18:48:50 +0100
>>
>>> On Wed, Feb 5, 2025 at 5:46 PM Alexander Lobakin
>>> <aleksander.lobakin@xxxxxxxxx> wrote:
>>>>
>>>> In fact, these two are not tied closely to each other. The only
>>>> requirements to GRO are to use it in the BH context and have some
>>>> sane limits on the packet batches, e.g. NAPI has a limit of its
>>>> budget (64/8/etc.).
>>>> Move purely GRO fields into a new tagged group, &gro_node. Embed it
>>>> into &napi_struct and adjust all the references. napi_id doesn't
>>>> really belong to GRO, but:
>>>>
>>>> 1. struct gro_node has a 4-byte padding at the end anyway. If you
>>>> leave napi_id outside, struct napi_struct takes additional 8 bytes
>>>> (u32 napi_id + another 4-byte padding).
>>>> 2. gro_receive_skb() uses it to mark skbs. We don't want to split it
>>>> into two functions or add an `if`, as this would be less efficient,
>>>> but we need it to be NAPI-independent. The current approach doesn't
>>>> change anything for NAPI-backed GROs; for standalone ones (which
>>>> are less important currently), the embedded napi_id will be just
>>>> zero => no-op.
>>>>
>>>> Three Ethernet drivers use napi_gro_flush() not really meant to be
>>>> exported, so move it to <net/gro.h> and add that include there.
>>>> napi_gro_receive() is used in more than 100 drivers, keep it
>>>> in <linux/netdevice.h>.
>>>> This does not make GRO ready to use outside of the NAPI context
>>>> yet.
>>>>
>>>> Tested-by: Daniel Xu <dxu@xxxxxxxxx>
>>>> Acked-by: Jakub Kicinski <kuba@xxxxxxxxxx>
>>>> Reviewed-by: Toke Høiland-Jørgensen <toke@xxxxxxxxxx>
>>>> Signed-off-by: Alexander Lobakin <aleksander.lobakin@xxxxxxxxx>
>>>> ---
>>>> include/linux/netdevice.h | 26 +++++---
>>>> include/net/busy_poll.h | 11 +++-
>>>> include/net/gro.h | 35 +++++++----
>>>> drivers/net/ethernet/brocade/bna/bnad.c | 1 +
>>>> drivers/net/ethernet/cortina/gemini.c | 1 +
>>>> drivers/net/wwan/t7xx/t7xx_hif_dpmaif_rx.c | 1 +
>>>> net/core/dev.c | 60 ++++++++-----------
>>>> net/core/gro.c | 69 +++++++++++-----------
>>>> 8 files changed, 112 insertions(+), 92 deletions(-)
>>>>
>>>> diff --git a/include/linux/netdevice.h b/include/linux/netdevice.h
>>>> index 2a59034a5fa2..d29b6ebde73f 100644
>>>> --- a/include/linux/netdevice.h
>>>> +++ b/include/linux/netdevice.h
>>>> @@ -340,8 +340,8 @@ struct gro_list {
>>>> };
>>>>
>>>> /*
>>>> - * size of gro hash buckets, must less than bit number of
>>>> - * napi_struct::gro_bitmask
>>>> + * size of gro hash buckets, must be <= the number of bits in
>>>> + * gro_node::bitmask
>>>> */
>>>> #define GRO_HASH_BUCKETS 8
>>>>
>>>> @@ -370,7 +370,6 @@ struct napi_struct {
>>>> unsigned long state;
>>>> int weight;
>>>> u32 defer_hard_irqs_count;
>>>> - unsigned long gro_bitmask;
>>>> int (*poll)(struct napi_struct *, int);
>>>> #ifdef CONFIG_NETPOLL
>>>> /* CPU actively polling if netpoll is configured */
>>>> @@ -379,11 +378,14 @@ struct napi_struct {
>>>> /* CPU on which NAPI has been scheduled for processing */
>>>> int list_owner;
>>>> struct net_device *dev;
>>>> - struct gro_list gro_hash[GRO_HASH_BUCKETS];
>>>> struct sk_buff *skb;
>>>> - struct list_head rx_list; /* Pending GRO_NORMAL skbs */
>>>> - int rx_count; /* length of rx_list */
>>>> - unsigned int napi_id; /* protected by netdev_lock */
>>>> + struct_group_tagged(gro_node, gro,
>>>> + unsigned long bitmask;
>>>> + struct gro_list hash[GRO_HASH_BUCKETS];
>>>> + struct list_head rx_list; /* Pending GRO_NORMAL skbs */
>>>> + int rx_count; /* length of rx_list */
>>>> + u32 napi_id; /* protected by netdev_lock */
>>>> +
>>>
>>> I am old school, I would prefer a proper/standalone old C construct.
>>>
>>> struct gro_node {
>>> unsigned long bitmask;
>>> struct gro_list hash[GRO_HASH_BUCKETS];
>>> struct list_head rx_list; /* Pending GRO_NORMAL skbs */
>>> int rx_count; /* length of rx_list */
>>> u32 napi_id; /* protected by netdev_lock */
>>> };
>>>
>>> Really, what struct_group_tagged() can possibly bring here, other than
>>> obfuscation ?
>>
>> You'd need to adjust every ->napi_id access, which is a lot.
>> Plus, as I wrote previously, napi_id doesn't really belong here, but
>> embedding it here eases life.
>>
>> I'm often an old school, too, but sometimes this helps a lot.
>> Unless you have very strong preference on this.
>>
>
> Is struct_group_tagged even supported by ctags ?
>
> In terms of maintenance, I am sorry to say this looks bad to me.
>
> Even without ctags, I find git grep -n "struct xxxx {" quite good.

compile_commands.json (already supported natively by Kbuild) + clangd is
not enough?

Elixir correctly tags struct_group()s.

napi->napi_id is used in a lot of core files and drivers, adjusting all
the references is not what I wanted to do in the series which does
completely different things.

Page Pool uses tagged struct groups, as well a ton of other different
files. Do you want to revert all this and adjust a couple thousand
references only due to ctags and grep?

(instead of just clicking on the references generated by clangd)

Thanks,
Olek