çå: çå: [PATCH][v2] page_pool: handle page recycle for NUMA_NO_NODE condition

From: Li,Rongqing
Date: Mon Dec 16 2019 - 06:14:43 EST




> -----éäåä-----
> åää: Ilias Apalodimas [mailto:ilias.apalodimas@xxxxxxxxxx]
> åéæé: 2019å12æ16æ 18:17
> æää: Li,Rongqing <lirongqing@xxxxxxxxx>
> æé: Yunsheng Lin <linyunsheng@xxxxxxxxxx>; Jesper Dangaard Brouer
> <brouer@xxxxxxxxxx>; Saeed Mahameed <saeedm@xxxxxxxxxxxx>;
> jonathan.lemon@xxxxxxxxx; netdev@xxxxxxxxxxxxxxx; mhocko@xxxxxxxxxx;
> peterz@xxxxxxxxxxxxx; Greg Kroah-Hartman <gregkh@xxxxxxxxxxxxxxxxxxx>;
> bhelgaas@xxxxxxxxxx; linux-kernel@xxxxxxxxxxxxxxx; BjÃrn TÃpel
> <bjorn.topel@xxxxxxxxx>
> äé: Re: çå: [PATCH][v2] page_pool: handle page recycle for
> NUMA_NO_NODE condition
>
> > > >
> > > > Simply clearing the pool->alloc.cache when calling
> > > > page_pool_update_nid() seems better.
> > > >
> > >
> > > How about the below codes, the driver can configure p.nid to any, which will
> be adjusted in NAPI polling, irq migration will not be problem, but it will add a
> check into hot path.
> >
> > We'll have to check the impact on some high speed (i.e 100gbit)
> > interface between doing anything like that. Saeed's current patch runs
> > once per NAPI. This runs once per packet. The load might be measurable.
> > The READ_ONCE is needed in case all producers/consumers run on the
> > same CPU
>
> I meant different cpus!
>

If no READ_ONCE, pool->p.nid will be always written and become dirty although it is unshared by multiple cpus

See Eric' patch:

https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=503978aca46124cd714703e180b9c8292ba50ba7

-Li
> > right?
> >
> >
> > Thanks
> > /Ilias
> > >
> > > diff --git a/net/core/page_pool.c b/net/core/page_pool.c index
> > > a6aefe989043..4374a6239d17 100644
> > > --- a/net/core/page_pool.c
> > > +++ b/net/core/page_pool.c
> > > @@ -108,6 +108,10 @@ static struct page
> *__page_pool_get_cached(struct page_pool *pool)
> > > if (likely(pool->alloc.count)) {
> > > /* Fast-path */
> > > page =
> > > pool->alloc.cache[--pool->alloc.count];
> > > +
> > > + if (unlikely(READ_ONCE(pool->p.nid) !=
> numa_mem_id()))
> > > + WRITE_ONCE(pool->p.nid,
> > > + numa_mem_id());
> > > +
> > > return page;
> > > }
> > > refill = true;
> > > @@ -155,6 +159,10 @@ static struct page
> *__page_pool_alloc_pages_slow(struct page_pool *pool,
> > > if (pool->p.order)
> > > gfp |= __GFP_COMP;
> > >
> > > +
> > > + if (unlikely(READ_ONCE(pool->p.nid) != numa_mem_id()))
> > > + WRITE_ONCE(pool->p.nid, numa_mem_id());
> > > +
> > > /* FUTURE development:
> > > *
> > > * Current slow-path essentially falls back to single page
> > > Thanks
> > >
> > > -Li
> > > > >
> > >