Re: [patch 00/10] [RFC] SLUB patches for more functionality,performance and maintenance
From: Christoph Lameter
Date: Mon Aug 13 2007 - 18:28:47 EST
On Mon, 13 Aug 2007, Mathieu Desnoyers wrote:
> > @@ -1554,23 +1564,20 @@ static void __always_inline *slab_alloc(
> > gfp_t gfpflags, int node, void *addr)
> > {
> > void **object;
> > - unsigned long flags;
> > struct kmem_cache_cpu *c;
> >
>
> What if we prefetch c->freelist here ? I see in this diff that the other
> code just reads it sooner as a condition for the if().
Not sure as to what this may bring. If you read it earlier then you may
get the wrong value and then may have to refetch the cacheline.
We cannot fetch c->freelist without determining c. I can remove the
check for c->page == page so that the fetch of c->freelist comes
immeidately after detemination of c. But that does not change performance.
> > - c->freelist = object;
> > - } else
> > - __slab_free(s, page, x, addr, c->offset);
> > +redo:
> > + freelist = c->freelist;
>
> I suspect this smp_rmb() may be the cause of a major slowdown.
> Therefore, I think we should try taking a copy of c->page and simply
> check if it has changed right after the cmpxchg_local:
Thought so too and I removed that smp_rmb and tested this modification
on UP again without any performance gains. I think the cacheline fetches
dominates the execution thread here and cmpxchg does not bring us
anything.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/