Re: [PATCH] mm: cache largest vma

From: Davidlohr Bueso
Date: Sun Nov 03 2013 - 23:04:33 EST

Next message: Davidlohr Bueso: "Re: [PATCH] mm: cache largest vma"
Previous message: Yuanhan Liu: "Re: [PATCH 0/4] per anon_vma lock and turn anon_vma rwsem lock torwlock_t"
In reply to: Linus Torvalds: "Re: [PATCH] mm: cache largest vma"
Next in thread: Ingo Molnar: "Re: [PATCH] mm: cache largest vma"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Sun, 2013-11-03 at 10:51 -0800, Linus Torvalds wrote:
> Ugh. This patch makes me angry. It looks way too ad-hoc.
>
> I can well imagine that our current one-entry cache is crap and could
> be improved, but this looks too random.

Indeed, my approach is random *because* I wanted to keep things as
simple and low overhead as possible. Caching the largest VMA is probably
as least invasive and as low as overhead as you can get in find_vma().

> Different code for the
> CONFIG_MMU case? Same name, but for non-MMU it's a single entry, for
> MMU it's an array? And the whole "largest" just looks odd. Plus why do
> you set LAST_USED if you also set LARGEST?
>
> Did you try just a two- or four-entry pseudo-LRU instead, with a
> per-thread index for "last hit"? Or even possibly a small fixed-size
> hash table (say "idx = (add >> 10) & 3" or something)?
>
> And what happens for threaded models? Maybe we'd be much better off
> making the cache be per-thread, and the flushing of the cache would be
> a sequence number that has to match (so "vma_clear_cache()" ends up
> just incrementing a 64-bit sequence number in the mm)?

I will look into doing the vma cache per thread instead of mm (I hadn't
really looked at the problem like this) as well as Ingo's suggestion on
the weighted LRU approach. However, having seen that we can cheaply and
easily reach around ~70% hit rate in a lot of workloads, makes me wonder
how good is good enough?

> Basically, my complaints boil down to "too random" and "too
> specialized", and I can see (and you already comment on) this patch
> being grown with even *more* ad-hoc random new cases (LAST, LARGEST,
> MOST_USED - what's next?). And while I don't know if we should worry
> about the threaded case, I do get the feeling that this ad-hoc
> approach is guaranteed to never work for that, which makes me feel
> that it's not just ad-hoc, it's also fundamentally limited.
>
> I can see us merging this patch, but I would really like to hear that
> we do so because other cleaner approaches don't work well. In
> particular, pseudo-LRU tends to be successful (and cheap) for caches.

OK, will report back with comparisons, hopefully I'll have a better
picture by then.

Thanks,
Davidlohr

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Davidlohr Bueso: "Re: [PATCH] mm: cache largest vma"
Previous message: Yuanhan Liu: "Re: [PATCH 0/4] per anon_vma lock and turn anon_vma rwsem lock torwlock_t"
In reply to: Linus Torvalds: "Re: [PATCH] mm: cache largest vma"
Next in thread: Ingo Molnar: "Re: [PATCH] mm: cache largest vma"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]