Re: [rfc] balance-on-fork NUMA placement
From: Nick Piggin
Date: Thu Aug 02 2007 - 23:14:40 EST
On Thu, Aug 02, 2007 at 06:34:04PM -0700, Christoph Lameter wrote:
> On Fri, 3 Aug 2007, Nick Piggin wrote:
>
> > Yeah it only gets set if the parent is initially using a default policy
> > at this stage (and then is restored afterwards of course).
>
> Uggh. Looks like more hackery ahead. I think this cannot be done in the
> desired clean way until we have some revving of the memory policy
> subsystem that makes policies task context independent so that you can do
Well what's wrong with it? It seems to use memory policies for exactly
what they are intended (aside from it being kernel directed...).
> alloc_pages(...., memory_policy)
That still doesn't completely help because again it would require modifying
call sites (at which point I could just do alloc_pages_node).
> The cleanest solution that I can think of at this point is certainly to
> switch to another processor and do the allocation and copying actions from
> there. We have the migration process context right? Can that be used to
> start the new thread and can the original processor wait on some flag
> until that is complete?
I guess you could, but that is going to add a context switch to fork
(although it usually already has one in single-CPU situation because we
run child first)... I bet it will slow something down, but it would be
interesting to see.
I don't know the fork path well enough off the top of my head to know if
it will be that simple (with error handling etc). But I think it could
be done.
> Forking off from there not only places the data correctly but it also
> warms up the caches for the new process and avoids evicting cacheline on
> the original processor.
Yeah, you might be right there. If the numbers say that approach is
better, then I'd not be against it. But we'd still need the simpler
mpol approach to compare it with.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/