Re: [patch] HT scheduler, sched-2.5.68-A9

From: Ingo Molnar (mingo@redhat.com)
Date: Tue Apr 22 2003 - 05:34:45 EST

Next message: Arjan van de Ven: "Re: 2.5.67-mm4 & IRQ balancing"
Previous message: Rick Lindsley: "Re: [patch] HT scheduler, sched-2.5.68-A9"
In reply to: Rick Lindsley: "Re: [patch] HT scheduler, sched-2.5.68-A9"
Next in thread: Rick Lindsley: "Re: [patch] HT scheduler, sched-2.5.68-A9"
Reply: Rick Lindsley: "Re: [patch] HT scheduler, sched-2.5.68-A9"
Reply: Bill Davidsen: "Re: several messages"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Tue, 22 Apr 2003, Rick Lindsley wrote:

> Ingo, several questions.
>
> What makes this statement:
>
> * At this point it's sure that we have a SMT
> * imbalance: this (physical) CPU is idle but
> * another CPU has two (or more) tasks running.
>
> true? Do you mean "this cpu/sibling set are all idle but another
> cpu/sibling set are all non-idle"? [...]

yes, precisely.

> [...] Are we assuming that because both a physical processor and its
> sibling are not idle, that it is better to move a task from the sibling
> to a physical processor? In other words, we are presuming that the case
> where the task on the physical processor and the task(s) on the
> sibling(s) are actually benefitting from the relationship is rare?

yes. This 'un-sharing' of contexts happens unconditionally, whenever we
notice the situation. (ie. whenever a CPU goes completely idle and notices
an overloaded physical CPU.) On the HT system i have i have measure this
to be a beneficial move even for the most trivial things like infinite
loop-counting.

the more per-logical-CPU cache a given SMT implementation has, the less
beneficial this move becomes - in that case the system should rather be
set up as a NUMA topology and scheduled via the NUMA scheduler.

> * We wake up one of the migration threads (it
> * doesnt matter which one) and let it fix things up:
>
> So although a migration thread normally pulls tasks to it, we've altered
> migration_thread now so that when cpu_active_balance() is set for its
> cpu, it will go find a cpu/sibling set in which all siblings are busy
> (knowing it has a good chance of finding one), decrement nr_running in
> the runqueue it has found, call load_balance() on the queue which is
> idle, and hope that load_balance will again find the busy queue (the
> same queue as the migration thread's) and decide to move a task over?

yes.

> whew. So why are we perverting the migration thread to push rather than
> pull? If active_load_balance() finds a imbalance, why must we use such
> indirection? Why decrement nr_running? Couldn't we put together a
> migration_req_t for the target queue's migration thread?

i'm not sure what you mean by perverting the migration thread to push
rather to pull, as migration threads always push - it's not different in
this case either. Since the task in question is running in an
un-cooperative way at the moment of active-balancing, that CPU needs to
run the high-prio migration thread, which pushes the task to the proper
CPU after that point. [if the push is still necessary.]

we could use a migration_req_t for this, in theory, but active balancing
is independent of ->cpus_allowed, so some special code would still be
needed. Also, active balancing is non-queued by nature. Is there a big
difference?

> Making the migration thread TASK_UNINTERRUPTIBLE has the nasty side
> effect of artificially raising the load average. Why is this changed?

agreed, this is an oversight, i fixed it in my tree.

Ingo

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Arjan van de Ven: "Re: 2.5.67-mm4 & IRQ balancing"
Previous message: Rick Lindsley: "Re: [patch] HT scheduler, sched-2.5.68-A9"
In reply to: Rick Lindsley: "Re: [patch] HT scheduler, sched-2.5.68-A9"
Next in thread: Rick Lindsley: "Re: [patch] HT scheduler, sched-2.5.68-A9"
Reply: Rick Lindsley: "Re: [patch] HT scheduler, sched-2.5.68-A9"
Reply: Bill Davidsen: "Re: several messages"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

This archive was generated by hypermail 2b29 : Wed Apr 23 2003 - 22:00:32 EST