Re: [RFC] sched_tick with interrupts enabled

From: Christoph Lameter
Date: Thu Oct 19 2006 - 11:51:28 EST


On Wed, 18 Oct 2006, Siddha, Suresh B wrote:

> On Wed, Oct 18, 2006 at 02:59:07PM -0700, Christoph Lameter wrote:
> > load_balancing has the potential of running for some time if f.e.
> > sched_domains for a system with 1024 processors have to be balanced.
> > We currently do all of that with interrupts disabled. So we may be unable
> > to service interrupts for some time. Most of that time is potentially
> > spend in rebalance_tick.
>
> Did you see an issue because of this or just theoretical?

Yes we actually had some livelocks when things got really busy and even
missed timer interrupts. We have another workaround by enabling/disabling
interrupts in the timer tick on IA64 now but it would be good
to address this issue in a general way.

> > +static void rebalance_tick(unsigned long dummy)
> > {
> > + int this_cpu = smp_processor_id();
> > + struct rq *this_rq = cpu_rq(this_cpu);
> > + enum idle_type idle;
> > unsigned long this_load, interval, j = cpu_offset(this_cpu);
> > struct sched_domain *sd;
> > int i, scale;
> >
> > + idle = (current == this_rq->idle) ? SCHED_IDLE : NOT_IDLE;
>
> We need to add nr_running check too, to determine if we want to do
> idle/not-idle balancing. This is in the context of wake_priority_sleeper()

Ok. Thanks. Would this work?

Index: linux-2.6.19-rc2-mm1/kernel/sched.c
===================================================================
--- linux-2.6.19-rc2-mm1.orig/kernel/sched.c 2006-10-19 09:39:08.000000000 -0500
+++ linux-2.6.19-rc2-mm1/kernel/sched.c 2006-10-19 09:42:10.733631242 -0500
@@ -2846,7 +2846,8 @@ static void rebalance_tick(unsigned long
struct sched_domain *sd;
int i, scale;

- idle = (current == this_rq->idle) ? SCHED_IDLE : NOT_IDLE;
+ idle = (current == this_rq->idle && !this_rq->nr_running) ?
+ SCHED_IDLE : NOT_IDLE;
this_load = this_rq->raw_weighted_load;

/* Update our load: */
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/