Re: [PATCH] i386 No-Idle-Hz aka Dynamic-Ticks 3
From: Tony Lindgren
Date: Mon Aug 08 2005 - 10:20:56 EST
* Srivatsa Vaddagiri <vatsa@xxxxxxxxxx> [050808 07:53]:
> On Mon, Aug 08, 2005 at 12:26:01AM -0700, Tony Lindgren wrote:
> > Good point, and it would be nice to have it resolved for systems that support
> > idling individual CPUs. The current setup was done because when I was tinkering
> > with the amd76x_pm patch a while a back, I noticed that idling the cpu
> > disconnects all cpus from the bus. (As far as I remember)
> > So this may need to be configured depending on the system.
>
> What do you mean by "idling" a cpu? Is it reprogramming the timer source
> to skip ticks? Or putting the cpu actually in some low-power state?
> And what do you mean by "disconnect all CPUs from the bus"? Is it
> the system bus? And what happens if the CPUs are disconnected from
> the bus like this while it is executing?
As far as I remember enabling AMD stop grant disconnects all cpus. This
means the system won't be able to do any work until the dyntick timer
interrupt wakes up the system.
> Both requirements (idling all CPUs together vs individually) I think
> will make the patch more complex. Are such systems (which require having to
> idle all CPUs together) pretty common that we have to care about?!
Probably all AMD SMP based systems? Somebody with better knowledge should
verify this.
> > > - All CPUs seem to cut off the same number of ticks (dyn_tick->skip). Isn't
> > > this wrong, considering that the timer list is per-CPU? This will cause
> > > some timers to be serviced much later than usual.
> >
> > Yes if it's done on per-CPU basis. In the current setup the first interrupt
> > will kick the system off the dyn-tick state and the timers get checked again.
>
> But that may be too late on some CPUs. If dyn_tick->skip = 100, all
> CPUs skip 100 ticks. However some CPUs may have timers that need to be
> service much before that.
Not in the current case, as the system is completely idle until some
interrupt wakes up the system. Of course it would be different if you make
it per-CPU.
> > > - The fact that dyn_tick_state is global and accessed from all CPUs
> > > is probably a scalability concern, especially if we allow the ticks
> > > to be cut off on per-CPU basis.
> >
> > >From idling devices point of view, we still need some global variable I
> > believe. How else would you be able to tell all devices that the whole
> > system does not have any timers for next 2 seconds?
>
> Why would anyone want to know that? Moreover, when we enable this
> per-CPU, the no-timer-interval is also distributed across CPUs.
Well we need to be able to do various things in the idle loop depending on
the length of the estimated sleep. For example, if next_timer_interrupt is
2 jiffies away, we cannot do much. But if next_timer_interrupt is 2 seconds
away, we can idle pretty much all devices.
> > But in any case on P4 systems the APIC timer is not the bottleneck as
> > stopping or reprogramming PIT also kills APIC. (This does not happen on P3
> > systems). So the bottleneck most likely is the length of PIT.
>
> On these systems, do you disabled APIC (dyntick=noapic)?
Yeah. It only seems to work on P3 systems.
> > HRT + VST depend on APIC only, and does not use next_timer_interrupt().
>
> AFAIK, it supports pm timer too and yes it has its own implementation
> of next_timer_interrupt, which makes it more work for merging.
> At this point, looks like dyn-tick is gaining more momentum and is
> more likely a candidate for merging. I will send out my SMP-support
> changes to dynamic tick soon.
Cool :) Sounds like SMP and HPET are about the only things that still need
work. Or can somebody think of anything else?
> > You may also want to check out the ARM implementation as it does not have
> > the issues listed above, which are mostly x86 specific issues.
>
> Thanks for the pointer. Will look at it.
OK
Tony
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/