Re: [PATCH] ARM: Don't ever downscale loops_per_jiffy in SMP systems

From: Nicolas Pitre
Date: Thu May 08 2014 - 12:04:43 EST


On Thu, 8 May 2014, Doug Anderson wrote:

> 1. Initially CPU1 and CPU2 at 200MHz. Pretend loops_per_jiffy is 1000.
>
> 2. CPU1 starts a delay. It reads global lpj (1000) and sets up its
> local registers up for the loop.
>
> 3. At the same time, CPU2 is transitioning the system to 2000MHz.
> Right after CPU1 reads lpj CPU2 stores it as 10000.
>
> 4. Now CPU1 and CPU2 are running at 2000MHz but CPU1 is only looping
> 1000 times. It will complete too fast.
>
> ...you could possibly try to account for this in the delay loop code
> (being careful to handle all of the corner cases and races). ...or we
> could make the delay loop super conservative and suggest that people
> should be using a real timer.

I don't see how you can possibly solve this issue without a timer based
delay. Even if you scale the loop count in only one direction, it will
still have this problem even though the window for the race would happen
much less often. Yet having a delay which is way longer than expected
might cause problems in some cases.

Yet clock frequency changes with the kind of magnitude you give in your
example are usually not instantaneous. Given udelay should be used for
very short delays, it is likely that CPU1 will complete its count before
the higher clock frequency is effective in most cases.

> How exactly do you do this in a generic way? I know that our systems
> don't always boot up at full speed. The HP Chromebook 11 might boot
> up at 900MHz and stay that way for a while until the battery gets
> enough juice. The Samsung Chromebook 2 will boot up at 1.8GHz
> although some of them can go to 1.9GHz and others to 2.0GHz. Waiting
> to actually see the cpufreq transition is a safe way, though it does
> end up with some extra function calls.

The Samsung Chromebook uses an A15 which does use a timer based udelay.
What about the others? IOW is this a real problem in practice or a
theoretical one?

SMP with shared clock for DVFS simply doesn't allow pure loop counts to
always be accurate. Trying to fix a broken implementation with
something that is still broken to some extent, and maybe more in
some cases, doesn't look like much progress to me.


Nicolas
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/