Re: [PATCH] ARM: Don't ever downscale loops_per_jiffy in SMP systems#

From: Nicolas Pitre
Date: Fri May 09 2014 - 14:01:01 EST


On Fri, 9 May 2014, Russell King - ARM Linux wrote:

> On Thu, May 08, 2014 at 09:37:15PM -0400, Nicolas Pitre wrote:
> > On Thu, 8 May 2014, Russell King - ARM Linux wrote:
> >
> > > If you're in a preempt or SMP environment, provide a timer for udelay().
> > > IF you're in an environment with IRQs which can take a long time, use
> > > a timer for udelay(). If you're in an environment where the CPU clock
> > > can change unexpectedly, use a timer for udelay().
> >
> > Longer delays are normally not a problem. If they are, then simply
> > disabling IRQs may solve it if absolutely required. With much shorter
> > delays than expected this is another story.
> >
> > What about the following:
> >
> > diff --git a/arch/arm/kernel/smp.c b/arch/arm/kernel/smp.c
> > index 7c4fada440..10030cc5a0 100644
> > --- a/arch/arm/kernel/smp.c
> > +++ b/arch/arm/kernel/smp.c
> > @@ -682,6 +682,15 @@ static int cpufreq_callback(struct notifier_block *nb,
> > cpufreq_scale(per_cpu(l_p_j_ref, cpu),
> > per_cpu(l_p_j_ref_freq, cpu),
> > freq->new);
> > + /*
> > + * Another CPU might have called udelay() just before LPJ
> > + * and a shared CPU clock is increased. That other CPU still
> > + * looping on the old LPJ value would return significantly
> > + * sooner than expected. The actual fix is to provide a
> > + * timer based udelay() implementation instead.
> > + */
> > + if (freq->old < freq->new)
> > + pr_warn_once("*** udelay() on SMP is racy and may be much shorter than expected ***\n");
> > }
> > return NOTIFY_OK;
> > }
>
> No, because you're assuming this is just a SMP problem. What about
> preempt, where you could preempt away from a udelay loop to change
> the CPU frequency, and then back again, possibly resulting in the
> CPU clock rate increasing and maybe a shorter delay if the switch
> from-change-clock-and-back is fast enough? Remember that udelay()
> can be used for up to 2ms delays.

Well... that would be somewhat less likely but still possible yes.

So the only way to "solve" this might look similar in spirit to what
Doug alluded to earlier i.e. increase a sequence number on
CPUFREQ_PRECHANGE and increase it again on CPUFREQ_POSTCHANGE, and have
udelay() compare the count sampled before reading lpj and after
returning from the loop code. When the sequence count doesn't match
then suffice to perform some arbitrarily large extra loops.


Nicolas
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/