Re: [PATCH RFC] rcutorture: Fully test lazy RCU

Next message: Eric Biggers: "Re: [PATCH] lib/crypto: tests: Depend on library options rather than selecting them"
Previous message: kernel test robot: "[tip:perf/core] BUILD SUCCESS b2ea0f541d354c10186a894e8bf5bf83abf8a517"
In reply to: Paul E. McKenney: "[PATCH RFC] rcutorture: Fully test lazy RCU"
Next in thread: Paul E. McKenney: "Re: [PATCH RFC] rcutorture: Fully test lazy RCU"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

From: Joel Fernandes

Date: Sat Feb 28 2026 - 22:09:22 EST

> On Feb 27, 2026, at 7:39 PM, Paul E. McKenney <paulmck@xxxxxxxxxx> wrote:
>
> Currently, rcutorture bypasses lazy RCU by using call_rcu_hurry().
> This works, avoiding the dreaded rtort_pipe_count WARN(), but fails to
> fully test lazy RCU. The rtort_pipe_count WARN() splats because lazy RCU
> could delay the start of an RCU grace period for a full stutter period,
> which defaults to only three seconds.

But call_rcu_hurry() should completely flush, so if we are still
splatting with it, does that not mean that there is a real bug the
splat uncovered?

> This commit therefore reverts the call_rcu_hurry() instances
> back to call_rcu(), but, in kernels built with CONFIG_RCU_LAZY=y,
> queues a workqueue handler just before the call to stutter_wait() in
> rcu_torture_writer(). This workqueue handler invokes rcu_barrier(),
> which motivates any lingering lazy callbacks, thus avoiding the splat.

But nothing should be lingering with _hurry().

> Questions for review:
>
> 1. Should we avoid queueing work for RCU implementations not
> supporting lazy callbacks?

I think so, best to isolate non Lazy cases so the barrier call does
not cause side effects.

> 2. Should we avoid queueing work in kernels built with
> CONFIG_RCU_LAZY=y, but that were not booted with the
> rcutree.enable_rcu_lazy kernel boot parameter set? (Note that
> this requires some ugliness to access this parameter, and must
> also handle Tiny RCU.)

Yes I think we should avoid.

> 3. Does the rcu_torture_ops structure need a ->call_hurry() field,
> and if so, why? If not, why not?
>
> 4. Your additional questions here!

Do we have a reproducer for the splat? If there is a link to the
report, I could take a look and investigate.

thanks,

--
Joel Fernandes