Re: [PATCH v2 04/12] tick/nohz: Transition to dynamic full dynticks state management

From: Thomas Gleixner

Date: Tue Apr 14 2026 - 18:04:13 EST


On Mon, Apr 13 2026 at 15:43, Qiliang Yuan wrote:

> Context:
> Full dynticks (NOHZ_FULL) is typically a static configuration determined
> at boot time. DHEI extends this to support runtime activation.

I have no idea what DHEI is. Provide proper information and not magic
acronyms.

> Problem:
> Switching to NOHZ_FULL at runtime requires careful synchronization
> of context tracking and housekeeping states. Re-invoking setup logic
> multiple times could lead to inconsistencies or warnings, and RCU
> dependency checks often prevented tick suppression in Zero-Conf setups.

And that careful synchronization is best achieved with an opaque
notifier callchain which relies on build time ordering. Impressive.

> Solution:
> - Replace the static tick_nohz_full_enabled() checks with a dynamic
> tick_nohz_full_running state variable.

That variable existed before and you are telling the what and not why
this is required and how that is correct vs. the other checks in
tick_nohz_full_enabled(). Also what's static about that function aside
of being marked static inline?

> - Refactor tick_nohz_full_setup to be safe for runtime invocation,
> adding guards against re-initialization and ensuring IRQ work
> interrupt support.

Refactoring has to be done in a preparatory patch and not

> - Implement boot-time pre-activation of context tracking (shadow
> init) for all possible CPUs to avoid instruction flow issues during
> dynamic transitions.

Again lot's of hand waving without a proper explanation.

> - Hook into housekeeping_notifier_list to update NO_HZ states dynamically.

See above.

> This provides the core state machine for reliable, on-demand tick
> suppression and high-performance isolation.

I can find a lot of hacks, but definitely not the slightest notion of a
state machine. Don't throw random buzzwords into a changelog if there is
no evidence for their existance.

> +static int tick_nohz_housekeeping_reconfigure(struct notifier_block *nb,
> + unsigned long action, void *data)
> +{
> + struct housekeeping_update *upd = data;
> + int cpu;
> +
> + if (action == HK_UPDATE_MASK && upd->type == HK_TYPE_TICK) {
> + cpumask_var_t non_housekeeping_mask;
> +
> + if (!alloc_cpumask_var(&non_housekeeping_mask, GFP_KERNEL))
> + return NOTIFY_BAD;
> +
> + cpumask_andnot(non_housekeeping_mask, cpu_possible_mask, upd->new_mask);
> +
> + if (!tick_nohz_full_mask) {
> + if (!zalloc_cpumask_var(&tick_nohz_full_mask, GFP_KERNEL)) {
> + free_cpumask_var(non_housekeeping_mask);
> + return NOTIFY_BAD;
> + }
> + }
> +
> + /* Kick all CPUs to re-evaluate tick dependency before change */
> + for_each_online_cpu(cpu)
> + tick_nohz_full_kick_cpu(cpu);

That solves what?

> + cpumask_copy(tick_nohz_full_mask, non_housekeeping_mask);

What's the exact point of this non_housekeeping_mask?

Why can't you simply do:

cpumask_andnot(tick_nohz_full_mask, cpu_possible_mask, upd->new_mask);

That'd be too simple and comprehensible, right?

> + tick_nohz_full_running = !cpumask_empty(tick_nohz_full_mask);
> +
> + /*
> + * If nohz_full is running, the timer duty must be on a housekeeper.
> + * If the current timer CPU is not a housekeeper, or no duty is assigned,
> + * pick the first housekeeper and assign it.
> + */
> + if (tick_nohz_full_running) {
> + int timer_cpu = READ_ONCE(tick_do_timer_cpu);

New line between declaration and code.

> + if (timer_cpu == TICK_DO_TIMER_NONE ||
> + !cpumask_test_cpu(timer_cpu, upd->new_mask)) {

No line break required. You have 100 characters

> + int next_timer = cpumask_first(upd->new_mask);

next_timer? Please pick variable names which are comprehensible and self
explaining. Also why can't you re-use timer_cpu, which would be actually useful?


> + if (next_timer < nr_cpu_ids)

How can upd->new_mask be empty? That'd be a bug, no?

> + WRITE_ONCE(tick_do_timer_cpu, next_timer);
> + }
> + }
> +
> + /* Kick all CPUs again to apply new nohz full state */
> + for_each_online_cpu(cpu)
> + tick_nohz_full_kick_cpu(cpu);

This whole thing lacks an explanation why it is even remotely correct.

> void __init tick_nohz_init(void)
...
> + if (!tick_nohz_full_mask) {
> + if (!slab_is_available())
> + alloc_bootmem_cpumask_var(&tick_nohz_full_mask);
> + else
> + zalloc_cpumask_var(&tick_nohz_full_mask, GFP_KERNEL);
> }

I've seen the same code sequence before. Copy & paste is simpler than
providing helper functions.....

> - if (IS_ENABLED(CONFIG_PM_SLEEP_SMP) &&
> - !IS_ENABLED(CONFIG_PM_SLEEP_SMP_NONZERO_CPU)) {
> - cpu = smp_processor_id();
> + housekeeping_register_notifier(&tick_nohz_housekeeping_nb);
>
> - if (cpumask_test_cpu(cpu, tick_nohz_full_mask)) {
> - pr_warn("NO_HZ: Clearing %d from nohz_full range "
> - "for timekeeping\n", cpu);
> - cpumask_clear_cpu(cpu, tick_nohz_full_mask);
> + if (tick_nohz_full_running) {

This indentation and the resulting goto mess can be completely avoided
if you actually refactor the code and not just claim to do so.

Again, this does too many things at once and then explains them badly,
which makes it unreviewable.

Thanks,

tglx