Re: [PATCH] fs/resctrl: Fix use-after-free in resctrl_offline_mon_domain()

Next message: Tim Chen: "Re: [PATCH v2 RFC 08/13] sched/qos: Add a new sched-qos interface"
Previous message: Sean Christopherson: "Re: [PATCH v2] KVM: x86: Rate-limit global clock updates on vCPU load"
In reply to: Luck, Tony: "Re: [PATCH] fs/resctrl: Fix use-after-free in resctrl_offline_mon_domain()"
Next in thread: Luck, Tony: "RE: [PATCH] fs/resctrl: Fix use-after-free in resctrl_offline_mon_domain()"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

From: Reinette Chatre

Date: Wed May 06 2026 - 16:33:56 EST

Hi Tony,

On 5/6/26 1:02 PM, Luck, Tony wrote:
> Question?
>
>> + if (!is_percpu_thread()) {
>> + list_for_each_entry(d, &r->mon_domains, hdr.list) {
>> + if (d->mbm_work_cpu == nr_cpu_ids)
>> + mbm_setup_overflow_handler(d, MBM_OVERFLOW_INTERVAL, RESCTRL_PICK_ANY_CPU);
>
> Should that "MBM_OVERFLOW_INTERVAL" be "0"? This worker is presumably
> already slightly late because of the offline CPU overhead and time to
> be picked up by another CPU. Maybe it should run right away on whatever
> new CPU in the domain is picked?

The delay is intentionally _not_ zero and there should probably be a comment
to make that clear. My module experiment demonstrated that when the work associated
with the work_struct is already running then no matter which CPU is provided as parameter
to schedule_delayed_work_on() the workqueue handling will schedule the work on the same
CPU as the currently executing work. Second time around is_percpu_thread() will still be
false but this time mbm_work_cpu will be set to CPU it should have been scheduled to and
work will exit without re-arming the worker and the associated domain loses its worker.

By setting the delay to MBM_OVERFLOW_INTERVAL it guarantees that the current executing
worker will be done by the time the newly scheduled worker should run and thus
be scheduled on correct CPU. I assume you are hinting that if the memory bandwidth is
under pressure there may thus be a risk that an overflow occurred? Perhaps
MBM_OVERFLOW_INTERVAL is too big - the delay only needs to be big enough to ensure that
current worker is done before new worker is scheduled. Do you have suggestions?

Reinette