Re: [PATCH v2] x86/resctrl: Preserve CDP enable over cpuhp
From: James Morse
Date: Fri Feb 21 2020 - 10:25:27 EST
Hi Reinette,
On 14/02/2020 19:24, Reinette Chatre wrote:
> On 2/14/2020 10:16 AM, James Morse wrote:
>> Resctrl assumes that all CPUs are online when the filesystem is
>> mounted, and that CPUs remember their CDP-enabled state over CPU
>> hotplug.
>>
>> This goes wrong when resctrl's CDP-enabled state changes while all
>> the CPUs in a domain are offline.
>>
>> When a domain comes online, enable (or disable!) CDP to match resctrl's
>> current setting.
>> diff --git a/arch/x86/kernel/cpu/resctrl/rdtgroup.c b/arch/x86/kernel/cpu/resctrl/rdtgroup.c
>> index 064e9ef44cd6..5967320a1951 100644
>> --- a/arch/x86/kernel/cpu/resctrl/rdtgroup.c
>> +++ b/arch/x86/kernel/cpu/resctrl/rdtgroup.c
>> @@ -1831,6 +1831,9 @@ static int set_cache_qos_cfg(int level, bool enable)
>> struct rdt_domain *d;
>> int cpu;
>>
>> + /* CDP state is restored during cpuhp, which takes this lock */
>> + lockdep_assert_held(&rdtgroup_mutex);
>> +
>
> I think this hunk can be dropped. (1) The code path where this
> annotation is added is not part of this fix. (2) The comment implies
> that the taking of the mutex is something new/unique added in the CPU
> hotplug path but that is not accurate since this mutex is also taken in
> the only other existing call path of this snippet that is handling the
> mounting of the filesystem.
These things answer the question: "what stops rdt_domain_reconfigure_cdp() racing with
set_cache_qos_cfg() on the mount path, causing the wrong value to be restored?".
We can try and answer that in the commit message, or comments, but these will quickly be
lost, stale, or wrong.
These annotations serve as a comment, and let lockdep check its still true.
(I think you can never have enough lockdep annotations!)
> You do mention that these annotations is helpful for the MPAM work.
Indeed, it splits up the, er, "big RDT mutex", these annotations mean lockdep catches me
out if I do something wrong, and makes it very clear when changing something subtle.
> Could the annotations instead be added as a separate patch forming part
> of that work?
Ideally these things are there from the beginning. Adding them over time as part of other
reviewed patches works. I don't think adding them in one go before refactoring helps: you
wouldn't have the confidence that they were correct in the first place.
I'll drop these.
>> if (level == RDT_RESOURCE_L3)
>> update = l3_qos_cfg_update;
>> else if (level == RDT_RESOURCE_L2)
>> @@ -1859,6 +1862,21 @@ static int set_cache_qos_cfg(int level, bool enable)
>> return 0;
>> }
>>
>> +/* Restore the qos cfg state when a package comes online */
>
> s/package/domain/? When, for example, considering L2 then "package" is
> not the right term to use.
Sure,
Thanks,
James