Re: [PATCH v20 00/18] Add support for Sub-NUMA cluster (SNC) systems
From: Reinette Chatre
Date: Fri Jun 14 2024 - 19:19:13 EST
Hi Babu,
On 6/14/24 2:29 PM, Moger, Babu wrote:
Hi Reinette,
On 6/14/2024 11:46 AM, Reinette Chatre wrote:
Hi Babu,
On 6/14/24 9:27 AM, Moger, Babu wrote:
Hi Reinette,
On 6/13/24 15:32, Reinette Chatre wrote:
Hi Babu,
On 6/13/24 12:17 PM, Moger, Babu wrote:
I may be little bit out of sync here. Also, sorry to come back late in the
series.
Looking at the series again, I see this approach adds lots of code.
Look at this structure.
@@ -187,10 +196,12 @@ struct rdt_resource {
bool alloc_capable;
bool mon_capable;
int num_rmid;
- enum resctrl_scope scope;
+ enum resctrl_scope ctrl_scope;
+ enum resctrl_scope mon_scope;
struct resctrl_cache cache;
struct resctrl_membw membw;
- struct list_head domains;
+ struct list_head ctrl_domains;
+ struct list_head mon_domains;
char *name;
int data_width;
u32 default_ctrl;
There are two scope fields.
There are two domains fields.
These are very confusing and very hard to maintain. Also, I am not sure if
these fields are useful for anything other than SNC feature. This approach
adds quite a bit of code for no specific advantage.
Why don't we just split the RDT_RESOURCE_L3 resource
into separate resources, one for control, one for monitoring.
We already have "control" only resources (MBA, SMBA, L2). Lets create new
"monitor" only resource. I feel it will be much cleaner approach.
Tony has already tried that approach and showed that it is much simpler.
v15-RFC :
https://lore.kernel.org/lkml/20240130222034.37181-1-tony.luck@xxxxxxxxx/
What do you think?
Some highlights of my thoughts in response to that series, but the whole
thread
may be of interest to you:
https://lore.kernel.org/lkml/78c88c6d-2e8d-42d1-a6f2-1c5ac38fb258@xxxxxxxxx/
https://lore.kernel.org/lkml/59944211-d34a-4ba3-a1de-095822c0b3f0@xxxxxxxxx/
Went through the thread, in summary:
The main concerns are related to duplication of code and data structures.
The solutions are
a) Split the domains.
This is what this series is doing now. This creates members like
ctrl_scope, mon_scope, ctrl_domains etc.. These fields are added to all
the resources (MBA, SMBA and L2). Then there is additional domain header.
b) Split the resource.
Split RDT_RESOURCE_L3 into two, one for "monitor" and one for "control".
There will be one domain structure for "monitor" and one for "control"
Both these approaches have code and data duplication. So, there is no
difference that way.
Could you please elaborate where code and data duplication of (a) is?
We have ctrl_scope, mon_scope, ctrl_domains. mon_domains. Only one
resource, RDT_RESOURCE_L3 is going to use these fields. Rest of the
resources don't need these fields. But these fields are part of all
the resources.
Correct. There are two new empty fields per resource that does
not support monitoring. Having the new mon_domains list results in
the benefit of eliminating monitoring fields from all the domains
forming part of resources that do not support monitoring. Providing
more details below but the additional pointer results in a significant
net reduction of unused fields. Having the new mon_scope field results
in the benefit to support SNC.
I am not too worried about the size of the patch. But, I don't
foresee these fields will be used anytime soon in these
resources(MBA. L3. SMBA). Why add it now? In future we may have to
cleanup all these anyways.
This work does indeed go through the effort to _eliminate_ unused fields.
Note how all domains of all resources (whether they support monitoring or
not) currently have to carry a significant number of monitoring fields.
These can be found in both struct rdt_domain (*rmid_busy_llc, *mbm_total,
*mbm_local, mbm_over, cqm_limbo, mbm_work_cpu, cqm_work_cpu) as
well as struct rdt_hw_domain (*arch_mbm_total, *arch_mbm_local).
For a resource that does not support monitoring it is of course
unnecessary to carry all of this for _every_ domain instance and
after this series it no longer will.
Reinette