Re: [PATCH v1 2/2] KVM: arm64: allow the VM to select DEVICE_* and NORMAL_NC for IO memory
From: Jason Gunthorpe
Date: Thu Oct 12 2023 - 14:36:41 EST
On Thu, Oct 12, 2023 at 05:39:31PM +0100, Will Deacon wrote:
> All I'm asking for is justification as to why Normal-NC is the right
> memory type rather than any other normal memory type. If it's not possible
> to explain that architecturally, then I'm not sure this change belongs in
> architecture code.
Well, I think Catalin summarized it nicely, I second his ask at the end:
We are basically at your scenario below - can you justify why
DEVICE_GRE is correct today, architecturally? We could not. Earlier
someone said uncontained failure prevention, but a deep dive on that
found it is not so.
> Ultimately, we need to be able to maintain this stuff, so we can't just
> blindly implement changes based on a combination of off-list discussions
> and individual product needs. For example, if somebody else rocks up
> tomorrow and asks for this to be Normal-writethrough, what grounds do
> we have to say no if we've taken this change already?
Hmm. Something got lost here.
This patch is talking about the S2 MemAttr[3:0]. There are only 4
relevant values (when FEAT_S2FWB) (see D5.5.5 in ARM DDI 0487F.c)
0b001 - Today: force VM to be Device nGnRE
0b101 - Proposed: prevent the VM from selecting cachable, allow it to
choose Device-* or NormalNC
0b110 - Force write back. Would totally break MMIO, ruled out
0b111 - Allow the VM to select anything, including cachable.
This is nice, but summarizing Catalin's remarks:
a) The kernel can't do cache maintenance (defeats FWB)
b) Catalin's concerns about MTE and Atomics triggering
uncontained failures
c) It is unclear about uncontained failures for cachable
in general
So the only argument is 001 v 110 v 111
Catalin has explained why 111 is not good as a default. Most likely
with some future ACPI/BSA/etc work, and some cache syncing in the
kernel, someone could define a way to allow 111 as a choice. So, I
think we can rule out 111 as being the default choice without more the
kernel getting more detailed system level knowledge.
Further, patch 1 is about making 110 a driver-opt-in choice for VFIO
memory which reduces the need for 111.
For 001 v 110: 001 is less functional in the VM. 001 offers no
advantage.
!FEAT_S2FWB has similar choices and similar argument.
So, IMHO, if someone comes to ask for something it would be to ask for
111 and we do have a set of pretty clear reasons why it should not be
111. (indeed we wanted to ask for that instead of patch 1 but there
are too many things required to get there),
Jason