Re: [PATCH v3 0/6] KVM: guest_memfd: support for uffd minor

From: Nikita Kalyazin
Date: Mon Apr 07 2025 - 10:12:44 EST




On 07/04/2025 14:40, Liam R. Howlett wrote:
* Nikita Kalyazin <kalyazin@xxxxxxxxxx> [250407 07:04]:


On 04/04/2025 18:12, Liam R. Howlett wrote:
+To authors of v7 series referenced in [1]

* Nikita Kalyazin <kalyazin@xxxxxxxxxx> [250404 11:44]:
This series is built on top of the Fuad's v7 "mapping guest_memfd backed
memory at the host" [1].

I didn't see their addresses in the to/cc, so I added them to my
response as I reference the v7 patch set below.

Hi Liam,

Thanks for the feedback and for extending the list.



With James's KVM userfault [2], it is possible to handle stage-2 faults
in guest_memfd in userspace. However, KVM itself also triggers faults
in guest_memfd in some cases, for example: PV interfaces like kvmclock,
PV EOI and page table walking code when fetching the MMIO instruction on
x86. It was agreed in the guest_memfd upstream call on 23 Jan 2025 [3]
that KVM would be accessing those pages via userspace page tables.

Thanks for being open about the technical call, but it would be better
to capture the reasons and not the call date. I explain why in the
linking section as well.

Thanks for bringing that up. The document mostly contains the decision
itself. The main alternative considered previously was a temporary
reintroduction of the pages to the direct map whenever a KVM-internal access
is required. It was coming with a significant complexity of guaranteeing
correctness in all cases [1]. Since the memslot structure already contains
a guest memory pointer supplied by the userspace, KVM can use it directly
when in the VMM or vCPU context. I will add this in the cover for the next
version.

Thank you.


[1] https://lore.kernel.org/kvm/20240709132041.3625501-1-roypat@xxxxxxxxxxxx/T/#m4f367c52bbad0f0ba7fb07ca347c7b37258a73e5


In
order for such faults to be handled in userspace, guest_memfd needs to
support userfaultfd.

Changes since v2 [4]:
- James: Fix sgp type when calling shmem_get_folio_gfp
- James: Improved vm_ops->fault() error handling
- James: Add and make use of the can_userfault() VMA operation
- James: Add UFFD_FEATURE_MINOR_GUEST_MEMFD feature flag
- James: Fix typos and add more checks in the test

Nikita

Please slow down...

This patch is at v3, the v7 patch that you are building off has lockdep
issues [1] reported by one of the authors, and (sorry for sounding harsh
about the v7 of that patch) the cover letter reads a bit more like an
RFC than a set ready to go into linux-mm.

AFAIK the lockdep issue was reported on a v7 of a different change.
I'm basing my series on [2] ("KVM: Mapping guest_memfd backed memory at the
host for software protected VMs"), while the issue was reported on [2]
("KVM: Restricted mapping of guest_memfd at the host and arm64 support"),
which is also built on top of [2]. Please correct me if I'm missing
something.

I think you messed up the numbering in your statement above.

I did, in an attempt to make it "even more clear" :) Sorry about that, glad you got the intention.


I believe you are making the point that I messed up which patches depend
on what and your code does not depend on faulty locking, which appears
to be the case.

There are a few issues with the required patch set?

There are indeed, but not in the part this series depends on, as far as I can see.



The key feature that is required by my series is the ability to mmap
guest_memfd when the VM type allows. My understanding is no-one is opposed
to that as of now, that's why I assumed it's safe to build on top of that.

[2] https://lore.kernel.org/kvm/20250318161823.4005529-1-tabba@xxxxxxxxxx/T/
[3] https://lore.kernel.org/all/diqz1puanquh.fsf@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx/T/

All of this is extremely confusing because the onus of figuring out what
the final code will look like is put on the reviewer. As it is, we have
issues with people not doing enough review of the code (due to limited
time). One way to get reviews is to make the barrier of entry as low as
possible.

I spent Friday going down a rabbit hole of patches referring to each
other as dependencies and I gave up. It looks like I mistook one set of
patches as required vs them requiring the same in-flight ones as your
patches.

I am struggling to see how we can adequately support all of you given
the way the patches are sent out in batches with dependencies - it is
just too time consuming to sort out.

I'm happy to do whatever I can to make the review easier. I suppose the extreme case is to wait for the dependencies to get accepted, effectively serialising submissions, but that slows the process down significantly. For example, I received very good feedback on v1 and v2 of this series and was able to address it instead of waiting for the dependency. Would including the required patches directly in the series help? My only concern is in that case the same patch will be submitted multiple times (as a part of every depending series), but if it's better, I'll be doing that instead.


Thank you,
Liam