[PATCH v2 01/10] KVM: Document KVM_MAP_MEMORY ioctl

From: isaku . yamahata
Date: Wed Apr 10 2024 - 18:08:15 EST


From: Isaku Yamahata <isaku.yamahata@xxxxxxxxx>

Adds documentation of KVM_MAP_MEMORY ioctl. [1]

It populates guest memory. It doesn't do extra operations on the
underlying technology-specific initialization [2]. For example,
CoCo-related operations won't be performed. Concretely for TDX, this API
won't invoke TDH.MEM.PAGE.ADD() or TDH.MR.EXTEND(). Vendor-specific APIs
are required for such operations.

The key point is to adapt of vcpu ioctl instead of VM ioctl. First,
populating guest memory requires vcpu. If it is VM ioctl, we need to pick
one vcpu somehow. Secondly, vcpu ioctl allows each vcpu to invoke this
ioctl in parallel. It helps to scale regarding guest memory size, e.g.,
hundreds of GB.

[1] https://lore.kernel.org/kvm/Zbrj5WKVgMsUFDtb@xxxxxxxxxx/
[2] https://lore.kernel.org/kvm/Ze-TJh0BBOWm9spT@xxxxxxxxxx/

Suggested-by: Sean Christopherson <seanjc@xxxxxxxxxx>
Signed-off-by: Isaku Yamahata <isaku.yamahata@xxxxxxxxx>
---
v2:
- Make flags reserved for future use. (Sean, Michael)
- Clarified the supposed use case. (Kai)
- Dropped source member of struct kvm_memory_mapping. (Michael)
- Change the unit from pages to bytes. (Michael)
---
Documentation/virt/kvm/api.rst | 52 ++++++++++++++++++++++++++++++++++
1 file changed, 52 insertions(+)

diff --git a/Documentation/virt/kvm/api.rst b/Documentation/virt/kvm/api.rst
index f0b76ff5030d..6ee3d2b51a2b 100644
--- a/Documentation/virt/kvm/api.rst
+++ b/Documentation/virt/kvm/api.rst
@@ -6352,6 +6352,58 @@ a single guest_memfd file, but the bound ranges must not overlap).

See KVM_SET_USER_MEMORY_REGION2 for additional details.

+4.143 KVM_MAP_MEMORY
+------------------------
+
+:Capability: KVM_CAP_MAP_MEMORY
+:Architectures: none
+:Type: vcpu ioctl
+:Parameters: struct kvm_memory_mapping (in/out)
+:Returns: 0 on success, < 0 on error
+
+Errors:
+
+ ========== =============================================================
+ EINVAL invalid parameters
+ EAGAIN The region is only processed partially. The caller should
+ issue the ioctl with the updated parameters when `size` > 0.
+ EINTR An unmasked signal is pending. The region may be processed
+ partially.
+ EFAULT The parameter address was invalid. The specified region
+ `base_address` and `size` was invalid. The region isn't
+ covered by KVM memory slot.
+ EOPNOTSUPP The architecture doesn't support this operation. The x86 two
+ dimensional paging supports this API. the x86 kvm shadow mmu
+ doesn't support it. The other arch KVM doesn't support it.
+ ========== =============================================================
+
+::
+
+ struct kvm_memory_mapping {
+ __u64 base_address;
+ __u64 size;
+ __u64 flags;
+ };
+
+KVM_MAP_MEMORY populates guest memory with the range, `base_address` in (L1)
+guest physical address(GPA) and `size` in bytes. `flags` must be zero. It's
+reserved for future use. When the ioctl returns, the input values are updated
+to point to the remaining range. If `size` > 0 on return, the caller should
+issue the ioctl with the updated parameters.
+
+Multiple vcpus are allowed to call this ioctl simultaneously. It's not
+mandatory for all vcpus to issue this ioctl. A single vcpu can suffice.
+Multiple vcpus invocations are utilized for scalability to process the
+population in parallel. If multiple vcpus call this ioctl in parallel, it may
+result in the error of EAGAIN due to race conditions.
+
+This population is restricted to the "pure" population without triggering
+underlying technology-specific initialization. For example, CoCo-related
+operations won't perform. In the case of TDX, this API won't invoke
+TDH.MEM.PAGE.ADD() or TDH.MR.EXTEND(). Vendor-specific uAPIs are required for
+such operations.
+
+
5. The kvm_run structure
========================

--
2.43.2