Re: [PATCH RFC 2/3] KVM: guest_memfd: support folio migration for non-confidential VMs

From: David Hildenbrand (Arm)

Date: Mon Jun 15 2026 - 14:35:51 EST


On 6/11/26 15:05, Shivank Garg wrote:
> guest_memfd folios are currently marked unmmovable, so the kernel
> cannot perform NUMA-balancing, memory compaction, etc.
> This is unavoidable for confidential VMs (SEV-SNP, TDX),
> since memory is encrypted and copying it need firmware assistance.
> However, for non-cofidential VMs (like firecracker), we can migrate
> the folios.
>
> Mark non-confidential VMs as movable and implement
> kvm_gmem_migrate_folio() using filemap_migrate_folio().
>
> This lays the ground work for migrating cofidential guest_memfd
> later. Once the firmware-assisted copying support is available,
> those VMs can be made movable. The confidential folio content can
> be copied separately, and the destination folio can be marked with
> FOLIO_CONTENT_COPIED so __migrate_folio() skips the host-side
> folio_mc_copy().
>
> Signed-off-by: Shivank Garg <shivankg@xxxxxxx>
> ---
> virt/kvm/guest_memfd.c | 50 +++++++++++++++++++++++++++++++++++++++++++++-----
> 1 file changed, 45 insertions(+), 5 deletions(-)
>
> diff --git a/virt/kvm/guest_memfd.c b/virt/kvm/guest_memfd.c
> index 806a42f0e031a1c7729f53c786316d2502532553..e4470106fc7792f328bce5275419683328c8b4ab 100644
> --- a/virt/kvm/guest_memfd.c
> +++ b/virt/kvm/guest_memfd.c
> @@ -487,13 +487,45 @@ static struct file_operations kvm_gmem_fops = {
> .fallocate = kvm_gmem_fallocate,
> };
>
> +#ifdef CONFIG_MIGRATION
> static int kvm_gmem_migrate_folio(struct address_space *mapping,
> struct folio *dst, struct folio *src,
> enum migrate_mode mode)
> {
> - WARN_ON_ONCE(1);
> - return -EINVAL;
> + struct inode *inode = mapping->host;
> + pgoff_t start, end;
> + int ret;
> +
> + if (!filemap_invalidate_trylock_shared(mapping))
> + return -EAGAIN;
> +
> + start = src->index;
> + end = start + folio_nr_pages(src);
> +
> + kvm_gmem_invalidate_begin(inode, start, end);
> +
> + /*
> + * For non-confidential guest_memfd the folio is host-readable,
> + * so filemap_migrate_folio() can copy the contents itself via
> + * folio_mc_copy().
> + *
> + * This is also the hook point for confidential VMs (SEV-SNP, TDX) once
> + * they are made movable: the host cannot copy encrypted/private memory,
> + * so a firmware-assisted copy would run here.
> + * Idea: https://lore.kernel.org/r/20260428155043.39251-8-shivankg@xxxxxxx
> + * Mark the @dst->migrate_info field with FOLIO_CONTENT_COPIED, so
> + * __migrate_folio() skip folio_mc_copy() for confidential VMs.
> + */
> + ret = filemap_migrate_folio(mapping, dst, src, mode);
> +
> + kvm_gmem_invalidate_end(inode, start, end);
> +
> + filemap_invalidate_unlock_shared(mapping);
> + return ret;
> }
> +#else
> +#define kvm_gmem_migrate_folio NULL
> +#endif
>
> static int kvm_gmem_error_folio(struct address_space *mapping, struct folio *folio)
> {
> @@ -592,9 +624,17 @@ static int __kvm_gmem_create(struct kvm *kvm, loff_t size, u64 flags)
> inode->i_size = size;
> mapping_set_gfp_mask(inode->i_mapping, GFP_HIGHUSER);
> mapping_set_inaccessible(inode->i_mapping);
> - mapping_set_unmovable(inode->i_mapping);
> - /* Unmovable mappings are supposed to be marked unevictable as well. */
> - WARN_ON_ONCE(!mapping_unevictable(inode->i_mapping));
> +
> + /*
> + * Confidential VMs (SEV-SNP, TDX) bind encryption to the physical
> + * address and require firmware assisted copy, so their folios cannot
> + * be migrated yet.
> + */
> + if (kvm_arch_has_private_mem(kvm)) {
> + mapping_set_unmovable(inode->i_mapping);
> + /* Unmovable mappings are supposed to be marked unevictable as well. */
> + WARN_ON_ONCE(!mapping_unevictable(inode->i_mapping));

We would still want our movable mappings to be flagged unevictable.

> + }
>

As discussed, for guest_memfd instances that support page migration, we would
want to also allocate the pages in for guest_memfd as GFP_HIGHUSER_MOVABLE.

That is, handle the mapping_set_gfp_mask() call as well.

It will unlock access to areas reserved for movable allocations (CMA/
ZONE_MOVABLE) and properly let the page allocator group pages by mobility
(MOVABLE vs. UNMOVABLE vs. RECLAIMABLE).

--
Cheers,

David