Re: [PATCH v2 0/8] Introduce a huge-page pre-zeroing mechanism

Next message: Matthew Wilcox: "Re: [PATCH] mm: fix VM_WARN_ONCE indentation in page_alloc.c"
Previous message: Rafael J. Wysocki: "Re: [PATCH] thermal: intel: Use sysfs_emit in sysfs show functions"
In reply to: dan.j.williams: "Re: [PATCH v2 0/8] Introduce a huge-page pre-zeroing mechanism"
Next in thread: Ankur Arora: "Re: [PATCH v2 0/8] Introduce a huge-page pre-zeroing mechanism"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

From: David Hildenbrand (Red Hat)

Date: Thu Jan 15 2026 - 15:22:34 EST

On 1/15/26 21:16, dan.j.williams@xxxxxxxxx wrote:

David Hildenbrand (Red Hat) wrote:
[..]

Give me a list of 1Gig pages and this stuff becomes much more efficient
than anything the CPU can do.

Right, and ideally we'd implement any such mechanisms in a way that more
parts of the kernel can benefit, and not just an unloved in-memory
file-system that most people just want to get rid of as soon as we can :)

CPUs have tended to eat the value of simple DMA offload operations like
copy/zero over time.

In the case of this patch there is no async-offload benefit because
userspace is already charged with spawning more threads if it wants more
parallelism.

In this subthread we're discussing handling that in the kernel like init_on_free. So when user space frees a hugetlb folio (or in the future, other similarly gigantic folios from another allocator), we'd be zeroing it.

If it would be freeing multiple such folios, we could pack them and send them to a DMA engine to zero them for us (concurrently? asynchronously? I don't know :) )

--
Cheers

David