[RFC 1/1] mm/pagewalk: don't split device-backed huge pfnmaps
From: Max Boone
Date: Mon Mar 09 2026 - 13:56:58 EST
Don't split and descend on special PMD/PUDs, which are generally
device-backed huge pfnmaps as used by vfio for BAR mapping. These
can be faulted back in after splitting and before descending, which
can race to an illegal read.
Signed-off-by: Max Boone <mboone@xxxxxxxxxx>
Signed-off-by: Max Tottenham <mtottenh@xxxxxxxxxx>
---
mm/pagewalk.c | 24 ++++++++++++++++++++----
1 file changed, 20 insertions(+), 4 deletions(-)
diff --git a/mm/pagewalk.c b/mm/pagewalk.c
index a94c401ab..d1460dd84 100644
--- a/mm/pagewalk.c
+++ b/mm/pagewalk.c
@@ -147,10 +147,18 @@ static int walk_pmd_range(pud_t *pud, unsigned long addr, unsigned long end,
continue;
}
- if (walk->vma)
+ if (walk->vma) {
+ /*
+ * Don't descend into device-backed pfnmaps,
+ * they might refault the PMD entry.
+ */
+ if (unlikely(pmd_special(*pmd)))
+ continue;
+
split_huge_pmd(walk->vma, pmd, addr);
- else if (pmd_leaf(*pmd) || !pmd_present(*pmd))
+ } else if (pmd_leaf(*pmd) || !pmd_present(*pmd)) {
continue; /* Nothing to do. */
+ }
err = walk_pte_range(pmd, addr, next, walk);
if (err)
@@ -213,10 +221,18 @@ static int walk_pud_range(p4d_t *p4d, unsigned long addr, unsigned long end,
continue;
}
- if (walk->vma)
+ if (walk->vma) {
+ /*
+ * Don't descend into device-backed pfnmaps,
+ * they might refault the PUD entry.
+ */
+ if (unlikely(pud_special(*pud)))
+ continue;
+
split_huge_pud(walk->vma, pud, addr);
- else if (pud_leaf(*pud) || !pud_present(*pud))
+ } else if (pud_leaf(*pud) || !pud_present(*pud)) {
continue; /* Nothing to do. */
+ }
if (pud_none(*pud))
goto again;
--
2.34.1