Re: [PATCH] mm: vmscan: fix dirty folios throttling on cgroup v1 for MGLRU

Next message: Xu Yilun: "[PATCH v2 18/31] iommu/vt-d: Cache max domain ID to avoid redundant calculation"
Previous message: Shrikanth Hegde: "Re: [PATCH 0/4] sched/fair: SMT-aware asymmetric CPU capacity"
In reply to: Baolin Wang: "Re: [PATCH] mm: vmscan: fix dirty folios throttling on cgroup v1 for MGLRU"
Next in thread: Kairui Song: "Re: [PATCH] mm: vmscan: fix dirty folios throttling on cgroup v1 for MGLRU"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

From: Johannes Weiner

Date: Fri Mar 27 2026 - 12:41:48 EST

On Fri, Mar 27, 2026 at 06:21:08PM +0800, Baolin Wang wrote:
> The balance_dirty_pages() won't do the dirty folios throttling on cgroupv1.
> See commit 9badce000e2c ("cgroup, writeback: don't enable cgroup writeback
> on traditional hierarchies").
>
> Moreover, after commit 6b0dfabb3555 ("fs: Remove aops->writepage"), we no
> longer attempt to write back filesystem folios through reclaim.
>
> On large memory systems, the flusher may not be able to write back quickly
> enough. Consequently, MGLRU will encounter many folios that are already
> under writeback. Since we cannot reclaim these dirty folios, the system
> may run out of memory and trigger the OOM killer.
>
> Hence, for cgroup v1, let's throttle reclaim after waking up the flusher,
> which is similar to commit 81a70c21d917 ("mm/cgroup/reclaim: fix dirty
> pages throttling on cgroup v1"), to avoid unnecessary OOM.

This fix for cgroup1 makes sense to me. For cgroup2, MGLRU shares the
shrink_node() reclaim throttling.

Acked-by: Johannes Weiner <hannes@xxxxxxxxxxx>

The remaining notable difference is global reclaim. I don't see any
equivalent throttling in the lru_gen_shrink_node() path. What prevents
premature OOMs at the system level?