Re: [PATCH 6/9] mm: don't avoid high-priority reclaim on memcg limit reclaim
From: Michal Hocko
Date: Wed Mar 01 2017 - 10:53:54 EST
On Tue 28-02-17 16:40:04, Johannes Weiner wrote:
> 246e87a93934 ("memcg: fix get_scan_count() for small targets") sought
> to avoid high reclaim priorities for memcg by forcing it to scan a
> minimum amount of pages when lru_pages >> priority yielded nothing.
> This was done at a time when reclaim decisions like dirty throttling
> were tied to the priority level.
>
> Nowadays, the only meaningful thing still tied to priority dropping
> below DEF_PRIORITY - 2 is gating whether laptop_mode=1 is generally
> allowed to write. But that is from an era where direct reclaim was
> still allowed to call ->writepage, and kswapd nowadays avoids writes
> until it's scanned every clean page in the system. Potential changes
> to how quick sc->may_writepage could trigger are of little concern.
>
> Remove the force_scan stuff, as well as the ugly multi-pass target
> calculation that it necessitated.
I _really_ like this, I hated the multi-pass part. One thig that I am
worried about and changelog doesn't mention it is what we are going to
do about small (<16MB) memcgs. On one hand they were already ignored in
the global reclaim so this is nothing really new but maybe we want to
preserve the behavior for the memcg reclaim at least which would reduce
side effect of this patch which is a great cleanup otherwise. Or at
least be explicit about this in the changelog.
Btw. why cannot we simply force scan at least SWAP_CLUSTER_MAX
unconditionally?
> + /*
> + * If the cgroup's already been deleted, make sure to
> + * scrape out the remaining cache.
Also make sure that small memcgs will not get
unnoticed during the memcg reclaim
> + */
> + if (!scan && !mem_cgroup_online(memcg))
if (!scan && (!mem_cgroup_online(memcg) || !global_reclaim(sc)))
> + scan = min(size, SWAP_CLUSTER_MAX);
>
--
Michal Hocko
SUSE Labs