[PATCH v5 2/6] memcg: stop vmscan when enough done.

From: KAMEZAWA Hiroyuki
Date: Tue Aug 09 2011 - 06:16:55 EST

memcg :avoid node fallback scan if possible.

Now, try_to_free_pages() scans all zonelist because the page allocator
should visit all zonelists...but that behavior is harmful for memcg.
Memcg just scans memory because it hits limit...no memory shortage
in pased zonelist.

For example, with following unbalanced nodes

Node 0 Node 1
File 1G 0
Anon 200M 200M

memcg will cause swap-out from Node1 at every vmscan.

Another example, assume 1024 nodes system.
With 1024 node system, memcg will visit 1024 nodes
pages per vmscan... This is overkilling.

This is why memcg's victim node selection logic doesn't work
as expected.

This patch is a help for stopping vmscan when we scanned enough.

Signed-off-by: KAMEZAWA Hiroyuki <kamezawa.hiroyu@xxxxxxxxxxxxxx>
mm/vmscan.c | 10 ++++++++++
1 file changed, 10 insertions(+)

Index: mmotm-Aug3/mm/vmscan.c
--- mmotm-Aug3.orig/mm/vmscan.c
+++ mmotm-Aug3/mm/vmscan.c
@@ -2124,6 +2124,16 @@ static void shrink_zones(int priority, s

shrink_zone(priority, zone, sc);
+ if (!scanning_global_lru(sc)) {
+ /*
+ * When we do scan for memcg's limit, it's bad to do
+ * fallback into more node/zones because there is no
+ * memory shortage. We quit as much as possible when
+ * we reache target.
+ */
+ if (sc->nr_to_reclaim <= sc->nr_reclaimed)
+ break;
+ }

To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/