Re: [patch rfc] memcg: correctly order reading PCG_USED andpc->mem_cgroup

From: Daisuke Nishimura
Date: Wed Jan 19 2011 - 20:57:36 EST


On Wed, 19 Jan 2011 13:03:19 +0100
Johannes Weiner <hannes@xxxxxxxxxxx> wrote:

> The placement of the read-side barrier is confused: the writer first
> sets pc->mem_cgroup, then PCG_USED. The read-side barrier has to be
> between testing PCG_USED and reading pc->mem_cgroup.
>
> Signed-off-by: Johannes Weiner <hannes@xxxxxxxxxxx>
> ---
> mm/memcontrol.c | 27 +++++++++------------------
> 1 files changed, 9 insertions(+), 18 deletions(-)
>
> I am a bit dumbfounded as to why this has never had any impact. I see
> two scenarios where charging can race with LRU operations:
>
> One is shmem pages on swapoff. They are on the LRU when charged as
> page cache, which could race with isolation/putback. This seems
> sufficiently rare.
>
> The other case is a swap cache page being charged while somebody else
> had it isolated. mem_cgroup_lru_del_before_commit_swapcache() would
> see the page isolated and skip it. The commit then has to race with
> putback, which could see PCG_USED but not pc->mem_cgroup, and crash
> with a NULL pointer dereference. This does sound a bit more likely.
>
> Any idea? Am I missing something?
>
pc->mem_cgroup is not cleared even when the page is freed, so NULL pointer
dereference can happen only when it's the first time the page is used.
But yes, even if it's not the first time, this means pc->mem_cgroup may be wrong.

Anyway, I welcome this patch.

Acked-by: Daisuke Nishimura <nishimura@xxxxxxxxxxxxxxxxx>


Thanks,
Daisuke Nishimura.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/