Re: [PATCH 1/4] memcg, mm: introduce lowlimit reclaim
From: Johannes Weiner
Date: Tue May 06 2014 - 11:21:32 EST
On Tue, May 06, 2014 at 04:32:42PM +0200, Michal Hocko wrote:
> On Tue 06-05-14 09:29:32, Johannes Weiner wrote:
> > On Fri, May 02, 2014 at 06:00:56PM -0400, Johannes Weiner wrote:
> > > On Fri, May 02, 2014 at 06:49:30PM +0200, Michal Hocko wrote:
> > > > On Fri 02-05-14 11:58:05, Johannes Weiner wrote:
> > > > > This is not even guarantees anymore, but rather another reclaim
> > > > > prioritization scheme with best-effort semantics. That went over
> > > > > horribly with soft limits, and I don't want to repeat this.
> > > > >
> > > > > Overcommitting on guarantees makes no sense, and you even agree you
> > > > > are not interested in it. We also agree that we can always add a knob
> > > > > later on to change semantics when an actual usecase presents itself,
> > > > > so why not start with the clear and simple semantics, and the simpler
> > > > > implementation?
> > > >
> > > > So you are really preferring an OOM instead? That was the original
> > > > implementation posted at the end of last year and some people
> > > > had concerns about it. This is the primary reason I came up with a
> > > > weaker version which fallbacks rather than OOM.
> > >
> > > I'll dig through the archives on this then, thanks.
> >
> > The most recent discussion on this I could find was between you and
> > Greg, where the final outcome was (excerpt):
> >
> > ---
> >
> > From: Greg Thelen <gthelen@xxxxxxxxxx>
> > To: Michal Hocko <mhocko@xxxxxxx>
> > Cc: linux-mm@xxxxxxxxx, Johannes Weiner <hannes@xxxxxxxxxxx>, Andrew Morton <akpm@xxxxxxxxxxxxxxxxxxxx>, KAMEZAWA Hiroyuki <kamezawa.hiroyu@xxxxxxxxxxxxxx>, LKML <linux-kernel@xxxxxxxxxxxxxxx>, Ying Han <yinghan@xxxxxxxxxx>, Hugh Dickins <hughd@xxxxxxxxxx>, Michel Lespinasse <walken@xxxxxxxxxx>, KOSAKI Motohiro <kosaki.motohiro@xxxxxxxxxxxxxx>, Tejun Heo <tj@xxxxxxxxxx>
> > Subject: Re: [RFC 0/4] memcg: Low-limit reclaim
> > References: <1386771355-21805-1-git-send-email-mhocko@xxxxxxx>
> > <xr93sis6obb5.fsf@xxxxxxxxxxxxxxxxxxxxxxxxxxx>
> > <20140130123044.GB13509@xxxxxxxxxxxxxx>
> > <xr931tzphu50.fsf@xxxxxxxxxxxxxxxxxxxxxxxxxxx>
> > <20140203144341.GI2495@xxxxxxxxxxxxxx>
> > Date: Mon, 03 Feb 2014 17:33:13 -0800
> > Message-ID: <xr93zjm7br1i.fsf@xxxxxxxxxxxxxxxxxxxxxxxxxxx>
> > List-ID: <linux-mm.kvack.org>
> >
> > On Mon, Feb 03 2014, Michal Hocko wrote:
> >
> > > On Thu 30-01-14 16:28:27, Greg Thelen wrote:
> > >> But this soft_limit,priority extension can be added later.
> > >
> > > Yes, I would like to have the strong semantic first and then deal with a
> > > weaker form. Either by a new limit or a flag.
> >
> > Sounds good.
> >
> > ---
> >
> > So I think everybody involved in the discussions so far are preferring
> > a hard guarantee, and then later, if needed, to either add a knob to
> > make it a soft guarantee or to actually implement a usable soft limit.
>
> I am afraid the most of that discussion happened off-list :( Sadly not
> much of a discussion happened on the list.
Time to do it now, then :)
> Sorry I should have been specific and mention that the discussions
> happened at LSF and partly at the KS.
>
> The strongest point was made by Rik when he claimed that memcg is not
> aware of memory zones and so one memcg with lowlimit larger than the
> size of a zone can eat up that zone without any way to free it.
But who actually cares if an individual zone can be reclaimed?
Userspace allocations can fall back to any other zone. Unless there
are hard bindings, but hopefully nobody binds a memcg to a node that
is smaller than that memcg's guarantee. And while the pages are not
reclaimable, they are still movable, so the NUMA balancer is free to
correct any allocation mistakes later on.
As to kernel allocations, watermarks and lowmem protection prevent any
single zone from filling up with userspace pages, regardless of their
reclaimability.
> This can cause additional troubles (permanent reclaim on that zone
> and OOM in an extreme situations).
We have protection against wasting CPU cycles on unreclaimable zones.
So how is it different than anonymous/shared memory without swap? Or
mlocked memory?
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/