Re: [PATCH 09/10] mm, oom_reaper: do not attempt to reap a task more than twice

From: Michal Hocko
Date: Thu Jun 16 2016 - 02:28:28 EST


On Wed 15-06-16 16:48:35, Oleg Nesterov wrote:
> On 06/09, Michal Hocko wrote:
> >
> > @@ -556,8 +556,27 @@ static void oom_reap_task(struct task_struct *tsk)
> > schedule_timeout_idle(HZ/10);
> >
> > if (attempts > MAX_OOM_REAP_RETRIES) {
> > + struct task_struct *p;
> > +
> > pr_info("oom_reaper: unable to reap pid:%d (%s)\n",
> > task_pid_nr(tsk), tsk->comm);
> > +
> > + /*
> > + * If we've already tried to reap this task in the past and
> > + * failed it probably doesn't make much sense to try yet again
> > + * so hide the mm from the oom killer so that it can move on
> > + * to another task with a different mm struct.
> > + */
> > + p = find_lock_task_mm(tsk);
> > + if (p) {
> > + if (test_and_set_bit(MMF_OOM_NOT_REAPABLE, &p->mm->flags)) {
> > + pr_info("oom_reaper: giving up pid:%d (%s)\n",
> > + task_pid_nr(tsk), tsk->comm);
> > + set_bit(MMF_OOM_REAPED, &p->mm->flags);
>
> But why do we need MMF_OOM_NOT_REAPABLE? We set MMF_OOM_REAPED, oom_reap_task()
> should not see this task again, at least too often.

We set MMF_OOM_REAPED only when actually reaping something in
__oom_reap_task. We might have failed the mmap_sem read lock. The
purpose of this patch is to not encounter such a task for ever and do
not back off too easily. I guess we could set the flag unconditionally
after the first failure and can do that eventually when running out of
MMF flags but thiw way it looks like an easy trade off to me.

--
Michal Hocko
SUSE Labs