Re: [PATCH] proc: fix pagemap_read() error case (was Re: [PATCH] proc: put check_mem_permission before __get_free_page in mem_read)
From: KOSAKI Motohiro
Date: Wed Apr 27 2011 - 21:45:20 EST
> Great, thank you. Though I see Steve has cleverly spotted another:
> yes, that should be worth looking into, but I've not studied it.
>
> I confess that when I wrote yesterday, I thought there were a lot more
> such instances in fs/proc. I thought /proc/pid/smaps was vulnerable,
> for example, but I cannot see that now: maybe I just confused it with
> /proc/pid/pagemap.
>
> I was worrying about this a couple of months ago, when David had an
> OOM dump with OOM-killed threads failing to exit because they could
> not skb_alloc in proc_exit_connector()'s cn_netlink_send() - connector
> needs to allocate skb upfront (I'm being vague!), not when thread exits.
>
> But it was a mystery why memory was still unavailable at that late
> stage: my unsubstantiated suspicion was that something else was
> holding on to the mm-to-be-freed while trying to allocate memory.
Hi
Thank you for the explanation.
Today, I revisit get_task_mm() users. So, I found four suspect codes
for google usecase.
o CPUSET
cpuset_change_nodemask() and cpuset_attach() call do_migrate_pages()
under holding mm reference. That said, If you try to attach crazy large
application, you can face the "OOM-killed threads failing to exit" issue.
o /proc/{pid}/cmdline and /proc/{pid}/environ
proc_pid_cmdline() and environ_read() call get_user_pages() under holding
mm reference. That said, If you run system monitor process and it periodically
read the above interface, oom-kill can't gurantte to work.
So, I'll second that Linus said we should remove TIF_MEMDIE and PF_EXITING check
from select_bad_process(). Or at least, We have to make timeout and try to kill
next process if timeout is happen.
And, as Oleg said at past, maybe we have to make MMF_OOM_KILLED and page fault
should detect it.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/