Re: oomkillers gone wild.
From: David Rientjes
Date:  Fri Jun 08 2012 - 15:57:35 EST
On Tue, 5 Jun 2012, Dave Jones wrote:
>   OBJS ACTIVE  USE OBJ SIZE  SLABS OBJ/SLAB CACHE SIZE NAME 
> 142524 142420  99%    9.67K  47510	  3   1520320K task_struct
> 142560 142417  99%    1.75K   7920	 18    253440K signal_cache
> 142428 142302  99%    1.19K   5478	 26    175296K task_xstate
> 306064 289292  94%    0.36K   6956	 44    111296K debug_objects_cache
> 143488 143306  99%    0.50K   4484	 32     71744K cred_jar
> 142560 142421  99%    0.50K   4455       32     71280K task_delay_info
> 150753 145021  96%    0.45K   4308	 35     68928K kmalloc-128
> 
> Why so many task_structs ? There's only 128 processes running, and most of them
> are kernel threads.
> 
Do you have CONFIG_OPROFILE enabled?
> /sys/kernel/slab/task_struct/alloc_calls shows..
> 
>  142421 copy_process.part.21+0xbb/0x1790 age=8/19929576/48173720 pid=0-16867 cpus=0-7
> 
> I get the impression that the oom-killer hasn't cleaned up properly after killing some of
> those forked processes.
> 
> any thoughts ?
> 
If we're leaking task_struct's, meaning that put_task_struct() isn't 
actually freeing them when the refcount goes to 0, then it's certainly not 
because of the oom killer which only sends a SIGKILL to the selected 
process.
Have you tried kmemleak?
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/