Q: css_task_iter_advance() && dying_tasks
From: Oleg Nesterov
Date: Mon Jun 10 2024 - 06:55:26 EST
I never understood the code in kernel/cgroup/ even remotely, most probably
I missed something, but let me ask a couple of stupid questions anyway.
cgroup_exit() does
css_set_move_task(tsk, cset, NULL, false);
list_add_tail(&tsk->cg_list, &cset->dying_tasks);
but unless I am totally confused css_task_iter_advance() always ignores
the "dying" sub-threads, so perhaps it should do, say,
css_set_move_task(tsk, cset, NULL, false);
if (delay_group_leader(tsk))
list_add_tail(&tsk->cg_list, &cset->dying_tasks);
and then cgroup_release() can check list_empty(cg_list) before it takes
css_set_lock.
No ?
Or, perhaps we can do even better? Can't cgroup_exit() do something like
// group_dead should be passed from do_exit()
css_set_move_task(tsk, cset, NULL, false);
if (thread_group_leader(tsk) && !group_dead)
list_add_tail(&tsk->cg_list, &cset->dying_tasks);
else if (!thread_group_leader(tsk) && group_dead) {
leader = tsk->group_leader;
if (!list_empty(leader->cg_list) {
css_set_skip_task_iters(task_css_set(leader), leader);
list_del_init(&leader->cg_list);
}
}
and then
- kill the atomic_read(&task->signal->live)) check in
css_task_iter_advance()
- kill the code under css_set_lock in cgroup_release()
Oleg.