Re: [PATCH] fix de_thread() vs do_coredump() deadlock

From: Roland McGrath
Date: Tue Apr 11 2006 - 03:27:47 EST


> So, de_thread() sets SIGNAL_GROUP_EXEC and sends SIGKILL to other thereads.
>
> Sub-thread receives the signal, and calls get_signal_to_deliver->do_group_exit.
> do_group_exit() calls zap_other_threads(SIGNAL_GROUP_EXIT) because there is no
> SIGNAL_GROUP_EXIT set. zap_other_threads() notices SIGNAL_GROUP_EXEC, wakes up
> execer, and changes ->signal->flags to SIGNAL_GROUP_EXIT.
>
> de_thread() re-locks sighand, sees !SIGNAL_GROUP_EXEC and goes to 'dying:'.

That is what I intend. The exec'ing thread backs out and processes its SIGKILL.
It sounds like you are calling this scenario a problem, but I don't know why.

> Another problem. de_thread() sets '->group_exit_task = current' _only_ if
> 'atomic_read(&sig->count) > count', so wake_up_process(->group_exit_task)
> in zap_other_threads() is unsafe.

Ah, this is indeed a problem when the only other thread is the old leader.
I think it's easy enough just to set it unconditionally and clear it at the
end, after the leader too is gone. i.e., this on top of the previous patch:

--- a/fs/exec.c
+++ b/fs/exec.c
@@ -659,8 +659,8 @@ static int de_thread(struct task_struct
goto dying;
}
}
+ sig->group_exit_task = current;
while (atomic_read(&sig->count) > count) {
- sig->group_exit_task = current;
sig->notify_count = count;
__set_current_state(TASK_UNINTERRUPTIBLE);
spin_unlock_irq(lock);
@@ -675,7 +675,6 @@ static int de_thread(struct task_struct
goto dying;
}
}
- sig->group_exit_task = NULL;
sig->notify_count = 0;
spin_unlock_irq(lock);

@@ -774,6 +773,7 @@ static int de_thread(struct task_struct
* but it's safe to stop telling the group to kill themselves.
*/
sig->flags = 0;
+ sig->group_exit_task = NULL;

no_thread_group:
exit_itimers(sig);



Thanks,
Roland
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/