Re: [PATCH v9 11/11] seccomp: implement SECCOMP_FILTER_FLAG_TSYNC

From: Kees Cook
Date: Thu Jul 10 2014 - 05:18:01 EST

On Wed, Jul 9, 2014 at 11:05 AM, Oleg Nesterov <oleg@xxxxxxxxxx> wrote:
> First of all, sorry for delay ;)
> So far I quickly glanced at this series and everything look fine, but
> I am confused by the signal_group_exit() check,
> On 06/27, Kees Cook wrote:
>> To make sure that de_thread() is actually able
>> to kill other threads during an exec, any sighand holders need to check
>> if they've been scheduled to be killed, and to give up on their work.
> Probably this connects to that check below? I can't understand this...
>> + /*
>> + * Make sure we cannot change seccomp or nnp state via TSYNC
>> + * while another thread is in the middle of calling exec.
>> + */
>> + if (flags & SECCOMP_FILTER_FLAG_TSYNC &&
>> + mutex_lock_killable(&current->signal->cred_guard_mutex))
>> + goto out_free;
> -EINVAL looks a bit confusing in this case, but this is cosemtic because
> userspace won't see this error-code anyway.

Happy to use whatever since, as you say, it's cosmetic. Perhaps -EAGAIN?

>> spin_lock_irq(&current->sighand->siglock);
>> + if (unlikely(signal_group_exit(current->signal))) {
>> + /* If thread is dying, return to process the signal. */
> OK, this doesn't hurt, but why?
> You could check __fatal_signal_pending() with the same effect. And since
> we hold this mutex, exec (de_thread) can be the source of that SIGKILL.
> We take this mutex specially to avoid the race with exec.
> So why do we need to abort if we race with kill() or exit_grouo() ?

In my initial code inspection that we could block waiting for the
cred_guard mutex, with exec holding it, exec would schedule death in
de_thread, and then once it released, the tsync thread would try to
keep running.

However, in looking at this again, now I'm concerned this produces a
dead-lock in de_thread, since it waits for all threads to actually
die, but tsync will be waiting with the killable mutex.

So I think I got too defensive when I read the top of de_thread where
it checks for pending signals itself.

It seems like I can just safely remove the singal_group_exit checks?
The other paths (non-tsync seccomp_set_mode_filter, and
seccomp_set_mode_strict) would just run until it finished the syscall,
and then died. I can't decide which feels cleaner: just letting stuff
clean up naturally on death or to short-circuit after taking

What do you think?


Kees Cook
Chrome OS Security
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at
Please read the FAQ at