Re: linux-next: umh: fix processed error when UMH_WAIT_PROC is used seems to break linux bridge on s390x (bisected)
From: Luis Chamberlain
Date: Thu Jul 02 2020 - 15:47:03 EST
On Thu, Jul 02, 2020 at 01:26:53PM +0900, Tetsuo Handa wrote:
> On 2020/07/02 0:38, Luis Chamberlain wrote:
> > @@ -156,6 +156,18 @@ static void call_usermodehelper_exec_sync(struct subprocess_info *sub_info)
> > */
> > if (KWIFEXITED(ret))
> > sub_info->retval = KWEXITSTATUS(ret);
> > + /*
> > + * Do we really want to be passing the signal, or do we pass
> > + * a single error code for all cases?
> > + */
> > + else if (KWIFSIGNALED(ret))
> > + sub_info->retval = KWTERMSIG(ret);
>
> No, this is bad. Caller of usermode helper is unable to distinguish exit(9)
> and e.g. SIGKILL'ed by the OOM-killer.
Right, the question is: do we care?
> Please pass raw exit status value.
>
> I feel that caller of usermode helper should not use exit status value.
> For example, call_sbin_request_key() is checking
>
> test_bit(KEY_FLAG_USER_CONSTRUCT, &key->flags) || key_validate(key) < 0
>
> condition (if usermode helper was invoked) in order to "ignore any errors from
> userspace if the key was instantiated".
For those not familiar with this code path, or if you cannot decipher
the above, the code path in question was:
static int call_sbin_request_key(struct key *authkey, void *aux)
{
...
/* do it */
ret = call_usermodehelper_keys(request_key, argv, envp, keyring,
UMH_WAIT_PROC);
kdebug("usermode -> 0x%x", ret);
if (ret >= 0) {
/* ret is the exit/wait code */
if (test_bit(KEY_FLAG_USER_CONSTRUCT, &key->flags) ||
key_validate(key) < 0)
ret = -ENOKEY;
/* ignore any errors from userspace if the key was
* instantiated */
ret = 0;
}
...
}
And the umh patch "umh: fix processed error when UMH_WAIT_PROC is used"
changed this to:
- if (ret >= 0) {
+ if (ret != 0) {
Prior to the patch negative return values from userspace were still
being captured, and likewise signals, but the error value was not
raw, not the actual value. After the patch, since we check for ret != 0
we still upkeep the sanity check for any error, correct the error value,
but as you noted signals were ignored as I made the wrong assumption
we would ignore them. The umh sub_info->retval is set after my original
patch only if KWIFSIGNALED(ret)), and ignored signals, and so that
would be now capitured with the additional KWIFSIGNALED(ret)) check.
The question still stands:
Do we want to open code all these checks or simply wrap them up in
the umh. If we do the later, as you note exit(9) and a SIGKILL will
be the same to the inspector in the kernel. But do we care?
Do we really want umh callers differntiatin between signals and exit values?
The alternative to making a compromise is using generic wrappers for
things which make sense and letting the callers use those.
Luis
> > + /* Same here */
> > + else if (KWIFSTOPPED((ret)))
> > + sub_info->retval = KWSTOPSIG(ret);
> > + /* And are we really sure we want this? */
> > + else if (KWIFCONTINUED((ret)))
> > + sub_info->retval = 0;
> > }
> >
> > /* Restore default kernel sig handler */
> >
>