Re: [PATCH 2/5] cifs: eliminate usage of kthread_stop for cifsd

From: Jeff Layton
Date: Wed Oct 15 2008 - 06:33:48 EST


On Tue, 14 Oct 2008 20:29:31 -0500
"Steve French" <smfrench@xxxxxxxxx> wrote:

> On Tue, Oct 14, 2008 at 7:41 PM, Jeff Layton <jlayton@xxxxxxxxxx> wrote:
> > When cifs_demultiplex_thread was converted to a kthread based kernel
> > thread, great pains were taken to make it so that kthread_stop would be
> > used to bring it down. This just added unnecessary complexity since we
> > needed to use a signal anyway to break out of kernel_recvmsg.
> >
> > Also, cifs_demultiplex_thread does a bit of cleanup as it's exiting, and
> > we need to be certain that this gets done. It's possible for a kthread
> > to exit before its main function is ever run if kthread_stop is called
> > soon after its creation. While I'm not sure that this is a real problem
> > with cifsd now, it could be at some point in the future if cifs_mount is
> > ever changed to bring down the thread quickly.
> >
> > The upshot here is that using kthread_stop to bring down the thread just
> > adds extra complexity with no real benefit. This patch changes the code
> > to use the original method to bring down the thread, but still leaves it
> > so that the thread is actually started with kthread_run.
> >
> > This seems to fix the deadlock caused by the reproducer in this bug
> > report:
> >
> > https://bugzilla.samba.org/show_bug.cgi?id=5720
>
> I agree with what Jeff is suggesting in this patch, and have no
> problem merging the patch, but want to make sure that those who wanted
> cifs to switch all of cifs threads to kthread usage are also ok with
> this.
>
> I don't like the patch 1 (disabling a section of code code) and 5
> (basically reenabling similar code fixing some problems) though - has
> to be a better way to do this by rewriting the function once.
>

The main argument for the way that I've proposed is that once you take
kthread_stop out, it becomes much easier to hit the other races.

The other races are pretty dangerous -- you're likely to oops, but may
also corrupt memory in use-after-free situations. My suggestion would
be to take the patchset as is. While it'll temporarily break multiuser
mounts if someone bisects in the middle of the series, that person
shouldn't be vulnerable to the other problems.

If you feel strongly about it though, we could eliminate patch 1, roll
similar changes into patch 5, and move it to the beginning of the
series.

--
Jeff Layton <jlayton@xxxxxxxxxx>
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/