Re: [PATCH v9 06/12] kthread: Add kthread_drain_worker()
From: Petr Mladek
Date: Fri Jun 24 2016 - 05:08:37 EST
On Fri 2016-06-24 09:05:15, Peter Zijlstra wrote:
> On Thu, Jun 23, 2016 at 05:32:58PM -0400, Tejun Heo wrote:
> > Hello,
> >
> > On Wed, Jun 22, 2016 at 10:54:45PM +0200, Peter Zijlstra wrote:
> > > > + * The caller is responsible for blocking all users of this kthread
> > > > + * worker from queuing new works. Also it is responsible for blocking
> > > > + * the already queued works from an infinite re-queuing!
> > >
> > > This, I really dislike that. And it makes the kthread_destroy_worker()
> > > from the next patch unnecessarily fragile.
> > >
> > > Why not add a kthread_worker::blocked flag somewhere and refuse/WARN
> > > kthread_queue_work() when that is set.
> >
> > It's the same logic from workqueue counterpart.
>
> So ? Clearly it (the kthread workqueue) can be improved here.
>
> > For workqueue, nothing can make it less fragile as the workqueue
> > struct itself is freed on destruction. If its users fail to stop
> > issuing work items, it'll lead to use-after-free.
>
> Right, but this kthread thingy does not, so why not add a failsafe?
The struct kthread_worker is freed in kthread_destroy_worker().
So kthread_worker is the same situation as workqueues.
The allocation/freeing has been added in v2. It helped
to make it clear when the structure was initialized. Note that we
still need the crate/destroy functions to start/stop the kthread.
See the discussion at
https://lkml.kernel.org/g/20150728172657.GC5322@xxxxxxxxxxxxxxx
I personally do not have strong opinion about it.
On one hand, it makes the code more complex because we need strong
synchronization between queueing/canceling/destroying. There are cases
where it is not that important, for example the hugepage daemon or
hung task. It does not matter if the next round will be done or not.
Well, it is strange if someting gets queued and it is not proceed.
On the other hand, there are situations where the work must be
done, e.g. some I/O operation. They need the strong syncronization.
We could print a warning when queueing a work for a destroyed
(stoped) kthread_worker to catch potential problems. But then we will need
the strong synchronization in all cases to avoid "false" alarms.
After all, the blocked flag will not necessarily make the usage
less hairy. Or did I miss something?
Best Regards,
Petr