Re: sched: hang in migrate_swap

From: Kirill Tkhai
Date: Wed May 14 2014 - 06:21:23 EST




14.05.2014, 14:14, "Peter Zijlstra" <peterz@xxxxxxxxxxxxx>:
> On Wed, May 14, 2014 at 01:42:32PM +0400, Kirill Tkhai wrote:
>
>>  Peter, do we have to queue stop works orderly?
>>
>>  Is there is not a possibility, when two pair of works queued different on
>>  different cpus?
>>
>>   kernel/stop_machine.c | 10 ++++++++--
>>   1 file changed, 8 insertions(+), 2 deletions(-)
>>  diff --git a/kernel/stop_machine.c b/kernel/stop_machine.c
>>  index b6b67ec..29e221b 100644
>>  --- a/kernel/stop_machine.c
>>  +++ b/kernel/stop_machine.c
>>  @@ -250,8 +250,14 @@ struct irq_cpu_stop_queue_work_info {
>>   static void irq_cpu_stop_queue_work(void *arg)
>>   {
>>           struct irq_cpu_stop_queue_work_info *info = arg;
>>  - cpu_stop_queue_work(info->cpu1, info->work1);
>>  - cpu_stop_queue_work(info->cpu2, info->work2);
>>  +
>>  + if (info->cpu1 < info->cpu2) {
>>  + cpu_stop_queue_work(info->cpu1, info->work1);
>>  + cpu_stop_queue_work(info->cpu2, info->work2);
>>  + } else {
>>  + cpu_stop_queue_work(info->cpu2, info->work2);
>>  + cpu_stop_queue_work(info->cpu1, info->work1);
>>  + }
>>   }
>
> I'm not sure, we already send the IPI to the first cpu of the pair, so
> supposing we have 4 cpus, and get 4 pairs like:
>
> 0,1 1,2 2,3 3,0
>
> That would result in IPIs to 0, 1, 2, and 0 again, and since the IPI
> function is serialized I don't immediately see a way for this to
> deadlock.

It's about stop_two_cpus(), I have a distrust about other users of stop task:

queue_stop_cpus_work() queues work consequentially:

0 1 2 4

stop_two_cpus() may queue:

1 0

Looks like, stop thread on 0th and on 1th are waiting for wrong works.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/