Re: Random panic in load_balance() with 3.16-rc

From: Michel Dänzer
Date: Tue Jul 22 2014 - 23:53:35 EST


On 22.07.2014 15:13, Michel Dänzer wrote:
> On 18.07.2014 18:29, Michel Dänzer wrote:
>> On 17.07.2014 16:58, Peter Zijlstra wrote:
>>> On Thu, Jul 17, 2014 at 04:31:04PM +0900, Michel Dänzer wrote:
>>>>
>>>> I've been running into the panic captured in the attached picture (hope
>>>> it's legible) randomly while running 3.16-rc4 and -rc5. I haven't
>>>> noticed any pattern as to when it happens; at least once it happened
>>>> while the box was basically sitting idle.
>>>>
>>>> dmesg, .config and /proc/cpuinfo attached as well; let me know if you
>>>> need anything else.
>>>
>>> Does lkml.kernel.org/r/20140716145546.GA6922@xxxxxxxx cure things?
>>
>> Yes, adding back
>>
>> cpumask_clear(sched_group_cpus(sg));
>>
>> seems to do the trick, thanks.
>
> I'm afraid it happened again with 3.16-rc5 plus the above change. It
> seemed to last longer than before, but maybe that was just luck.
>
> Going to try 3.16-rc6 now.

Just happened again with the same change on top of 3.16-rc6.

Are there any other potential fixes yet?

I hope this problem is on the radar as a showstopper for 3.16.


--
Earthling Michel Dänzer | http://www.amd.com
Libre software enthusiast | Mesa and X developer

Attachment: signature.asc
Description: OpenPGP digital signature