Re: frequent softlockups with 3.10rc6.

From: Pavel Machek
Date: Mon Jul 01 2013 - 08:49:41 EST


On Sat 2013-06-29 19:44:49, Dave Jones wrote:
> On Sat, Jun 29, 2013 at 03:23:48PM -0700, Linus Torvalds wrote:
>
> > > So with that patch, those two boxes have now been fuzzing away for
> > > over 24hrs without seeing that specific sync related bug.
> >
> > Ok, so at least that confirms that yes, the problem is the excessive
> > contention on inode_sb_list_lock.
> >
> > Ugh. There's no way we can do that patch by DaveC for 3.10. Not only
> > is it scary, Andi pointed out that it's actively buggy and will miss
> > inodes that need writeback due to moving things to private lists.
> >
> > So I suspect we'll have to do 3.10 with this starvation issue in
> > place, and mark for stable backporting whatever eventual fix we find.
>
> Given I'm the only person who seems to have been bitten by this,
> I suspect it's not going to be a big deal. Worst case we can tell
> people "yeah, just disable the soft watchdog until this is fixed".

Actually... I don't think you are alone. I was doing big dd's in
attempt to debug the bad sectors (on 3.10-rc), and got soft-lockups
too... by stuff as simple as "read the disk in the background and try
to work" and "write zeros to disk in the background and try to work".

But as machine survived, I figured out I was simply loading machine
too much.
Pavel
--
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/