Re: processes hung after sys_renameat, and 'missing' processes
From: Dave Jones
Date: Sun Jun 03 2012 - 20:01:02 EST
On Sun, Jun 03, 2012 at 04:07:35PM -0700, Linus Torvalds wrote:
> On Sun, Jun 3, 2012 at 3:36 PM, Dave Jones <davej@xxxxxxxxxx> wrote:
> >
> > It's still up and in this state if you want me to gather any further info
> > before I reboot it.
>
> You seem to have lockdep enabled, since I see the lock information,
> but presumably you didn't get a lockdep splat before this?
Nothing.
> Because yes, it looks like you have a boattload of trinity processes
> stuck on an inode mutex. Looks like every single one of them is in
> 'lock_rename()'. It *shouldn't* be an ABBA deadlock, since lockdep
> should have noticed that, but who knows.
another data point: the core dumps that were being deleted were in the
directory that trinity runs from. It's feasible that some of the processes
were using those dumps as input data for random syscalls, before they
got deleted.
> It looks like the lock information is somewhat truncated. I suspect
> the dmesg buffer had filled up with all the task data.
>
> Can you do just "sysrq-d" for the lock information (and maybe
> separately "sysrq-w" for the blocked tasks)? The non-truncated lock
> data might tell us more.
sysrq-w: http://fpaste.org/WUd9/
sysrq-d: http://fpaste.org/ow9O/
Dave
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/