Re: Apparent Deadlock with nfsd/jfs on 2.6.21.1 under bonnie.

From: Dave Kleikamp
Date: Tue May 29 2007 - 14:30:32 EST


On Tue, 2007-05-29 at 13:14 -0500, Roger Heflin wrote:
> Dave Kleikamp wrote:
> > On Tue, 2007-05-29 at 12:16 -0500, Roger Heflin wrote:
> >
> >> Dave,
> >>
> >> Apparently there appears to be another different similar lockup,
> >> The MTBF has risen from 1-2 hours without that patch to >100 hours,
> >> so I am fairly sure the patch did correct the original lockup, or
> >> at the very least make it a lot less likely.
> >>
> >> I hit the machine across NFS for 5 days before it deadlocked, before
> >> the patch I could only get an hour or two (2-4 different tries).
> >>
> >> Given that pdflush is "D" it does not appear to be an NFS issue.
> >>
> >> Included is the sysrq-t.
> >>
> >> This is with 2.6.21.1 + the JFSIO patch.
> >
> > Is the system still in this state? Can you cat /proc/fs/jfs/TxAnchor
> > (if CONFIG_JFS_DEBUG is defined) and /proc/fs/jfs/txstats (if
> > CONFIG_JFS_STATISTICS is defined)?
> >
> > Thanks,
> > Shaggy
>
> Yes, the machine is still in that state.
>
> Apparently I don't have either of those configured.
>
> Anything else that we can collect before I rebuild the kernel with
> those options setup?

No. I think I may have found something to explain the hang. I need to
look a bit closer. Go ahead and rebuild the kernel with those options
in case I'm mistaken.

Thanks,
Shaggy

>
> Roger
>
--
David Kleikamp
IBM Linux Technology Center

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/