2.4.20 NFS server lock-up (SMP)

From: Jan Kasprzak (kas@informatics.muni.cz)
Date: Wed Jan 29 2003 - 07:34:34 EST


        Hello, world!\n

        I have a problem on Linux 2.4.20 with NFS server - my NFS
server from time to time (currently about once a day) stops responding
to NFS requests. Apart from that the system is OK, I can log in via SSH,
and I can run "/sbin/reboot -n -f" to reboot it. Also filesystem operations
seem to be OK, even Samba server is responding. When this happens,
I see all nfsd processes and lockd process to be stuck in the "D" state.

        The server is dual athlon with large (700GB) LVM volume
on six IDE drives, RedHat 8.0, 2.4.20 (also tried 2.4.21-pre1 and pre3),
ext3fs. It serves about 2000 subdirs in this volume (= export list
has ~2000 lines) to about 100 NFS clients (various Linuxes, IRIX 6.5,
Solaris 2.7 and 2.8), and runs Samba for ~50 Windows clients.
It has 1GB of RAM (CONFIG_HIGHMEM=y, but no CONFIG_HIGHIO).

        Does anybody have similar problem on a big NFS server?
The server with those kernel was stable with ~20 NFS clients
and 150 exported subdirs on the same volume, so I think this
problem is some race condition triggered only by bigger load.
However, the server still has load average <1, so it is not
overloaded.

        Another question connected to this: How can I do
an equivalent of AltGr+ScrollLock remotely? I want to get
a call trace of the nfsd processes, but it is difficult for me
to go to the machine physically.

        Thanks,

-Yenya

-- 
| Jan "Yenya" Kasprzak  <kas at {fi.muni.cz - work | yenya.net - private}> |
| GPG: ID 1024/D3498839      Fingerprint 0D99A7FB206605D7 8B35FCDE05B18A5E |
| http://www.fi.muni.cz/~kas/   Czech Linux Homepage: http://www.linux.cz/ |
|-- If you start doing things because you hate others and want to screw  --|
|-- them over the end result is bad.   --Linus Torvalds to the BBC News  --|
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Fri Jan 31 2003 - 22:00:22 EST