PROBLEM: reproduceable 2.4.21 NFS client Freeze

From: Sven Geggus (sven@gegg.us)
Date: Wed Jun 25 2003 - 03:05:49 EST


Hello,

looks like my previous Bug-Report has been somewhat unspecific. So here is a
version which has the form suggested in the REPORTING-BUGS documentation.

[1.] reproduceable 2.4.21 NFS client Freeze

[2.] When an NFS Volume mounted from a Windows 2000 Machine (Hummingbird NFS
     Maestro Server, Version unknown) is accessed using something like "ls
     /mountpoint" the machine freezes immediately. The state is not
     completely frozen, but a state where anything but ICMP-echo responces
     seem to be frozen.

[3.] Kernelmode NFS client

[4.] Linux version 2.4.21 (root@neptun)
     (gcc version 2.95.4 20011002 (Debian prerelease))
     #1 Mon Jun 16 13:43:25 CEST 2003
     This is a vanilla Kernel!

[5.] There is no oops

[6.] a simple "ls /mountpoint" triggers the Problem. Here is what the Kernel
     reports with rpc_debug and nfs_debug enabled in between the time the
     "ls /mountpoint" is started and the freeze:

RPC: 24 call_start nfs3 proc 1 (sync)
RPC: 24 call_reserve
RPC: 24 reserved req df16d078 xid 81c60017
RPC: 24 xprt_reserve returns 0
RPC: 24 call_reserveresult (status 0)
RPC: 24 call_allocate (status 0)
RPC: 24 call_encode (status 0)
RPC: 24 call_transmit (status 0)
RPC: 24 xprt_transmit(81c60017)
RPC: 24 xprt_cwnd_limited cong = 0 cwnd = 512
RPC: xprt_sendmsg(0) = 108
RPC: udp_data_ready...
RPC: udp_data_ready client df16d000
RPC: 24 received reply
RPC: cong 256, cwnd was 512, now 512
RPC: 24 has input (112 bytes)
RPC: 24 xmit complete
RPC: 24 call_status (status 112)
RPC: 24 call_decode (status 112)
RPC: 24 call_decode result 0
RPC: 24 release request df16d078
RPC: rpc_release_client(dfd6b480, 2)
NFS: refresh_inode(7/395120 ct=1 info=0x6)
RPC: 25 call_start nfs3 proc 4 (sync)
RPC: 25 call_reserve
RPC: 25 reserved req df16d078 xid 81c60018
RPC: 25 xprt_reserve returns 0
RPC: 25 call_reserveresult (status 0)
RPC: 25 call_allocate (status 0)
RPC: 25 call_encode (status 0)
RPC: 25 call_transmit (status 0)
RPC: 25 xprt_transmit(81c60018)
RPC: 25 xprt_cwnd_limited cong = 0 cwnd = 512
RPC: xprt_sendmsg(0) = 112
RPC: 25 xmit complete
RPC: udp_data_ready...
RPC: udp_data_ready client df16d000
RPC: 25 received reply
RPC: cong 256, cwnd was 512, now 512
RPC: 25 has input (36 bytes)
RPC: 25 call_status (status 36)
RPC: 25 call_decode (status 36)
RPC: 25 call_decode result 0
RPC: 25 release request df16d078
RPC: rpc_release_client(dfd6b480, 2)
NFS: readdir_search_pagecache() searching for cookie 0
NFS: find_dirent_page() searching directory page 0
NFS: nfs_readdir_filler() reading cookie 0 into page 0.
RPC: 26 call_start nfs3 proc 16 (sync)
RPC: 26 call_reserve
RPC: 26 reserved req df16d078 xid 81c60019
RPC: 26 xprt_reserve returns 0
RPC: 26 call_reserveresult (status 0)
RPC: 26 call_allocate (status 0)
RPC: 26 call_encode (status 0)
RPC: 26 call_transmit (status 0)
RPC: 26 xprt_transmit(81c60019)
RPC: 26 xprt_cwnd_limited cong = 0 cwnd = 512
RPC: xprt_sendmsg(0) = 128
RPC: 26 xmit complete
RPC: udp_data_ready...
RPC: udp_data_ready client df16d000
RPC: 26 received reply
RPC: cong 256, cwnd was 512, now 512
RPC: 26 has input (760 bytes)
RPC: 26 call_status (status 760)
RPC: 26 call_decode (status 760)

[7.] Environment

This is just one machine I randomly picked, but the Problem exists on any
machine we are running 2.4.21 on! The Problem does not exist with 2.4.20.

[7.1.] Output of ver_linux:

Linux neptun 2.4.21 #1 Mon Jun 16 13:43:25 CEST 2003 i686 unknown
 
Gnu C 2.95.4
Gnu make 3.79.1
util-linux 2.11n
mount 2.11n
modutils 2.4.15
e2fsprogs 1.27
Linux C Library 2.2.5
Dynamic linker (ldd) 2.2.5
Procps 2.0.7
Net-tools 1.60
Console-tools 0.2.3
Sh-utils 2.0.11

[7.2.] /proc/cpuinfo

processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 8
model name : Pentium III (Coppermine)
stepping : 10
cpu MHz : 928.181
cache size : 256 KB
fdiv_bug : no
hlt_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 2
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 mmx fxsr sse
bogomips : 1848.11

[7.3.] This machine is using a monolithic Kernel

[8.] The bug is not specific to one particular machine, but see attached
.config of the machine I picked.

Regards

Sven

-- 
"Those who do not understand Unix are condemned to reinvent it, poorly"
(Henry Spencer)

/me is giggls@ircnet, http://sven.gegg.us/ on the Web


- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Mon Jun 30 2003 - 22:00:18 EST