Hello All,
I am running a FreeBSD-4.2 NFS server with dozens of FreeBSD-4.2 NFS
clients on 100BaseTX LAN. Recently I found that when the NFS server
receives a lot of requests in a short period (e.g., 2 clients start X
with gnome desktop simultaneously), all nfsd server processes hang in
inode state.
UID PID PPID CPU PRI NI VSZ RSS WCHAN STAT TT TIME COMMAND
0 440 1 0 2 0 360 132 accept Is ?? 0:00.00 nfsd: master (nfsd)
0 441 440 0 -14 0 352 124 inode D ?? 0:03.49 nfsd: server (nfsd)
0 442 440 0 -14 0 352 124 inode D ?? 0:00.17 nfsd: server (nfsd)
0 443 440 0 -14 0 352 124 inode D ?? 0:00.02 nfsd: server (nfsd)
0 444 440 0 -14 0 352 124 inode D ?? 0:00.01 nfsd: server (nfsd)
I cannot kill or restart them. The consoles of the clients print ``NFS
server not responding'' and I should restart the server. This occurs
about once a week.
I tried
(1) increasing the number of nfsd processes (4 -> 8, 20)
(2) replacing the server HDD (SCSI) with another ATA33 HDD
(3) changing mount_nfs options (tried removing tcp, adding soft,dumbtimer)
but all failed to solve the problem.
Perhaps this problem is the one reported previously in
http://www.FreeBSD.org/cgi/getmsg.cgi?fetch=1913802+1916627+/usr/local/www/db/text/2000/freebsd-questions/20001008.freebsd-questions
The reporter avoided this problem by switching to NetBSD, but I prefer FreeBSD.
Do you have any idea to solve or circumvent this problem?
Any suggestions would be appreciated.
Many thanks in advance,
S. Natori
+-------------------------------------------------------------------------
|More detailed information about the NFS server/client.
+-------------------------------------------------------------------------
Server spec:
CPU: Pentium III 800MHz
MEM: 256MB
SCSI: Adaptec 29160 Ultra160 SCSI adapter
HDD: da0: <IBM DDYS-T18350N S80D> Fixed Direct Access SCSI-3 device
da0: 160.000MB/s transfers (80.000MHz, offset 63, 16bit), Tagged Queueing
Enabled
da0: 17501MB (35843670 512 byte sectors: 255H 63S/T 2231C)
NIC: Intel Pro 100+
Client fstab:
server:/home /home nfs rw,tcp,intr 0 0
server:/usr/local /usr/local nfs rw,tcp,intr 0 0
server:/usr/X11R6 /usr/X11R6 nfs rw,tcp,intr 0 0
server:/usr/compat /usr/compat nfs rw,tcp,intr 0 0
server:/usr/share /usr/share nfs rw,tcp,intr 0 0
+-------------------------------------------------------------------------
|Much more detailed information about the hanged nfsds
+-------------------------------------------------------------------------
USER PID %CPU %MEM ACFLG CPU F INBLK JOBC KTRACE KTRACEP LIM LOGIN
STARTED MAJFLT MINFLT MSGRCV MSGSND NI NIVCSW NSIGS NSWAP
NVCSW WCHAN OUBLK P_RU PADDR PAGEIN PGID PPID PRI RE RGID RLINK RSS RSZ
RTPRIO RUID RUSER SESS PENDING CAUGHT IGNORED BLOCKED SL STARTED STAT SVGID
SVUID TDEV TIME TPGID TSESS TSIZ TT TTY UCOMM UID UPR VSZ
WCHAN XSTAT COMMAND
root 440 0.0 0.1 3 0 84 0 0 0 0 - root
Sat 4/ 7 15:16:03 2001 0 20 4 4 0 58 0 0
58 cdae0f6 0 0 e855440 0 440 1 2 5662 0 3ab3e0 132 132
normal 0 root 1808880 0 80800 1840c007 0 525 3:16PM Is 0
0 ?? 0:00.00 0 0 156 ?? ?? nfsd 0 50 360
accept 0 nfsd: master (nfsd)
root 441 0.0 0.1 3 0 4 4322 0 0 0 - root
Sat 4/ 7 15:16:03 2001 0 23 0 100282 0 0 0 0
104888 17fb700 1569 0 e855ac0 0 440 440 -14 5662 0 3ab298 124
124 normal 0 root 1808880 0 80800 1840c007 0 738 3:16PM D
0 0 ?? 0:03.49 0 0 156 ?? ?? nfsd 0 50
352 inode 0 nfsd: server (nfsd)
root 442 0.0 0.1 3 0 4 494 0 0 0 - root
Sat 4/ 7 15:16:03 2001 0 7 0 4108 0 0 0 0
4634 17fb700 18 0 e855920 0 440 440 -14 5662 0 e855ac0 124 124
normal 0 root 1808880 0 80800 1840c007 0 738 3:16PM D
0 0 ?? 0:00.17 0 0 156 ?? ?? nfsd 0 50
352 inode 0 nfsd: server (nfsd)
root 443 0.0 0.1 3 0 4 58 0 0 0 - root
Sat 4/ 7 15:16:03 2001 0 12 0 581 0 0 0 0
696 17fb700 22 0 e855780 0 440 440 -14 5662 0 e855920 124 124
normal 0 root 1808880 0 80800 1840c007 0 738 3:16PM D 0
0 ?? 0:00.02 0 0 156 ?? ?? nfsd 0 50 352
inode 0 nfsd: server (nfsd)
root 444 0.0 0.1 3 0 4 17 0 0 0 - root
Sat 4/ 7 15:16:03 2001 0 7 0 198 0 0 0 0
271 188fe00 0 0 d4470c0 0 440 440 -14 5662 0 3ab4d0 124 124
normal 0 root 1808880 0 80800 1840c007 0 738 3:16PM D 0
0 ?? 0:00.01 0 0 156 ?? ?? nfsd 0 50 352
inode 0 nfsd: server (nfsd)
To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-hackers" in the body of the message