Hi Melbogia,

On 09/03/10 18:08 -0800, melbogia wrote:
> Hello,
> We have a zfs filesystem exported as read-only to a subnet. Some of the 
> clients are able to mount the nfs share and read the data, others can mount 
> but when i do an 'ls' command it just sits there. I checked the server and 
> 'dmesg' shows lots of nfs errors like so
> 
> Mar  9 06:06:24 fortress nfssrv: [ID 939466 kern.warning] WARNING: nfsauth: 
> mountd has not established door
> Mar  9 06:12:49 fortress last message repeated 6 times
> Mar  9 06:13:54 fortress nfssrv: [ID 939466 kern.warning] WARNING: nfsauth: 
> mountd has not established door
> Mar  9 06:19:15 fortress last message repeated 5 times
> Mar  9 06:20:19 fortress nfssrv: [ID 939466 kern.warning] WARNING: nfsauth: 
> mountd has not established door
> Mar  9 06:25:40 fortress last message repeated 5 times
> Mar  9 06:26:45 fortress nfssrv: [ID 939466 kern.warning] WARNING: nfsauth: 
> mountd has not established door
> 
> I noticed that there are lots of mountd coredumps in / which I assume are 
> related to my nfs issue. I ran pstack on one of those files and got this.
> 
> root at fortress:/# pstack core.mountd.1268185359
> core 'core.mountd.1268185359' of 1877:  /usr/lib/nfs/mountd
> -----------------  lwp# 1 / thread# 1  --------------------
>  feef1687 __pollsys (80e8c28, 8, 0, 0, 0, feb4c000) + 7
>  fee97824 poll     (80e8c28, 8, ffffffff, feafbbaf) + 4c
>  feafbd21 _svc_run_mt (feb4da88, feb4da98, feb4daa8, feb4daa8, 8054ba3, 
> feec8868) + 1bd
>  feafb7c3 svc_run  (1, 2328, 1, 0, 8047e18, feffb7b4) + 77
>  08054ba3 main     (1, 8047e5c, 8047e64, feffb7b4) + 4af
>  0805457d _start   (1, 8047ef8, 0, 8047f0c, 8047f22, 8047f3a) + 7d
> -----------------  lwp# 2 / thread# 2  --------------------
>  feef1667 __pause  (8, 200, 8, 5, fe000, fef7f000) + 7
>  08054677 nfsauth_svc (0, fef7f000, fed5efec, feeecd1e) + 3b
>  feeecd56 _thrp_setup (fee00a00) + 7e
>  feeecfe0 _lwp_start (fee00a00, 0, 0, feeecd1e, 0, 0)
> -----------------  lwp# 3 / thread# 3  --------------------
>  feef1667 __pause  (8, 200, 8, 6, fe000, fef7f000) + 7
>  080546d3 cmd_svc  (0, fef7f000, fec5ffec, feeecd1e) + 3b
>  feeecd56 _thrp_setup (fee01200) + 7e
>  feeecfe0 _lwp_start (fee01200, 0, 0, feeecd1e, 0, 0)
> -----------------  lwp# 4 / thread# 4  --------------------
>  080562f1 in_access_list (fe99eadc, 0, 80cdf56, 0) + 69
>  08056cc7 check_client_new (80e79b8, fe99eadc, 0, 1) + d3
>  080583e6 nfsauth_access (fe99ed34, fe99ed20, 44, 1) + 13a
>  08058518 nfsauth_func (0, fe99edbc, 44, 0, 0, 805847c) + 9c
>  feef1ff2 __door_return () + 52
> -----------------  lwp# 5 / thread# 5  --------------------
>  feef1fc1 __door_return (0, 0, 0, 0) + 21
>  feed8d87 door_create_func (0, fef7f000, fe89ffec, feeecd1e) + 2f
>  feeecd56 _thrp_setup (fee02200) + 7e
>  feeecfe0 _lwp_start (fee02200, 0, 0, feeecd1e, 0, 0)
> 
> 
> I am not sure where to go from here and how to get my problem resolved. Any 
> help will be greatly appreciated.

I suspect that mountd dumps core when executing code in in_access_list()
(as one of the stacks shows). Which nevada build is it? Which options
did you use for sharing the zfs filesystem? (grep <exported-file-system>
/etc/dfs/sharetab)

Also output of '::stack' and '::status' mdb commands on core dump
could help.

cheers
jan

Reply via email to