I am assuming you are mounting the nfs mounts with the nordirplus
mount option. If not, that is known to deadlock a nfsd thread leading
to what you are seeing.

There are two possible reasons for this error. One is a dlm issue.
Other is a local deadlock like above.

To see if the dlm is the cause for the hang, run scanlocks2.
http://oss.oracle.com/~smushran/.dlm/scripts/scanlocks2

This will dump the busy lock resources. Run it a few times. If
a lock resource comes up regularly, then it indicates a dlm problem.

Then dump the fs and dlm lock state on that node.
debugfs.ocfs2 -R "fs_locks LOCKNAME" /dev/sdX
debugfs.ocfs2 -R "dlm_locks LOCKNAME" /dev/sdX

The dlm lock will tell you the master node. Repeat the two dumps
on the master node. The dlm lock on the master node will point
to the current holder. Repeat the same on that node. Email all that
to me asap.

michael.a.jaqu...@verizon.com wrote:
> All,
>
> I've seen a few posts about this issue in the past, but not a resolution.  I 
> have a 3 node cluster sharing ocfs2 volumes to app nodes via nfs.  On 
> occasion, one of our db nodes will have nfs go into an uninterruptable sleep 
> state.  The nfs daemon is completely useless at this point.  The db node has 
> to be rebooted to resolve.  It seems that nfs is waiting on 
> ocfs2_wait_for_mask.  Any suggestions on a resolution would be appreciated.
>
> root     18387  0.0  0.0      0     0 ?        S<   Mar15   0:00 [nfsd4]
> root     18389  0.0  0.0      0     0 ?        D    Mar15   0:10 [nfsd]
> root     18390  0.0  0.0      0     0 ?        D    Mar15   0:10 [nfsd]
> root     18391  0.0  0.0      0     0 ?        D    Mar15   0:10 [nfsd]
> root     18392  0.0  0.0      0     0 ?        D    Mar15   0:13 [nfsd]
> root     18393  0.0  0.0      0     0 ?        D    Mar15   0:08 [nfsd]
> root     18394  0.0  0.0      0     0 ?        D    Mar15   0:09 [nfsd]
> root     18395  0.0  0.0      0     0 ?        D    Mar15   0:12 [nfsd]
> root     18396  0.0  0.0      0     0 ?        D    Mar15   0:13 [nfsd] 
>
> 18387 nfsd4           worker_thread
> 18389 nfsd            ocfs2_wait_for_mask
> 18390 nfsd            ocfs2_wait_for_mask
> 18391 nfsd            ocfs2_wait_for_mask
> 18392 nfsd            ocfs2_wait_for_mask
> 18393 nfsd            ocfs2_wait_for_mask
> 18394 nfsd            ocfs2_wait_for_mask
> 18395 nfsd            ocfs2_wait_for_mask
> 18396 nfsd            ocfs2_wait_for_mask
>  
>
> -Mike Jaquays
> _______________________________________________
> Ocfs2-users mailing list
> Ocfs2-users@oss.oracle.com
> http://oss.oracle.com/mailman/listinfo/ocfs2-users
>   


_______________________________________________
Ocfs2-users mailing list
Ocfs2-users@oss.oracle.com
http://oss.oracle.com/mailman/listinfo/ocfs2-users

Reply via email to