Re: [Ocfs2-devel] [PATCH] Fix waiting status race condition in dlm recovery

2012-05-25 Thread srinivas eeda
comments inline On 5/24/2012 10:53 PM, xiaowei...@oracle.com wrote: From: Xiaowei.Huxiaowei...@oracle.com when the master requested locks ,but one/some of the live nodes died, after it received the request msg and before send out the locks packages, the recovery will fall into endless

Re: [Ocfs2-devel] [PATCH] Fix waiting status race condition in dlm recovery

2012-05-25 Thread Xiaowei
Thanks Srini , This sounds good, I tried to use dlm_is_node_dead in this patch , but this function can't report another node is dead if this node already in recovery process. It was blocked to set the bit in domain_map, but the live_nodes_map could always reflect the really live nodes. I will

[Ocfs2-devel] [PATCH] Fix waiting status race condition in dlm recovery V2

2012-05-25 Thread xiaowei . hu
From: Xiaowei.Hu xiaowei...@oracle.com when the master requested locks ,but one/some of the live nodes died, after it received the request msg and before send out the locks packages, the recovery will fall into endless loop,waiting for the status changed to finalize NodeA