The first node that gets the lock will do the actual recovery. The others will get the lock and see a clean journal and skip the recovery. A thread should never error out if it fails to get the lock. It should try and try again.
On May 17, 2013, at 11:27 PM, Joseph Qi <[email protected]> wrote: > Hi, > Once there is node down in the cluster, ocfs2_recovery_thread will be > triggered on each node. These threads then do the down node recovery by > get super lock. > I have several questions on this: > 1) Why each node has to run such a thread? We know at last one node can > get the super lock and do the actual recovery. > 2) If this thread is running but something error occurred, take > ocfs2_super_lock failed for example, the thread will exit without > clearing recovery map, will it cause other threads still waiting for > recovery in ocfs2_wait_for_recovery? > _______________________________________________ Ocfs2-devel mailing list [email protected] https://oss.oracle.com/mailman/listinfo/ocfs2-devel
