The network connect is failing. Could be because of a firewall, or bad ip address, some switch issue.
Mount the volume on node 2. Then enable tracing and tail messages file. # debugfs.ocfs2 -l TCP allow # tail -f /var/log/messages Then from node 4, ping node 2 using netcat. # nc -z 192.168.1.2 7777 If it succeeds, then you should see: Connection to 192.168.1.2 7777 port [tcp/cbt] succeeded! Additionally, you will see a message on node 2 "attempt to connect from node...". If not, then look at your network setup. Remember to disable tracing on node 2. #debugfs.ocfs2 -l TCP off Sunil Chris Clonch wrote: > We are testing clustering and I am having issues getting all of my > nodes to mount. I have 4 nodes. I am using iSCSI to share 1 target > with 2 luns. All 4 nodes can are accessing the target; I can run > fdisk -l against the block devices. Initially I had all 4 nodes > mounting the share but brought the cluster down to add an additional > NIC. Presently nodes 2 and 3 can mount the shares, 1 and 4 can not. > Previously I had node 1 mounted and nodes 2, 3 and 4 could not. > > Any help is appreciated! > > Nodes 2 & 3: > > # service o2cb status > Driver for "configfs": Loaded > Filesystem "configfs": Mounted > Driver for "ocfs2_dlmfs": Loaded > Filesystem "ocfs2_dlmfs": Mounted > Checking O2CB cluster ocfs2: Online > Heartbeat dead threshold = 31 > Network idle timeout: 30000 > Network keepalive delay: 2000 > Network reconnect delay: 2000 > Checking O2CB heartbeat: Active > > > Nodes 1 & 4: > > # service o2cb status > Driver for "configfs": Loaded > Filesystem "configfs": Mounted > Driver for "ocfs2_dlmfs": Loaded > Filesystem "ocfs2_dlmfs": Mounted > Checking O2CB cluster ocfs2: Online > Heartbeat dead threshold = 31 > Network idle timeout: 30000 > Network keepalive delay: 2000 > Network reconnect delay: 2000 > Checking O2CB heartbeat: Not active > > > All nodes: > > # mounted.ocfs2 -d > Device FS UUID Label > /dev/sda1 ocfs2 fea0a398-a696-414f-bd9f-d7aa84bd6b77 ocu01 > /dev/sdb1 ocfs2 26e82fa7-ec91-4a81-a965-571ed4223ab0 > oracluster > > # mounted.ocfs2 -f > Device FS Nodes > /dev/sda1 ocfs2 ocnode2, ocnode3 > /dev/sdb1 ocfs2 ocnode2, ocnode3 > > > dmesg snippet from node 4: > > o2net: connected to node ocnode2 (num 2) at 192.168.1.2:7777 > <http://192.168.1.2:7777> > (4145,0):o2net_connect_expired:1664 ERROR: no connection established > with node 3 after 30.0 seconds, giving up and returning errors. > (4176,0):dlm_request_join:1036 ERROR: status = -107 > (4176,0):dlm_try_to_join_domain:1210 ERROR: status = -107 > (4176,0):dlm_join_domain:1488 ERROR: status = -107 > (4176,0):dlm_register_domain:1754 ERROR: status = -107 > (4176,0):ocfs2_dlm_init:2723 ERROR: status = -107 > (4176,0):ocfs2_mount_volume:1437 ERROR: status = -107 > ocfs2: Unmounting device (8,17) on (node 4) > o2net: no longer connected to node ocnode2 (num 2) at 192.168.1.2:7777 > <http://192.168.1.2:7777> _______________________________________________ Ocfs2-users mailing list Ocfs2-users@oss.oracle.com http://oss.oracle.com/mailman/listinfo/ocfs2-users