Hello all, maybe someone can help me adding a node in my test cluster.
I have a working three-node test Cluster in my VirtualBox, using OEL7.2 and now I try to add another node but I have no plan and I did some kind of try & error which does not work. This is my three-node cluster: # pcs status Cluster name: zweiter Last updated: Thu Aug 11 12:53:55 2016 Last change: Thu Aug 11 12:53:47 2016 by root via cibadmin on knoten01-hb Stack: corosync Current DC: knoten01-hb (version 1.1.13-10.el7-44eb2dd) - partition with quorum 3 nodes and 12 resources configured Online: [ knoten01-hb knoten02-hb knoten03-hb ] Full list of resources: Clone Set: ResO2CB-clone [ResO2CB] Started: [ knoten01-hb knoten02-hb knoten03-hb ] Clone Set: ResOCFS2_00-clone [ResOCFS2_00] Started: [ knoten01-hb knoten02-hb knoten03-hb ] Clone Set: ClusterIP-clone [ClusterIP] (unique) ClusterIP:0 (ocf::heartbeat:IPaddr2): Started knoten01-hb ClusterIP:1 (ocf::heartbeat:IPaddr2): Started knoten02-hb ClusterIP:2 (ocf::heartbeat:IPaddr2): Started knoten03-hb Clone Set: WebSite-clone [WebSite] Started: [ knoten01-hb knoten02-hb knoten03-hb ] PCSD Status: knoten01-hb: Online knoten02-hb: Online knoten03-hb: Online Daemon Status: corosync: active/disabled pacemaker: active/disabled pcsd: active/enabled I setup another host with name knoten04-hb, pcsd is started by systemd. My first idea was modifying /etc/corosync/corosync.conf on all four hosts, adding the new host. When done, without any stop/start action, pcs shows me this: # pcs status Cluster name: zweiter Last updated: Thu Aug 11 13:02:22 2016 Last change: Thu Aug 11 12:53:47 2016 by root via cibadmin on knoten01-hb Stack: corosync Current DC: knoten01-hb (version 1.1.13-10.el7-44eb2dd) - partition with quorum 3 nodes and 12 resources configured Online: [ knoten01-hb knoten02-hb knoten03-hb ] Full list of resources: Clone Set: ResO2CB-clone [ResO2CB] Started: [ knoten01-hb knoten02-hb knoten03-hb ] Clone Set: ResOCFS2_00-clone [ResOCFS2_00] Started: [ knoten01-hb knoten02-hb knoten03-hb ] Clone Set: ClusterIP-clone [ClusterIP] (unique) ClusterIP:0 (ocf::heartbeat:IPaddr2): Started knoten01-hb ClusterIP:1 (ocf::heartbeat:IPaddr2): Started knoten02-hb ClusterIP:2 (ocf::heartbeat:IPaddr2): Started knoten03-hb Clone Set: WebSite-clone [WebSite] Started: [ knoten01-hb knoten02-hb knoten03-hb ] PCSD Status: knoten01-hb: Online knoten02-hb: Online knoten03-hb: Online knoten04-hb: Online Daemon Status: corosync: active/disabled pacemaker: active/disabled pcsd: active/enabled PCSD Status shows the new node, everything else is unchanged. I guess the corosync part is OK. Next step is getting pacemaker configured. I tried: # cibadmin --query > add_knoten04-hb_cfg # vi add_knoten04-hb_cfg # cibadmin --replace --xml-file add_knoten04-hb_cfg The only entry I changed was this: <nodes> <node id="1" uname="knoten01-hb"/> <node id="2" uname="knoten02-hb"/> <node id="3" uname="knoten03-hb"/> <node id="4" uname="knoten04-hb"/> <============== </nodes> Again, this looks good: # pcs status Cluster name: zweiter Last updated: Thu Aug 11 13:08:19 2016 Last change: Thu Aug 11 13:08:11 2016 by root via cibadmin on knoten01-hb Stack: corosync Current DC: knoten01-hb (version 1.1.13-10.el7-44eb2dd) - partition with quorum 4 nodes and 13 resources configured Online: [ knoten01-hb knoten02-hb knoten03-hb ] OFFLINE: [ knoten04-hb ] Full list of resources: Clone Set: ResO2CB-clone [ResO2CB] Started: [ knoten01-hb knoten02-hb knoten03-hb ] Clone Set: ResOCFS2_00-clone [ResOCFS2_00] Started: [ knoten01-hb knoten02-hb knoten03-hb ] Clone Set: ClusterIP-clone [ClusterIP] (unique) ClusterIP:0 (ocf::heartbeat:IPaddr2): Started knoten01-hb ClusterIP:1 (ocf::heartbeat:IPaddr2): Started knoten02-hb ClusterIP:2 (ocf::heartbeat:IPaddr2): Started knoten03-hb Clone Set: WebSite-clone [WebSite] Started: [ knoten01-hb knoten02-hb knoten03-hb ] Stopped: [ knoten04-hb ] PCSD Status: knoten01-hb: Online knoten02-hb: Online knoten03-hb: Online knoten04-hb: Online Daemon Status: corosync: active/disabled pacemaker: active/disabled pcsd: active/enabled knoten04-hb is known but offline, for sure. OK, let's start it: # pcs cluster start knoten04-hb knoten04-hb: Starting Cluster... The funny thing: pcs status on the old nodes tell me: [root@knoten01 ~]# pcs status Cluster name: zweiter Last updated: Thu Aug 11 13:10:45 2016 Last change: Thu Aug 11 13:08:11 2016 by root via cibadmin on knoten01-hb Stack: corosync Current DC: knoten01-hb (version 1.1.13-10.el7-44eb2dd) - partition with quorum 4 nodes and 13 resources configured Online: [ knoten01-hb knoten02-hb knoten03-hb ] OFFLINE: [ knoten04-hb ] Full list of resources: Clone Set: ResO2CB-clone [ResO2CB] Started: [ knoten01-hb knoten02-hb knoten03-hb ] Clone Set: ResOCFS2_00-clone [ResOCFS2_00] Started: [ knoten01-hb knoten02-hb knoten03-hb ] Clone Set: ClusterIP-clone [ClusterIP] (unique) ClusterIP:0 (ocf::heartbeat:IPaddr2): Started knoten01-hb ClusterIP:1 (ocf::heartbeat:IPaddr2): Started knoten02-hb ClusterIP:2 (ocf::heartbeat:IPaddr2): Started knoten03-hb Clone Set: WebSite-clone [WebSite] Started: [ knoten01-hb knoten02-hb knoten03-hb ] Stopped: [ knoten04-hb ] PCSD Status: knoten01-hb: Online knoten02-hb: Online knoten03-hb: Online knoten04-hb: Online Daemon Status: corosync: active/disabled pacemaker: active/disabled pcsd: active/enabled And pcs on the new node tells me that: [root@knoten04 ~]# pcs status Cluster name: zweiter Last updated: Thu Aug 11 13:10:25 2016 Last change: Thu Aug 11 12:40:38 2016 by hacluster via crmd on knoten04-hb Stack: corosync Current DC: knoten04-hb (version 1.1.13-10.el7-44eb2dd) - partition WITHOUT quorum 4 nodes and 13 resources configured Online: [ knoten04-hb ] OFFLINE: [ knoten01-hb knoten02-hb knoten03-hb ] Full list of resources: Clone Set: ResO2CB-clone [ResO2CB] Stopped: [ knoten01-hb knoten02-hb knoten03-hb knoten04-hb ] Clone Set: ResOCFS2_00-clone [ResOCFS2_00] Stopped: [ knoten01-hb knoten02-hb knoten03-hb knoten04-hb ] Clone Set: ClusterIP-clone [ClusterIP] (unique) ClusterIP:0 (ocf::heartbeat:IPaddr2): Stopped ClusterIP:1 (ocf::heartbeat:IPaddr2): Stopped ClusterIP:2 (ocf::heartbeat:IPaddr2): Stopped Clone Set: WebSite-clone [WebSite] Stopped: [ knoten01-hb knoten02-hb knoten03-hb knoten04-hb ] PCSD Status: knoten01-hb: Online knoten02-hb: Online knoten03-hb: Online knoten04-hb: Online Daemon Status: corosync: active/disabled pacemaker: active/disabled pcsd: active/enabled This is obviously no valid cluster. So what am I doing wrong? How to add the node, getting a working four-node cluster? Thanks in advance for any help. Thomas
signature.asc
Description: This is a digitally signed message part.
_______________________________________________ Users mailing list: Users@clusterlabs.org http://clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org