On 29/05/2019 21:04, Ken Gaillot wrote: > On Wed, 2019-05-29 at 17:28 +0100, lejeczek wrote: >> hi guys, >> >> I have a 3-nodes cluster but one node is a freaking mystery to me. I >> see >> this: >> >> May 29 17:21:45 [51617] rider.private pacemakerd: info: >> pcmk_cpg_membership: Node 3 still member of group pacemakerd >> (peer=rider.private, counter=0.2) >> May 29 17:21:45 [51617] rider.private pacemakerd: info: >> mcp_cpg_deliver: Ignoring process list sent by peer for local node >> May 29 17:21:45 [51617] rider.private pacemakerd: info: >> mcp_cpg_deliver: Ignoring process list sent by peer for local node > These are harmless and unrelated. > >> and I wonder if it in any way relates to the fact that the node says: >> >> $ crm_mon --one-shot >> Connection to cluster failed: Transport endpoint is not connected >> $ pcs status --all >> Error: cluster is not currently running on this node > What user are you running as? That's expected if the user isn't either > root or in the haclient group.
root I earlier pasted that systemctl status -l pacemaker as it looks interestingly different to the output from other two working nodes, eg: $ systemctl status -l pacemaker ● pacemaker.service - Pacemaker High Availability Cluster Manager Loaded: loaded (/etc/systemd/system/pacemaker.service; enabled; vendor preset: disabled) Drop-In: /etc/systemd/system/pacemaker.service.d └─override.conf Active: active (running) since Fri 2019-05-10 15:29:26 BST; 2 weeks 5 days ago Docs: man:pacemakerd https://clusterlabs.org/pacemaker/doc/en-US/Pacemaker/1.1/html-single/Pacemaker_Explained/index.html Main PID: 1164197 (pacemakerd) Tasks: 7 Memory: 79.8M CGroup: /system.slice/pacemaker.service ├─1164197 /usr/sbin/pacemakerd -f ├─1164198 /usr/libexec/pacemaker/cib ├─1164199 /usr/libexec/pacemaker/stonithd ├─1164200 /usr/libexec/pacemaker/lrmd ├─1164201 /usr/libexec/pacemaker/attrd ├─1164202 /usr/libexec/pacemaker/pengine └─1164203 /usr/libexec/pacemaker/crmd Where to start troubleshooting it? I've already removed/re-added the node. I've even removed rpm packages thus started a clean slate, so I believe. many thanks, L. >> and: >> $ systemctl status -l pacemaker.service >> ● pacemaker.service - Pacemaker High Availability Cluster Manager >> Loaded: loaded (/usr/lib/systemd/system/pacemaker.service; >> disabled; vendor preset: disabled) >> Active: active (running) since Wed 2019-05-29 17:21:45 BST; 7s ago >> Docs: man:pacemakerd >> >> https://clusterlabs.org/pacemaker/doc/en-US/Pacemaker/1.1/html-single/Pacemaker_Explained/index.html >> Main PID: 51617 (pacemakerd) >> Tasks: 1 >> Memory: 3.3M >> CGroup: /system.slice/pacemaker.service >> └─51617 /usr/sbin/pacemakerd -f >> >> May 29 17:21:45 rider.private pacemakerd[51617]: notice: Tracking >> existing pengine process (pid=51528) >> May 29 17:21:45 rider.private pacemakerd[51617]: notice: Tracking >> existing lrmd process (pid=51542) >> May 29 17:21:45 rider.private pacemakerd[51617]: notice: Tracking >> existing stonithd process (pid=51558) >> May 29 17:21:45 rider.private pacemakerd[51617]: notice: Tracking >> existing attrd process (pid=51559) >> May 29 17:21:45 rider.private pacemakerd[51617]: notice: Tracking >> existing cib process (pid=51560) >> May 29 17:21:45 rider.private pacemakerd[51617]: notice: Tracking >> existing crmd process (pid=51566) >> May 29 17:21:45 rider.private pacemakerd[51617]: notice: Quorum >> acquired >> May 29 17:21:45 rider.private pacemakerd[51617]: notice: Node >> whale.private state is now member >> May 29 17:21:45 rider.private pacemakerd[51617]: notice: Node >> swir.private state is now member >> May 29 17:21:45 rider.private pacemakerd[51617]: notice: Node >> rider.private state is now member
pEpkey.asc
Description: application/pgp-keys
_______________________________________________ Manage your subscription: https://lists.clusterlabs.org/mailman/listinfo/users ClusterLabs home: https://www.clusterlabs.org/
