On 26.06.2023 21:14, Priyanka Balotra wrote:
Hi All,
We are seeing an issue where we replaced no-quorum-policy=ignore with other
options in corosync.conf order to simulate the same behaviour :
* wait_for_all: 0*
* last_man_standing: 1 last_man_standing_window: 20000*
There was another property (auto-tie-breaker) tried but couldn't configure
it as crm did not recognise this property.
But even after using these options, we are seeing that system is not
quorate if at least half of the nodes are not up.
Some properties from crm config are as follows:
*primitive stonith-sbd stonith:external/sbd \ params
pcmk_delay_base=5s.*
*.property cib-bootstrap-options: \ have-watchdog=true \
dc-version="2.1.2+20211124.ada5c3b36-150400.2.43-2.1.2+20211124.ada5c3b36"
\ cluster-infrastructure=corosync \ cluster-name=FILE \
stonith-enabled=true \ stonith-timeout=172 \
stonith-action=reboot \ stop-all-resources=false \
no-quorum-policy=ignorersc_defaults build-resource-defaults: \
resource-stickiness=1rsc_defaults rsc-options: \
resource-stickiness=100 \ migration-threshold=3 \
failure-timeout=1m \ cluster-recheck-interval=10minop_defaults
op-options: \ timeout=600 \ record-pending=true*
On a 4-node setup when the whole cluster is brought up together we see
error logs like:
*2023-06-26T11:35:17.231104+00:00 FILE-1 pacemaker-schedulerd[26359]:
warning: Fencing and resource management disabled due to lack of quorum*
*2023-06-26T11:35:17.231338+00:00 FILE-1 pacemaker-schedulerd[26359]:
warning: Ignoring malformed node_state entry without uname*
*2023-06-26T11:35:17.233771+00:00 FILE-1 pacemaker-schedulerd[26359]:
warning: Node FILE-2 is unclean!*
*2023-06-26T11:35:17.233857+00:00 FILE-1 pacemaker-schedulerd[26359]:
warning: Node FILE-3 is unclean!*
*2023-06-26T11:35:17.233957+00:00 FILE-1 pacemaker-schedulerd[26359]:
warning: Node FILE-4 is unclean!*
According to this output FILE-1 lost connection to three other nodes, in
which case it cannot be quorate.
Kindly help correct the configuration to make the system function normally
with all resources up, even if there is just one node up.
Please let me know if any more info is needed.
Thanks
Priyanka
_______________________________________________
Manage your subscription:
https://lists.clusterlabs.org/mailman/listinfo/users
ClusterLabs home: https://www.clusterlabs.org/
_______________________________________________
Manage your subscription:
https://lists.clusterlabs.org/mailman/listinfo/users
ClusterLabs home: https://www.clusterlabs.org/