[ClusterLabs] Current DC becomes None suddenly
Hi, I have set up a ACTIVE/PASSIVE HA *Issue 1) * *corosync.conf* file is # Please read the openais.conf.5 manual page totem { version: 2 # How long before declaring a token lost (ms) token: 1 # How many token retransmits before forming a new configuration token_retransmits_before_loss_const: 20 # How long to wait for join messages in the membership protocol (ms) join: 1 # How long to wait for consensus to be achieved before starting a new round of membership configuration (ms) consensus: 12000 # Turn off the virtual synchrony filter vsftype: none # Number of messages that may be sent by one processor on receipt of the token max_messages: 20 # Limit generated nodeids to 31-bits (positive signed integers) clear_node_high_bit: yes # Disable encryption secauth: off # How many threads to use for encryption/decryption threads: 0 # Optionally assign a fixed node id (integer) # nodeid: 1234 # This specifies the mode of redundant ring, which may be none, active, or passive. rrp_mode: none interface { # The following values need to be set based on your environment ringnumber: 0 bindnetaddr: 192.168.101.0 mcastport: 5405 } transport: udpu } amf { mode: disabled } quorum { # Quorum for the Pacemaker Cluster Resource Manager provider: corosync_votequorum expected_votes: 1 } nodelist { node { ring0_addr: 192.168.101.73 } node { ring0_addr: 192.168.101.74 } } aisexec { user: root group: root } logging { fileline: off to_stderr: yes to_logfile: yes to_syslog: yes syslog_facility: daemon logfile: /var/log/corosync/corosync.log debug: off timestamp: on logger_subsys { subsys: AMF debug: off tags: enter|leave|trace1|trace2|trace3|trace4|trace6 } } And I have added 5 resources - 1 is VIP and 4 are upstart jobs Node names are configured as -> sc-node-1(ACTIVE) and sc-node-2(PASSIVE) Resources are running on ACTIVE node Default cluster properties - But sometimes after 2-3 migrations from ACTIVE to STANDBY and then from STANDBY to ACTIVE, both nodes become OFFLINE and Current DC becomes None, I have disabled the stonith property and even quorum is ignored root@sc-node-2:/usr/lib/python2.7/dist-packages/sc# crm status Last updated: Sat Oct 3 00:01:40 2015 Last change: Fri Oct 2 23:38:28 2015 via crm_resource on sc-node-1 Stack: corosync Current DC: NONE 2 Nodes configured 5 Resources configured OFFLINE: [ sc-node-1 sc-node-2 ] What is going wrong here ? What is the reason for node Current DC becoming None suddenly ? Is corosync.conf okay ? Are default cluster properties fine ? Help will be appreciated. *Issue 2)* Command used to add upstart job is crm configure primitive service upstart:service meta allow-migrate=true migration-threshold=5 failure-timeout=30s op monitor interval=15s timeout=60s But still sometimes I see fail count going to INFINITY. Why ? How can we avoid it ? Resource should have migrated as soon as it reaches migration threshold. * Node sc-node-2: service: migration-threshold=5 fail-count=100 last-failure='Fri Oct 2 23:38:53 2015' service1: migration-threshold=5 fail-count=100 last-failure='Fri Oct 2 23:38:53 2015' Failed actions: service_start_0 (node=sc-node-2, call=-1, rc=1, status=Timed Out, last-rc-change=Fri Oct 2 23:38:53 2015 , queued=0ms, exec=0ms ): unknown error service1_start_0 (node=sc-node-2, call=-1, rc=1, status=Timed Out, last-rc-change=Fri Oct 2 23:38:53 2015 , queued=0ms, exec=0ms -- Thanks and Regards, Pritam Kharat. ___ Users mailing list: Users@clusterlabs.org http://clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
Re: [ClusterLabs] Small bug in RA heartbeat/syslog-ng
On 23/09/15 08:17 +0200, Ulrich Windl wrote: Lars Ellenberg schrieb am 22.09.2015 um 14:30 in Nachricht <20150922123054.GA26107@soda.linbit>: >> So, unless you happen to have an explicitly set to the empty string >> OCF_RESKEY_syslog_ng_binary in your environment, things work just fine. >> And if you do, then that's the bug. > > You are right: if X is unset ${X-Y} will do (in addition to ${X:-Y}, but if X > has no value only the latter will do: > > v04:~> X=1 > v04:~> echo ${X-Y} > 1 > v04:~> X= > pv04:~> echo ${X-Y} > > v04:~> unset X > v04:~> echo ${X-Y} > Y > pv04:~> X=1 > v04:~> echo ${X:-Y} > 1 > v04:~> X= > v04:~> echo ${X:-Y} > Y > v04:~> unset X > pv04:~> echo ${X:-Y} > Y > v04:~> > > Still I think it's wrong, simply because it's not documented. In fact it is, at least since bash 2.0 (~1996): http://git.savannah.gnu.org/cgit/bash.git/tree/doc/bash.1?id=ccc6cda#n1821 > omitting the colon results in a test only for a parameter that is unset But don't be worry, I was trapped on this for a bit, too :) -- Jan (Poki) pgpP9XuQG7bN9.pgp Description: PGP signature ___ Users mailing list: Users@clusterlabs.org http://clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
Re: [ClusterLabs] Adding and removing a node dyamically
On 10/02/2015 05:36 AM, Vijay Partha wrote: > could someone help me out with this please? i am making use of cman and > pacemaker. pcs cluster node add/remove is not working as it throws > pcsd service is not running on . pcs relies on pcsd running on all nodes. Make sure pcs is installed on all nodes, and pcsd is enabled to start at boot (via service or systemctl depending on which you are using). Then set a password (same on all nodes) for the hacluster user. Finally, run "pcs cluster auth " on the machine you want to run pcs from, and give it the hacluster user/pass. You may want to review the Clusters From Scratch documentation to see other common configuration that needs to be done (firewall, SELinux, hostnames, etc.): http://clusterlabs.org/doc/en-US/Pacemaker/1.1-pcs/html-single/Clusters_from_Scratch/index.html > On Fri, Oct 2, 2015 at 1:17 PM, Vijay Partha > wrote: > >> Hi, >> >> I would like to add and remove a node dynamically in pacemaker. What >> commands are to be given for this to be done. >> >> Thanking you >> >> -- >> With Regards >> P.Vijay >> > > > > > > ___ > Users mailing list: Users@clusterlabs.org > http://clusterlabs.org/mailman/listinfo/users > > Project Home: http://www.clusterlabs.org > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf > Bugs: http://bugs.clusterlabs.org > ___ Users mailing list: Users@clusterlabs.org http://clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
Re: [ClusterLabs] Adding and removing a node dyamically
could someone help me out with this please? i am making use of cman and pacemaker. pcs cluster node add/remove is not working as it throws pcsd service is not running on . On Fri, Oct 2, 2015 at 1:17 PM, Vijay Partha wrote: > Hi, > > I would like to add and remove a node dynamically in pacemaker. What > commands are to be given for this to be done. > > Thanking you > > -- > With Regards > P.Vijay > -- With Regards P.Vijay ___ Users mailing list: Users@clusterlabs.org http://clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
Re: [ClusterLabs] Adding and removing a node dyamically
Hi, what do mean with add or remove? Do you want to remove a node from a cluster completely, not being a cluster member any more? Or do you want to remove it just for maintenance temporarely? Regards, sven Am 02.10.2015 09:47 schrieb Vijay Partha : > > Hi, > > I would like to add and remove a node dynamically in pacemaker. What commands > are to be given for this to be done. > > Thanking you > > -- > With Regards > P.Vijay ___ Users mailing list: Users@clusterlabs.org http://clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org
[ClusterLabs] Adding and removing a node dyamically
Hi, I would like to add and remove a node dynamically in pacemaker. What commands are to be given for this to be done. Thanking you -- With Regards P.Vijay ___ Users mailing list: Users@clusterlabs.org http://clusterlabs.org/mailman/listinfo/users Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://bugs.clusterlabs.org