[Pacemaker] Error while performing crm attribute operation. and cl_status commands
Hi All I had configured Mysql as multi-state resource in HA I had configured HA cluster consisting of two nodes and one Mysql multistate resource crm_attribute --type rsc_location --node mysql2 --get-value score non-option ARGV-elements: score scope=rsc_location value=(null) Error performing operation: cib object missing While executing the above command i am getting an error, how to get the score value and status of the node from cib.xml, 1) I had stopped HA process on node 1 and tried executing the command cl_status nodestatus mysql3 if i start HA process on node2 also still i am getting status as active for while, if i try executing the same command after a while stopping HA process i am getting an output status value as dead. Ho can we get the dead status of a node from cib.xml, once we stop the HA process on node-1 please find the cib.xml for the above HA configuration cib validate-with=pacemaker-1.0 crm_feature_set=3.0.1 have-quorum=1 admin_epoch=0 epoch=1751 dc-uuid=87b8b88e-3ded-4e34-8708-46f7afe62935 num_updates=20 cib-last-written=Mon May 16 06:54:24 2011 configuration crm_config cluster_property_set id=cib-bootstrap-options nvpair id=cib-bootstrap-options-dc-version name=dc-version value=1.0.9-89bd754939df5150de7cd76835f98fe90851b677/ nvpair id=cib-bootstrap-options-cluster-infrastructure name=cluster-infrastructure value=Heartbeat/ nvpair id=cib-bootstrap-options-stonith-enabled name=stonith-enabled value=false/ nvpair id=cib-bootstrap-options-no-quorum-policy name=no-quorum-policy value=ignore/ nvpair id=cib-bootstrap-options-last-lrm-refresh name=last-lrm-refresh value=1300787402/ /cluster_property_set /crm_config nodes node id=87b8b88e-3ded-4e34-8708-46f7afe62935 type=normal uname=mysql3/ node id=6317f856-e57b-4a03-acf1-ca81af4f19ce type=normal uname=cisco-demomsf instance_attributes id=nodes-6317f856-e57b-4a03-acf1-ca81af4f19ce nvpair id=nodes-6317f856-e57b-4a03-acf1-ca81af4f19ce-standby name=standby value=false/ nvpair id=nodes-6317f856-e57b-4a03-acf1-ca81af4f19ce-kernel name=kernel value=2.6.22.10-vs2.2.0.5-cisco/ /instance_attributes /node /nodes resources master id=MS_Mysql meta_attributes id=MS_Mysql-meta_attributes nvpair id=MS_Mysql-meta_attributes-notify name=notify value=true/ nvpair id=MS_Mysql-meta_attributes-target-role name=target-role value=Started/ /meta_attributes primitive class=ocf id=Mysql provider=msf type=mysql instance_attributes id=Mysql-instance_attributes nvpair id=Mysql-instance_attributes-binary name=binary value=/usr/bin/mysqld_safe/ nvpair id=Mysql-instance_attributes-config name=config value=/etc/my.cnf/ nvpair id=Mysql-instance_attributes-datadir name=datadir value=/var/lib/mysql/ nvpair id=Mysql-instance_attributes-user name=user value=mysql/ nvpair id=Mysql-instance_attributes-pid name=pid value=/var/lib/mysql/mysql.pid/ nvpair id=Mysql-instance_attributes-socket name=socket value=/var/lib/mysql/mysql.sock/ nvpair id=Mysql-instance_attributes-test_passwd name=test_passwd value=slavepass/ nvpair id=Mysql-instance_attributes-test_table name=test_table value=msfha.conn/ nvpair id=Mysql-instance_attributes-test_user name=test_user value=repl/ nvpair id=Mysql-instance_attributes-replication_user name=replication_user value=repl/ nvpair id=Mysql-instance_attributes-replication_passwd name=replication_passwd value=slavepass/ nvpair id=Mysql-instance_attributes-node_list name=node_list value=mysql3 cisco-demomsf/ nvpair id=Mysql-instance_attributes-scoresChange_script name=scoresChange_script value=/usr/lib/ocf/resource.d/msf/scoresChange/ /instance_attributes operations op id=Mysql-start-0 interval=0 name=start timeout=120s/ op id=Mysql-stop-0 interval=0 name=stop timeout=120s/ op id=Mysql-monitor-10s interval=10s name=monitor role=Master timeout=8s/ op id=Mysql-monitor-12s interval=12s name=monitor timeout=8s/ /operations /primitive /master /resources constraints rsc_location id=L_Master rsc=MS_Mysql rule id=L_Master-rule1 role=Master score=INFINITY expression attribute=#uname id=L_Master-expression operation=eq value=mysql3/ /rule rule id=L_Master-rule2 role=Master score=100 expression attribute=#uname id=L_Master-expression-0 operation=eq value=cisco-demomsf/ /rule /rsc_location /constraints rsc_defaults meta_attributes id=rsc-options nvpair id=rsc-options-resource-stickiness name=resource-stickiness value=100/ /meta_attributes /rsc_defaults op_defaults/ /configuration status node_state
Re: [Pacemaker] [pacemaker] Error while executing crm_attribute command from command line
Andrew Beekhof andrew@... writes: # CIB_file=./cib.xml crm_attribute -N mysql3 -n master-Mysql:0 -l forever -v 3 -V crm_attribute[23703]: 2011/05/10_08:42:59 info: Invoked: crm_attribute -N mysql3 -n master-Mysql:0 -l forever -v 3 -V crm_attribute[23703]: 2011/05/10_08:42:59 ERROR: Type ID doesn't allow value 'nodes-87b8b88e-3ded-4e34-8708-46f7afe62935-master-Mysql:0' So no colons allowed I'm afraid. Hi andrew Thank you very much for replying back the colons are created by the HA itself nvpair name=master-Mysql:0 id=status-87b8b88e-3ded-4e34-8708-46f7afe62935-master-Mysql:0 value=3601/ nvpair id=status-6317f856-e57b-4a03-acf1-ca81af4f19ce-master-Mysql:1 name=master-Mysql:1 value=1/ Regards Rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
[Pacemaker] [pacemaker] Error while executing crm_attribute command from command line
Hi All while i am executing crm_attribute command from the command line, it is throwing the following error *bash-3.2# crm_attribute -N mysql3 -n master-Mysql:0 -l forever -v 3 Error setting master-Mysql:0=3 (section=nodes, set=nodes-87b8b88e-3ded-4e34-8708-46f7afe62935): Update does not conform to the configured schema/DTD Error performing operation: Update does not conform to the configured schema/DTD* Please find my cib.xml file cib validate-with=pacemaker-1.0 crm_feature_set=3.0.1 have-quorum=1 admin_epoch=0 epoch=1194 num_updates=23 cib-last-written=Fri May 6 03:46:57 2011 dc-uuid=87b8b88e-3ded-4e34-8708-46f7afe62935 configuration crm_config cluster_property_set id=cib-bootstrap-options nvpair id=cib-bootstrap-options-dc-version name=dc-version value=1.0.9-89bd754939df5150de7cd76835f98fe90851b677/ nvpair id=cib-bootstrap-options-cluster-infrastructure name=cluster-infrastructure value=Heartbeat/ nvpair id=cib-bootstrap-options-stonith-enabled name=stonith-enabled value=false/ nvpair id=cib-bootstrap-options-no-quorum-policy name=no-quorum-policy value=ignore/ nvpair id=cib-bootstrap-options-last-lrm-refresh name=last-lrm-refresh value=1300787402/ /cluster_property_set /crm_config nodes node id=87b8b88e-3ded-4e34-8708-46f7afe62935 type=normal uname=mysql3/ node id=6317f856-e57b-4a03-acf1-ca81af4f19ce type=normal uname=cisco-demomsf/ /nodes resources primitive class=ocf id=MysqlVIP provider=heartbeat type=IPaddr3 instance_attributes id=MysqlVIP-instance_attributes nvpair id=MysqlVIP-instance_attributes-ip name=ip value=172.21.52.150/ nvpair id=MysqlVIP-instance_attributes-eth_num name=eth_num value=eth0:3/ nvpair id=MysqlVIP-instance_attributes-vip_cleanup_file name=vip_cleanup_file value=/var/run/bigha.pid/ /instance_attributes operations op id=MysqlVIP-start-0 interval=0 name=start timeout=120s/ op id=MysqlVIP-monitor-30s interval=30s name=monitor/ /operations meta_attributes id=MysqlVIP-meta_attributes nvpair id=MysqlVIP-meta_attributes-target-role name=target-role value=Started/ /meta_attributes /primitive master id=MS_Mysql meta_attributes id=MS_Mysql-meta_attributes nvpair id=MS_Mysql-meta_attributes-notify name=notify value=true/ nvpair id=MS_Mysql-meta_attributes-target-role name=target-role value=Started/ /meta_attributes primitive class=ocf id=Mysql provider=heartbeat type=mysql instance_attributes id=Mysql-instance_attributes nvpair id=Mysql-instance_attributes-binary name=binary value=/usr/bin/mysqld_safe/ nvpair id=Mysql-instance_attributes-config name=config value=/etc/my.cnf/ nvpair id=Mysql-instance_attributes-datadir name=datadir value=/var/lib/mysql/ nvpair id=Mysql-instance_attributes-user name=user value=mysql/ nvpair id=Mysql-instance_attributes-pid name=pid value=/var/lib/mysql/mysql.pid/ nvpair id=Mysql-instance_attributes-socket name=socket value=/var/lib/mysql/mysql.sock/ nvpair id=Mysql-instance_attributes-test_passwd name=test_passwd value=slavepass/ nvpair id=Mysql-instance_attributes-test_table name=test_table value=msfha.conn/ nvpair id=Mysql-instance_attributes-test_user name=test_user value=repl/ nvpair id=Mysql-instance_attributes-replication_user name=replication_user value=repl/ nvpair id=Mysql-instance_attributes-replication_passwd name=replication_passwd value=slavepass/ /instance_attributes operations op id=Mysql-start-0 interval=0 name=start timeout=120s/ op id=Mysql-stop-0 interval=0 name=stop timeout=120s/ op id=Mysql-monitor-10s interval=10s name=monitor role=Master timeout=8s/ op id=Mysql-monitor-12s interval=12s name=monitor timeout=8s/ /operations /primitive /master /resources constraints rsc_location id=L_Master rsc=MS_Mysql rule id=L_Master-rule1 role=Master score=100 expression attribute=#uname id=L_Master-expression-0 operation=eq value=mysql3/ /rule rule id=L_Master-rule2 role=Master score=100 expression attribute=#uname id=L_Master-expression operation=eq value=cisco-demomsf/ /rule /rsc_location rsc_colocation id=Mysql-with-ip rsc=MysqlVIP score=INFINITY with-rsc=MS_Mysql with-rsc-role=Master/ rsc_order first=MysqlVIP id=Mysql-after-ip score=INFINITY then=MS_Mysql/ /constraints rsc_defaults meta_attributes id=rsc-options nvpair id=rsc-options-resource-stickiness name=resource-stickiness value=100/ /meta_attributes /rsc_defaults op_defaults/ /configuration status node_state id=87b8b88e-3ded-4e34-8708-46f7afe62935 uname=mysql3 ha=active
Re: [Pacemaker] [pacemaker] unable to execute commands like crm configure show.crm resource stop
Dejan Muhamedagic dejanmm@... writes: Hi, On Wed, Mar 16, 2011 at 10:58:16AM +0530, rakesh k wrote: Hi All I had installed pacemaker,heartbeat on my unix machine, I had created an SSH user who can access my unix machine remotely. I had given permissions for that user by editing *sudoers *file so that he can perform all the opertaions related to *crm* The configuration i used is as follows in sudoers file. username ALL=NOPASSWD:/usr/sbin since the commands like crm_mon.crm_verify, cibadmin resides in the /usr/sbin directory. but when my ssh user is executing the following opertation *-Bash-3.2#sudo /usr/sbin/crm configure show* *-Bash-3.2#sudo /usr/sbin/crm resource stop* This is the following error poping out *sudo /usr/sbin/crm configure show cibadmin not available, check your installation* Please provide me the solution to come over this type of situations. /usr/sbin is not in the PATH. Thanks, Dejan Regards Rakesh ___ Pacemaker mailing list: Pacemaker@... http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker ___ Pacemaker mailing list: Pacemaker@... http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker Hi Dejan Thanks for providing the info, as mentioned i tried creating a bash profile in which i exported the PATH variable Here is my bash profile. # .bash_profile # User specific environment and startup programs PATH=$PATH:/usr/sbin export PATH even though i am facing the same issue ..when i try to execute the commands from ssh user i am getting the same issue again and again. is there any other way to execute this commands from an ssh user. Regards Rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] [pacemaker]Notification alerts when fail-over take place from one node to other node in cluster.
Vadym Chepkov vchepkov@... writes: You have to create MailTo resource for each resource or group you would like to be notified, unfortunately. You can also run crm_mon -1f¦grep -qi fail from either cron or from snmp. It's not perfect, but better then nothing. I also found check_crm script on nagios exchange, it's not ideal, but again, since this functionality doesn't come with pacemaker yet, you would have to invent your own wheel ;) Cheers, Vadym On Apr 25, 2011 1:15 AM, Rakesh K rakirocker4...@gmail.com wrote: Vadym Chepkov vchepkov at ... writes: You can colocate your resource with a MailTo pseudo resource : # crm ra meta MailTo Notifies recipients by email in the event of resource takeover (ocf:heartbeat:MailTo) Vadym ___ Pacemaker mailing list: Pacemaker at ... http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker Hi Vadym thanks for providing the reply. You said ti co-locate the resource with the MailTo resource which will notify the recipients by email provided in the configuration. But I had configured 4 resources in two node cluster. for this case what would be the best approach .. Regards Rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker ___ Pacemaker mailing list: Pacemaker@... http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker Hi Vadym Chepkov Thanks for giving the reply. As mentioned i am trying to configure MailTo RA with the Heartbeat from the command line i used the following configuration to configure it on the Heartbeat. primitive mail ocf:heartbeat:MailTo \ params email=emailid \ params subject=ClusterFailover and tried to restart the HA process using /etc/init.d/heartbeat restart when i do crm_mon it is unable to start MailTo process and digged into the ha-debug file and found the related information can u give some heads up on this issue so that i can proceed further. bash-3.2# cat ha-debug | grep MailTo May 26 10:34:39 hatest-msf3 pengine: [18575]: notice: native_print: mail MailTo[18614]: 2011/05/26_10:34:39 ERROR: Setup problem: Couldn't find utility May 26 10:34:40 hatest-msf3 pengine: [18575]: notice: native_print: mail May 26 10:34:40 hatest-msf3 pengine: [18575]: notice: native_print: mail May 26 10:34:40 hatest-msf3 pengine: [18575]: notice: native_print: mail MailTo[18635]: 2011/05/26_10:34:40 ERROR: Setup problem: Couldn't find utility May 26 10:34:44 hatest-msf3 pengine: [18575]: notice: native_print: mail Regards rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] [pacemaker]Notification alerts when fail-over take place from one node to other node in cluster.
Vadym Chepkov vchepkov@... writes: You can colocate your resource with a MailTo pseudo resource : # crm ra meta MailTo Notifies recipients by email in the event of resource takeover (ocf:heartbeat:MailTo) Vadym ___ Pacemaker mailing list: Pacemaker@... http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker Hi Vadym thanks for providing the reply. You said ti co-locate the resource with the MailTo resource which will notify the recipients by email provided in the configuration. But I had configured 4 resources in two node cluster. for this case what would be the best approach .. Regards Rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] Ordering set of resources, problem in ordering chain of resources
Andrew Beekhof andrew@... writes: Hi Andrew thanks for giving replies sorry for troubling you frequently here is the out put of crm configure show xml ?xml version=1.0 ? cib admin_epoch=0 crm_feature_set=3.0.1 dc-uuid=87b8b88e-3ded-4e34-8708-46f7afe62935 epoch=1120 have-quorum=1 num_updates=35 validate-with=pacemaker-1.0 configuration crm_config cluster_property_set id=cib-bootstrap-options nvpair id=cib-bootstrap-options-dc-version name=dc-version value=1.0.9-89bd754939df5150de7cd76835f98fe90851b677/ nvpair id=cib-bootstrap-options-cluster-infrastructure name=cluster-infrastructure value=Heartbeat/ nvpair id=cib-bootstrap-options-stonith-enabled name=stonith-enabled value=false/ nvpair id=cib-bootstrap-options-no-quorum-policy name=no-quorum-policy value=ignore/ nvpair id=cib-bootstrap-options-last-lrm-refresh name=last-lrm-refresh value=1300787402/ /cluster_property_set /crm_config rsc_defaults meta_attributes id=rsc-options nvpair id=rsc-options-resource-stickiness name=resource-stickiness value=100/ /meta_attributes /rsc_defaults op_defaults/ nodes node id=6317f856-e57b-4a03-acf1-ca81af4f19ce type=normal uname=cisco-demomsf/ node id=87b8b88e-3ded-4e34-8708-46f7afe62935 type=normal uname=mysql3/ /nodes resources master id=MS_Mysql meta_attributes id=MS_Mysql-meta_attributes nvpair id=MS_Mysql-meta_attributes-notify name=notify value=true/ nvpair id=MS_Mysql-meta_attributes-target-role name=target-role value=Stopped/ /meta_attributes primitive class=ocf id=Mysql provider=heartbeat type=mysql instance_attributes id=Mysql-instance_attributes nvpair id=Mysql-instance_attributes-binary name=binary value=/usr/bin/mysqld_safe/ nvpair id=Mysql-instance_attributes-config name=config value=/etc/my.cnf/ nvpair id=Mysql-instance_attributes-datadir name=datadir value=/var/lib/mysql/ nvpair id=Mysql-instance_attributes-user name=user value=mysql/ nvpair id=Mysql-instance_attributes-pid name=pid value=/var/lib/mysql/mysql.pid/ nvpair id=Mysql-instance_attributes-socket name=socket value=/var/lib/mysql/mysql.sock/ nvpair id=Mysql-instance_attributes-test_passwd name=test_passwd value=slavepass/ nvpair id=Mysql-instance_attributes-test_table name=test_table value=msfha.conn/ nvpair id=Mysql-instance_attributes-test_user name=test_user value=repl/ nvpair id=Mysql-instance_attributes-replication_user name=replication_user value=repl/ nvpair id=Mysql-instance_attributes-replication_passwd name=replication_passwd value=slavepass/ /instance_attributes operations op id=Mysql-start-0 interval=0 name=start timeout=120s/ op id=Mysql-stop-0 interval=0 name=stop timeout=120s/ op id=Mysql-monitor-10s interval=10s name=monitor role=Master timeout=8s/ op id=Mysql-monitor-12s interval=12s name=monitor timeout=8s/ /operations /primitive /master primitive class=ocf id=Tomcat1VIP provider=heartbeat type=IPaddr3 instance_attributes id=Tomcat1VIP-instance_attributes nvpair id=Tomcat1VIP-instance_attributes-ip name=ip value=172.21.52.140/ nvpair id=Tomcat1VIP-instance_attributes-eth_num name=eth_num value=eth0:2/ nvpair id=Tomcat1VIP-instance_attributes-vip_cleanup_file name=vip_cleanup_file value=/var/run/bigha.pid/ /instance_attributes operations op id=Tomcat1VIP-start-0 interval=0 name=start timeout=120s/ op id=Tomcat1VIP-monitor-30s interval=30s name=monitor/ /operations meta_attributes id=Tomcat1VIP-meta_attributes nvpair id=Tomcat1VIP-meta_attributes-target-role name=target-role value=Started/ /meta_attributes /primitive primitive class=ocf id=Tomcat1 provider=msf type=tomcat instance_attributes id=Tomcat1-instance_attributes nvpair id=Tomcat1-instance_attributes-tomcat_name name=tomcat_name value=tomcat/ nvpair id=Tomcat1-instance_attributes-statusurl name=statusurl value=http://localhost:8080/dbtest/testtomcat.html/ nvpair id=Tomcat1-instance_attributes-java_home name=java_home value=// nvpair id=Tomcat1-instance_attributes-catalina_home name=catalina_home value=/home/msf/runtime/tomcat/apache-tomcat-6.0.18/ nvpair id=Tomcat1-instance_attributes-client name=client value=curl/ nvpair id=Tomcat1-instance_attributes-testregex name=testregex value=*lt;/htmlgt;/ /instance_attributes operations op id=Tomcat1-start-0 interval=0 name=start timeout=60s/ op id=Tomcat1-monitor-50s interval=50s name=monitor timeout=50s/ op id=Tomcat1-stop-0 interval=0 name=stop/ /operations
[Pacemaker] [pacemaker] need some help regarding network failure setup in pacemaker.
Hello Everybody How can we detect network failure in pacemaker configuration. where my to nodes in cluster frame work are as follows two network routers connected via switch as mediator for communication. how can we detect network failure and stop the heartbeat processes, when i shutdown the interface. i am seeing a split brain issue.where heart beat is started on two nodes and each acting as two different heartbeat process. I had configured pingd resource which comes with pacemaker, as clone processes when there is a network failure, i see there is a split brain issue where the heart beat processes are starting separately on both nodes, my question is here is, is there any way to stop the heart beat process while pingd on a particular node gives that there is communication between the interface and node where HA is running. Regards rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] [pacemaker] need some help regarding network failure setup in pacemaker.
Jelle de Jong jelledejong@... writes: Hi Jelle de Jong On 20-04-11 11:44, rakesh k wrote: How can we detect network failure in pacemaker configuration. http://www.clusterlabs.org/wiki/Pingd_with_resources_on_different_networks http://www.woodwose.net/thatremindsme/2011/04/the-pacemaker-ping-resource-agent/ http://wiki.lustre.org/index.php/Using_Pacemaker_with_Lustre crm configure help location crm ra info ocf:ping That should give you a jup start. You may need to increase the corosync token. Kind regards, Jelle de Jong ___ Pacemaker mailing list: Pacemaker@... http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker Thanks for the help my question is I had gone through the scripts, where i found in ping_update method there is a variable called ACTIVE no.of nodes(host_list) active based on this value, for our scenario, can we stop the heartbeat/pacemaker process, when the host node cannot ping any other nodes in the cluster frame work. provide me your suggestion so that it will help us in our context. Regards Rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
[Pacemaker] Ordering set of resources, problem in ordering chain of resources
Hi All I had configured Heartbeeat, pacemkaer on my two VM's Cluster with two nodes, two nodes are running *cent-os* as operation systems. Cluster configured with 8 resources and defined order, taking reference pacemaker explained please find the order and co-location constraints, taken from cib.xml rsc_colocation id=Httpd-with-ip rsc=HttpdVIP score=INFINITY with-rsc=Httpd/ rsc_colocation id=Tomcat1-with-ip rsc=Tomcat1VIP score=INFINITY with-rsc=Tomcat1/ rsc_colocation id=Mysql-with-ip rsc=MysqlVIP score=INFINITY with-rsc=MS_Mysql with-rsc-role=Master/ rsc_order first=HttpdVIP id=Httpd-after-op score=INFINITY then=Httpd/ rsc_order first=MysqlVIP id=Mysql-after-ip score=INFINITY then=MS_Mysql/ rsc_order first=Tomcat1VIP id=Tomcat1-after-ip score=INFINITY then=Tomcat1/ rsc_order first=MS_Mysql id=Tomcat1-after-MYSQL score=INFINITY then=Tomcat1VIP/ rsc_colocation id=Tomcat2-with-Tomcat rsc=Tomcat1 score=INFINITY with-rsc=Tomcat2VIP/ rsc_order first=Tomcat2VIP id=Tomcat2-after-ip score=INFINITY then=Tomcat2/ rsc_order first=Tomcat2 id=Httpd-after-Tomcat2 score=INFINITY then=Httpd/ rsc_colocation id=tomcat2-with-ip rsc=Tomcat2VIP score=INFINITY with-rsc=Tomcat2/ so when heartbeat start on both nodes, the order resources started are MysqlVIP--MSMysql--tomcat1VIP--Tomcat--Tomcat2VIP--tomcat2--HttpdVIP--Httpd My question is when i try to stop Mysql process since there is an order constraint .all the resources should stop, but still, when i do *crm_mon *i see tomcat2,tomcat2vip,http,httpdVIP resources running on cluster frame work , can you please suggest me if there is any flaw in determining the order or co-location constraints. Regards Rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] Ordering set of resources, problem in ordering chain of resources
Andrew Beekhof andrew@... writes: Hi Andrew thanks for giving reply. The version of pacemaker i am using is pacemaker-1.0.9.1 Regards Rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] Unable to stop Multi state resource
Rakesh K rakirocker4236@... writes: Hi Andrew FSR is a File system replication script which adheres to ocf cluster frame work, the script is similar to Mysql ocf script, which is a multi state resource, where in master ssh server would be running and in slave there are rsync scripts which uses to synchronize the data between the Master and slave. the rsync script will be having the Master FSR location, so that the rysnc tool will be frequently replication the data from the FSR master location. here is the crm configuration show output node $id=82a5281a-a069-49c1-9f57-d4a8f6eb3d72 prodmsf2 node $id=d8b6c2e7-d1c3-4a15-9411-ed4d710c8672 prodmsf primitive FSR ocf:msf:fsr \ params client_script=/home/msf/ha/scripts/ocf/rsyncClient source_dir=/home/msf/services/persistence/ dest_dir=/home/msf/services/persistence/ user=root pid=/var/run/fsr.pid rsync_binary=/usr/bin/rsync rsync_options=-az rsync_interval=1 config_file=/home/msf/ha/config/ocf/fsr.config status_dump=/home/msf/ha/status/rsync_client_dump \ op start interval=0 timeout=120s \ op stop interval=0 timeout=120s \ op monitor interval=10s role=Master timeout=8s \ op monitor interval=12s timeout=8s primitive Httpd ocf:heartbeat:apache \ params configfile=/etc/httpd/conf/httpd.conf httpd=/usr/sbin/httpd client=curl statusurl=http://localhost/img/test.html; testregex=*/html \ op start interval=0 timeout=120s \ op stop interval=0 timeout=120s \ op monitor interval=50s timeout=50s primitive HttpdVIP ocf:heartbeat:IPaddr3 \ params ip=10.10.30.103 eth_num=eth0:1 vip_cleanup_file=/var/run/bigha.pid \ op start interval=0 timeout=120s \ op stop interval=0 timeout=120s \ op monitor interval=30s \ meta target-role=Started primitive Mysql ocf:heartbeat:mysql \ params binary=/usr/bin/mysqld_safe config=/etc/my.cnf datadir=/var/lib/mysql user=mysql pid=/var/lib/mysql/mysql.pid socket=/var/lib/mysql/mysql.sock test_passwd=slavepass test_table=test.conn test_user=repl replication_user=repl replication_passwd=slavepass \ op start interval=0 timeout=120s \ op stop interval=0 timeout=120s \ op monitor interval=10s role=Master timeout=8s \ op monitor interval=12s timeout=8s primitive MysqlVIP ocf:heartbeat:IPaddr3 \ params ip=10.10.30.105 eth_num=eth0:3 vip_cleanup_file=/var/run/bigha.pid \ op start interval=0 timeout=120s \ op stop interval=0 timeout=60s \ op monitor interval=30s \ meta target-role=Started primitive Tomcat1 ocf:msf:tomcat1 \ params tomcat_name=tomcat statusurl=http://localhost:8080/; java_home=/ catalina_home=/home/msf/runtime/tomcat/apache-tomcat-6.0.18 client=curl testregex=*/html \ op start interval=0 timeout=120s \ op monitor interval=50s timeout=50s \ op stop interval=0 timeout=120s \ meta target-role=Started primitive Tomcat1VIP ocf:heartbeat:IPaddr3 \ params ip=10.10.30.104 eth_num=eth0:2 vip_cleanup_file=/var/run/bigha.pid \ op start interval=0 timeout=120s \ op stop interval=0 timeout=120s \ op monitor interval=30s \ meta target-role=Started ms MS_FSR FSR \ meta notify=true target-role=Started ms MS_Mysql Mysql \ meta notify=true target-role=Started colocation FSR-with-Tomcat inf: Tomcat1 MS_FSR:Master colocation Httpd-with-ip inf: HttpdVIP Httpd colocation Mysql-with-ip inf: MysqlVIP MS_Mysql:Master colocation Tomcat1-with-ip inf: Tomcat1VIP Tomcat1 order FSR-after-tomcat inf: Tomcat1 MS_FSR order Httpd-after-ip inf: HttpdVIP Httpd order Httpd-after-tomcat inf: Tomcat1 HttpdVIP order Mysql-after-ip inf: MysqlVIP MS_Mysql order Tomcat1-after-MYSQL inf: MS_Mysql Tomcat1VIP order Tomcat1-after-ip inf: Tomcat1VIP Tomcat1 property $id=cib-bootstrap-options \ dc-version=1.0.9-89bd754939df5150de7cd76835f98fe90851b677 \ cluster-infrastructure=Heartbeat \ stonith-enabled=false \ no-quorum-policy=ignore Regards Rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
[Pacemaker] [pacemaker] To start all the resources on one node when HA starts in 2 node configuration.
Hi All Here is my description regarding this While configuring HA i used this CLI command *crm configure location HTTPD Httpd rule id=HTTPD-rule 100: \#uname eq hatest1rule id=HTTPD-rule1 200: \#uname eq hatest2* where Httpd is resource and given score 100 for hatest1 and score 200 for node -2 hatest2 similarly there are other three resources where i have given score 100 for first node and score 200 for second node when HA starts it checks for the scores and starts the processes on hatest2 Is there any other better way such that heartbeat/pacemaker checks the node level configuration rather than HA checks resource location constraint . Regards Rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
[Pacemaker] unable to migrate resource between nodes in cluster
Hi All I am unable to migrate apache resource between the nodes 1) To monitor apache resource i had used status url and reg-ex pattern is * */html* which find the exact pattern in statusurl, for monitoring and fail-over conditions for apache script . 2) on node1 i manually edited the status url and saw the resource is getting migrated to other node2 in cluster. 3) And then i again update the status url text on node1 to original content .and emptied the file on node2, for this scenario i should see there should be a migration of resource to node1 which is not happening 4) when i go through the ha-debug file for exact reason i found this message in ha-debug file . *do_state_transition: Starting PEngine Recheck Timer* Please help me regarding this.. Regards Rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] build Issue while configurin g the cluster glue in CENT OS
Larry Brigman larry.brigman@... writes: Hi Larry Thanks for you suggestion the when i tried to install e2fsprogs-libs-1.39-23.el5_5.1 rpm file it says it is uptodate and again i tried to install cluster glue using make command . which thrown me the same error. [root@low-house lib]# rpm -qf /lib/libuuid.so.1 e2fsprogs-libs-1.39-23.el5_5.1 [root@low-house lib]# yum install e2fsprogs-libs-1.39-23.el5_5.1 Loaded plugins: fastestmirror Loading mirror speeds from cached hostfile * addons: mirror.nwresd.org * base: mirror.stanford.edu * extras: mirror.stanford.edu * updates: mirror.san.fastserv.com Setting up Install Process Package e2fsprogs-libs-1.39-23.el5_5.1.i386 already installed and latest version Nothing to do [root@low-house lib]# regards Rakesh ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] build Issue while configurin g the cluster glue in CENT OS
Vladislav Bogdanov bubble@... writes: Hi Vladislav Bogdanov When i listed down all the packages in CentOS i found this package is already isntalled i used rpm-qa for listing down the packages is there any alternative for building cluster glue in my CENT OS through code base since this an urgent requirement sorry for posting the questions frequently ___ Pacemaker mailing list: Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker Project Home: http://www.clusterlabs.org Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf Bugs: http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
Re: [Pacemaker] build Issue while configuring the cluster glue in CENT OS
Hi Dejan I am posting the steps i followed to install cluster glue from the source as mentioned in the http://clusterlabs.org/ i tried to download the cluster glue tar bal using *wget the tar ball is * * cluster-glue.tar.bz2* and when i extracted this is the following directory i got*Reusable-Cluster-Components-glue--be41a3ef5717 * ** and tried to configure and got the output while i am procedding forwarded while doing make ..i am getting the following message ... o -lbz2 -lxml2 -lc -lrt -ldl -lglib-2.0 ./.libs/libplumb.so: undefined reference to `uuid_parse' ./.libs/libplumb.so: undefined reference to `uuid_generate' ./.libs/libplumb.so: undefined reference to `uuid_copy' ./.libs/libplumb.so: undefined reference to `uuid_is_null' ./.libs/libplumb.so: undefined reference to `uuid_unparse' ./.libs/libplumb.so: undefined reference to `uuid_clear' ./.libs/libplumb.so: undefined reference to `uuid_compare' collect2: ld returned 1 exit status gmake[2]: *** [ipctest] Error 1 gmake[2]: Leaving directory `/home/rakesh/Desktop/Pacemaker/Reusable-Cluster-Components-glue--be41a3ef5717/lib/clplumbing' gmake[1]: *** [all-recursive] Error 1 gmake[1]: Leaving directory `/home/rakesh/Desktop/Pacemaker/Reusable-Cluster-Components-glue--be41a3ef5717/lib' make: *** [all-recursive] Error 1 and the configuring output is as follows cluster-glue configuration: Version = 1.0.7 (Build: be41a3ef57175fcd35ffe0c4bc897f996cb447ba) Features = Prefix = /usr Executables = /usr/sbin Man pages= /usr/man Libraries= /usr/lib Header files = /usr/include Arch-independent files = /usr/share Documentation= /usr/share/doc State information= /usr/var System configuration = /usr/etc Use system LTDL = no HA group name= haclient HA user name = hacluster CFLAGS = -g -O2 -ggdb3 -O0 -fgnu89-inline -fstack-protector-all -Wall -Waggregate-return -Wbad-function-cast -Wcast-qual -Wcast-align -Wdeclaration-after-statement -Wendif-labels -Wfloat-equal -Wformat=2 -Wformat-security -Wformat-nonliteral -Winline -Wmissing-prototypes -Wmissing-declarations -Wmissing-format-attribute -Wnested-externs -Wno-long-long -Wno-strict-aliasing -Wpointer-arith -Wstrict-prototypes -Wwrite-strings -ansi -D_GNU_SOURCE -DANSI_ONLY -Werror Libraries= -lbz2 -lxml2 -lc -lrt -ldl -L/lib -lglib-2.0 Stack Libraries = and i tried to install libuuid-devel where it is displaying that no rpm found ..and i am able to see libuuid.so.1 file in /usr/lib please help me regarding this issue .. Regards Raki On Mon, Mar 7, 2011 at 6:24 PM, rakesh k rakirocker4...@gmail.com wrote: Hi All I got the errors like this while compiling cluster glue: ../.libs/libplumb.so: undefined reference to `uuid_parse' ../.libs/libplumb.so: undefined reference to `uuid_generate' ../.libs/libplumb.so: undefined reference to `uuid_copy' ../.libs/libplumb.so: undefined reference to `uuid_is_null' ../.libs/libplumb.so: undefined reference to `uuid_unparse' ../.libs/libplumb.so: undefined reference to `uuid_clear' ../.libs/libplumb.so: undefined reference to `uuid_compare' collect2: ld returned 1 exit status gmake[2]: *** [ipctest] Error 1 gmake[2]: Leaving directory `/root/Reusable-Cluster-Components-glue-1.0.5/lib/clplumbing' gmake[1]: *** [all-recursive] Error 1 gmake[1]: Leaving directory `/root/Reusable-Cluster-Components-glue-1.0.5/lib' make: *** [all-recursive] Error 1 and here is the output when I run ./configure: cluster-glue configuration: Version = 1.0.7 (Build: be41a3ef57175fcd35ffe0c4bc897f996cb447ba) Features = Prefix = /usr Executables = /usr/sbin Man pages= /usr/man Libraries= /usr/lib Header files = /usr/include Arch-independent files = /usr/share Documentation= /usr/share/doc State information= /usr/var System configuration = /usr/etc Use system LTDL = no HA group name= haclient HA user name = hacluster CFLAGS = -g -O2 -ggdb3 -O0 -fgnu89-inline -fstack-protector-all -Wall -Waggregate-return -Wbad-function-cast -Wcast-qual -Wcast-align -Wdeclaration-after-statement -Wendif-labels -Wfloat-equal -Wformat=2 -Wformat-security -Wformat-nonliteral -Winline -Wmissing-prototypes -Wmissing-declarations -Wmissing-format-attribute -Wnested-externs -Wno-long-long -Wno-strict-aliasing -Wpointer-arith -Wstrict-prototypes -Wwrite-strings -ansi -D_GNU_SOURCE -DANSI_ONLY -Werror Libraries= -lbz2 -lxml2 -lc -lrt -ldl -L/lib -lglib-2.0 Stack Libraries = and also i tried to update the libs directive in mke file pointing libraries to /lib/libuuid.so.1 still i am