Hi Folks,

i have a problem using stonith with suicide device.
First my use case:
2 SLES 11 nodes in active active configuration using clones.
The main approach is to load balance http requests to a application server.
There is a virtual ip adress which is used by both nodes.

I've try to keep the services alive and if any service didn't start the 
node should reboot for the first time.

Now here comes the problem.
I see in the logs that a request for fence (reboot) are rejected with 
the message
info: process_remote_stonith_query: Ignoring reply from srvhfosamno1, 
hosts are not permitted to commit suicide

So i think i have to add the nodes in the hostlist but the suicide 
stonith devices doesn't know any parameters.

The main question is this the right approach and is there a useful 
documentation about that. Is the sbd device an option?

I looked around some time but didn't find any helpful for me except that 
this suicide device is deprecated.

My Configuration at the moment and i know this is not perfect ;-)
node srvhfosamno1 \
     attributes standby="off"
node srvhfosamno2 \
     attributes standby="off"
primitive STONITH-1 stonith:suicide \
     op monitor interval="120s" \
     op start interval="0s" timeout="240s"
primitive apache ocf:heartbeat:apache \
     op monitor interval="30s" timeout="10s" on-fail="fence" \
     op start interval="0s" timeout="15s" \
     op stop interval="0s" timeout="15s"
primitive apache_ip ocf:heartbeat:IPaddr2 \
     params ip="141.90.235.144" \
     op monitor interval="5s" timeout="5s"
primitive jboss ocf:heartbeat:jboss \
     params jboss_home="/opt/cosmit/jboss-5.1.0.GA" 
java_home="/usr/java/default" user="jboss" run_opts="-c default -b 
127.0.0.1" \
     op monitor interval="30s" timeout="240s" on-fail="fence" \
     op start interval="0s" timeout="240s" \
     op stop interval="0s" timeout="120s"
primitive mailnotification ocf:heartbeat:MailTo \
     params email="[email protected]" subject="SAMURAJ Application 
Cluster"
primitive nfs ocf:heartbeat:Filesystem \
     params fstype="nfs" directory="/var/cosmit/repository/" 
device="141.90.235.149:/var/cosmit/repository/" \
     op monitor interval="30s" timeout="60s" \
     op start interval="0s" timeout="240s" \
     op stop interval="0s" timeout="120s"
clone STONITH-1_CLONE STONITH-1 \
     meta target-role="Started" globally-unique="false" clone-max="2" 
clone-node-max="1" interleave="true" order="true"
clone apache_clone apache \
     meta interleave="true" notifiy="true"
clone apache_ip_clone apache_ip \
     meta master-node-max="1" master-max="1" clone-max="2" 
clone-node-max="1" interleave="true" globally-unique="true"
clone jboss_clone jboss \
     meta interleave="true" target-role="Started"
clone mailnotification_clone mailnotification \
     meta interleave="true"
clone nfs_clone nfs \
     meta interleave="true"
order apache-after-all inf: mailnotification_clone jboss_clone 
apache_clone apache_ip_clone
property $id="cib-bootstrap-options" \
     dc-version="1.1.2-ecb1e2ea172ba2551f0bd763e557fccde68c849b" \
     cluster-infrastructure="openais" \
     no-quorum-policy="ignore" \
     expected-quorum-votes="2" \
     stonith-enabled="true"

so long
frank-matthias
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to