On May 13, 2010, at 4:40 PM, Brodie, Kent wrote:

> Greetings!   I’m hoping someone can enlighten me as to what conceptual thing 
> I’m doing wrong?
>  
> I have four services, defined as a colocated group.   Under normal 
> circumstances, things seem to work fine--  when I put a node in standby, 
> everything moves over as it should , and so on.  (By the way, this whole 
> openais/pacemaker setup is really cool, I’m enjoying this thoroughly).
>  
> The one problem I’m having is the situation where one particular service 
> dies- and keeps on dying.  (simulated, for example, by manually killing the 
> Apache httpd process, etc).   When I do this, the local failcount is 
> incremented (good)--   the service is restarted on the same host (good) – and 
> life goes on.   Until that is, I reach the migration threshold (as below, 5 
> attempts).   What I’m looking for is, “ok, after 5 failed attempts of this 
> service, fail over to the other node.      This works--  but only partially?  
>   That is, (using the apache example)—the apache service DOES fail over.    
> That much works.   But—nothing else goes along with it!    I thought the 
> “collocation” parameter is supposed to do this--  am I missing something else?
>  
>  
>  
> node bennett \
>                 attributes standby="off"
> node caffey \
>                 attributes standby="on"
> primitive Apache ocf:heartbeat:apache \
>                 params configfile="/usr/local/apache2/conf/httpd.conf" 
> port="8000" httpd="/usr/local/apache2/bin/httpd" 
> statusurl="http://127.0.0.1:8000/server-status"; \
>                 op monitor interval="30s" \
>                 meta target-role="Started" \
>                 meta migration-threshold="5"
> primitive ClusterIP ocf:heartbeat:IPaddr2 \
>                 params ip="141.136.224.243" cidr_netmask="24" \
>                 op monitor interval="30s" \
>                 meta target-role="Started"
> primitive asys ocf:heartbeat:asys \
>                 op monitor interval="30s" \
>                 meta target-role="Started" \
>                 meta migration-threshold="5"
> primitive glassfish ocf:heartbeat:glassfish \
>                 op monitor interval="30s" \
>                 op start interval="0" timeout="60s" \
>                 meta target-role="Started" \
>                 meta migration-threshold="5"
> colocation all_services inf: ClusterIP Apache glassfish asys
> order asys-after-glassfish inf: glassfish asys
> property $id="cib-bootstrap-options" \
>                 dc-version="1.0.8-9881a7350d6182bae9e8e557cf20a3cc5dac3ee7" \
>                 cluster-infrastructure="openais" \
>                 expected-quorum-votes="2" \
>                 stonith-enabled="false" \
>                 no-quorum-policy="ignore" \
>                 last-lrm-refresh="1273782512"
> rsc_defaults $id="rsc-options" \
>                 resource-stickiness="100"
>  

Hi,

You might find more help in pacemaker maillist:  [email protected].
But config doesn't look unreasonable. Have you checked how constraints actually 
look like?

 cibadmin -Q -o constraints

and as a workaround you can define a group instead of using collocation/order

group all_services ClusterIP Apache glassfish asys

Vadym

_______________________________________________
Openais mailing list
[email protected]
https://lists.linux-foundation.org/mailman/listinfo/openais

Reply via email to