Re: [Linux-HA] pacemaker restarts services (on the same node) when failed node returns

David Vossel Fri, 13 Dec 2013 07:09:07 -0800

----- Original Message -----
> From: "Peto Michalak" <[email protected]>
> To: [email protected], "linux-ha" <[email protected]>
> Sent: Wednesday, December 11, 2013 2:26:02 AM
> Subject: Re: [Linux-HA] pacemaker restarts services (on the same node) when 
> failed node returns
> 
> Hi David,
> 
> I've attached a crm_report which should show the restart of services, when
> failed node returns.
> 
> I will go through the report as well to see if I find something there.



The constraint in the xml below is what causes the restart. You are telling 
pacemaker place the PGServer on node drpg-02... When drpg-02 joins the cluster, 
PGserver restarts because it is being relocated to prpg-02. This should be 
expected. 

      <rsc_location id="cli-prefer-PGServer" rsc="PGServer">
        <rule id="cli-prefer-rule-PGServer" score="INFINITY" boolean-op="and">
          <expression id="cli-prefer-expr-PGServer" attribute="#uname" 
operation="eq" value="drpg-02" type="string"/>
        </rule>
      </rsc_location>


> 
> Thank you for your help.
> 
> Best Regards,
> -Peter
> 
> > Hello,
> >
> > I really searched for the answer before posting : ).
> >
> > I have a pacemaker setup + corosync + drbd in Active/Passive mode running
> > in 2 node cluster on Ubuntu 12.04.3.
> >
> > Everything works fine and on node failure the services are taken care of
> by
> > the other node (THANKS guys!), well the problem is that I've noticed, that
> > once the failed node comes back alive, the pacemaker restarts the
> > postgresql and virtual IP and that takes around 4-7 seconds (but keeps it
> > on the same node as I wanted, what's the point? :) ). Is this really
> > necessary, or I've messed up something in the configuration?
> 
> any chance you could provide us with a crm_report during the time frame of
> this unwanted restart?
> 
> -- Vossel
> 
> > My pacemaker config:
> >
> > node drpg-01 attributes standby="off"
> > node drpg-02 attributes standby="off"
> > primitive drbd_pg ocf:linbit:drbd \
> >         params drbd_resource="drpg" \
> >         op monitor interval="15" \
> >         op start interval="0" timeout="240" \
> >         op stop interval="0" timeout="120"
> > primitive pg_fs ocf:heartbeat:Filesystem \
> >         params device="/dev/drbd/by-res/drpg" directory="/db/pgdata"
> > options="noatime,nodiratime" fstype="xfs" \
> >         op start interval="0" timeout="60" \
> >         op stop interval="0" timeout="120"
> > primitive pg_lsb lsb:postgresql \
> >         op monitor interval="30" timeout="60" \
> >         op start interval="0" timeout="60" \
> >         op stop interval="0" timeout="60"
> > primitive pg_vip ocf:heartbeat:IPaddr2 \
> >         params ip="10.34.2.60" iflabel="pgvip" \
> >         op monitor interval="5"
> > group PGServer pg_fs pg_lsb pg_vip
> > ms ms_drbd_pg drbd_pg \
> >         meta master-max="1" master-node-max="1" clone-max="2"
> > clone-node-max="1" notify="true"
> > colocation col_pg_drbd inf: PGServer ms_drbd_pg:Master
> > order ord_pg inf: ms_drbd_pg:promote PGServer:start
> > property $id="cib-bootstrap-options" \
> >         dc-version="1.1.6-9971ebba4494012a93c03b40a2c58ec0eb60f50c" \
> >         cluster-infrastructure="openais" \
> >         expected-quorum-votes="2" \
> >         no-quorum-policy="ignore" \
> >         pe-warn-series-max="1000" \
> >         pe-input-series-max="1000" \
> >         pe-error-series-max="1000" \
> >         default-resource-stickiness=1000 \
> >         cluster-recheck-interval="5min" \
> >         stonith-enabled="false" \
> >         last-lrm-refresh="1385646505"
> >
> > Thank you.
> >
> > Best Regards,
> > -Peter
> > _______________________________________________
> > Linux-HA mailing list
> > [email protected]
> > http://lists.linux-ha.org/mailman/listinfo/linux-ha
> > See also: http://linux-ha.org/ReportingProblems
> >
> 
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] pacemaker restarts services (on the same node) when failed node returns

Reply via email to