Re: [Linux-HA] Master/Slave problems

Adrian Chapela Tue, 26 Feb 2008 03:45:41 -0800

Raoul Bhatia [IPAX] escribió:

Adrian Chapela wrote:
      <master_slave id="MySQL_Server">
[snip]
          <operations>
<op id="mysqld-child-monitor" name="monitor"interval="20s" timeout="19s" prereq="nothing"/>
            <op id="mysqld-child-start" name="start" prereq="nothing"/>
          </operations>
        </primitive>
      </master_slave>
I think that this line: <op id="mysqld-child-monitor"name="monitor" interval="20s" timeout="19s" prereq="nothing"/> is theline to config monitoring operations and the time to do that. In thisline I think interval is 20 seconds, but I am testing and I manuallymake an error in the Master MySQL server to test failover. I saw thatmonitoring operation isn't being executed and the error isn'tdetected by Heartbeat.
If I run the script manually the error is detected but Heartbeat isnot running the script in monitor mode and it don't know the problem.This is the crm_mon output:
[snip]

Yes, I already did this and now I am testing more options. Now, a Slaveserver is making failover well but I have some problems with my mysqlscript ( http://code.adrianchapela.net/heartbeat/mysql_slave_master ).One of them is the stop operation. After a failure, my mysql resource isstopped but MySQL monitor is always informing that the server is downand failed. Heartbeat knows the server is failed. When I am stoppingHeartbeart server, this can't stop well. It says this:

crmd[8531]: 2008/02/26_11:09:10 ERROR: verify_stopped: Resourcemysqld-child:0 was active at shutdown. You may ignore this error if itis unmanaged.crmd[8531]: 2008/02/26_11:09:10 info: process_client_disconnect:Received HUP from tengine:[-1]crmd[8531]: 2008/02/26_11:09:10 ERROR: verify_stopped: Resourcemysqld-child:0 was active at shutdown. You may ignore this error if itis unmanaged.


And this:

tengine[8566]: 2008/02/26_11:09:09 info: te_connect_stonith: Attemptingconnection to fencing daemon...crmd[8531]: 2008/02/26_11:09:09 info: stop_subsystem: Sent -TERM totengine: [8566]tengine[8566]: 2008/02/26_11:09:09 ERROR: stonithd_signon: Can'tinitiate connection to stonithdcrmd[8531]: 2008/02/26_11:09:09 info: do_shutdown: Waiting forsubsystems to exit

tengine[8566]: 2008/02/26_11:09:09 notice: Not currently connected.

crmd[8531]: 2008/02/26_11:09:09 info: do_shutdown: All subsystemsstopped, conti

I am searching information about this errors and How can I force thestop operation ? Stonith daemon should shutdown the server automatically ?

please refer to [1] and add more monitoring actions for all applicable
roles.

cheers,
raoul
[1]http://www.linux-ha.org/ClusterInformationBase/Actions#head-951a50aae161c116d73c95aa0659873ee7a2973b


_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Master/Slave problems

Reply via email to