On Wednesday 03 March 2010 20:40:18 Brian Wolfe wrote: > I have a cluster setup with 2 dell servers, dual ethernet heartbeats, > and a single 8-port APCMaster PDU switch. The cluster works except > for one issue. The cloned stonithd interface refuses to make a call to > the apcmaster to power down the node that's "dead". Reading through > the logs I can see that during setup the stonithd asks the > apcmastersnmp module to check it's hosts list and it returns the > correct hostnames "tpc-dal-prlores3 tpc-dal-tcfs2". However when the > time comes for it to actually use the device I get the following > message from stonithd refusing to actually kill the other node.
hmm .... the outlet names of the PDU are also uppercase? Regards, Andreas > > Mar 3 13:00:36 TPC-DAL-TCFS2 crmd: [15805]: info: te_fence_node: > Executing poweroff fencing operation (24) on TPC-DAL-PRLORES3 > (timeout=60000) > Mar 3 13:00:36 TPC-DAL-TCFS2 crmd: [15805]: debug: waiting for the > stonith reply msg. > Mar 3 13:00:36 TPC-DAL-TCFS2 stonithd: [15800]: info: client tengine > [pid: 15805] requests a STONITH operation POWEROFF on node > TPC-DAL-PRLORES3 > Mar 3 13:00:36 TPC-DAL-TCFS2 stonithd: [15800]: info: we can't manage > TPC-DAL-PRLORES3, broadcast request to other nodes > Mar 3 13:00:36 TPC-DAL-TCFS2 stonithd: [15800]: debug: inserted > optype=POWEROFF, key=-2 > Mar 3 13:00:36 TPC-DAL-TCFS2 stonithd: [15800]: info: Broadcasting > the message succeeded: require others to stonith node > TPC-DAL-PRLORES3. > Mar 3 13:00:36 TPC-DAL-TCFS2 stonithd: [15800]: debug: > stonithd_node_fence: sent back a synchronous reply. > Mar 3 13:00:36 TPC-DAL-TCFS2 crmd: [15805]: debug: > stonithd_node_fence:574: stonithd's synchronous answer is ST_APIOK > > > The stonith is configured as follows: > > <clone id="fencing" > > <primitive class="stonith" id="apcstonith23" type="apcmastersnmp" > > <operations id="apcstonith23-operations" > > <op id="apcstonith23-op-monitor-15" interval="15" > name="monitor" start-delay="15" timeout="15" /> > </operations> > <instance_attributes id="apcstonith23-instance_attributes" > > <nvpair id="nvpair-604e339f-a400-4b30-82c0-f046de0ed663" > name="ipaddr" value="172.20.1.23" /> > <nvpair id="nvpair-ed611421-97a1-4091-a5cd-8159f1230096" name="port" > value="161" /> > <nvpair id="nvpair-997431e2-ea78-4065-b835-f9149bbcb596" > name="community" value="private" /> > </instance_attributes> > </primitive> > <meta_attributes id="fencing-meta_attributes" > > </meta_attributes> > </clone> > > > I can confirm the use of the stonith via the command "stonith -t > apcmastersnmp <params> tpc-dal-prlores3" and it'll switch off the > server. > > Any help would be appreciated. > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems > _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
