Hi, On Thu, Aug 20, 2009 at 07:55:07AM -0500, Terry L. Inzauro wrote: > Dejan Muhamedagic wrote: > > Hi, > > > > On Thu, Aug 20, 2009 at 11:01:43AM +0200, Thomas Glanzmann wrote: > >> Hello Terry, > >> > >>> What would cause the stonith 'start' operation to fail after it > >>> initially had succeeded? > >> if my understanding is correct (I wrote a stonith agent for vsphere > >> yesterday). Than it runs the status command of the stonith agent and > >> looks at the exist status, like that: > >> > >> (ha-01) [~] VI_SERVER=esx-03.glanzmann.de VI_USERNAME=root > >> /usr/lib/stonith/plugins/external/vsphere status; echo $? > >> Enter password: > >> 0 > > > > Right. The start operation includes a status. If the status > > operation fails, the start obviously fails too. > > > > Thanks, > > > > Dejan > > > >> Thomas > >> _______________________________________________ > > > Ok, I understand that, but why would it intermittently fail if > it initialy succeeds? These machines are not heavily loaded > and are by no means slow.
That obviously depends on your stonith device. Unless you think it works without problems in which case you must've found a well hidden bug. If so, please file a bugzilla with a hb_report report. BTW, external/ssh should be used _only_ for testing purposes. > I guess the better question would be: How to I track down the > culprit? Obviously, if a monitor or start command fail on the > stonith agent, then it will cause a "stonith reboot" operation No, that shouldn't cause a reboot of any kind, unless you specifically ask for it. Thanks, Dejan > of one or both of the nodes which shouldn't happen unless > theres a definite reason to do so. > > > > _______________________________________________ > Linux-HA mailing list > [email protected] > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems _______________________________________________ Linux-HA mailing list [email protected] http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems
