Hi Andrew,
> If the crmd dies, then I (IIRC) the lrmd cancels all existing resource > monitoring. > However, when the crmd is recovered, it should setup the resource > monitoring again. > > Is the second part not happening? There are some patterns in a problem. When crmd/stonithd restarts in a ACT node in a DC node, the monitor completely stops. When others of lrmd/mgmtd restart in the STB node of the DC node, the monitor completely stops. > > 1) When a process related to a monitor fell, a system reboots.(Emergency > > Reboot) > Thats the lazy way out. I think your opinion is right. If a monitor is reopened definitely, I think that the Emergency reboot is not necessary. Best Regards, Hideo Yamauchi. --- Andrew Beekhof <and...@beekhof.net> wrote: > 2009/7/17 <renayama19661...@ybb.ne.jp>: > > Hi Andrew, > > > >> What do you mean by monitor here? > >> Do you mean that pacemaker would no longer detect if those two processes > >> died? > > > > This monitor means the monitor of the resource. > > When these processes fall, the monitor of the resource of lrmd/stonithd > > stops. > > For example, the monitor of external/ssh and pgsql stops. > > > >> Do you mean the way heartbeat behaves with "crm on" instead of "crm > >> respawn" ? > > > > I do not understand this meaning well. > > That makes two of us :-) > I'm not sure I really understand the problem here. > > If the crmd dies, then I (IIRC) the lrmd cancels all existing resource > monitoring. > However, when the crmd is recovered, it should setup the resource > monitoring again. > > Is the second part not happening? > > > > > I think that the following approach is necessary. > > > > 1) When a process related to a monitor fell, a system reboots.(Emergency > > Reboot) > > Thats the lazy way out. > > > 2) When a process related to a monitor fell, a monitor does not stop even > > if it reboot. > > > > The first approach thinks that realization is simple. > > > > Best Regards, > > Hideo Yamauchi. > > > > --- Andrew Beekhof <and...@beekhof.net> wrote: > > > >> On Fri, Jul 17, 2009 at 3:34 AM, <renayama19661...@ybb.ne.jp> wrote: > >> > Hi, > >> > > >> > We began shift investigation to the combination of Pacemaker and > >> > corosync/openais now. > >> > > >> > We put Pacemaker and openais(whitetank) together and confirmed movement > >> > at the time of the > >> process > >> > trouble. > >> > (This is the function that a reboot emergency occurred by a combination > >> > with Heartbeat.) > >> > > >> > I let a process of Pacemaker break down. (kill -9 pid) > >> > The following behavior was seen then. > >> > > >> > * When crmd reboots on ACT node(Not DC), the monitor of the lrmd > >> > resource stops. > >> > \xA0And the monitor of the stonnith resource stops. > >> > >> What do you mean by monitor here? > >> Do you mean that pacemaker would no longer detect if those two processes > >> died? > >> > >> > * When stonithd reboots on ACT node(Not DC), the monitor of the stonnith > >> > resource stops. > >> > * When crmd reboots on STB node(DC), the monitor of the stonnith > >> > resource stops. > >> > * When pengine reboots on STB node(DC), the monitor of the stonnith > >> > resource stops. > >> > * And more.... > >> > > >> > We feel a problem for the stop of the monitor after the process reboot. > >> > When we combined openais/corosync, we hope a function such as the urgent > >> > reboot of > Heartbeat > >> to be > >> > included. > >> > >> Do you mean the way heartbeat behaves with "crm on" instead of "crm > >> respawn" ? > >> > > >> > Best Regards, > >> > Hideo Yamauchi. > >> > > >> > > >> > _______________________________________________ > >> > Pacemaker mailing list > >> > Pacemaker@oss.clusterlabs.org > >> > http://oss.clusterlabs.org/mailman/listinfo/pacemaker > >> > > >> > >> _______________________________________________ > >> Pacemaker mailing list > >> Pacemaker@oss.clusterlabs.org > >> http://oss.clusterlabs.org/mailman/listinfo/pacemaker > >> > > > > > > > > _______________________________________________ > > Pacemaker mailing list > > Pacemaker@oss.clusterlabs.org > > http://oss.clusterlabs.org/mailman/listinfo/pacemaker > > > > > > _______________________________________________ > Pacemaker mailing list > Pacemaker@oss.clusterlabs.org > http://oss.clusterlabs.org/mailman/listinfo/pacemaker >
_______________________________________________ Pacemaker mailing list Pacemaker@oss.clusterlabs.org http://oss.clusterlabs.org/mailman/listinfo/pacemaker