On 6/12/07, Anders Brownworth <[EMAIL PROTECTED]> wrote:
Hi,

I have a heartbeat v2 setup that I am trying to use to migrate OpenSER and
an IP address back and forth between 2 boxes. (box01 and box02) I wrote an
OCF for OpenSER and am using the heartbeat provided IPaddr to manage the IP
address. My OCF checks out with the ocf-tester script and my
cib.xmlverifies with crm_verify -x /var/lib/heartbeat/crm/cib.xml.

When I start both nodes with exactly the same configuration, they fight
about what the state of things and the first ERROR I get is:

crmd: [3507]: ERROR: do_exit:control.c Could not recover from internal error

in the /var/log/messages of box02 and neither service is started. (IP nor
OpenSER) Both boxes give off a pile of info and warning messages that don't
seem to point me in a worthwhile direction.

Both boxes seem to get to "info: main:attrd.c Starting mainloop..." without
any issues. But once they try to decide on who has what, they start fighting
and constantly rewriting their cib.xml files over and over unendedly. (Is
this normal?) crmd starts complaning about pengine dying on a signal 14 and
everyting pretty much goes to hell from there.

it _should_ be able to recover.

can you send your logs as attachments? (trying to read logs wrapped at
80 chars is hell)

you also didnt mention what version you're running
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to