On Mar 5, 2009, at 12:39 PM, Harakiri wrote:
YES it _is_.
The log messages above indicate the order heartbeat starts
them in -
anything after that is up to the scheduler of your OS.
Regardless, the crmd and cib both have loops that retry
opening
connections to the services they require - with the
possible exception
of the cluster itself.
But these loops dont work - as i said on other systems like debian
the processes are executed in the right order but not here.
I can manually fix the opening of pipes with adding a while loop
ipcsocket.c when the pipe does not exist yet - if they would loop
itself to try again - why isnt it working ? i dont see any reference
to a loop to
struct IPC_CHANNEL *
socket_client_channel_new(GHashTable *ch_attrs)
where is it?
the loops i'm talking about are at a much higher level - i've no
knowledge of how the IPC code works.
eg. do_cib_control() arranges for the crmd to try connecting to the
cib up to 30 times before giving up.
it sounds like the solaris equivalent of socket_client_channel_new()
isnt failing properly.
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems