On Mar 5, 2009, at 12:39 PM, Harakiri wrote:

YES it _is_.
The log messages above indicate the order heartbeat starts
them in -
anything after that is up to the scheduler of your OS.

Regardless, the crmd and cib both have loops that retry
opening
connections to the services they require - with the
possible exception
of the cluster itself.

But these loops dont work - as i said on other systems like debian the processes are executed in the right order but not here.

I can manually fix the opening of pipes with adding a while loop ipcsocket.c when the pipe does not exist yet - if they would loop itself to try again - why isnt it working ? i dont see any reference to a loop to

struct IPC_CHANNEL *
socket_client_channel_new(GHashTable *ch_attrs)

where is it?

the loops i'm talking about are at a much higher level - i've no knowledge of how the IPC code works. eg. do_cib_control() arranges for the crmd to try connecting to the cib up to 30 times before giving up.

it sounds like the solaris equivalent of socket_client_channel_new() isnt failing properly.
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to