Hi Lars

[email protected] wrote on 08.09.2009 16:49:05:

> SLE_11 is more uptodate at the UNSTABLE repo:
>
> zypper ar http://download.opensuse.org/repositories/server:/ha-
> clustering:/UNSTABLE/SLE_11/ sle11-ha
> zypper ref
> zypper in pacemaker

After installing unfortunately I got everything else than a running
heartbeat. :-/ I tried it on one machine with an update and on another one
after de- and reinstallation. Both of them show something like this
in /var/log/messages:

...
Sep 16 10:35:00 sles11-master ccm: [3997]: info: Hostname: sles11-master
Sep 16 10:35:00 sles11-master stonithd: [4000]: WARN: Core dumps could be
lost if multiple dumps occur.
Sep 16 10:35:00 sles11-master stonithd: [4000]: WARN: Consider setting
non-default value in /proc/sys/kernel/core_pattern (or equivalent) for
maximum supportability
Sep 16 10:35:00 sles11-master stonithd: [4000]: WARN: Consider
setting /proc/sys/kernel/core_uses_pid (or equivalent) to 1 for maximum
supportability
Sep 16 10:35:00 sles11-master stonithd: [4000]: info:
G_main_add_SignalHandler: Added signal handler for signal 10
Sep 16 10:35:00 sles11-master stonithd: [4000]: info:
G_main_add_SignalHandler: Added signal handler for signal 12
Sep 16 10:35:00 sles11-master stonithd: [4000]: info: crm_cluster_connect:
Unsupported cluster stack: (null)
Sep 16 10:35:00 sles11-master stonithd: [4000]: ERROR: failed to connect to
cluster
Sep 16 10:35:00 sles11-master stonithd: [4000]:
ERROR: /usr/lib/heartbeat/stonithd abnormally abort.
Sep 16 10:35:00 sles11-master heartbeat: [3956]: WARN:
Managed /usr/lib/heartbeat/stonithd process 4000 exited with return code
100.
Sep 16 10:35:00 sles11-master heartbeat: [4001]: info: Starting
"/usr/lib/heartbeat/attrd" as uid 90  gid 90 (pid 4001)
Sep 16 10:35:00 sles11-master cib: [3998]: info: retrieveCib: Reading
cluster configuration from: /var/lib/heartbeat/crm/cib.xml
(digest: /var/lib/heartbeat/crm/cib.xml.sig)
Sep 16 10:35:00 sles11-master cib: [3998]: WARN: retrieveCib: Cluster
configuration not found: /var/lib/heartbeat/crm/cib.xml
Sep 16 10:35:00 sles11-master cib: [3998]: WARN: readCibXmlFile: Primary
configuration corrupt or unusable, trying backup...
Sep 16 10:35:00 sles11-master cib: [3998]: WARN: readCibXmlFile: Continuing
with an empty configuration.
Sep 16 10:35:00 sles11-master lrmd: [3999]: info: G_main_add_SignalHandler:
Added signal handler for signal 15
Sep 16 10:35:00 sles11-master attrd: [4001]: info:
Invoked: /usr/lib/heartbeat/attrd
Sep 16 10:35:00 sles11-master attrd: [4001]: info: main: Starting up
Sep 16 10:35:00 sles11-master attrd: [4001]: info: crm_cluster_connect:
Unsupported cluster stack: (null)
Sep 16 10:35:00 sles11-master attrd: [4001]: ERROR: main: HA Signon failed
Sep 16 10:35:00 sles11-master attrd: [4001]: info: main: Cluster connection
active
Sep 16 10:35:00 sles11-master attrd: [4001]: info: main: Accepting
attribute updates
Sep 16 10:35:00 sles11-master lrmd: [3999]: info: G_main_add_SignalHandler:
Added signal handler for signal 17
Sep 16 10:35:00 sles11-master lrmd: [3999]: WARN: Core dumps could be lost
if multiple dumps occur.
Sep 16 10:35:00 sles11-master lrmd: [3999]: WARN: Consider setting
non-default value in /proc/sys/kernel/core_pattern (or equivalent) for
maximum supportability
Sep 16 10:35:00 sles11-master lrmd: [3999]: WARN: Consider
setting /proc/sys/kernel/core_uses_pid (or equivalent) to 1 for maximum
supportability
Sep 16 10:35:00 sles11-master lrmd: [3999]: info: G_main_add_SignalHandler:
Added signal handler for signal 10
Sep 16 10:35:00 sles11-master lrmd: [3999]: info: G_main_add_SignalHandler:
Added signal handler for signal 12
Sep 16 10:35:00 sles11-master lrmd: [3999]: info: Started.
Sep 16 10:35:00 sles11-master attrd: [4001]: ERROR: main: Aborting startup
Sep 16 10:35:00 sles11-master heartbeat: [3956]: WARN:
Managed /usr/lib/heartbeat/attrd process 4001 exited with return code 100.
Sep 16 10:35:00 sles11-master cib: [3998]: info: startCib: CIB
Initialization completed successfully
Sep 16 10:35:00 sles11-master cib: [3998]: info: crm_cluster_connect:
Unsupported cluster stack: (null)
Sep 16 10:35:01 sles11-master cib: [3998]: CRIT: cib_init: Cannot sign in
to the cluster... terminating
Sep 16 10:35:01 sles11-master heartbeat: [3956]: WARN:
Managed /usr/lib/heartbeat/cib process 3998 exited with return code 100.
Sep 16 10:35:01 sles11-master heartbeat: [3956]: EMERG: Rebooting system.
Reason: /usr/lib/heartbeat/cib
Sep 16 10:35:01 sles11-master crmd: [4002]: info: do_cib_control: Could not
connect to the CIB service: connection failed
Sep 16 10:35:01 sles11-master crmd: [4002]: WARN: do_cib_control: Couldn't
complete CIB registration 1 times... pause and retry
Sep 16 10:35:01 sles11-master crmd: [4002]: info: crmd_init: Starting
crmd's mainloop

What's happening there? Which informations do you need to dig deeper?

Regards,

Yves Schumann
Softwareentwicklungsingenieur Security Solutions Division
IT-Koordinator
______________________________
Ascom (Schweiz) AG

"Walking on water and developing software from a specification are easy if
both are frozen" -- Edward V. Berard

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to