On 29/11/2007, Dejan Muhamedagic <[EMAIL PROTECTED]> wrote:
> Yes, very much so. For some reason the MCP (master control
> process) doesn't start the rest of the programs which are doing
> the real work. I really can't say why. Can you please attach the
> logs from this node?

A pstree(1) on the better node visualizes the responsibility of
starting the programs pretty vividly:

  |-heartbeat,18449
  |   |-attrd,18477
  |   |-ccm,18473
  |   |-cib,18474
  |   |-crmd,18478
  |   |   |-pengine,18505
  |   |   `-tengine,18504
  |   |-heartbeat,18452
  |   |-heartbeat,18453
  |   |-heartbeat,18454
  |   |-heartbeat,18455
  |   |-heartbeat,18456
  |   |-lrmd,18475 -r
  |   |-mgmtd,18479 -v
  |   `-stonithd,18476

Here they are again (from tonight):

      1 heartbeat[17481]: 2007/11/29_07:12:40 WARN: heartbeat: udp
port 695 reserved for service "ieee-mms-ssl".
      2 heartbeat[17481]: 2007/11/29_07:12:40 info: Version 2 support: yes
      3 heartbeat[17481]: 2007/11/29_07:12:40 WARN: File
/etc/ha.d/haresources exists.
      4 heartbeat[17481]: 2007/11/29_07:12:40 WARN: This file is not
used because crm is enabled
      5 heartbeat[17481]: 2007/11/29_07:12:40 WARN: Logging daemon is
disabled --enabling logging daemon is recommended
      6 heartbeat[17481]: 2007/11/29_07:12:40 info: **************************
      7 heartbeat[17481]: 2007/11/29_07:12:40 info: Configuration
validated. Starting heartbeat 2.1.2
      8 heartbeat[17482]: 2007/11/29_07:12:40 info: heartbeat: version 2.1.2
      9 heartbeat[17482]: 2007/11/29_07:12:40 info: Heartbeat
generation: 1196102397
     10 heartbeat[17482]: 2007/11/29_07:12:40 info:
G_main_add_TriggerHandler: Added signal manual handler
     11 heartbeat[17482]: 2007/11/29_07:12:40 info:
G_main_add_TriggerHandler: Added signal manual handler
     12 heartbeat[17482]: 2007/11/29_07:12:40 info: Removing
/var/run/heartbeat/rsctmp failed, recreating.
     13 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast: write
socket priority set to IPTOS_LOWDELAY on eth0
     14 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast: bound
send socket to device: eth0
     15 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast: bound
receive socket to device: eth0
     16 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast:
started on port 695 interface eth0 to 192.168.0.248
     17 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast: write
socket priority set to IPTOS_LOWDELAY on eth0
     18 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast: bound
send socket to device: eth0
     19 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast: bound
receive socket to device: eth0
     20 heartbeat[17482]: 2007/11/29_07:12:40 info: glib: ucast:
started on port 695 interface eth0 to 192.168.0.249
     21 heartbeat[17482]: 2007/11/29_07:12:40 info:
G_main_add_SignalHandler: Added signal handler for signal 17
     22 heartbeat[17482]: 2007/11/29_07:12:40 info: Local status now
set to: 'up'
     23 heartbeat[17482]: 2007/11/29_07:12:41 info: Link
drbd01.test.spammatters.local:eth0 up.
     24 heartbeat[17482]: 2007/11/29_07:12:41 info: Status update for
node drbd01.test.spammatters.local: status up
     25 heartbeat[17482]: 2007/11/29_07:13:45 info: all clients are now paused
     26 heartbeat[17482]: 2007/11/29_07:13:45 debug: hist->ackseq =0
     27 heartbeat[17482]: 2007/11/29_07:13:45 debug: hist->lowseq =0,
hist->hiseq=101
     28 heartbeat[17482]: 2007/11/29_07:13:45 debug: expecting from
drbd01.test.spammatters.local
     29 heartbeat[17482]: 2007/11/29_07:13:45 debug: it's ackseq=0
     30 heartbeat[17482]: 2007/11/29_07:13:45 debug:

(The line numbers might come handy in discussing them).

The last five "debug:" lines repeat ad-infinitum.

Thanks very much.

--Amos
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to