Wasn't the start order changed on payloads so that clmna started before immnd ?
It doesn't look like it from this log.

/Bertil

From: Mathi Naickan [mailto:[email protected]]
Sent: den 19 mars 2014 10:13
To: [email protected]
Subject: [tickets] [opensaf:tickets] #814 CLMNA fails and logging is poor


  *   status: unassigned --> assigned
  *   assigned_to: Mathi Naickan
  *   Milestone: future --> 4.4.1

________________________________

[tickets:#814]<http://sourceforge.net/p/opensaf/tickets/814/> CLMNA fails and 
logging is poor

Status: assigned
Milestone: 4.4.1
Created: Wed Mar 19, 2014 08:07 AM UTC by Hans Feldt
Last Updated: Wed Mar 19, 2014 08:40 AM UTC
Owner: Mathi Naickan

MDS/TCP
Changeset:

parent: 5070:fc02663112d8
user: Lennart Lund [email protected]<mailto:[email protected]>
date: Tue Mar 18 17:07:48 2014 +0100
summary: logsv: Do not allow NULL pointers for string variables in OI validity 
check [#771]<http://sourceforge.net/p/opensaf/tickets/771/>

Mar 19 08:55:07 PL-3 osafdtmd[354]: Started
Mar 19 08:55:07 PL-3 osafimmnd[368]: Started
Mar 19 08:55:07 PL-3 osafimmnd[368]: NO Persistent Back-End capability 
configured, Pbe file:imm.db (suffix may get added)
Mar 19 08:55:07 PL-3 osafdtmd[354]: NO Established contact with 'SC-1'
Mar 19 08:55:07 PL-3 osafdtmd[354]: NO Established contact with 'PL-5'
Mar 19 08:55:07 PL-3 osafdtmd[354]: NO Established contact with 'SC-2'
Mar 19 08:55:07 PL-3 osafdtmd[354]: NO Established contact with 'PL-4'
Mar 19 08:55:07 PL-3 osafimmnd[368]: NO Fevs count adjusted to 1427 preLoadPid: 0
Mar 19 08:55:07 PL-3 osafimmnd[368]: NO SERVER STATE: IMM_SERVER_ANONYMOUS --> 
IMM_SERVER_CLUSTER_WAITING
Mar 19 08:55:07 PL-3 osafimmnd[368]: NO SERVER STATE: 
IMM_SERVER_CLUSTER_WAITING --> IMM_SERVER_LOADING_PENDING
Mar 19 08:55:07 PL-3 osafimmnd[368]: NO SERVER STATE: 
IMM_SERVER_LOADING_PENDING --> IMM_SERVER_SYNC_PENDING
Mar 19 08:55:07 PL-3 osafimmnd[368]: NO NODE STATE-> IMM_NODE_ISOLATED
Mar 19 08:55:08 PL-3 osafimmnd[368]: NO NODE STATE-> IMM_NODE_W_AVAILABLE
Mar 19 08:55:08 PL-3 osafimmnd[368]: NO SERVER STATE: IMM_SERVER_SYNC_PENDING 
--> IMM_SERVER_SYNC_CLIENT
Mar 19 08:55:13 PL-3 osafimmnd[368]: NO NODE STATE-> IMM_NODE_FULLY_AVAILABLE 
2316
Mar 19 08:55:13 PL-3 osafimmnd[368]: NO RepositoryInitModeT is 
SA_IMM_KEEP_REPOSITORY
Mar 19 08:55:13 PL-3 osafimmnd[368]: NO Epoch set to 5 in ImmModel
Mar 19 08:55:13 PL-3 osafimmnd[368]: NO SERVER STATE: IMM_SERVER_SYNC_CLIENT 
--> IMM SERVER READY
Mar 19 08:55:13 PL-3 osafclmna[396]: Started
Mar 19 08:55:23 PL-3 osafclmna[396]: ER Exiting
Mar 19 08:55:23 PL-3 opensafd[336]: ER Failed #012 DESC:CLMNA
Mar 19 08:55:23 PL-3 opensafd[336]: ER Going for recovery
Mar 19 08:55:23 PL-3 opensafd[336]: ER Trying To RESPAWN 
/usr/local/lib/opensaf/clc-cli/osaf-clmna attempt #1
Mar 19 08:55:23 PL-3 opensafd[336]: ER Sending SIGKILL to CLMNA, pid=392
Mar 19 08:55:23 PL-3 osafclmna[396]: exiting for shutdown
Mar 19 08:55:38 PL-3 osafclmna[443]: Started
Mar 19 08:55:48 PL-3 osafclmna[443]: ER Exiting
Mar 19 08:55:48 PL-3 opensafd[336]: ER Could Not RESPAWN CLMNA
Mar 19 08:55:48 PL-3 opensafd[336]: ER Failed #012 DESC:CLMNA
Mar 19 08:55:48 PL-3 opensafd[336]: ER Trying To RESPAWN 
/usr/local/lib/opensaf/clc-cli/osaf-clmna attempt #2
Mar 19 08:55:48 PL-3 opensafd[336]: ER Sending SIGKILL to CLMNA, pid=439
Mar 19 08:55:48 PL-3 osafclmna[443]: exiting for shutdown
Mar 19 08:56:03 PL-3 osafclmna[493]: Started
Mar 19 08:56:06 PL-3 opensafd: Stopping OpenSAF Services
Mar 19 08:56:06 PL-3 opensafd: OpenSAF services successfully stopped

Note clmna just says "Exiting" without any reason!

At the same time there is not much in the active controller syslog:

Mar 19 08:55:07 SC-1 osafdtmd[352]: NO Established contact with 'PL-3'
Mar 19 08:55:07 SC-1 osafimmd[392]: NO Extended intro from node 2030f
Mar 19 08:55:07 SC-1 osafimmd[392]: NO Node 2030f request sync sync-pid:368 
epoch:0
Mar 19 08:55:08 SC-1 osafimmnd[403]: NO NODE STATE-> IMM_NODE_R_AVAILABLE
Mar 19 08:55:08 SC-1 osafimmd[392]: NO Successfully announced sync. New ruling 
epoch:5
Mar 19 08:55:13 SC-1 osafimmnd[403]: NO NODE STATE-> IMM_NODE_FULLY_AVAILABLE 
15652
Mar 19 08:55:13 SC-1 osafimmnd[403]: NO Epoch set to 5 in ImmModel
Mar 19 08:55:13 SC-1 osafimmd[392]: NO ACT: New Epoch for IMMND process at node 
2050f old epoch: 4 new epoch:5
Mar 19 08:55:13 SC-1 osafimmd[392]: NO ACT: New Epoch for IMMND process at node 
2040f old epoch: 4 new epoch:5
Mar 19 08:55:13 SC-1 osafimmd[392]: NO ACT: New Epoch for IMMND process at node 
2010f old epoch: 4 new epoch:5
Mar 19 08:55:13 SC-1 osafimmd[392]: NO ACT: New Epoch for IMMND process at node 
2030f old epoch: 0 new epoch:5
Mar 19 08:55:13 SC-1 osafimmd[392]: NO ACT: New Epoch for IMMND process at node 
2020f old epoch: 4 new epoch:5
Mar 19 08:56:06 SC-1 osafdtmd[352]: NO Lost contact with 'PL-3'
Mar 19 08:56:06 SC-1 osafimmnd[403]: NO Global discard node received for 
nodeId:2030f pid:368
Mar 19 08:56:07 SC-1 osafimmnd[403]: NO Global discard node received for 
nodeId:2040f pid:375
Mar 19 08:56:07 SC-1 osafamfd[465]: NO Node 'PL-4' left the cluster

MDS log on PL-3 says this:

Mar 19 8:55:13.065458 <396> NOTIFY |BEGIN MDS LOGGING| PID=396|ARCH=0|64bit=1
Mar 19 8:55:23.077557 <396> ERR |MDS_SND_RCV: Timeout or Error occured
Mar 19 8:55:23.077761 <396> ERR |MDS_SND_RCV: Timeout occured on sndrsp message
Mar 19 8:55:23.077823 <396> ERR |MDS_SND_RCV: Adest=<0x00000000,16>
Mar 19 8:55:38.195098 <443> NOTIFY |BEGIN MDS LOGGING| PID=443|ARCH=0|64bit=1
Mar 19 8:55:48.200218 <443> ERR |MDS_SND_RCV: Timeout or Error occured
Mar 19 8:55:48.200435 <443> ERR |MDS_SND_RCV: Timeout occured on sndrsp message
Mar 19 8:55:48.200505 <443> ERR |MDS_SND_RCV: Adest=<0x00000000,16>
Mar 19 8:56:03.337048 <493> NOTIFY |BEGIN MDS LOGGING| PID=493|ARCH=0|64bit=1

________________________________

Sent from sourceforge.net because 
[email protected]<mailto:[email protected]>
 is subscribed to https://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
https://sourceforge.net/p/opensaf/admin/tickets/options. Or, if this is a 
mailing list, you can unsubscribe from the mailing list.



---

** [tickets:#814] CLMNA fails and logging is poor**

**Status:** assigned
**Milestone:** 4.4.1
**Created:** Wed Mar 19, 2014 08:07 AM UTC by Hans Feldt
**Last Updated:** Wed Mar 19, 2014 09:13 AM UTC
**Owner:** Mathi Naickan

MDS/TCP
Changeset:

parent:      5070:fc02663112d8
user:        Lennart Lund <[email protected]>
date:        Tue Mar 18 17:07:48 2014 +0100
summary:     logsv: Do not allow NULL pointers for string variables in OI 
validity check [#771]


Mar 19 08:55:07 PL-3 osafdtmd[354]: Started
Mar 19 08:55:07 PL-3 osafimmnd[368]: Started
Mar 19 08:55:07 PL-3 osafimmnd[368]: NO Persistent Back-End capability 
configured, Pbe file:imm.db (suffix may get added)
Mar 19 08:55:07 PL-3 osafdtmd[354]: NO Established contact with 'SC-1'
Mar 19 08:55:07 PL-3 osafdtmd[354]: NO Established contact with 'PL-5'
Mar 19 08:55:07 PL-3 osafdtmd[354]: NO Established contact with 'SC-2'
Mar 19 08:55:07 PL-3 osafdtmd[354]: NO Established contact with 'PL-4'
Mar 19 08:55:07 PL-3 osafimmnd[368]: NO Fevs count adjusted to 1427 preLoadPid: 0
Mar 19 08:55:07 PL-3 osafimmnd[368]: NO SERVER STATE: IMM_SERVER_ANONYMOUS --> 
IMM_SERVER_CLUSTER_WAITING
Mar 19 08:55:07 PL-3 osafimmnd[368]: NO SERVER STATE: 
IMM_SERVER_CLUSTER_WAITING --> IMM_SERVER_LOADING_PENDING
Mar 19 08:55:07 PL-3 osafimmnd[368]: NO SERVER STATE: 
IMM_SERVER_LOADING_PENDING --> IMM_SERVER_SYNC_PENDING
Mar 19 08:55:07 PL-3 osafimmnd[368]: NO NODE STATE-> IMM_NODE_ISOLATED
Mar 19 08:55:08 PL-3 osafimmnd[368]: NO NODE STATE-> IMM_NODE_W_AVAILABLE
Mar 19 08:55:08 PL-3 osafimmnd[368]: NO SERVER STATE: IMM_SERVER_SYNC_PENDING 
--> IMM_SERVER_SYNC_CLIENT
Mar 19 08:55:13 PL-3 osafimmnd[368]: NO NODE STATE-> IMM_NODE_FULLY_AVAILABLE 
2316
Mar 19 08:55:13 PL-3 osafimmnd[368]: NO RepositoryInitModeT is 
SA_IMM_KEEP_REPOSITORY
Mar 19 08:55:13 PL-3 osafimmnd[368]: NO Epoch set to 5 in ImmModel
Mar 19 08:55:13 PL-3 osafimmnd[368]: NO SERVER STATE: IMM_SERVER_SYNC_CLIENT 
--> IMM SERVER READY
Mar 19 08:55:13 PL-3 osafclmna[396]: Started
Mar 19 08:55:23 PL-3 osafclmna[396]: ER Exiting
Mar 19 08:55:23 PL-3 opensafd[336]: ER Failed #012 DESC:CLMNA
Mar 19 08:55:23 PL-3 opensafd[336]: ER Going for recovery
Mar 19 08:55:23 PL-3 opensafd[336]: ER Trying To RESPAWN 
/usr/local/lib/opensaf/clc-cli/osaf-clmna attempt #1
Mar 19 08:55:23 PL-3 opensafd[336]: ER Sending SIGKILL to CLMNA, pid=392
Mar 19 08:55:23 PL-3 osafclmna[396]: exiting for shutdown
Mar 19 08:55:38 PL-3 osafclmna[443]: Started
Mar 19 08:55:48 PL-3 osafclmna[443]: ER Exiting
Mar 19 08:55:48 PL-3 opensafd[336]: ER Could Not RESPAWN CLMNA
Mar 19 08:55:48 PL-3 opensafd[336]: ER Failed #012 DESC:CLMNA
Mar 19 08:55:48 PL-3 opensafd[336]: ER Trying To RESPAWN 
/usr/local/lib/opensaf/clc-cli/osaf-clmna attempt #2
Mar 19 08:55:48 PL-3 opensafd[336]: ER Sending SIGKILL to CLMNA, pid=439
Mar 19 08:55:48 PL-3 osafclmna[443]: exiting for shutdown
Mar 19 08:56:03 PL-3 osafclmna[493]: Started
Mar 19 08:56:06 PL-3 opensafd: Stopping OpenSAF Services
Mar 19 08:56:06 PL-3 opensafd: OpenSAF services successfully stopped

Note clmna just says "Exiting" without any reason!

At the same time there is not much in the active controller syslog:

Mar 19 08:55:07 SC-1 osafdtmd[352]: NO Established contact with 'PL-3'
Mar 19 08:55:07 SC-1 osafimmd[392]: NO Extended intro from node 2030f
Mar 19 08:55:07 SC-1 osafimmd[392]: NO Node 2030f request sync sync-pid:368 
epoch:0 
Mar 19 08:55:08 SC-1 osafimmnd[403]: NO NODE STATE-> IMM_NODE_R_AVAILABLE
Mar 19 08:55:08 SC-1 osafimmd[392]: NO Successfully announced sync. New ruling 
epoch:5
Mar 19 08:55:13 SC-1 osafimmnd[403]: NO NODE STATE-> IMM_NODE_FULLY_AVAILABLE 
15652
Mar 19 08:55:13 SC-1 osafimmnd[403]: NO Epoch set to 5 in ImmModel
Mar 19 08:55:13 SC-1 osafimmd[392]: NO ACT: New Epoch for IMMND process at node 
2050f old epoch: 4  new epoch:5
Mar 19 08:55:13 SC-1 osafimmd[392]: NO ACT: New Epoch for IMMND process at node 
2040f old epoch: 4  new epoch:5
Mar 19 08:55:13 SC-1 osafimmd[392]: NO ACT: New Epoch for IMMND process at node 
2010f old epoch: 4  new epoch:5
Mar 19 08:55:13 SC-1 osafimmd[392]: NO ACT: New Epoch for IMMND process at node 
2030f old epoch: 0  new epoch:5
Mar 19 08:55:13 SC-1 osafimmd[392]: NO ACT: New Epoch for IMMND process at node 
2020f old epoch: 4  new epoch:5
Mar 19 08:56:06 SC-1 osafdtmd[352]: NO Lost contact with 'PL-3'
Mar 19 08:56:06 SC-1 osafimmnd[403]: NO Global discard node received for 
nodeId:2030f pid:368
Mar 19 08:56:07 SC-1 osafimmnd[403]: NO Global discard node received for 
nodeId:2040f pid:375
Mar 19 08:56:07 SC-1 osafamfd[465]: NO Node 'PL-4' left the cluster

MDS log on PL-3 says this:

Mar 19  8:55:13.065458 <396> NOTIFY |BEGIN MDS LOGGING| PID=396|ARCH=0|64bit=1
Mar 19  8:55:23.077557 <396> ERR    |MDS_SND_RCV: Timeout or Error occured
Mar 19  8:55:23.077761 <396> ERR    |MDS_SND_RCV: Timeout occured on sndrsp 
message
Mar 19  8:55:23.077823 <396> ERR    |MDS_SND_RCV: Adest=<0x00000000,16>
Mar 19  8:55:38.195098 <443> NOTIFY |BEGIN MDS LOGGING| PID=443|ARCH=0|64bit=1
Mar 19  8:55:48.200218 <443> ERR    |MDS_SND_RCV: Timeout or Error occured
Mar 19  8:55:48.200435 <443> ERR    |MDS_SND_RCV: Timeout occured on sndrsp 
message
Mar 19  8:55:48.200505 <443> ERR    |MDS_SND_RCV: Adest=<0x00000000,16>
Mar 19  8:56:03.337048 <493> NOTIFY |BEGIN MDS LOGGING| PID=493|ARCH=0|64bit=1




---

Sent from sourceforge.net because [email protected] is 
subscribed to http://sourceforge.net/p/opensaf/tickets/

To unsubscribe from further messages, a project admin can change settings at 
http://sourceforge.net/p/opensaf/admin/tickets/options.  Or, if this is a 
mailing list, you can unsubscribe from the mailing list.
------------------------------------------------------------------------------
Learn Graph Databases - Download FREE O'Reilly Book
"Graph Databases" is the definitive new guide to graph databases and their
applications. Written by three acclaimed leaders in the field,
this first edition is now available. Download your free book today!
http://p.sf.net/sfu/13534_NeoTech
_______________________________________________
Opensaf-tickets mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-tickets

Reply via email to