<This is the second attempt to post this, using the new hpe.com domain email address, which I’ve registered, but which the broken website has not allowed me to acknowledge.>
Trying to bring up OpenSAF on a variant of Linux for the Machine from Keith Packard’s group: https://lwn.net/Articles/655437/ OpenSAF dies early on, after starting ntf, but before hearing from clm (I think), which is next in line after ntf (see below). I tried turning on info level logging and tracing in those two by changing the .conf files and triggering with the commands that usually work to accomplish this after OpenSAF is up: ps -e | grep osafntfd | awk '{print \$1;}' | xargs sudo kill -s SIGUSR2 ps -e | grep osafclmd | awk '{print \$1;}' | xargs sudo kill -s SIGUSR2 ps -e | grep osafclmna | awk '{print \$1;}' | xargs sudo kill -s SIGUSR2 It would seem that I can’t catch the newly started process with a SIGUSR2 in time to cause it to log and trace, before it dies. I don’t think I would be able to catch it fast enough to debug it either. Any suggestions as to how to get it to do this? Or ideas as to what might screw up clm in startup? I don’t have these problems in Fedora with the identical setup (ignore the user group, setgrent, and getgrouplist errors). Charlie … Jan 19 12:19:14 metabox-l4tm-dl360g6-1 opensafd: Starting OpenSAF Services(4.7.0 - ) (Using TCP) Jan 19 12:19:14 metabox-l4tm-dl360g6-1 osafdtmd[17978]: NO setgrent failed: Numerical result out of range Jan 19 12:19:14 metabox-l4tm-dl360g6-1 osafdtmd[17978]: getgrouplist failed, uid=999 (Numerical result out of range). Continuing without supplementary groups. Jan 19 12:19:14 metabox-l4tm-dl360g6-1 osafdtmd[17978]: Started Jan 19 12:19:14 metabox-l4tm-dl360g6-1 osafrded[17993]: NO setgrent failed: Numerical result out of range Jan 19 12:19:14 metabox-l4tm-dl360g6-1 osafrded[17993]: getgrouplist failed, uid=999 (Numerical result out of range). Continuing without supplementary groups. Jan 19 12:19:14 metabox-l4tm-dl360g6-1 osafrded[17993]: Started Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafrded[17993]: NO No peer available => Setting Active role for this node Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osaffmd[18004]: NO setgrent failed: Numerical result out of range Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osaffmd[18004]: getgrouplist failed, uid=999 (Numerical result out of range). Continuing without supplementary groups. Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osaffmd[18004]: Started Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafimmd[18013]: NO setgrent failed: Numerical result out of range Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafimmd[18013]: getgrouplist failed, uid=999 (Numerical result out of range). Continuing without supplementary groups. Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafimmd[18013]: Started Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafimmd[18013]: Initialization Success, role ACTIVE Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO setgrent failed: Numerical result out of range Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafimmnd[18023]: getgrouplist failed, uid=999 (Numerical result out of range). Continuing without supplementary groups. Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafimmnd[18023]: Started Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafimmnd[18023]: Initialization Success Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO SERVER STATE: IMM_SERVER_ANONYMOUS --> IMM_SERVER_CLUSTER_WAITING Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafimmd[18013]: NO New IMMND process is on ACTIVE Controller at 2010f Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafimmd[18013]: IN 4.4 intro pbeEnabled adjusted to be zero for node 2010f Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafimmd[18013]: NO First SC IMMND (OpenSAF 4.4 or later) attached 2010f Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafimmd[18013]: NO First IMMND at SC to attach is NOT configured for PBE Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafimmd[18013]: NO First IMMND on SC found at 2010f this IMMD at 2010f. Cluster is loading, *not* 2PBE => designating tha\ t IMMND as coordinator Jan 19 12:19:16 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO This IMMND is now the NEW Coord Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO SERVER STATE: IMM_SERVER_CLUSTER_WAITING --> IMM_SERVER_LOADING_PENDING Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO SERVER STATE: IMM_SERVER_LOADING_PENDING --> IMM_SERVER_LOADING_SERVER Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmd[18013]: NO Successfully announced loading. New ruling epoch:1 Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO NODE STATE-> IMM_NODE_LOADING Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmloadd: NO Load starting Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmloadd: NO ***** Loading from XML file imm.xml at /etc/opensaf ***** Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmloadd: NO The class OpensafImm has been created since it was missing from the imm.xml load file Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmloadd: IN Class OsafImmPbeRt created Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmloadd: NO The class OsafImmPbeRt has been created since it was missing from the imm.xml load file Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmloadd: NO The opensafImm=opensafImm,safApp=safImmService object of class OpensafImm has been created since it was m\ issing from the imm.xml load file Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO Ccb 1 COMMITTED (IMMLOADER) Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO Closing admin owner IMMLOADER id(1), loading of IMM done Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO NODE STATE-> IMM_NODE_FULLY_AVAILABLE 2653 Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO RepositoryInitModeT is SA_IMM_INIT_FROM_FILE Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: WA IMM Access Control mode is DISABLED! Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO opensafImmNostdFlags changed to: 0x76 Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO Epoch set to 2 in ImmModel Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmd[18013]: NO ACT: New Epoch for IMMND process at node 2010f old epoch: 1 new epoch:2 Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmd[18013]: NO Ruling epoch changed to:2 Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmloadd: NO Load ending normally Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO SERVER STATE: IMM_SERVER_LOADING_SERVER --> IMM_SERVER_READY Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osaflogd[18038]: NO setgrent failed: Numerical result out of range Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osaflogd[18038]: getgrouplist failed, uid=999 (Numerical result out of range). Continuing without supplementary groups. Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osaflogd[18038]: Started Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osaflogd[18038]: NO LOGSV_DATA_GROUPNAME not found Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osaflogd[18038]: NO LOG root directory is: "/var/log/opensaf/saflog" Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osaflogd[18038]: NO LOG data group is: "" Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osaflogd[18038]: NO LGS_MBCSV_VERSION = 5 Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO Implementer connected: 1 (safLogService) <2, 2010f> Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO implementer for class 'OpenSafLogConfig' is safLogService => class extent is safe. Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO implementer for class 'SaLogStreamConfig' is safLogService => class extent is safe. Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: IN Create runtime object 'logConfig=currentConfig,safApp=safLogService' by Impl id: 1 Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafntfd[18048]: NO setgrent failed: Numerical result out of range Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafntfd[18048]: getgrouplist failed, uid=999 (Numerical result out of range). Continuing without supplementary groups. Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafntfd[18048]: Started Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osaffmd[18004]: exiting for shutdown Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmd[18013]: exiting for shutdown Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osafimmnd[18023]: NO No IMMD service => cluster restart, exiting Jan 19 12:19:19 metabox-l4tm-dl360g6-1 osaflogd[18038]: exiting for shutdown Jan 19 12:19:20 metabox-l4tm-dl360g6-1 osafntfd[18048]: exiting for shutdown Jan 19 12:19:20 metabox-l4tm-dl360g6-1 osafrded[17993]: exiting for shutdown Jan 19 12:19:20 metabox-l4tm-dl360g6-1 opensafd: Starting OpenSAF failed ------------------------------------------------------------------------------ Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=267308311&iu=/4140 _______________________________________________ Opensaf-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/opensaf-users
