Hello,

I have a bunch of machines, were OMSA from DSU 16.04.00 is no longer starting 
or throwing segfaults.

For example, on a Dell PowerEdge R430 with Scientific Linux 6 (yes, not 
"supported"...), some services could not be started:

# /opt/dell/srvadmin/sbin/srvadmin-services.sh start
Starting Systems Management Device Drivers:
Starting dell_rbu:                                         [  OK  ]
Starting ipmi driver: 
Already started                                            [  OK  ]
Starting Systems Management Data Engine:
Starting dsm_sa_datamgrd:                                  [FAILED]
Starting dsm_sa_eventmgrd:                                 [FAILED]
Starting DSM SA Shared Services:                           [  OK  ]

# rpm -qa srvadmin\* | sort
srvadmin-base-8.3.0-1908.9058.el6.x86_64
srvadmin-cm-8.3.0-1908.9058.el6.x86_64
srvadmin-deng-8.3.0-1908.9058.el6.x86_64
srvadmin-hapi-8.3.0-1908.9058.el6.x86_64
srvadmin-isvc-8.3.0-1908.9058.el6.x86_64
srvadmin-nvme-8.3.0-1908.9058.el6.x86_64
srvadmin-omacore-8.3.0-1908.9058.el6.x86_64
srvadmin-omacs-8.3.0-1908.9058.el6.x86_64
srvadmin-omcommon-8.3.0-1908.9058.el6.x86_64
srvadmin-omilcore-8.3.0-1908.9058.el6.x86_64
srvadmin-ominst-8.3.0-1908.9058.el6.x86_64
srvadmin-realssd-8.3.0-1908.9058.el6.x86_64
srvadmin-server-cli-8.3.0-1908.9058.el6.x86_64
srvadmin-smcommon-8.3.0-1908.9058.el6.x86_64
srvadmin-storage-8.3.0-1908.9058.el6.x86_64
srvadmin-storage-cli-8.3.0-1908.9058.el6.x86_64
srvadmin-storageservices-cli-8.3.0-1908.9058.el6.x86_64
srvadmin-storelib-8.3.0-1908.9058.el6.x86_64
srvadmin-storelib-sysfs-8.3.0-1908.9058.el6.x86_64
srvadmin-sysfsutils-8.3.0-1908.9058.el6.x86_64
srvadmin-xmlsup-8.3.0-1908.9058.el6.x86_64

Afterwards, omreport does not recognize any components to monitor. There is no 
error message logged in /var/log/messages or anywhere else.

On a different PowerEdge R430 (same OS), OMSA is just segfaulting:

# /opt/dell/srvadmin/sbin/srvadmin-services.sh status
/etc/init.d/instsvcdrv: line 1589: 788703 Segmentation fault      
${ISVCDD_SBIN_DIR}/${ISVCDD_DCHCFG_EXE} command=getsystype > /dev/null 2>&1
dcdbas (module) is running
dell_rbu (module) is running
/etc/init.d/instsvcdrv: line 1589: 788726 Segmentation fault      
${ISVCDD_SBIN_DIR}/${ISVCDD_DCHCFG_EXE} command=getsystype > /dev/null 2>&1
dsm_sa_datamgrd is stopped
dsm_sa_eventmgrd is stopped
dsm_om_shrsvcd (pid 719966) is running

At least an error message is logged in /var/log/messages:
kernel: dchcfg[788726]: segfault at 0 ip 000000393f5336bf sp 00007ffccea3cf78 
error 4 in libc-2.12.so[393f400000+18a000]

Any idea how to fix this and get proper hardware monitoring back?

Regards,
Stefan

--
------------------------------------------------------------------------
Stefan Dietrich            Deutsches Elektronen-Synchrotron (IT-Systems)
                        Ein Forschungszentrum der Helmholtz-Gemeinschaft
                                                            Notkestr. 85
phone:  +49-40-8998-4696                                   22607 Hamburg
e-mail: [email protected]                                  Germany
------------------------------------------------------------------------

_______________________________________________
Linux-PowerEdge mailing list
[email protected]
https://lists.us.dell.com/mailman/listinfo/linux-poweredge

Reply via email to