Hello,
I have a bunch of machines, were OMSA from DSU 16.04.00 is no longer starting
or throwing segfaults.
For example, on a Dell PowerEdge R430 with Scientific Linux 6 (yes, not
"supported"...), some services could not be started:
# /opt/dell/srvadmin/sbin/srvadmin-services.sh start
Starting Systems Management Device Drivers:
Starting dell_rbu: [ OK ]
Starting ipmi driver:
Already started [ OK ]
Starting Systems Management Data Engine:
Starting dsm_sa_datamgrd: [FAILED]
Starting dsm_sa_eventmgrd: [FAILED]
Starting DSM SA Shared Services: [ OK ]
# rpm -qa srvadmin\* | sort
srvadmin-base-8.3.0-1908.9058.el6.x86_64
srvadmin-cm-8.3.0-1908.9058.el6.x86_64
srvadmin-deng-8.3.0-1908.9058.el6.x86_64
srvadmin-hapi-8.3.0-1908.9058.el6.x86_64
srvadmin-isvc-8.3.0-1908.9058.el6.x86_64
srvadmin-nvme-8.3.0-1908.9058.el6.x86_64
srvadmin-omacore-8.3.0-1908.9058.el6.x86_64
srvadmin-omacs-8.3.0-1908.9058.el6.x86_64
srvadmin-omcommon-8.3.0-1908.9058.el6.x86_64
srvadmin-omilcore-8.3.0-1908.9058.el6.x86_64
srvadmin-ominst-8.3.0-1908.9058.el6.x86_64
srvadmin-realssd-8.3.0-1908.9058.el6.x86_64
srvadmin-server-cli-8.3.0-1908.9058.el6.x86_64
srvadmin-smcommon-8.3.0-1908.9058.el6.x86_64
srvadmin-storage-8.3.0-1908.9058.el6.x86_64
srvadmin-storage-cli-8.3.0-1908.9058.el6.x86_64
srvadmin-storageservices-cli-8.3.0-1908.9058.el6.x86_64
srvadmin-storelib-8.3.0-1908.9058.el6.x86_64
srvadmin-storelib-sysfs-8.3.0-1908.9058.el6.x86_64
srvadmin-sysfsutils-8.3.0-1908.9058.el6.x86_64
srvadmin-xmlsup-8.3.0-1908.9058.el6.x86_64
Afterwards, omreport does not recognize any components to monitor. There is no
error message logged in /var/log/messages or anywhere else.
On a different PowerEdge R430 (same OS), OMSA is just segfaulting:
# /opt/dell/srvadmin/sbin/srvadmin-services.sh status
/etc/init.d/instsvcdrv: line 1589: 788703 Segmentation fault
${ISVCDD_SBIN_DIR}/${ISVCDD_DCHCFG_EXE} command=getsystype > /dev/null 2>&1
dcdbas (module) is running
dell_rbu (module) is running
/etc/init.d/instsvcdrv: line 1589: 788726 Segmentation fault
${ISVCDD_SBIN_DIR}/${ISVCDD_DCHCFG_EXE} command=getsystype > /dev/null 2>&1
dsm_sa_datamgrd is stopped
dsm_sa_eventmgrd is stopped
dsm_om_shrsvcd (pid 719966) is running
At least an error message is logged in /var/log/messages:
kernel: dchcfg[788726]: segfault at 0 ip 000000393f5336bf sp 00007ffccea3cf78
error 4 in libc-2.12.so[393f400000+18a000]
Any idea how to fix this and get proper hardware monitoring back?
Regards,
Stefan
--
------------------------------------------------------------------------
Stefan Dietrich Deutsches Elektronen-Synchrotron (IT-Systems)
Ein Forschungszentrum der Helmholtz-Gemeinschaft
Notkestr. 85
phone: +49-40-8998-4696 22607 Hamburg
e-mail: [email protected] Germany
------------------------------------------------------------------------
_______________________________________________
Linux-PowerEdge mailing list
[email protected]
https://lists.us.dell.com/mailman/listinfo/linux-poweredge