Hi,
While starting one SU with SA aware components I see the command "amf-adm -t
120 unlock SU" returns success(0) even though I see the CSI assignments are
not complete.
Because of a persistent failure in the node, the component and hence the SU
is expected to fail/restart many times and during such scenario, the amf-adm
commands simply returns
without a timeout or error code other than 0. This gives a wrong perception
to the user that the SU has successfully started while it not. Underlying it
is still restarting/doing recovery/error
escalation trying to make it up.
/var/log/messages (One failure instance from the continuous restart logs):
Feb 24 16:48:53 node2 osafamfnd[4351]: NO
'safSu=node2.SU,safSg=node2.SU,safApp=TestApp' Presence State TERMINATING =>
INSTANTIATED
Feb 24 16:48:53 node2 osafamfnd[4351]: NO Assigning
'safSi=node2.SU,safApp=TestApp' ACTIVE to
'safSu=node2.SU,safSg=node2.SU,safApp=TestApp'
Feb 24 16:40:18 node2.xxx.lab [node2.SU-comp1] [AMF] ERROR - 90011 - Failed
to start application.com.xxxx.xx.dao.exception.DaoException: A failure
occurred selecting from the database
Feb 24 16:40:18 node2 osafamfnd[4351]: NO
'safComp=node2.SU.comp1,safSu=node2.SU,safSg=node2.SU,safApp=TestApp'
faulted due to 'errorReport' : Recovery is 'suFailover'
Feb 24 16:40:18 node2 osafamfnd[4351]: NO
'safSu=node2.SU,safSg=node2.SU,safApp=TestApp' Presence State INSTANTIATED
=> TERMINATING
Feb 24 16:40:18 node2 osafamfnd[4351]: NO
'safComp=node2.SU.comp1,safSu=node2.SU,safSg=node2.SU,safApp=TestApp'
faulted due to 'errorReport' : Recovery is 'suFailover'
Feb 24 16:40:18 node2 osafamfnd[4351]: NO Assigned
'safSi=node2.SU,safApp=TestApp' QUIESCED to
'safSu=node2.SU,safSg=node2.SU,safApp=TestApp'
Feb 24 16:40:18 node2 startSAAmber: Running the CLC-CLI script for
component: comp1
Feb 24 16:40:18 node2 osafamfnd[4351]: NO Removed
'safSi=node2.SU,safApp=TestApp' from
'safSu=node2.SU,safSg=node2.SU,safApp=TestApp'
Any help is much appreciated.
--
Best Regards,
Santosh
------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the
conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Opensaf-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-users