Hi,
   While starting one SU with SA aware components I see the command "amf-adm -t
   120 unlock SU" returns success(0) even though I see the CSI assignments are
   not complete.
   Because of a persistent failure in the node, the component and hence the SU
   is expected to fail/restart many times and during such scenario, the amf-adm
   commands simply returns
   without a timeout or error code other than 0. This gives a wrong perception
   to the user that the SU has successfully started while it not. Underlying it
   is still restarting/doing recovery/error
   escalation trying to make it up.
   /var/log/messages (One failure instance from the continuous restart logs):
   Feb 24 16:48:53 node2 osafamfnd[4351]: NO
   'safSu=node2.SU,safSg=node2.SU,safApp=TestApp' Presence State TERMINATING =>
   INSTANTIATED
   Feb     24    16:48:53    node2    osafamfnd[4351]:    NO    Assigning
   'safSi=node2.SU,safApp=TestApp' ACTIVE to
   'safSu=node2.SU,safSg=node2.SU,safApp=TestApp'
   Feb 24 16:40:18 node2.xxx.lab [node2.SU-comp1] [AMF] ERROR - 90011 - Failed
   to start application.com.xxxx.xx.dao.exception.DaoException: A failure
   occurred selecting from the database
   Feb 24 16:40:18 node2 osafamfnd[4351]: NO
   'safComp=node2.SU.comp1,safSu=node2.SU,safSg=node2.SU,safApp=TestApp'
   faulted due to 'errorReport' : Recovery is 'suFailover'
   Feb 24 16:40:18 node2 osafamfnd[4351]: NO
   'safSu=node2.SU,safSg=node2.SU,safApp=TestApp' Presence State INSTANTIATED
   => TERMINATING
   Feb 24 16:40:18 node2 osafamfnd[4351]: NO
   'safComp=node2.SU.comp1,safSu=node2.SU,safSg=node2.SU,safApp=TestApp'
   faulted due to 'errorReport' : Recovery is 'suFailover'
   Feb     24     16:40:18    node2    osafamfnd[4351]:    NO    Assigned
   'safSi=node2.SU,safApp=TestApp' QUIESCED to
   'safSu=node2.SU,safSg=node2.SU,safApp=TestApp'
   Feb  24  16:40:18  node2  startSAAmber: Running the CLC-CLI script for
   component: comp1
   Feb     24     16:40:18     node2    osafamfnd[4351]:    NO    Removed
   'safSi=node2.SU,safApp=TestApp' from
   'safSu=node2.SU,safSg=node2.SU,safApp=TestApp'
   Any help is much appreciated.
   --
   Best Regards,
   Santosh
------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Opensaf-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-users

Reply via email to