In the AMF code, there is a check in unlock operation of SG where it 
sets  the readiness state of SU to inservice if it is instantiated.
But at the same time there is a issue #493 which says that "Sis are 
assigned to uninstantiated SU".
Please upload the traces for amfd and amfnd.


Thanks,
Praveen

On 08-Apr-14 3:54 AM, Alex Jones wrote:
> I've tracked this down to a race condition when using the ADMIN API.  I
> don't think this has anything to do with dynamically created CSIs.
>
> The problem seems to be in amfnd.
>
> If you have a component that takes a while to instantiate (in our case
> ~9 seconds), and issue the "UNLOCK-INSTANTIATE" admin command for the
> SG, and then immediately issue "UNLOCK" admin command (while the
> component is still instantiating), the amfnd "curr_assign_state" of
> avnd_comp_csi_rec on the STANDBY never get set to
> AVND_COMP_CSI_ASSIGN_STATE_ASSIGNED, even though amfd has created all
> the proper SaAmfCSIAssignments in IMM.
>
> Then when you issue "SHUTDOWN" from the admin API, amfnd on the standby
> doesn't remove the CSIs because it doesn't think it ever assigned them.
>
> Does this ring any bells with anyone?
>
> Alex
>
> On 04/04/2014 06:22 PM, Alex Jones wrote:
>> I'm seeing a problem when doing an administrative shutdown on an SG
>> with dynamically created CSIs.
>>
>> I'm assuming that dynamically creating CSIs is supported...
>>
>> I have an N+1 service group with one component entirely in the imm.xml
>> config file including CSIs.  There is another component in this SG
>> with all its config in the imm.xml except the SaAmfCSI description.
>> These are added dynamically after the system is up and running.
>>
>> Adding these CSIs dynamically seems to be working OK.  The active and
>> standby assignments are given.  I can administratively shutdown the
>> active SU, and failover occurs properly, etc.
>>
>> The problem comes when I try to admin shutdown the SG.  All the active
>> components are correctly handled, but the standby fails.  I see this
>> in the log for the standby SU:
>>
>> Apr  4 21:16:57 linux osafamfnd[4542]: NO Removing 'all (5) SIs' from
>> 'safSu=DDD-SU1,safSg=DDD-Np1,safApp=DDDApp'
>> Apr  4 21:16:57 linux osafamfnd[4542]: NO Removing
>> 'safSi=DDD-Np1-SI-1,safApp=DDDApp' from
>> 'safSu=DDD-SU1,safSg=DDD-Np1,safApp=DDDApp'
>> Apr  4 21:16:57 linux osafamfnd[4542]: NO Removing
>> 'safSi=DDD-Np1-SI-2,safApp=DDDApp' from
>> 'safSu=DDD-SU1,safSg=DDD-Np1,safApp=DDDApp'
>> Apr  4 21:16:57 linux osafamfnd[4542]: NO Removing
>> 'safSi=DDD-Np1-SI-3,safApp=DDDApp' from
>> 'safSu=DDD-SU1,safSg=DDD-Np1,safApp=DDDApp'
>> Apr  4 21:16:57 linux osafamfnd[4542]: NO Removing
>> 'safSi=DDD-Np1-SI-4,safApp=DDDApp' from
>> 'safSu=DDD-SU1,safSg=DDD-Np1,safApp=DDDApp'
>> Apr  4 21:16:57 linux osafamfnd[4542]: NO Removing
>> 'safSi=DDD-Np1-SI-5,safApp=DDDApp' from
>> 'safSu=DDD-SU1,safSg=DDD-Np1,safApp=DDDApp'
>>
>> But, I never see "Removed".  It looks like it never finishes. Then any
>> other admin operation on this SG results in:
>>
>> Apr  4 21:19:58 linux osafamfd[4761]: WA SG not in STABLE state
>> (safSg=DDD-Np1,safApp=DDDApp)
>>
>> After doing the admin shutdown the SaAmfCSIAssignment entries for the
>> standby SU that failed still exist in IMM.  All other
>> SaAmfCSIAssignment entries for the active SUs have been deleted, and I
>> see this in the osafamfd log on the active controller.  I never see
>> deletes for the standby.
>>
>> Apr  4 21:16:57.143902 osafamfd [4761:avd_csi.c:1042] TR Deleting
>> safCSIComp=safComp=DddManager\,safSu=DDD-SU5\,safSg=DDD-Np1\,safApp=DDDApp,safCsi=DddManager,safSi=DDD-Np1-SI-4,safApp=DDDApp
>>
>>
>> It seems like it's a problem with the standby SU.
>>
>> I know that the components are getting the csiRemove callback, so it
>> doesn't seem to be a component issue.
>>
>> I'll continue looking into this,  but any help would be appreciated.
>>
>> Thanks!
>>
>> Alex
>>
>>
>
>
> ------------------------------------------------------------------------------
> Put Bad Developers to Shame
> Dominate Development with Jenkins Continuous Integration
> Continuously Automate Build, Test & Deployment
> Start a new project now. Try Jenkins in the cloud.
> http://p.sf.net/sfu/13600_Cloudbees
> _______________________________________________
> Opensaf-devel mailing list
> Opensaf-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/opensaf-devel


------------------------------------------------------------------------------
Put Bad Developers to Shame
Dominate Development with Jenkins Continuous Integration
Continuously Automate Build, Test & Deployment 
Start a new project now. Try Jenkins in the cloud.
http://p.sf.net/sfu/13600_Cloudbees
_______________________________________________
Opensaf-devel mailing list
Opensaf-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/opensaf-devel

Reply via email to