Hi ,

Comments inline.

/Neel.

On Tuesday 21 October 2014 04:41 AM, Shu Wang wrote:
> The IMM documentation states:
>
> Applications that intend to add their own imm classes and imm objects need to 
> be aware that capacity is limited. OpenSAF4.1 has been system tested with up 
> to 350 000 objects of average size 300 bytes. It is not advisable to generate 
> larger imm-contents than that.

>
> What is the definition of an object?
The 300 bytes is the size of each object, (which is the accumulated size 
of the assigned attributes for a class). The size of the object depends 
upon the number of  attributes and different type of attributes of a class.
>
> We have a cluster defined across 6 nodes with a total of 12 SGs, a total of 
> 64 SUs, and a total of 292 components.  We can start OpenSAF successfully 
> across the nodes and unlock all SUs with no problems.
>
> The cluster definition was increased to 6 nodes, 15 SGs, a total of 56 SUs, 
> and a total of 388 components.  We are able to start OpenSAF on all nodes 
> successfully but as soon as a little over 300 components have been unlocked, 
> things start to fall apart.  The opensaf processes start to die and the 
> cluster is no longer usable.
The SU's, SG's and components are internally objects for IMM. Incresing 
to "6 nodes, 15 SGs, a total of 56 SUs, and a total of 388 components" 
should not have caused any IMM related problems.

> Oct 19 16:29:06 colobus osafamfnd[3649]: NO Assigned 
> 'safSi=amfSDFSISI1.3,safApp=olcApp' ACTIVE to 
> 'safSu=amfSDFSISU1.4,safSg=amfSDFSISG1,safApp=olcApp'
> Oct 19 16:29:07 colobus osafamfnd[3649]: NO 
> 'safComp=amfOELCElfComp2.3.1,safSu=amfOELCSU2.3,safSg=amfOELCSG2,safApp=olcApp'
>  faulted due to 'avaDown' : Recovery is 'componentRestart'
> Oct 19 16:29:09 colobus osafamfnd[3649]: NO 
> 'safComp=amfOELCElfComp2.3.1,safSu=amfOELCSU2.3,safSg=amfOELCSG2,safApp=olcApp'
>  faulted due to 'avaDown' : Recovery is 'componentRestart'
> Oct 19 16:29:09 colobus osafntfd[3587]: ER ntfs_mds_msg_send FAILED
> Oct 19 16:29:09 colobus osafntfd[3587]: ER ntfs_mds_msg_send to ntfa failed 
> rc: 2
> Oct 19 16:29:09 colobus osafntfd[3587]: ER ntfs_mds_msg_send FAILED
> Oct 19 16:29:09 colobus osafntfd[3587]: ER ntfs_mds_msg_send FAILED
> ....
> Oct 19 16:33:24 colobus ntpd_initres[2608]: host name not found: 
> 0.rhel.pool.ntp.org
> ....
> Oct 19 16:35:18 colobus osafimmnd[3549]: NO Implementer disconnected 14 <0, 
> 22b0f> (MsgQueueService142095)
> Oct 19 16:35:19 colobus osafclmd[3602]: NO proc_initialize_msg: send failed. 
> dest:22b0f00007a77
> Oct 19 16:35:19 colobus osafimmnd[3549]: NO Global discard node received for 
> nodeId:22b0f pid:31333
> Oct 19 16:35:20 colobus osafamfnd[3649]: NO 
> 'safComp=amfOELCElfComp2.3.1,safSu=amfOELCSU2.3,safSg=amfOELCSG2,safApp=olcApp'
>  faulted due to 'avaDown' : Recovery is 'componentRestart'
> Oct 19 16:35:22 colobus osafamfnd[3649]: NO 
> 'safComp=amfOELCElfComp2.3.1,safSu=amfOELCSU2.3,safSg=amfOELCSG2,safApp=olcApp'
>  faulted due to 'avaDown' : Recovery is 'componentRestart'
> Oct 19 16:35:24 colobus osafamfnd[3649]: NO 
> 'safComp=amfOELCElfComp2.3.1,safSu=amfOELCSU2.3,safSg=amfOELCSG2,safApp=olcApp'
>  faulted due to 'avaDown' : Recovery is 'componentRestart'
> Oct 19 16:35:26 colobus osafimmnd[3549]: NO Implementer connected: 16 
> (MsgQueueService142095) <12021, 2280f>
> Oct 19 16:35:26 colobus osafimmnd[3549]: NO Implementer locally disconnected. 
> Marking it as doomed 16 <12021, 2280f> (MsgQueueService142095)
> Oct 19 16:35:26 colobus osafimmnd[3549]: NO Implementer disconnected 16 
> <12021, 2280f> (MsgQueueService142095)
> Oct 19 16:35:26 colobus osafamfd[3631]: NO Node 'bedrazzas.monkey.lab' left 
> the cluster
>
> Have we reached a max of the number of SUs/Components that can be started 
> within a single OpenSAF cluster?
OpenSAF 4.4/4.5 is tested for 70 nodes.
>
> We have tried the above with OpenSAF 4.4 and OpenSAF 4.5 and with both TCP 
> and TIPC, all fail similarly.
This should have been an application problem or adjustments related to 
timeouts. Please share the syslog messages of all the nodes.
> Thank you!
>
> Shu Wang | Senior Analyst | +1(407)708-5117 or x3917| www.NetCracker.com
> Proven Partner to Communications Service Providers
>
>
>
>
> ________________________________
> The information transmitted herein is intended only for the person or entity 
> to which it is addressed and may contain confidential, proprietary and/or 
> privileged material. Any review, retransmission, dissemination or other use 
> of, or taking of any action in reliance upon, this information by persons or 
> entities other than the intended recipient is prohibited. If you received 
> this in error, please contact the sender and delete the material from any 
> computer.
> ------------------------------------------------------------------------------
> Comprehensive Server Monitoring with Site24x7.
> Monitor 10 servers for $9/Month.
> Get alerted through email, SMS, voice calls or mobile push notifications.
> Take corrective actions from your mobile device.
> http://p.sf.net/sfu/Zoho
> _______________________________________________
> Opensaf-users mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/opensaf-users


------------------------------------------------------------------------------
Comprehensive Server Monitoring with Site24x7.
Monitor 10 servers for $9/Month.
Get alerted through email, SMS, voice calls or mobile push notifications.
Take corrective actions from your mobile device.
http://p.sf.net/sfu/Zoho
_______________________________________________
Opensaf-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-users

Reply via email to