[devel] checkpoint problems

2014-01-03 Thread Alex Jones
Hello All, I'm experimenting with the checkpoint service, and some things don't appear to work. The saCkptActiveReplicaSet and saCkptCheckpointSynchronize[Async] don't appear to work when the checkpoint has section numbers greater than around 5500. I've created a checkpoint with

Re: [devel] checkpoint problems

2014-01-06 Thread Alex Jones
, Alex Jones wrote: Hello All, I'm experimenting with the checkpoint service, and some things don't appear to work. The saCkptActiveReplicaSet and saCkptCheckpointSynchronize[Async] don't appear to work when the checkpoint has section numbers greater than around 5500. I've created

Re: [devel] checkpoint problems

2014-01-07 Thread Alex Jones
with SA_TIME_ONE_DAY ? -AVM On 1/7/2014 1:17 AM, Alex Jones wrote: AVM, I've been playing around with your test program, and have gotten it to fail. I made the following changes: 1. Change init_dataX to be 1024k bytes, so that you are initializing the section to be 1024k. 2

Re: [devel] checkpoint problems

2014-01-07 Thread Alex Jones
to take a deeper look at this, but I'm mentioning it in case it rings any bells. I am using TIPC as the transport. Alex On 01/07/2014 07:24 AM, Alex Jones wrote: AVM, I get SA_AIS_ERR_TIMEOUT even when I pass SA_TIME_END as the timeout value. Is this not a bug? the synchronous

Re: [devel] checkpoint problems

2014-01-09 Thread Alex Jones
((MDS_DIRECT_BUF_MAXSIZEMDTM_NORMAL_MSG_FRAG_SIZE)? \ (MDS_DIRECT_BUF_MAXSIZE+SUM_MDS_HDR_PLUS_MDTM_HDR_PLUS_LEN):(MDTM_NORMAL_MSG_FRAG_SIZE+SUM_MDS_HDR_PLUS_MDTM_HDR_PLUS_LEN)) == -AVM On 1/8/2014 10:42 PM, Alex Jones wrote

Re: [devel] checkpoint problems

2014-01-10 Thread Alex Jones
)) == -AVM On 1/8/2014 10:42 PM, Alex Jones wrote: Hi Hans, Changing rmem_default and rmem_max has no effect on the problem. I even tried up to 2M to no avail. However, after looking

[devel] checkpoint section create performance

2014-01-14 Thread Alex Jones
AVM, In my 5+1 setup, when I have the standby node open all the checkpoints and read from them, as well as open the hot-standby callback, the section creates done on the other active nodes can take a very long time. (For 40k sections, it can sometimes take over 2 minutes). Once the

Re: [devel] checkpoint section create performance

2014-01-14 Thread Alex Jones
the SA_CKPT_WR_ACTIVE_REPLICA or SA_CKPT_WR_ACTIVE_REPLICA_WEAK property by making this asynchronous. Any chance we can change this behaviour, and make it like saCkptCheckpointWrite? Alex On 01/14/2014 03:16 PM, Alex Jones wrote: AVM, In my 5+1 setup, when I have the standby node open all

Re: [devel] checkpoint section create performance

2014-01-17 Thread Alex Jones
to do it. Alex On 01/14/2014 05:13 PM, Alex Jones wrote: 3.7.1 saCkptSectionCreate() If the checkpoint was created with the SA_CKPT_WR_ALL_REPLICAS property, the section is created in all of the checkpoint replicas when the invocation returns; otherwise, the section has been created

Re: [devel] checkpoint section create performance

2014-02-04 Thread Alex Jones
Privyet Mihkail! I have the work done, but I don't have the patch file created yet. I was going to create a ticket, and attach the patch to it. I will try to do it today. I will let you know. Alex On 02/04/2014 09:24 AM, Domrachev, Mikhail wrote: Hi, Alex. Currently I'm fixing that issue

[devel] SaAmfSIDependency problem

2014-03-04 Thread Alex Jones
Hello All, I'm seeing a problem with SaAmfSIDependency when it is used between SIs in different SGs of different types. object class=SaAmfSIDependency dnsafDepend=safSi=Management-2N\,safApp=ManagementApp,safSi=Management-NoRed2,safApp=ManagementApp/dn /object In

Re: [devel] SaAmfSIDependency problem

2014-03-05 Thread Alex Jones
: Please specify which version of OpenSAF is being used. Also please tell the steps performed. Thanks, Praveen On 04-Mar-14 5:40 PM, Alex Jones wrote: Hello All, I'm seeing a problem with SaAmfSIDependency when it is used between SIs in different SGs of different types

Re: [devel] SaAmfSIDependency problem

2014-03-07 Thread Alex Jones
Praveen, Can you either post or send me privately the final imm.xml file? Alex From: praveen malviya [praveen.malv...@oracle.com] Sent: Friday, March 07, 2014 3:28 AM To: Alex Jones Cc: opensaf-devel@lists.sourceforge.net Subject: Re: [devel

[devel] problem doing admin shutdown on SG with dynamically created CSIs

2014-04-04 Thread Alex Jones
I'm seeing a problem when doing an administrative shutdown on an SG with dynamically created CSIs. I'm assuming that dynamically creating CSIs is supported... I have an N+1 service group with one component entirely in the imm.xml config file including CSIs. There is another component in this

Re: [devel] problem doing admin shutdown on SG with dynamically created CSIs

2014-04-07 Thread Alex Jones
SaAmfCSIAssignments in IMM. Then when you issue SHUTDOWN from the admin API, amfnd on the standby doesn't remove the CSIs because it doesn't think it ever assigned them. Does this ring any bells with anyone? Alex On 04/04/2014 06:22 PM, Alex Jones wrote: I'm seeing a problem when doing an administrative

[devel] changing saAmfSGNumPrefActiveSUs for N+1 SG w/out bringing it down

2014-04-11 Thread Alex Jones
I notice that dynamically changing saAmfSGNumPrefActiveSUs for an N+1 service group is not supported, unless the service group is not unlocked. This prevents in-service capacity addition which really should be supported. Let's say you have a 6-slot ATCA chassis, but you only want to use 3

Re: [devel] changing saAmfSGNumPrefActiveSUs for N+1 SG w/out bringing it down

2014-04-22 Thread Alex Jones
everything through, and might not expect it. How would we pick which one to move? By rank? Alex On 04/22/2014 02:47 AM, Hans Feldt wrote: Please do, that would be great. Thanks, Hans On 11 April 2014 17:37, Alex Jones ajo...@genband.com wrote: I notice that dynamically changing

[devel] [PATCH 0 of 1] Review Request for amfd: infinite loop when processing CLM tracking validate #877

2014-04-29 Thread Alex Jones
Other n Comments (indicate scope for each y above): - changeset 9e1708d7a66517fcff52d6af094000feb1897143 Author: Alex Jones ajo...@genband.com Date: Mon, 28 Apr 2014 15:56:46 -0400 amfd: fix infinite loop when processing CLM

Re: [devel] [PATCH 1 of 1] amfd: fix infinite loop when processing CLM node tracking [#877]

2014-04-29 Thread Alex Jones
Actually, it is a PLM Deactivation request, which CLM translates into NODE_LEFT. Alex On 04/29/2014 10:15 AM, Hans Feldt wrote: What is the test case, CLM node lock? /Hans -Original Message- From: Alex Jones [mailto:ajo...@genband.com] Sent: den 29 april 2014 16:04 To: Hans Feldt

Re: [devel] [PATCH 1 of 1] amfd: fix infinite loop when processing CLM node tracking [#877]

2014-04-30 Thread Alex Jones
Guys, Gentle reminder to ack... Alex On 04/29/2014 10:21 AM, Alex Jones wrote: Actually, it is a PLM Deactivation request, which CLM translates into NODE_LEFT. Alex On 04/29/2014 10:15 AM, Hans Feldt wrote: What is the test case, CLM node lock? /Hans -Original Message

[devel] [PATCH 0 of 1] Review Request for amfnd: fix process of proxy/proxied termination on OpenSAF shutdown [#629]

2014-05-02 Thread Alex Jones
Other n Comments (indicate scope for each y above): - changeset e14b3510889d13784714b33648d104ec218a64f4 Author: Alex Jones ajo...@genband.com Date: Fri, 02 May 2014 11:59:41 -0400 amfnd: fix process of proxy/proxied

[devel] [PATCH 1 of 1] amfnd: fix process of proxy/proxied termination on OpenSAF shutdown [#629]

2014-05-02 Thread Alex Jones
osaf/services/saf/amf/amfnd/clc.cc | 27 +++ 1 files changed, 27 insertions(+), 0 deletions(-) May 2 15:22:11 linux osafamfnd[2990]: NO 'safSu=Management-SU1,safSg=Management-2N,safApp=ManagementApp' Presence State INSTANTIATED = TERMINATION_FAILED May 2 15:22:38

[devel] [PATCH 0 of 1] Review Request for amfd: fix SU in-service macro [#493]

2014-05-02 Thread Alex Jones
Comments (indicate scope for each y above): - changeset 3e3f3f22c8d18430a807cc33a0c89617b71f6891 Author: Alex Jones ajo...@genband.com Date: Fri, 02 May 2014 15:02:32 -0400 amfd: fix SU in-service macro [#493] May 2 18:56:32 linux

[devel] [PATCH 1 of 1] amfd: fix SU in-service macro [#493]

2014-05-02 Thread Alex Jones
osaf/services/saf/amf/amfd/include/su.h | 13 + 1 files changed, 9 insertions(+), 4 deletions(-) May 2 18:56:32 linux osafamfnd[12420]: NO Assigned 'safSi=Dataplane-Np1-SI-1,safApp=DataplaneApp' STANDBY to 'safSu=Dataplane-SU1,safSg=Dataplane-Np1,safApp=DataplaneApp' May 2

Re: [devel] [PATCH 1 of 1] amfd: fix SU in-service macro [#493]

2014-05-06 Thread Alex Jones
state, SU operational state, and SU presence state. Thanks, Hans -Original Message- From: Alex Jones [mailto:ajo...@genband.com] Sent: den 2 maj 2014 21:27 To: Hans Feldt; praveen.malv...@oracle.com; nagendr...@oracle.com Cc: opensaf-devel@lists.sourceforge.net Subject: [PATCH 1

Re: [devel] [PATCH 1 of 1] amfd: fix SU in-service macro [#493]

2014-05-06 Thread Alex Jones
(avd_sg_2n_su_chose_asgn() for 2N model) also. Thanks, Praveen On 03-May-14 12:57 AM, Alex Jones wrote: osaf/services/saf/amf/amfd/include/su.h | 13 + 1 files changed, 9 insertions(+), 4 deletions(-) May 2 18:56:32 linux osafamfnd[12420]: NO Assigned 'safSi=Dataplane-Np1-SI-1,safApp

Re: [devel] [PATCH 1 of 1] amfd: fix SU in-service macro [#493]

2014-05-07 Thread Alex Jones
for INSTANTIATED has been added. What do you think? Thanks, Hans -Original Message- From: Alex Jones [mailto:ajo...@genband.com] Sent: den 6 maj 2014 18:07 To: Hans Feldt; praveen.malv...@oracle.com; nagendr...@oracle.com Cc: opensaf-devel@lists.sourceforge.net Subject: Re: [PATCH

Re: [devel] [PATCH 0 of 1] Review Request for AMF #493

2014-05-07 Thread Alex Jones
Ack. Alex On 05/07/2014 07:03 AM, Hans Feldt wrote: Summary: Fix SU in-service calculation Review request for Trac Ticket(s): 493 Peer Reviewer(s): Alex Nags Pull request to: LIST THE PERSON WITH PUSH ACCESS HERE Affected branch(es): all Development branch: 4.3

[devel] [PATCH 1 of 1] amfd: allow change to saAmfSGNumPrefActiveSUs for N+M SG while UNLOCKED [#871]

2014-05-12 Thread Alex Jones
osaf/services/saf/amf/amfd/sg.cc | 39 +++ 1 files changed, 39 insertions(+), 0 deletions(-) ccb_completed_modify_hdlr: Attribute 'saAmfSGNumPrefActiveSUs' cannot be modified when SG is unlocked OpenSAF disallows changing saAmfSGNumPrefActiveSUs while the

[devel] [PATCH 0 of 1] Review Request for amfd: allow change to saAmfSGNumPrefActiveSUs for N+M SG while UNLOCKED [#871]

2014-05-12 Thread Alex Jones
Other n Comments (indicate scope for each y above): - changeset e5f045244159cdd5c70e76d600c7caf92cdbd058 Author: Alex Jones ajo...@genband.com Date: Mon, 12 May 2014 14:57:28 -0400 amfd: allow change

Re: [devel] [PATCH 1 of 1] amfd: allow change to saAmfSGNumPrefActiveSUs for N+M SG while UNLOCKED [#871]

2014-05-13 Thread Alex Jones
= sg-saAmfSGNumPrefActiveSUs; + else Thanks -Nagu -Original Message- From: Alex Jones [mailto:ajo...@genband.com] Sent: 13 May 2014 00:36 To: hans.fe...@ericsson.com; Nagendra Kumar; Praveen Malviya Cc: opensaf-devel@lists.sourceforge.net Subject: [PATCH 1

[devel] [PATCH 0 of 1] Review Request for amfd: allow increase of saAmfSGNumPrefActiveSUs for N+M SG while UNLOCKED [#871]

2014-05-19 Thread Alex Jones
bae2d2e9601f2e1f09e4ac8d60d4e118b448cd20 Author: Alex Jones ajo...@genband.com Date: Mon, 19 May 2014 17:40:58 -0400 amfd: allow increase of saAmfSGNumPrefActiveSUs for N+M SG while UNLOCKED [#871] ccb_completed_modify_hdlr: Attribute 'saAmfSGNumPrefActiveSUs' cannot

[devel] [PATCH 0 of 1] Review Request for test

2014-05-20 Thread Alex Jones
Samples n Tests n Other n Comments (indicate scope for each y above): - EXPLAIN/COMMENT THE PATCH SERIES HERE changeset 6f9c8080657af783462c52d81137e41038aee6a6 Author: Alex Jones ajo...@genband.com Date

[devel] [PATCH 0 of 1] Review Request for cpnd: increase performance when creating large numbers of sections [#770]

2014-05-20 Thread Alex Jones
n Comments (indicate scope for each y above): - Please ignore the previous fat-finger submission. changeset 6f9c8080657af783462c52d81137e41038aee6a6 Author: Alex Jones ajo...@genband.com Date: Tue, 20 May 2014 17:03:25 -0400 cpnd

[devel] [PATCH 1 of 1] amfnd: fix process of proxy/proxied cleanup on OpenSAF shutdown [#629]

2014-05-21 Thread Alex Jones
osaf/services/saf/amf/amfnd/clc.cc | 20 1 files changed, 20 insertions(+), 0 deletions(-) May 2 15:22:11 linux osafamfnd[2990]: NO 'safSu=Management-SU1,safSg=Management-2N,safApp=ManagementApp' Presence State INSTANTIATED = TERMINATION_FAILED May 2 15:22:38 linux

Re: [devel] [PATCH 0 of 1] Review Request for cpnd: increase performance when creating large numbers of sections [#770]

2014-05-21 Thread Alex Jones
/testing the complete patch (database a C++ STL map for fast access ) ,for now my initial comment is : On 5/21/2014 2:57 AM, Alex Jones wrote: make MAX_SYNC_TRANSFER_SIZE much smaller: 3M instead of30M. Considering the below fragmentation changes (changesets 5279- 5282) , I hope

[devel] [PATCH 0 of 1] Review Request for osaf: port existing daemonization and nid process to systemd new-style daemon [#455]

2014-06-02 Thread Alex Jones
08c4e042835b7dbf45876c10fd3b1db5ba066319 Author: Alex Jones ajo...@genband.com Date: Mon, 02 Jun 2014 17:03:09 -0400 osaf: port existing daemonization and nid process to systemd new-style daemon [#455] Jun 2 16:55:57 linux-po6q osafrded[16114]: Started Jun 2 16:55:57 linux- po6q osafrded

[devel] [PATCH 1 of 1] osaf: port existing daemonization and nid process to systemd new-style daemon [#455]

2014-06-02 Thread Alex Jones
Makefile.common | 1 + configure.ac | 16 ++ contrib/plmc/Makefile.common | 1 + contrib/plmc/config/plmcd.conf | 4 +

Re: [devel] [PATCH 0 of 1] Review Request for osaf: port existing daemonization and nid process to systemd new-style daemon [#455]

2014-06-10 Thread Alex Jones
one is more strange, though. Do you get this message on your system? Do you get the same log message? Maybe this configuration option has changed in recent versions of systemd? / Anders Widell On 06/02/2014 11:12 PM, Alex Jones wrote: Summary: osaf: port existing daemonization and nid

[devel] [PATCH 0 of 1] Review Request for osaf: port existing daemonization and nid process to systemd new-style daemon [#455]

2014-06-17 Thread Alex Jones
it. No attempt has been made to convert the individual daemons to systemd, except for plmcd, which has been converted to a full-blown systemd daemon. I've built and run this for Fedora 19, OpenSUSE 12.2, and OpenSUSE 13.1. changeset b14cec02030e81cbeaa1404849971205291f148b Author: Alex Jones ajo

[devel] [PATCH 1 of 1] osaf: port existing daemonization and nid process to systemd new-style daemon [#455]

2014-06-17 Thread Alex Jones
Makefile.common | 2 + configure.ac| 18 +++ contrib/plmc/Makefile.common| 1 + contrib/plmc/config/Makefile.am | 6 +

Re: [devel] [PATCH 0 of 1] Review Request for osaf: port existing daemonization and nid process to systemd new-style daemon [#455]

2014-06-17 Thread Alex Jones
versions, maybe another option is needed? / Anders Widell On 06/10/2014 05:30 PM, Alex Jones wrote: Oh great... After a quick perusal of the systemd changelogs, it looks like they removed support for ControlGroup in systemd-205, replacing it with different attributes. ControlGroup

[devel] [PATCH 0 of 1] Review Request for PLM: add support for ATCA entity paths [#881]

2014-06-20 Thread Alex Jones
scope for each y above): - changeset 00bdfc09aa2fc605cdbc7f9a90d19de3e1c95853 Author: Alex Jones ajo...@genband.com Date: Fri, 20 Jun 2014 14:28:49 -0400 plm: add support for ATCA-specific entity paths [#881] Jun 20 14:17:18 linux

[devel] [PATCH 1 of 1] plm: add support for ATCA-specific entity paths [#881]

2014-06-20 Thread Alex Jones
osaf/services/saf/plmsv/plms/hpi_intf/plms_epath_util.c | 32 +--- 1 files changed, 26 insertions(+), 6 deletions(-) Jun 20 14:17:18 linux-po6q osafplmd[20561]: ER epath_matched: 0, he_type_idr_matched: 0#011#011HE. ent_path:

[devel] [PATCH 1 of 1] plm: plm does not always get correct inventory RDR from HPI [#880]

2014-06-20 Thread Alex Jones
osaf/services/saf/plmsv/plms/hpi_intf/plms_hrb.c | 36 +++ osaf/services/saf/plmsv/plms/hpi_intf/plms_hsm.c | 2 +- osaf/services/saf/plmsv/plms/plms_he_pres_fsm.c | 2 +- 3 files changed, 31 insertions(+), 9 deletions(-) Jun 20 13:47:58 linux-po6q osafplmd[4923]: ER

[devel] [PATCH 0 of 1] Review Request for PLM: plm does not always get correct inventory RDR from HPI [#880]

2014-06-20 Thread Alex Jones
n Comments (indicate scope for each y above): - This patch addresses some bugs using HPI. changeset 77393bd4f86585391e7829b082214c6afcb9af4a Author: Alex Jones ajo...@genband.com Date: Fri, 20 Jun 2014 15:45:09 -0400 plm: plm does not always

Re: [devel] [PATCH 1 of 1] osaf: port existing daemonization and nid process to systemd new-style daemon [#455]

2014-07-03 Thread Alex Jones
Responses inline... Alex On 07/02/2014 03:39 AM, Mathivanan Naickan Palanivelu wrote: Alex, Good start to the migration to systemd. After reading the reviews in lwn, the early impression i get is that even though there are no direct benefits for us, like for eg:- i haven't seen much of

Re: [devel] [PATCH 1 of 1] osaf: port existing daemonization and nid process to systemd new-style daemon [#455]

2014-07-03 Thread Alex Jones
Responses inline: On 07/03/2014 03:26 PM, Mathivanan Naickan Palanivelu wrote: Comments inline: -Original Message- From: Alex Jones [mailto:ajo...@genband.com] Sent: Friday, July 04, 2014 12:07 AM To: Mathivanan Naickan Palanivelu Cc: opensaf-devel@lists.sourceforge.net Subject

[devel] [PATCH 1 of 1] plm: use SAHPI_HS_ACTION_XXX for activating/deactivating HE if Full-5 HS model [#943]

2014-07-14 Thread Alex Jones
osaf/services/saf/plmsv/plms/hpi_intf/plms_hrb.c | 35 --- osaf/services/saf/plmsv/plms/plms_he_pres_fsm.c | 8 ++-- 2 files changed, 22 insertions(+), 21 deletions(-) Jul 14 16:56:02 linux-po6q osafplmd[32627]: ER plms_hrb_req FAILED. Ent:

[devel] [PATCH 0 of 1] Review Request for plm: use SAHPI_HS_ACTION_XXX for activating/deactivating HE if Full-5 HS model [#943]

2014-07-14 Thread Alex Jones
n Comments (indicate scope for each y above): - changeset 49a80c50cd5c7e8579106e3c328cc42f6fb14b56 Author: Alex Jones ajo...@genband.com Date: Mon, 14 Jul 2014 17:15:35 -0400 plm: use SAHPI_HS_ACTION_XXX for activating

Re: [devel] [PATCH 0 of 5] Review Request for replace localtime with thread safe localtime_r [#958]

2014-07-16 Thread Alex Jones
One comment. From the man page of localtime_r: According to POSIX.1-2004, localtime() is required to behave as though tzset(3) was called, while localtime_r() does not have this require- ment. For portable code tzset(3) should be called before local-

[devel] [PATCH 0 of 1] Review Request for PLM: Get inventory data if SAHPI_HS_STATE_INACTIVE event is received [#976]

2014-08-04 Thread Alex Jones
n Comments (indicate scope for each y above): - EXPLAIN/COMMENT THE PATCH SERIES HERE changeset 52c97ed2c36dcbf5d0a3e2135e0134a73670f09a Author: Alex Jones ajo...@genband.com Date: Mon, 04 Aug 2014 17:06:02 -0400 plmc: Get inventory

[devel] [PATCH 1 of 1] plmc: Get inventory data if SAHPI_HS_STATE_INACTIVE event is received [#976]

2014-08-04 Thread Alex Jones
osaf/services/saf/plmsv/plms/hpi_intf/plms_hsm.c | 6 +- osaf/services/saf/plmsv/plms/plms_he_pres_fsm.c | 4 +++- 2 files changed, 8 insertions(+), 2 deletions(-) PLM Presence State for a FRU that is in M1 (present, but inactive) shows up as SA_PLM_HE_PRESENCE_NOT_PRESENT, when PLM

[devel] [PATCH 1 of 1] plm: Add function to state machine to handle NOT_PRESENT-INACTIVE [#976]

2014-08-05 Thread Alex Jones
osaf/services/saf/plmsv/plms/plms_he_pres_fsm.c | 49 - 1 files changed, 48 insertions(+), 1 deletions(-) saPlmHEPresenceState still showing NOT_PRESENT when HPI first starts, and a FRU is PRESENT, but not ACTIVE. The HE is first created with NOT_PRESENT state. When

[devel] [PATCH 0 of 1] Review Request for PLM: Add function to state machine to handle NOT_PRESENT-INACTIVE [#976]

2014-08-05 Thread Alex Jones
n Comments (indicate scope for each y above): - Forgot to submit this code in the original review. changeset b5e399fd0bb8e8fa67bbd7db9156c09b29cb889b Author: Alex Jones ajo...@genband.com Date: Tue, 05 Aug 2014 10:51:14 -0400 plm: Add

[devel] [PATCH 1 of 1] plm: change plms standby behaviour to wait until contact with HPI is made [#977]

2014-08-05 Thread Alex Jones
osaf/libs/common/plmsv/include/plms.h| 1 + osaf/services/saf/plmsv/plms/plms_imm.c | 15 +++ osaf/services/saf/plmsv/plms/plms_main.c | 13 - 3 files changed, 28 insertions(+), 1 deletions(-) SMF rolling upgrade with reboot fails for standby controller when HPI

[devel] [PATCH 0 of 1] Review Request for PLM: increase plmd timeout and add blocking HPI call during plms init [#977]

2014-08-06 Thread Alex Jones
4b0c14f493672ebfe2ebb28b755710f9279f1c65 Author: Alex Jones ajo...@genband.com Date: Wed, 06 Aug 2014 13:17:30 -0400 plm: increase plmd timeout and add blocking HPI call during plms init [#977] The previous patch didn't quite fix the entire problem. Even though the SessionOpen

[devel] [PATCH 1 of 1] plm: increase plmd timeout and add blocking HPI call during plms init [#977]

2014-08-06 Thread Alex Jones
osaf/services/infrastructure/nid/config/nodeinit.conf.controller | 2 +- osaf/services/saf/plmsv/plms/hpi_intf/plms_hsm.c | 40 +++--- 2 files changed, 29 insertions(+), 13 deletions(-) The previous patch didn't quite fix the entire problem. Even though the SessionOpen

[devel] [PATCH 1 of 1] ckptnd: increase performance when creating and writing large numbers of sections [#770]

2014-08-07 Thread Alex Jones
osaf/libs/common/cpsv/include/Makefile.am |1 + osaf/libs/common/cpsv/include/cpnd.h |1 + osaf/libs/common/cpsv/include/cpnd_cb.h | 14 +- osaf/libs/common/cpsv/include/cpnd_init.h | 15 +- osaf/libs/common/cpsv/include/cpnd_sec.h | 60

[devel] [PATCH 0 of 1] Review Request for cpnd: increase performance when creating and writing large numbers of sections [#770]

2014-08-07 Thread Alex Jones
n Comments (indicate scope for each y above): - This patch should fully replace the patch posted on May 20. It contains updates from comments received. changeset 4ede43c2720f9d9599da83b3cd6d93b29b5f3c6d Author: Alex Jones ajo

Re: [devel] [PATCH 1 of 1] ckptnd: increase performance when creating and writing large numbers of sections [#770]

2014-08-11 Thread Alex Jones
; +} = -AVM On 8/7/2014 9:08 PM, Alex Jones wrote: osaf/libs/common/cpsv/include/Makefile.am |1 + osaf/libs/common/cpsv/include/cpnd.h |1 + osaf/libs/common/cpsv/include/cpnd_cb.h | 14 +- osaf/libs/common/cpsv/include/cpnd_init.h | 15 +- osaf

[devel] [PATCH 1 of 1] plm: during admin RESET of HE return response to IMM immediately if there is no EE [#978]

2014-08-15 Thread Alex Jones
osaf/services/saf/plmsv/plms/plms_adm_fsm.c | 29 - 1 files changed, 20 insertions(+), 9 deletions(-) Doing PLM admin RESET on an HE with no EE never returns. The code assumes that there is always an EE associated with an HE, which is not true. It always waits for

[devel] [PATCH 0 of 1] Review Request for PLM: during admin RESET of HE return response to IMM immediately if there is no EE [#978]

2014-08-15 Thread Alex Jones
n Other n Comments (indicate scope for each y above): - EXPLAIN/COMMENT THE PATCH SERIES HERE changeset d2e9dbb3cb18c9b52c51edff4760b79fb329bfb6 Author: Alex Jones ajo...@genband.com Date: Fri, 15 Aug 2014 11:06:13 -0400

[devel] [PATCH 1 of 1] plm: clear HE operational state when HE is removed [#1011]

2014-08-22 Thread Alex Jones
osaf/services/saf/plmsv/plms/hpi_intf/plms_hsm.c | 24 +++- osaf/services/saf/plmsv/plms/plms_utils.c| 11 ++- 2 files changed, 25 insertions(+), 10 deletions(-) Aug 22 21:14:18 linux osafplmd[10076]: ER Entity safHE=XXX,safDomain=xxxDomain can not be

[devel] [PATCH 0 of 1] Review Request for PLM: ignore MDS events from other nodes [#1026]

2014-08-27 Thread Alex Jones
Comments (indicate scope for each y above): - EXPLAIN/COMMENT THE PATCH SERIES HERE changeset 80c9d0759c9cc818e8416f1d4fdd9aee60598b6f Author: Alex Jones ajo...@genband.com Date: Wed, 27 Aug 2014 17:39:09 -0400 plm: ignore MDS events from other

Re: [devel] [PATCH 1 of 1] plm: ignore MDS events from other nodes [#1026]

2014-08-28 Thread Alex Jones
Yes, much better solution. I will resubmit the patch. Alex On 08/28/2014 01:55 AM, Nagendra Kumar wrote: Hi Alex, I would suggest to change subscription to NCSMDS_SCOPE_INTRANODE instead. Thanks -Nagu -Original Message- From: Alex Jones [mailto:ajo

[devel] [PATCH 0 of 1] Review Request for PLM: change PLMS_HRB MDS subscription to INTRANODE [#1026]

2014-08-28 Thread Alex Jones
93890d241919770f80a9f431fcb56398d076e819 Author: Alex Jones ajo...@genband.com Date: Thu, 28 Aug 2014 10:17:37 -0400 plm: change PLMS_HRB MDS subscription to INTRANODE [#1026] Aug 27 0:32:50.265452 osafplmd [12007:plms_common_mds.c:0081] ER INVALID MSG FORMAT IN ENC When

Re: [devel] [PATCH 1 of 1] plm: change PLMS_HRB MDS subscription to INTRANODE [#1026]

2014-08-28 Thread Alex Jones
Yes, it is always reproducible. I think this came out due to checkin 5581 (ticket #977). In order to handle a fast switchover during SMF upgrade, we decided to start the HPI thread on the standby at init time and wait for the inventory to be there before responding. Alex On 08/28/2014 10:48

[devel] [PATCH 0 of 1] Review Request for plm: don't reset self during plms_ee_instantiate [#1031]

2014-09-03 Thread Alex Jones
Comments (indicate scope for each y above): - EXPLAIN/COMMENT THE PATCH SERIES HERE changeset 98bb99f081bbdc340a151425e94a418071390d59 Author: Alex Jones ajo...@genband.com Date: Tue, 02 Sep 2014 13:20:02 -0400 plm: don't reset self during

[devel] [PATCH 1 of 1] plm: don't reset self during plms_ee_instantiate [#1031]

2014-09-03 Thread Alex Jones
# HG changeset patch # User Alex Jones ajo...@genband.com # Date 1409678402 14400 # Tue Sep 02 13:20:02 2014 -0400 # Node ID 98bb99f081bbdc340a151425e94a418071390d59 # Parent 12251687a7e6ade049c5de9b3e4536b98e5b2091 plm: don't reset self during plms_ee_instantiate [#1031] Active

[devel] [PATCH 1 of 1] plm: don't reset self during plms_ee_instantiate [#1031]

2014-09-09 Thread Alex Jones
# HG changeset patch # User Alex Jones ajo...@genband.com # Date 1410281601 14400 # Tue Sep 09 12:53:21 2014 -0400 # Node ID 1e666e56246f4f18063be688cbadb48465ee4cbd # Parent 12251687a7e6ade049c5de9b3e4536b98e5b2091 plm: don't reset self during plms_ee_instantiate [#1031] Active

[devel] [PATCH 0 of 1] Review Request for plm: don't reset self during plms_ee_instantiate [#1031]

2014-09-09 Thread Alex Jones
n Comments (indicate scope for each y above): - changeset 1e666e56246f4f18063be688cbadb48465ee4cbd Author: Alex Jones ajo...@genband.com Date: Tue, 09 Sep 2014 12:53:21 -0400 plm: don't reset self during plms_ee_instantiate [#1031

Re: [devel] [PATCH 1 of 1] plm: don't reset self during plms_ee_instantiate [#1031]

2014-09-18 Thread Alex Jones
Hi Mathi, This was flagged by valgrind as an overlapping buffer copy. It's just copying itself onto itself! And then in the next line this same structure gets filled in by ioctl(). It's really just bad, dead code, so I removed it. I will take a look at 1073. Alex On 09/18/2014

[devel] OpenSAF in the cloud

2014-10-16 Thread Alex Jones
Hey Anders Mathi, Do you have a pointer to your presentation that you can share, for those of us who were not able to make it? Has anyone written a document which offers best practices on using OpenSAF in the cloud? For instance, I see that container/contained is not yet

[devel] N+M standby assignment to two different SUs

2014-10-23 Thread Alex Jones
Hi Guys, I'm chasing an issue with N+M, and before I dig deeper to fix it I wanted to see if any of you know about it. Here's what I do: 1. si-swap of OpenSAF-2N controller 2. AMF node lock of now standby controller This (now OpenSAF standby) controller has an active

Re: [devel] N+M standby assignment to two different SUs

2014-10-24 Thread Alex Jones
Message- From: Nagendra Kumar Sent: 25 October 2014 00:11 To: Alex Jones; Hans Feldt; Praveen Malviya Cc: opensaf-devel@lists.sourceforge.net Subject: Re: [devel] N+M standby assignment to two different SUs My mistake, I had given red as Nway. But please confirm the release. Thanks -Nagu

Re: [devel] N+M standby assignment to two different SUs

2014-10-24 Thread Alex Jones
=AmfDemo2,safSi=AmfDemo1,safApp=AmfDemo2 saAmfSISUHAState=ACTIVE(1) Here safSi=AmfDemo,safApp=AmfDemo2 only has Standby assignment with no Act. Thanks -Nagu -Original Message- From: Alex Jones [mailto:ajo...@genband.com] Sent: 25 October 2014 00:33 To: Nagendra Kumar; Hans

[devel] [PATCH 1 of 1] plm: allow dynamic modification of saPlmDependency [#1030]

2014-10-27 Thread Alex Jones
osaf/services/saf/plmsv/plms/plms_imm.c | 101 + osaf/services/saf/plmsv/plms/plms_utils.c | 14 +++- 2 files changed, 87 insertions(+), 28 deletions(-) Attempting to change saPlmDependency returns SA_AIS_ERR_BAD_OPERATION. The code forbids modifying

[devel] [PATCH 0 of 1] Review Request for PLM: allow dynamic modification of saPlmDependency [#1030]

2014-10-27 Thread Alex Jones
(indicate scope for each y above): - changeset 3cb7fd545a9ded536b48acdfb5e4eb8717811099 Author: Alex Jones ajo...@genband.com Date: Mon, 27 Oct 2014 15:37:26 -0400 plm: allow dynamic modification of saPlmDependency [#1030] Attempting

[devel] [PATCH 0 of 1] Review Request for PLM: allow dynamic

2014-11-18 Thread Alex Jones
Comments (indicate scope for each y above): - Same patch as before, but with extra log message removed. changeset e713dc3adcbf158c8da1ee86d9a8632b93ad1fe0 Author: Alex Jones ajo...@genband.com Date: Tue, 18 Nov 2014 11:10:41 -0500 plm: allow

[devel] [PATCH 1 of 1] plm: fix memory corruption in plms [#1244]

2015-01-22 Thread Alex Jones
osaf/services/saf/plmsv/plms/hpi_intf/plms_epath_util.c | 2 +- osaf/services/saf/plmsv/plms/plms_adm_fsm.c | 4 ++-- 2 files changed, 3 insertions(+), 3 deletions(-) Symptoms of the problem include PLMS crashing, and the node resetting because of the crash. Valgrind shows memory

Re: [devel] [PATCH 1 of 1] amfd: fix interaction with plm/clm for node left [#1264]

2015-03-16 Thread Alex Jones
Mathi, I am performing a PLM_ADMIN_RESET of the active controller. The problem is that amfd on the active controller is forcing a failover before the node left is received on the standby. This results in all the core OpenSAF services on the standby failing at active assignment because their

[devel] [PATCH 0 of 1] Review Request for amfd: fix interaction between plm/clm for node left [#1264]

2015-03-17 Thread Alex Jones
n Comments (indicate scope for each y above): - changeset 46c53346ea5deab1e3cd5059c1c9f6ffeb40d76f Author: Alex Jones ajo...@genband.com Date: Tue, 17 Mar 2015 11:48:29 -0400 amfd: fix interaction with plm/clm for node left [#1264

[devel] [PATCH 1 of 1] amfd: fix interaction with plm/clm for node left [#1264]

2015-03-17 Thread Alex Jones
osaf/services/saf/amf/amfd/clm.cc | 6 ++ 1 files changed, 6 insertions(+), 0 deletions(-) When active controller is hard reset, standby reboots, too, because the OpenSAF services all fail in active assignment because saImmOiClassImplementerSet fails. When amfd on the active controller

Re: [devel] [PATCH 3 of 3] plm: Fix GCC 5.1.0 compiler warning [#1340]

2015-05-05 Thread Alex Jones
Ack, not tested. Alex On 05/05/2015 02:47 AM, Mathivanan Naickan Palanivelu wrote: Including Alex. Mathi. - anders.wid...@ericsson.com wrote: osaf/services/saf/plmsv/plms/hpi_intf/plms_hsm.c | 2 ++ 1 files changed, 2 insertions(+), 0 deletions(-) The following warning was

[devel] [PATCH 1 of 1] plm: fix blade extraction mechanism [#1378]

2015-06-02 Thread Alex Jones
osaf/services/saf/plmsv/plms/hpi_intf/plms_hsm.c | 204 +++--- osaf/services/saf/plmsv/plms/plms_amf.c | 19 +-- 2 files changed, 71 insertions(+), 152 deletions(-) Blade extraction does not work consistently. If you extract a blade, and AMF should reject it (because

[devel] [PATCH 0 of 1] Review Request for PLM: fix blade extraction mechanism

2015-06-02 Thread Alex Jones
n Comments (indicate scope for each y above): - EXPLAIN/COMMENT THE PATCH SERIES HERE changeset 1d5402442516a05d4ade8a4323796683128d1a83 Author: Alex Jones ajo...@genband.com Date: Tue, 02 Jun 2015 16:33:43 -0400 plm: fix blade

Re: [devel] saAmfNodeAdminState different between IMM and amfd

2015-05-27 Thread Alex Jones
clm_node_exit_start finishes? 2. And then add the below code in avd_node_down_mw_susi_failover? Alex On 05/27/2015 09:09 AM, praveen malviya wrote: On 27-May-15 2:58 AM, Alex Jones wrote: Praveen/Nagu, I'm seeing an issue where the node admin state is different between IMM and amfd. I

[devel] discrepancies with released tar file and Mercurial

2015-08-07 Thread Alex Jones
Guys, I'm seeing a huge leak in cpnd. There is something very strange with the released software, though. Here is the leak from the released tar file (official 4.5.1.tar.gz and 4.6.0.tar.gz both have this): cpnd_proc.c:272 send_evt.info.cpa.info.arr_msg.ckpt_data = NULL; This

Re: [devel] AMF/CLM validation question

2015-07-23 Thread Alex Jones
There a few inconsistencies here, I believe. Firstly, we don't prevent an AMF node lock of the active controller, but we do prevent a CLM node lock of the active controller. Why the difference? Secondly, I'm also thinking of blade extraction as a use case here. If the operator wants to

Re: [devel] AMF/CLM validation question

2015-07-23 Thread Alex Jones
Disregard the first question. I realized after I hit the send button, that AMF node lock doesn't affect the internal OpenSAF daemons. But, I am still curious about the second. Alex On 07/23/2015 10:56 AM, Alex Jones wrote: There a few inconsistencies here, I believe. Firstly, we don't

Re: [devel] [PATCH 1 of 1] clmd: don't always checkpoint nodes in plm readiness callback completed state [#1416]

2015-07-23 Thread Alex Jones
Comments inline: On 07/23/2015 12:41 PM, Mathivanan Naickan Palanivelu wrote: Comments inline: - ajo...@genband.com wrote: osaf/services/saf/clmsv/clms/clms_plm.c | 12 +--- 1 files changed, 9 insertions(+), 3 deletions(-) At active controller switchover, the newly active

[devel] [PATCH 0 of 1] Review Request for plm: don't restart EE in opensaf_reboot unless it is instantiated [#1415]

2015-07-14 Thread Alex Jones
Other n Comments (indicate scope for each y above): - changeset 94ecad7888f6dc9b0d609602c507b27873e902f8 Author: Alex Jones ajo...@genband.com Date: Mon, 13 Jul 2015 15:13:04 -0400 plm: don't restart EE in opensaf_reboot

[devel] [PATCH 1 of 1] amfd: fix N+M identification of standby SU at controller failover [#1414]

2015-07-13 Thread Alex Jones
osaf/services/saf/amf/amfd/sg_npm_fsm.cc | 5 - 1 files changed, 4 insertions(+), 1 deletions(-) An N+M SU, with an active assignment, can also get a standby assignment at the same time. Details for reproducing this are in the ticket. The SUSI returned from avd_sg_npm_su_othr() is not

[devel] [PATCH 1 of 1] clmd: don't always checkpoint nodes in plm readiness callback completed state [#1416]

2015-07-14 Thread Alex Jones
osaf/services/saf/clmsv/clms/clms_plm.c | 12 +--- 1 files changed, 9 insertions(+), 3 deletions(-) At active controller switchover, the newly active sends node down events for nodes that are not down. There is a race condition on the standby between receiving the node_down checkpoint

[devel] [PATCH 0 of 1] Review Request for clmd: don't always checkpoint nodes in plm readiness callback completed state [#1416]

2015-07-14 Thread Alex Jones
n Comments (indicate scope for each y above): - changeset 7105a8f6ef5c0ba8928b66388058a852bb6e874c Author: Alex Jones ajo...@genband.com Date: Tue, 14 Jul 2015 14:38:59 -0400 clmd: don't always checkpoint nodes in plm readiness

[devel] [PATCH 0 of 1] Review Request for lcknd: fix deadlock when master lcknd is rebooted [#1488]

2015-09-17 Thread Alex Jones
n Comments (indicate scope for each "y" above): - <> changeset 799bdb0e951ab07e9bc9f21ba20ffeaacb5c199c Author: Alex Jones <ajo...@genband.com> Date: Thu, 17 Sep 2015 12:31:00 -0400 lcknd: fix lock sync bug during

[devel] [PATCH 0 of 1] Review Request for CKPT: fix crash in cpnd when checkpoint open sync to active times out [#1510]

2015-10-01 Thread Alex Jones
Other n Comments (indicate scope for each "y" above): - <> changeset 923566e6c96312c15330b4e8ed0c81a80a2701f0 Author: Alex Jones <ajo...@genband.com> Date: Thu, 01 Oct 2015 12:56:53 -0400 ckptnd: fi

[devel] [PATCH 1 of 1] ckptnd: fix crash when checkpoint open sync to active times out [#1510]

2015-10-01 Thread Alex Jones
osaf/services/saf/cpsv/cpnd/cpnd_evt.c | 10 ++ 1 files changed, 10 insertions(+), 0 deletions(-) ckptnd core dumps with many different stack traces When a collocated checkpoint replica is opened, and the active replica has large numbers of sections (~200k), the sync from the active

[devel] [PATCH 1 of 1] plm: fix deactivate for ATCA [#1289]

2015-09-18 Thread Alex Jones
osaf/services/saf/plmsv/plms/plms_he_pres_fsm.c | 6 ++ 1 files changed, 6 insertions(+), 0 deletions(-) When SA_PLM_ADMIN_DEACTIVATE is issued for an ATCA FRU, the HE presence state gets stuck in SA_PLM_HE_PRESENCE_DEACTIVATING, and never transitions to SA_PLM_HE_PRESENCE_INACTIVE. The

  1   2   3   4   >