[users] OpenSAF release 5.0.1 can not promote SC after enable "headless cluster" feature

2016-10-10 Thread Jianfeng Dong
dify to enable 'headless cluster' feature? Do we miss anything? Attachments are the syslog of SC and payload card when this problem happened, hope the log files can help us to find out the root cause. Much appreciated to any comment,

Re: [users] OpenSAF release 5.0.1 can not promote SC after enable "headless cluster" feature

2016-10-10 Thread Jianfeng Dong
/IMMND link among those nodes, but don't know how to prove it. From: Neelakanta Reddy [mailto:reddy.neelaka...@oracle.com] Sent: Monday, October 10, 2016 8:39 PM To: Jianfeng Dong ; opensaf-users@lists.sourceforge.net Subject: Re: [users] OpenSAF release 5.0.1 can not promote SC after enable &quo

Re: [users] OpenSAF release 5.0.1 can not promote SC after enable "headless cluster" feature

2016-10-10 Thread Jianfeng Dong
2016 12:55 AM To: Jianfeng Dong ; Neelakanta Reddy ; opensaf-users@lists.sourceforge.net Subject: Re: [users] OpenSAF release 5.0.1 can not promote SC after enable "headless cluster" feature There is a (probably not so well documented :-) assumption that the system controllers are co

Re: [users] OpenSAF release 5.0.1 can not promote SC after enable "headless cluster" feature

2016-10-11 Thread Jianfeng Dong
: Anders Widell [mailto:anders.wid...@ericsson.com] Sent: Tuesday, October 11, 2016 2:30 PM To: Jianfeng Dong ; Neelakanta Reddy ; opensaf-users@lists.sourceforge.net Subject: Re: [users] OpenSAF release 5.0.1 can not promote SC after enable "headless cluster" feature There is a one-to-one mapping bet

Re: [users] OpenSAF release 5.0.1 can not promote SC after enable "headless cluster" feature

2016-10-11 Thread Jianfeng Dong
ility issues. Thanks, Jianfeng -Original Message- From: Anders Widell [mailto:anders.wid...@ericsson.com] Sent: Tuesday, October 11, 2016 4:10 PM To: Jianfeng Dong ; Neelakanta Reddy ; opensaf-users@lists.sourceforge.net Subject: Re: [users] OpenSAF release 5.0.1 can not promote SC after e

Re: [users] OpenSAF release 5.0.1 can not promote SC after enable "headless cluster" feature

2016-10-11 Thread Jianfeng Dong
esday, October 11, 2016 5:59 PM To: Jianfeng Dong ; Neelakanta Reddy ; opensaf-users@lists.sourceforge.net Subject: Re: [users] OpenSAF release 5.0.1 can not promote SC after enable "headless cluster" feature I can send you a patch within the next few days and let you try it out. r

Re: [users] OpenSAF release 5.0.1 can not promote SC after enable "headless cluster" feature

2016-10-12 Thread Jianfeng Dong
letely, now looking forward to your patch very much! Thanks in advance. Thanks, Jianfeng -Original Message- From: Jianfeng Dong Sent: Tuesday, October 11, 2016 8:48 PM To: 'Anders Widell' ; Neelakanta Reddy ; opensaf-users@lists.sourceforge.net Subject: RE: [users] OpenSAF

[users] Max timeout limit problem in the "headless cluster" feature

2016-10-18 Thread Jianfeng Dong
ebooting for a enough long time till the controller come back, and this change should be not much risky . So, could you please change to save "IMMSV_SC_ABSENCE_ALLOWED" into a 32bit unsigned variable to keep payload continue running more than 18 hours in "headless" mode? Than

Re: [users] Max timeout limit problem in the "headless cluster" feature

2016-10-20 Thread Jianfeng Dong
Much appreciate! I notice the milestone is 5.0.2, but the fix code of this ticket will go into 5.1.0 as well, right? Regards, Jianfeng From: Hung Nguyen [mailto:hung.d.ngu...@dektech.com.au] Sent: Thursday, October 20, 2016 6:48 PM To: Jianfeng Dong ; opensaf-users@lists.sourceforge.net

Re: [users] Max timeout limit problem in the "headless cluster" feature

2016-10-21 Thread Jianfeng Dong
Thanks! Regards, Jianfeng From: Hung Nguyen [mailto:hung.d.ngu...@dektech.com.au] Sent: Friday, October 21, 2016 11:11 AM To: Jianfeng Dong ; opensaf-users@lists.sourceforge.net Subject: Re: [users] Max timeout limit problem in the "headless cluster" feature Yes, it will be pus

[users] How to prevent system auto-reboot if "osafamfd/osafamfnd" is killed?

2016-10-22 Thread Jianfeng Dong
auto-reboot function for a little while at running time? Thanks! Regards, Jianfeng Dong -- Check out the vibrant tech community on one of the world's most engaging tech sites, SlashDot

Re: [users] How to prevent system auto-reboot if "osafamfd/osafamfnd" is killed?

2016-10-22 Thread Jianfeng Dong
eback [mailto:hans.nordeb...@ericsson.com] Sent: Saturday, October 22, 2016 5:19 PM To: Jianfeng Dong ; opensaf-users@lists.sourceforge.net Subject: Re: [users] How to prevent system auto-reboot if "osafamfd/osafamfnd" is killed? Hi, you can uncomment AMFWDOG_TIMEOUT_MS in /etc/opensaf/amfwdog.

Re: [users] How to prevent system auto-reboot if "osafamfd/osafamfnd" is killed?

2016-10-24 Thread Jianfeng Dong
Thanks for clearing that! So we have to change our design to resolve this issue. Regards, Jianfeng -Original Message- From: praveen malviya [mailto:praveen.malv...@oracle.com] Sent: Monday, October 24, 2016 7:56 PM To: Jianfeng Dong ; Hans Nordeback ; opensaf-users

Re: [users] Max timeout limit problem in the "headless cluster" feature

2016-10-25 Thread Jianfeng Dong
Got it, probably we will upgrade to the next release. Thanks a lot! Regards, Jianfeng -Original Message- From: Zoran Milinkovic [mailto:zoran.milinko...@ericsson.com] Sent: Tuesday, October 25, 2016 2:48 PM To: Jianfeng Dong ; Hung Duc Nguyen ; opensaf-users@lists.sourceforge.net

[users] How to detect if PLD is being in "SC Absence" mode?

2017-03-01 Thread Jianfeng Dong
Hi, We have enabled the new feature "SC Absence" of OpenSAF 5.x in our product, it works good so far. Now we need to make some actions when PLD go in/out "SC Absence" mode, we have to find a way in PLD to detect if it is being in "SC Absent" mode or not. So, does anyone knows how to make it by

Re: [users] How to detect if PLD is being in "SC Absence" mode?

2017-03-10 Thread Jianfeng Dong
SAM(SC Absent Mode) Regards, Jianfeng -Original Message- From: praveen malviya [mailto:praveen.malv...@oracle.com] Sent: Friday, March 10, 2017 1:59 PM To: Jianfeng Dong ; opensaf-users@lists.sourceforge.net Subject: Re: [users] How to detect if PLD is being in "SC Ab

[users] Question about ticket 1617

2017-03-20 Thread Jianfeng Dong
Hi, We have a bug seems related with ticket 1617( https://sourceforge.net/p/opensaf/tickets/1617/ ), now we just found OpenSAF 5.0 has resolved the issue, but we don't know how to reproduce the issue in our release with OpenSAF 4.5.2, thus we can confirm it is caused by the ticket 1617 indee

Re: [users] Question about ticket 1617

2017-03-21 Thread Jianfeng Dong
our build environment. Thanks, Jianfeng -Original Message- From: Zoran Milinkovic [mailto:zoran.milinko...@ericsson.com] Sent: Monday, March 20, 2017 11:15 PM To: Jianfeng Dong ; opensaf-users@lists.sourceforge.net Subject: RE: Question about ticket 1617 Hi Jianfeng, Ticket

Re: [users] Question about ticket 1617

2017-03-22 Thread Jianfeng Dong
Thanks, I will take a try in this way. Have a good day! :-) Regards, Jianfeng -Original Message- From: Zoran Milinkovic [mailto:zoran.milinko...@ericsson.com] Sent: Wednesday, March 22, 2017 12:04 AM To: Jianfeng Dong ; opensaf-users@lists.sourceforge.net Subject: RE: Question about

[users] osafamfd coredump issue

2017-05-23 Thread Jianfeng Dong
Hi, We also got a 'osafamfd' coredump in our controller board, could please someone take a look at the issue? Thanks in advance. I listed the backtrace info here but not attach the coredump file(due to email size limit), so please let me know if you need more information. root@scm1:/co

[users] osafimmnd coredump issue

2017-05-23 Thread Jianfeng Dong
Hi, We got a 'osafimmnd' core dump in our chassis, could someone please take a look at the issue? Thanks. I can't attach the coredump file here due to OpenSAF email size limit policy, so please let me know if you need more information on the issue. atlas@scm1:/coredumps/ $ gdb /usr/lib64

Re: [users] osafimmnd coredump issue

2017-05-23 Thread Jianfeng Dong
that the problem was introduced by ticket #1848. > I'm working on this now. > > Can you reproduce the problem ? > If you can, can you provide the steps ? > > Thanks, > Zoran > > -Original Message- > From: Jianfeng Dong [mailto:jd...@juniper.net] > Sent: den 2

Re: [users] osafimmnd coredump issue

2017-05-23 Thread Jianfeng Dong
Hi, We got a 'osafimmnd' core dump in our chassis, could someone please take a look at the issue? Thanks. I can't attach the coredump file because it will make this email too big and get blocked by mailserver, so I just list backtrace info here, please let me know if you need more informatio

Re: [users] osafimmnd coredump issue

2017-05-23 Thread Jianfeng Dong
OpenSAF 5.1.0, and it will run with "SC Absent" feature enabled. Regards, Jianfeng -Original Message- From: Zoran Milinkovic [mailto:zoran.milinko...@ericsson.com] Sent: Tuesday, May 23, 2017 10:30 PM To: Jianfeng Dong ; opensaf-users@lists.sourceforge.net Subject: RE:

Re: [users] osafamfd coredump issue

2017-05-24 Thread Jianfeng Dong
avor to open a ticket for the issue? I just tried to register in sourceforge but failed, the registration page always complain something “Form security missing”. Thanks, Jianfeng -Original Message- From: praveen malviya [mailto:praveen.malv...@oracle.com] Sent: Wednesday, May 24,

Re: [users] osafimmnd coredump issue

2017-05-24 Thread Jianfeng Dong
Send again for email size limit reason. From: Jianfeng Dong Sent: Wednesday, May 24, 2017 6:08 PM To: 'Zoran Milinkovic' ; 'opensaf-users@lists.sourceforge.net' Subject: RE: osafimmnd coredump issue Hi Zoran, Seems the issue is hard to repro, I checked the syslog an

Re: [users] osafamfd coredump issue

2017-05-25 Thread Jianfeng Dong
Thank you Praveen, I will upload those files asap. Much appreciate for the help! Thanks, Jianfeng -Original Message- From: praveen malviya [mailto:praveen.malv...@oracle.com] Sent: Thursday, May 25, 2017 4:48 PM To: Jianfeng Dong Cc: opensaf-users@lists.sourceforge.net Subject

[users] Payload card reboot due to a short time network break

2018-03-08 Thread Jianfeng Dong
Hi, Several days ago we got a payload card reboot issue in customer field, a PLD lost connection with SC for a little while(about 10 seconds), then SC forced the PLD to reboot even though the PLD was going into “SC Absent mode”. System summary: our product is a system with 2 SC boards and at mo

Re: [users] Payload card reboot due to a short time network break

2018-03-09 Thread Jianfeng Dong
ction recovered quickly. Regards, Jianfeng -Original Message- From: Anders Widell [mailto:anders.wid...@ericsson.com] Sent: Thursday, March 8, 2018 8:38 PM To: Jianfeng Dong ; opensaf-users@lists.sourceforge.net Subject: Re: [users] Payload card reboot due to a short time network brea

Re: [users] Payload card reboot due to a short time network break

2018-03-13 Thread Jianfeng Dong
understand it definitely would not be easy to make such a change. Thanks, Jianfeng From: Anders Widell [mailto:anders.wid...@ericsson.com] Sent: Monday, March 12, 2018 7:52 PM To: Mathi N P ; Jianfeng Dong Cc: opensaf-users@lists.sourceforge.net Subject: Re: [users] Payload card reboot due to a

Re: [users] Payload card reboot due to a short time network break

2018-04-09 Thread Jianfeng Dong
From: Jianfeng Dong Sent: Tuesday, March 13, 2018 5:38 PM To: Anders Widell ; Mathi N P Cc: opensaf-users@lists.sourceforge.net Subject: RE: [users] Payload card reboot due to a short time network break Anders, As you can see in those logs we had set the TIPC link tolerance to 10 seconds, I’m

Re: [users] Payload card reboot due to a short time network break

2018-04-10 Thread Jianfeng Dong
: Jianfeng Dong Cc: opensaf-users@lists.sourceforge.net Subject: Re: [users] Payload card reboot due to a short time network break The only way to be sure if it is appropriate is to test under realistic conditions. I agree that it makes sense to increase it so that it is larger than the TIPC link