Thanks. /Ted

Sent from Samsung Mobile


-------- Original message --------
From: A V Mahesh
Date:2015/04/13 10:07 PM (GMT+08:00)
To: Yao Cheng LIANG ,Mathivanan Naickan Palanivelu
Cc: [email protected]
Subject: Re: [users] how long it takes to detect node sudden power loss

Hi,

Un-comment the below line to enable trace of osafdtm in  /etc/opensaf/dtmd.conf

#args="--tracemask=0xffffffff"   ------>  args="--tracemask=0xffffffff"

And do  `export MDS_LOG_LEVEL=5` on both node consoles before 
`/etc/init.d/opensafd restart`
to get debuig MDS logs.


-AVM


On 4/13/2015 11:52 AM, Yao Cheng LIANG wrote:
Dear AVM,

Thanks. But I need to add “args="--loglevel=info"” to dtmd.conf so that 
/var/log/opensaf/osafdtm and /var/log/opensaf/mds.log can be seen, right?

Ted


From: A V Mahesh [mailto:[email protected]]
Sent: Monday, April 13, 2015 1:03 PM
To: Yao Cheng LIANG; Mathivanan Naickan Palanivelu
Cc: 
[email protected]<mailto:[email protected]>
Subject: Re: [users] how long it takes to detect node sudden power loss

Hi Ted,

On 4/10/2015 3:54 PM, Yao Cheng LIANG wrote:
I did 3o times rebooting “standby” node, and found two times it needs 1~2 
minutes for the “active” node to detect it

Can you please share the  following data of both nodes when “active” node 
detection of standby taken 1~2 minutes.

1) #/var/log/opensaf/osafdtm
2) #/var/log/opensaf/mds.log
3) #/var/log/messages ( syslog )

4) #top    (output at the time of detection)
5) /etc/opensaf/dtmd.conf

-AVM


On 4/10/2015 3:54 PM, Yao Cheng LIANG wrote:
I did some tests recently. I have two controllers, and I reboot one and see how 
long the second could detect failure of the peer. I did 3o times rebooting 
“standby” node, and found two times it needs 1~2 minutes for the “active” node 
to detect it. Could you anyone tell me the reason and the solution?

Thanks.

Ted

Sent from Windows Mail

From: Mathivanan Naickan Palanivelu<mailto:[email protected]>
Sent: ‎Thursday‎, ‎April‎ ‎9‎, ‎2015 ‎7‎:‎39‎ ‎PM
To: Yao Cheng LIANG<mailto:[email protected]>
Cc: 
[email protected]<mailto:[email protected]>,
 'A V Mahesh'<mailto:[email protected]>

I think since these are TCP keepalive configuration values,
the connection loss would be detected immediatey in the cases
of abrupt powershutdown or cable unplug.

Thanks,
Mathi.

----- [email protected]<mailto:[email protected]> wrote:

> Is there any approach to hasten this detection, because 4 seconds is
> too long for some use cases?
>
> Br,
>
> Ted
>
> -----Original Message-----
> From: A V Mahesh [mailto:[email protected]]
> Sent: Monday, March 30, 2015 12:29 PM
> To: 
> [email protected]<mailto:[email protected]>
> Subject: Re: [users] how long it takes to detect node sudden power
> loss
>
> Hi,
>
>  >>Does that mean it needs 2 + 2*1 = 4s before the peer can detect the
> node connection loss if I suddenly unplug power supply of one node?
> Yes,when the connection goes down (  disconnect the cable/unplug power
> supply )  in 4 seconds detect that the connection has been lost
>
>   -AVM
>
> On 3/29/2015 7:11 PM, Yao Cheng LIANG wrote:
> > Dear all,
> >
> > If using tcp, the underlying dtms using tcp keepalive to detect
> connection loss. If my dtmd.conf is as below:
> >
> > DTM_TCP_KEEPIDLE_TIME=2
> >
> > DTM_TCP_KEEPALIVE_INTVL=1
> >
> > DTM_TCP_KEEPALIVE_PROBES=2
> >
> > Does that mean it needs 2 + 2*1 = 4s before the peer can detect the
> node connection loss if I suddenly unplug power supply of one node?
> >
> > Thanks.
> >
> > Ted
> >
> >
> ----------------------------------------------------------------------
> > -------- Dive into the World of Parallel Programming The Go Parallel
>
> > Website, sponsored by Intel and developed in partnership with
> Slashdot
> > Media, is your hub for all things parallel software development,
> from
> > weekly thought leadership blogs to news, videos, case studies,
> > tutorials and more. Take a look and join the conversation now.
> > http://goparallel.sourceforge.net/
> > _______________________________________________
> > Opensaf-users mailing list
> > [email protected]<mailto:[email protected]>
> > https://lists.sourceforge.net/lists/listinfo/opensaf-users
>
>
> ------------------------------------------------------------------------------
> Dive into the World of Parallel Programming The Go Parallel Website,
> sponsored by Intel and developed in partnership with Slashdot Media,
> is your hub for all things parallel software development, from weekly
> thought leadership blogs to news, videos, case studies, tutorials and
> more. Take a look and join the conversation now.
> http://goparallel.sourceforge.net/
> _______________________________________________
> Opensaf-users mailing list
> [email protected]<mailto:[email protected]>
> https://lists.sourceforge.net/lists/listinfo/opensaf-users
>
> ------------------------------------------------------------------------------
> Dive into the World of Parallel Programming The Go Parallel Website,
> sponsored
> by Intel and developed in partnership with Slashdot Media, is your hub
> for all
> things parallel software development, from weekly thought leadership
> blogs to
> news, videos, case studies, tutorials and more. Take a look and join
> the
> conversation now. http://goparallel.sourceforge.net/
> _______________________________________________
> Opensaf-users mailing list
> [email protected]<mailto:[email protected]>
> https://lists.sourceforge.net/lists/listinfo/opensaf-users


------------------------------------------------------------------------------
BPM Camp - Free Virtual Workshop May 6th at 10am PDT/1PM EDT
Develop your own process in accordance with the BPMN 2 standard
Learn Process modeling best practices with Bonita BPM through live exercises
http://www.bonitasoft.com/be-part-of-it/events/bpm-camp-virtual- event?utm_
source=Sourceforge_BPM_Camp_5_6_15&utm_medium=email&utm_campaign=VA_SF
_______________________________________________
Opensaf-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-users

Reply via email to