Also, I must point out the importance of having a redundant network 
connection between the nodes; otherwise it will be a single point of 
failure. Is your network duplicated?

/ Anders Widell

On 09/24/2015 12:21 PM, Mathivanan Naickan Palanivelu wrote:
> Hi,
>
> Note that FMS_PROMOTE_ACTIVE_TIMER and opensaf_reboot scripts are two 
> platform adaptation attributes in
> OpenSAF w.r.t failover and fencing. An OpenSAF user can customize these in 
> their deployments.
>
> Upon receiving connection loss indication with the active controller,
> the STANDBY controller starts this promote active timer (see 
> FMS_PROMOTE_ACTIVE_TIMER in /etc/opensaf/fmd.conf).
> This timer acts as a tolerance mechanism to handle or differentiate temporary 
> link-flaps and false-positives
> in your network.
> Upon expiry of this timer, the STANDBY invokes opensaf_reboot script (with 
> the intention to reboot
> the ACTIVE node) and subsequently promotes itself to ACTIVE.
>
> The opensaf_reboot script is an integration point for the OpenSAF user. So, 
> during failover
> when this opensaf_reboot script is invoked the node information (node_id, PLM 
> ee name) of the
> peer ACTIVE node is passed as input to this script.
> Inside this script, the user can modify so as to invoke 'commands' that will 
> perform remote reboots
> of the old ACTIVE node.
> The 'commands' here could be an IPMI command or any STONITH agent/command.
>
> Cheers,
> Mathi.
>
> ----- [email protected] wrote:
>
>> When a system gets into split-brain scenario, both controllers assume
>> active role. How does a payload node distinguish which controller it
>> is associated to? Is there a way that we find out which payload nodes
>> connect to which controller?
>>
>> Our cluster needs to provide service 24x7.  So restarting the cluster
>> is not possible when this situation occurs.  What is the best way to
>> correct a split-brain situation? If we stop and restart one of the
>> controller nodes to allow it to rejoin the other controller, should we
>> also restart the payload nodes associated to that controller? Those
>> payload nodes should be stopped before stopping their associated
>> controller node, correct?
>>
>> Shu Wang
>>
>>
>>
>>
>> ________________________________
>> The information transmitted herein is intended only for the person or
>> entity to which it is addressed and may contain confidential,
>> proprietary and/or privileged material. Any review, retransmission,
>> dissemination or other use of, or taking of any action in reliance
>> upon, this information by persons or entities other than the intended
>> recipient is prohibited. If you received this in error, please contact
>> the sender and delete the material from any computer.
>> ------------------------------------------------------------------------------
>> Monitor Your Dynamic Infrastructure at Any Scale With Datadog!
>> Get real-time metrics from all of your servers, apps and tools
>> in one place.
>> SourceForge users - Click here to start your Free Trial of Datadog
>> now!
>> http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140
>> _______________________________________________
>> Opensaf-users mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/opensaf-users
> ------------------------------------------------------------------------------
> Monitor Your Dynamic Infrastructure at Any Scale With Datadog!
> Get real-time metrics from all of your servers, apps and tools
> in one place.
> SourceForge users - Click here to start your Free Trial of Datadog now!
> http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140
> _______________________________________________
> Opensaf-users mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/opensaf-users
>


------------------------------------------------------------------------------
Monitor Your Dynamic Infrastructure at Any Scale With Datadog!
Get real-time metrics from all of your servers, apps and tools
in one place.
SourceForge users - Click here to start your Free Trial of Datadog now!
http://pubads.g.doubleclick.net/gampad/clk?id=241902991&iu=/4140
_______________________________________________
Opensaf-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/opensaf-users

Reply via email to