Re: [ofa-general] ofed 1.3.2 opensmd failover

2009-08-26 Thread PN
2009/8/26 Hal Rosenstock 

>
>
> On 8/25/09, PN  wrote:
>>
>> HI,
>>
>> I can think of a situation in which all servers have dual port IB cards
>> and need failover of OpenSM to achieve HA.
>> As I know, OpenSM can only bind to 1 port at a time,
>
>
> Yes.
>
>  so do I need to start 2 OpenSM in server A and 2 OpenSM in server B?
>
>
> That would be one valid configuration. I'm assuming all ports are connected
> to same subnet.
>

In some cases, I will use IB bonding. While in another cases, I may use 1
port for calculation and another port to connect the storage.
I'm not sure which configuration will provide better performance.


> Will they use the same guid2lid file?
>
>
> Depends how the OpenSM configuration is done.
>
>  Do I need to set something in the config file or they will automatically
>> communcate each other?
>
>
> What communication are you referring to ? The all need to share the same
> subnet prefix.
>

I mean the handover mechanism. I remember in the previous OpenSM config file
(in OFED 1.2.x/1.3.x), there is a field about all the subnet manager in the
subnet, but this field is omitted in the new version. I wonder whether all
the OpenSM will automatically discover each other and do the handover
mechanism right.

Thanks.

PN


>
>
>
>> Do I need to run sldd.sh manually or it will automatically sync with other
>> OpenSM?
>
>
> You can either manually copy the guid2lid file around to the appropriate
> places. I'm not that familiar with sldd.sh but I think it can either be run
> manually or made to run automatically but I'm not familiar with the details.
>
> -- Hal
>
>
> Thanks a lot.
>>
>> Regards,
>> PN
>>
>>
>>
>>
>> 2009/8/26 Hal Rosenstock 
>>
>>>
>>>
>>>  On 8/25/09, [email protected]  wrote:

 Hi all,

 Quick question - is there a need to run anything except opensmd deamons
 to provide failover capability on ib network in ofed 1.3?
>>>
>>>
>>> In terms of SM failover, modulo bugs fixed relative to this feature since
>>> OFED 1.3 (there are a couple of things here which may affect your
>>> environment if I recall correctly), you only need to run more than 1 SM for
>>> this (one will become master, the other standby).
>>>
>>> I'm aware that when master manager dies standby one comes in and manages
 the network, but that does not necessary means that lids are preserved,
 especially for nodes joining in. I used to run sldd.sh for distributing 
 lids
 list on ofed 1.2.5, but while this script seems to be in place noone
 mentions necessity for it.
>>>
>>>
>>> So subnet manager failover is provided by running standby opensm.
>>>
>>>
>>> And how LID preservation is provided?
>>>
>>>
>>> If you want LIDs to be preserved, the guid2lid file needs to be sync'd
>>> (copied from the master SM once it's fully assembled to the node which is
>>> running the standby SM). That's what the sldd.sh script does.
>>>
>>> -- Hal
>>>
>>> Regards,

 Zdenek Kovlensky

 --
 Kup wlasne mieszkanie za 33 tys. zl!
 Sprawdz >>> http://link.interia.pl/f22f2

 ___
 general mailing list
 [email protected]
 http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

 To unsubscribe, please visit
 http://openib.org/mailman/listinfo/openib-general

>>>
>>>
>>> ___
>>> general mailing list
>>> [email protected]
>>> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
>>>
>>> To unsubscribe, please visit
>>> http://openib.org/mailman/listinfo/openib-general
>>>
>>
>>
>>
>> --
>> Best Regards,
>> PN Lai
>> HPC Specialist
>> Galactic Computng Corp.
>> Tel: 86-755-26733939 ext 826
>> Mobile: 86-13823161729
>> Fax: 86-755-26733780
>> URL: http://www.galactic.com.hk
>>
>
>


-- 
Best Regards,
PN Lai
HPC Specialist
Galactic Computng Corp.
Tel: 86-755-26733939 ext 826
Mobile: 86-13823161729
Fax: 86-755-26733780
URL: http://www.galactic.com.hk
___
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Re: [ofa-general] ofed 1.3.2 opensmd failover

2009-08-26 Thread Hal Rosenstock
On 8/25/09, PN  wrote:
>
> HI,
>
> I can think of a situation in which all servers have dual port IB cards and
> need failover of OpenSM to achieve HA.
> As I know, OpenSM can only bind to 1 port at a time,


Yes.

 so do I need to start 2 OpenSM in server A and 2 OpenSM in server B?


That would be one valid configuration. I'm assuming all ports are connected
to same subnet.

Will they use the same guid2lid file?


Depends how the OpenSM configuration is done.

 Do I need to set something in the config file or they will automatically
> communcate each other?


What communication are you referring to ? The all need to share the same
subnet prefix.


> Do I need to run sldd.sh manually or it will automatically sync with other
> OpenSM?


You can either manually copy the guid2lid file around to the appropriate
places. I'm not that familiar with sldd.sh but I think it can either be run
manually or made to run automatically but I'm not familiar with the details.

-- Hal


Thanks a lot.
>
> Regards,
> PN
>
>
>
>
> 2009/8/26 Hal Rosenstock 
>
>>
>>
>>  On 8/25/09, [email protected]  wrote:
>>>
>>> Hi all,
>>>
>>> Quick question - is there a need to run anything except opensmd deamons
>>> to provide failover capability on ib network in ofed 1.3?
>>
>>
>> In terms of SM failover, modulo bugs fixed relative to this feature since
>> OFED 1.3 (there are a couple of things here which may affect your
>> environment if I recall correctly), you only need to run more than 1 SM for
>> this (one will become master, the other standby).
>>
>> I'm aware that when master manager dies standby one comes in and manages
>>> the network, but that does not necessary means that lids are preserved,
>>> especially for nodes joining in. I used to run sldd.sh for distributing lids
>>> list on ofed 1.2.5, but while this script seems to be in place noone
>>> mentions necessity for it.
>>
>>
>> So subnet manager failover is provided by running standby opensm.
>>
>>
>> And how LID preservation is provided?
>>
>>
>> If you want LIDs to be preserved, the guid2lid file needs to be sync'd
>> (copied from the master SM once it's fully assembled to the node which is
>> running the standby SM). That's what the sldd.sh script does.
>>
>> -- Hal
>>
>> Regards,
>>>
>>> Zdenek Kovlensky
>>>
>>> --
>>> Kup wlasne mieszkanie za 33 tys. zl!
>>> Sprawdz >>> http://link.interia.pl/f22f2
>>>
>>> ___
>>> general mailing list
>>> [email protected]
>>> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
>>>
>>> To unsubscribe, please visit
>>> http://openib.org/mailman/listinfo/openib-general
>>>
>>
>>
>> ___
>> general mailing list
>> [email protected]
>> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
>>
>> To unsubscribe, please visit
>> http://openib.org/mailman/listinfo/openib-general
>>
>
>
>
> --
> Best Regards,
> PN Lai
> HPC Specialist
> Galactic Computng Corp.
> Tel: 86-755-26733939 ext 826
> Mobile: 86-13823161729
> Fax: 86-755-26733780
> URL: http://www.galactic.com.hk
>
___
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Re: [ofa-general] ofed 1.3.2 opensmd failover

2009-08-25 Thread PN
HI,

I can think of a situation in which all servers have dual port IB cards and
need failover of OpenSM to achieve HA.
As I know, OpenSM can only bind to 1 port at a time, so do I need to start 2
OpenSM in server A and 2 OpenSM in server B? Will they use the same guid2lid
file? Do I need to set something in the config file or they will
automatically communcate each other?
Do I need to run sldd.sh manually or it will automatically sync with other
OpenSM?

Thanks a lot.

Regards,
PN




2009/8/26 Hal Rosenstock 

>
>
> On 8/25/09, [email protected]  wrote:
>>
>> Hi all,
>>
>> Quick question - is there a need to run anything except opensmd deamons to
>> provide failover capability on ib network in ofed 1.3?
>
>
> In terms of SM failover, modulo bugs fixed relative to this feature since
> OFED 1.3 (there are a couple of things here which may affect your
> environment if I recall correctly), you only need to run more than 1 SM for
> this (one will become master, the other standby).
>
>  I'm aware that when master manager dies standby one comes in and manages
>> the network, but that does not necessary means that lids are preserved,
>> especially for nodes joining in. I used to run sldd.sh for distributing lids
>> list on ofed 1.2.5, but while this script seems to be in place noone
>> mentions necessity for it.
>
>
> So subnet manager failover is provided by running standby opensm.
>
>
> And how LID preservation is provided?
>
>
> If you want LIDs to be preserved, the guid2lid file needs to be sync'd
> (copied from the master SM once it's fully assembled to the node which is
> running the standby SM). That's what the sldd.sh script does.
>
> -- Hal
>
> Regards,
>>
>> Zdenek Kovlensky
>>
>> --
>> Kup wlasne mieszkanie za 33 tys. zl!
>> Sprawdz >>> http://link.interia.pl/f22f2
>>
>> ___
>> general mailing list
>> [email protected]
>> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
>>
>> To unsubscribe, please visit
>> http://openib.org/mailman/listinfo/openib-general
>>
>
>
> ___
> general mailing list
> [email protected]
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
>
> To unsubscribe, please visit
> http://openib.org/mailman/listinfo/openib-general
>



-- 
Best Regards,
PN Lai
HPC Specialist
Galactic Computng Corp.
Tel: 86-755-26733939 ext 826
Mobile: 86-13823161729
Fax: 86-755-26733780
URL: http://www.galactic.com.hk
___
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

Re: [ofa-general] ofed 1.3.2 opensmd failover

2009-08-25 Thread Hal Rosenstock
On 8/25/09, [email protected]  wrote:
>
> Hi all,
>
> Quick question - is there a need to run anything except opensmd deamons to
> provide failover capability on ib network in ofed 1.3?


In terms of SM failover, modulo bugs fixed relative to this feature since
OFED 1.3 (there are a couple of things here which may affect your
environment if I recall correctly), you only need to run more than 1 SM for
this (one will become master, the other standby).

 I'm aware that when master manager dies standby one comes in and manages
> the network, but that does not necessary means that lids are preserved,
> especially for nodes joining in. I used to run sldd.sh for distributing lids
> list on ofed 1.2.5, but while this script seems to be in place noone
> mentions necessity for it.


So subnet manager failover is provided by running standby opensm.


And how LID preservation is provided?


If you want LIDs to be preserved, the guid2lid file needs to be sync'd
(copied from the master SM once it's fully assembled to the node which is
running the standby SM). That's what the sldd.sh script does.

-- Hal

Regards,
>
> Zdenek Kovlensky
>
> --
> Kup wlasne mieszkanie za 33 tys. zl!
> Sprawdz >>> http://link.interia.pl/f22f2
>
> ___
> general mailing list
> [email protected]
> http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general
>
> To unsubscribe, please visit
> http://openib.org/mailman/listinfo/openib-general
>
___
general mailing list
[email protected]
http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general