[Lustre-discuss] Recommended failover software for Lustre

2012-07-16 Thread Christopher J.Walker
The configuring failover section in the Whamcloud release of the
Lustre manual seems rather out of date:

http://build.whamcloud.com/job/lustre-manual/lastSuccessfulBuild/artifact/lustre_manual.html#configuringfailover

The Oracle release says much the same thing:
http://wiki.lustre.org/manual/LustreManual20_HTML/ConfiguringFailover.html#50540588_50628

In section 11.1.1 Power management software, it says:

For more information about PowerMan, go to:
https://computing.llnl.gov/linux/powerman.html;

Which no longer exists. It should probably point at
http://code.google.com/p/powerman/


Then in section 11.2. Setting up High-Availability (HA) Software with
Lustre it mentions Red Hat Cluster Manager  and Pacemaker.

Red Hat Cluster Manager points to
http://wiki.lustre.org/index.php/Using_Red_Hat_Cluster_Manager_with_Lustre

which says In comparison with other HA solutions, RedHat Cluster as in
RHEL 5.5 is an old HA solution. We recommend using other HA solutions
like Pacemaker, if possible. 

The pacemaker link:
http://wiki.lustre.org/index.php/Using_Pacemaker_with_Lustre

Although the title of this is Using Pacemaker with Lustre, it starts
off by saying In modern clusters, OpenAIS, or more specifically, its
communication stack corosync, is used for this task.


In summary:

1) The manual could do with some updating here.

2) I suspect I should be using corosync.

Chris



___
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss


Re: [Lustre-discuss] Recommended failover software for Lustre

2012-07-16 Thread Cliff White
Thanks, we've created http://jira.whamcloud.com/browse/LUDOC-69 to track
the fixes to the manual.
cliffw


On Mon, Jul 16, 2012 at 4:23 AM, Christopher J.Walker c.j.wal...@qmul.ac.uk
 wrote:

 The configuring failover section in the Whamcloud release of the
 Lustre manual seems rather out of date:


 http://build.whamcloud.com/job/lustre-manual/lastSuccessfulBuild/artifact/lustre_manual.html#configuringfailover

 The Oracle release says much the same thing:

 http://wiki.lustre.org/manual/LustreManual20_HTML/ConfiguringFailover.html#50540588_50628

 In section 11.1.1 Power management software, it says:

 For more information about PowerMan, go to:
 https://computing.llnl.gov/linux/powerman.html;

 Which no longer exists. It should probably point at
 http://code.google.com/p/powerman/


 Then in section 11.2. Setting up High-Availability (HA) Software with
 Lustre it mentions Red Hat Cluster Manager  and Pacemaker.

 Red Hat Cluster Manager points to
 http://wiki.lustre.org/index.php/Using_Red_Hat_Cluster_Manager_with_Lustre

 which says In comparison with other HA solutions, RedHat Cluster as in
 RHEL 5.5 is an old HA solution. We recommend using other HA solutions
 like Pacemaker, if possible. 

 The pacemaker link:
 http://wiki.lustre.org/index.php/Using_Pacemaker_with_Lustre

 Although the title of this is Using Pacemaker with Lustre, it starts
 off by saying In modern clusters, OpenAIS, or more specifically, its
 communication stack corosync, is used for this task.


 In summary:

 1) The manual could do with some updating here.

 2) I suspect I should be using corosync.

 Chris



 ___
 Lustre-discuss mailing list
 Lustre-discuss@lists.lustre.org
 http://lists.lustre.org/mailman/listinfo/lustre-discuss




-- 
cliffw
Support Guy
WhamCloud, Inc.
www.whamcloud.com
___
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss