I'm not familiar with using FLR to tolerate OSS failures. My site does the HA pairs with shared storage method. It's sort of described in the manual
https://doc.lustre.org/lustre_manual.xhtml#configuringfailover but in more, Pacemaker-specific detail at https://wiki.lustre.org/Creating_a_Framework_for_High_Availability_with_Pacemaker and https://wiki.lustre.org/Creating_Pacemaker_Resources_for_Lustre_Storage_Services _______________________________________________ lustre-discuss mailing list [email protected] http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
