Which version of lustre are you using?  We initially has problem with this too 
when using failover with lustre 2.8 and 2.9.  We got a patch that fixed it and 
recent versions work fine for us.  We have a combined MGS/MDS so our scenario 
is a little different but this sounds very similar to our issue.

http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/2017-January/014125.html



From: lustre-discuss <[email protected]> on behalf of 
"Mannthey, Keith" <[email protected]>
Date: Wednesday, October 25, 2017 at 11:30 AM
To: Ravi Konila <[email protected]>, Lustre Discuss 
<[email protected]>
Subject: Re: [lustre-discuss] MGS is not working in HA

Kavi,
  You may want to open a jira ticket with this error.  It looks like the mount 
command is only trying only the first nid of the mount command.

Jira is https://jira.hpdd.intel.com “LU” project.

I have seen Lustre Servers first mount behave like this but not client mounts.  
It should try the first server, timeout and try the 2nd server.

Thanks,
Keith

From: lustre-discuss [mailto:[email protected]] On Behalf 
Of Ravi Konila
Sent: Wednesday, October 25, 2017 5:07 AM
To: Lustre Discuss <[email protected]>
Subject: [lustre-discuss] MGS is not working in HA

Hi
I have two servers for MGS/MDS and have configured it pacemaker for HA.
The command which I gave on first MGS/MDS mds01 is

mkfs.lustre --mgs --failnode 192.168.0.51@o2ib<mailto:192.168.0.51@o2ib> 
--backfstype=ldiskfs /dev/mapper/mpathd

Next I created lustre filesystem for MDT
mkfs.lustre --mdt --fsname lhome --index 0 --mgsnode 
192.168.0.50@o2ib<mailto:192.168.0.50@o2ib> --mgsnode 
192.168.0.51@o2ib<mailto:192.168.0.51@o2ib> --servicenode 
192.168.0.50@o2ib<mailto:192.168.0.50@o2ib> --servicenode 
192.168.0.51@o2ib<mailto:192.168.0.51@o2ib> --backfstype=ldiskfs 
/dev/mapper/mpathb

Now, in my client, If I give
mount –t lustre 
192.168.0.50@o2ib:192.168.0.51@o2ib:/lhome<mailto:192.168.0.50@o2ib:192.168.0.51@o2ib:/lhome>
 /home, it does not work and asks if MGS is running.
But if I give mount –t lustre 
192.168.0.50@o2ib:/lhome<mailto:192.168.0.50@o2ib:/lhome> /home it works fine.

Also when my first MDS (mds01) is down, my client is not mounting lustre from 
2nd MGS.
It says check if MGS is running?

Any help will be highly appreciated.

Regards
Ravi Konila
_______________________________________________
lustre-discuss mailing list
[email protected]
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Reply via email to