Isaac, I followed your instructions, and now I can see the client on the second network on the MDS when I run:
lctl> network tcp1 lctl> peer_list lctl> conn_list However ... when I attempt to mount, I get this error on the client: -- client -- mount -t lustre [EMAIL PROTECTED]:/lustre /mnt/lustre mount.lustre: mount [EMAIL PROTECTED]:/lustre at /mnt/lustre failed: No such file or directory Is the MGS specification correct? Is the filesystem name correct? If upgrading, is the copied client log valid? (see upgrade docs) -- client -- This is what turns up on the MDS (via dmesg): -- mds -- LustreError: 14098:0:(socklnd_cb.c:2167:ksocknal_recv_hello()) Error -104 reading HELLO from 172.16.128.100 LustreError: 11b-b: Connection to [EMAIL PROTECTED] at host 172.16.128.100 on port 988 was reset: is it running a compatible version of Lustre and is [EMAIL PROTECTED] one of its NIDs? -- mds -- If I initially constructed my file system to use failover MDS, do I have to specify it in my mount command? Is there a way to query the creation-time flags and options set on a particular file system so I can see if I am indeed attempting to talk to the MGS as well? thanks, Klaus On 11/15/07 1:11 AM, "Isaac Huang" <[EMAIL PROTECTED]>did etch on stone tablets: > On Wed, Nov 14, 2007 at 06:23:36PM -0800, Klaus Steden wrote: > [......] >> And on the MDS side, here's what I see in the output of 'dmesg': >> >> -- mds -- >> LustreError: 120-3: Refusing connection from 172.16.128.100 for >> [EMAIL PROTECTED]: No matching NI >> -- mds -- >> >> I was initially using this in my modprobe.conf: >> >> -- modprobe.conf -- >> options lnet networks=tcp0(eth0,bond0) >> -- modprobe.conf -- >> > > This only gave the MDS one NID: [EMAIL PROTECTED], i.e. IP address of > the 1st interface specified was used to generate the NID. > >> where 'eth0' is attached to 172.16.129.0/24, and 'bond0' is attached to >> 172.16.128.0/24. >> > > In your case, [EMAIL PROTECTED] > >> What's happening here, and where do I look for information on how to fix it? >> > > When the client tried to reach the MDS at [EMAIL PROTECTED], the MDS > refused the connection since [EMAIL PROTECTED] wasn't one of its NIDs. > > If they're two separate networks, just give the MDS two NIDs: > options lnet networks='tcp0(eth0),tcp1(bond0)' > > And for clients on eth0's network: > options lnet networks='tcp0(eth?)' > > At last clients on bond0's network: > options lnet networks='tcp1(eth?)' > > > HTH, > Isaac _______________________________________________ Lustre-discuss mailing list [email protected] https://mail.clusterfs.com/mailman/listinfo/lustre-discuss
