Hi Amy,
You could try first unmount all ost, mgs, etc, and redo a tunefs on
each relevant disk:
tunefs.lustre --writeconf --mgs --mdt --fsname=lufs DISKNAME
tunefs.lustre --erase-param --mgsnode=10.0.38....@tcp0 --writeconf
DISKNAME
Best Regards,
Jiawei
On Aug 10, 2009, at 3:56 PM, Lee Amy wrote:
On Mon, Aug 10, 2009 at 9:32 AM, Lee Amy<openlinuxsou...@gmail.com>
wrote:
---------- Forwarded message ----------
From: Lee Amy <openlinuxsou...@gmail.com>
Date: Mon, Aug 10, 2009 at 9:32 AM
Subject: Re: [Lustre-discuss] Help: NIC Changed Error
To: Rhys McMurdo <r...@mcmurdo.id.au>
On Mon, Aug 10, 2009 at 6:14 AM, Rhys McMurdo<r...@mcmurdo.id.au>
wrote:
Hi Amy,
You may want to try the following options in your /etc/modprobe.conf
options lnet networks=tcp0(eth1)
Regards,
Rhys
2009/8/8 Lee Amy <openlinuxsou...@gmail.com>
Hi,
I'm a Lustre newbie. The server I set up is combined MGS/MDT file
system on a block device. And set up OST on a block device. I set
up
MGS/MDT and OST in the same machine by using 2 disks. The NID is
10.0.38....@tcp, and the address 10.0.38.102 was assigned to
eth0. One
day I noticed the eth0 is broken so I use another NIC eth1 then
assign
IP address 10.0.38.102 to this card.
Then I use client the mount the server Lustre FS by following
command.
mount -t lustre 10.0.38....@tcp:/ericlfs /mnt/foobar
It reported following error messages.
Lustre: Request x1310428982411274 sent from mgc10.0.38....@tcp to
NID
10.0.38....@tcp 5s ago has timed out (limit 5s).
LustreError: 4397:0:(client.c:792:ptlrpc_import_delay_req()) @@@
IMP_INVALID r...@ffff81002cb7d800 x1310428982411276/t0
o501->m...@mgc10.0.38.102@tcp_0:26/25 lens 264/432 e 0 to 1 dl 0
ref 1
fl Rpc:/0/0 rc 0/0
LustreError: 15c-8: mgc10.0.38....@tcp: The configuration from log
'ericlfs-client' failed (-108). This may be the result of
communication errors between this node and the MGS, a bad
configuration, or other errors. See the syslog for more
information.
LustreError: 4397:0:(llite_lib.c:1169:ll_fill_super()) Unable to
process log: -108
Lustre: client ffff81002bd17400 umount complete
mount.lustre: mount 10.0.38....@tcp:/ericlfs at /mnt failed: Cannot
send after transport endpoint shutdown
So I feel a little confused. Is this problem caused by I replace
the
NIC card? And furthermore, how do I fix that problem?
Thank you very much.
Best Regards,
Amy
_______________________________________________
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss
_______________________________________________
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss
Thanks very much. Anyway, my nid is 10.0.38....@tcp, not
10.0.38....@tcp0. If I add the above item in /etc/modprobe.conf I
don't know whether it will affect something wrong.
Could you tell me what's the difference between tcp and tcp?
Thank you very much.
Regards,
Amy
Hi,
It seems this method cannot solve my problem. My NID is
10.0.38....@tcp, and furthermore when I add the item
options lnet network=tcp0(eth1)
I still encountered the same problem and after this failure I change
this item back to
options lnet network=tcp
That still got failure. So I really feel very confused about that.
When I installed Lustre the NID is 10.0.68....@tcp. not tcp0 suffix.
Could someone tell me how to fix that problem?
Thank you very much.
Regards,
Amy
_______________________________________________
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss
_______________________________________________
Lustre-discuss mailing list
Lustre-discuss@lists.lustre.org
http://lists.lustre.org/mailman/listinfo/lustre-discuss