Thanks - tunefs worked fine on MDS/MGT combo server.
On OSS tunefs fails with "unsupported features" message. Both MDS/MGT
and OSS are running 2.6.9-42 kernel.
I'd expect tunefs to fail on both servers if the kernel is to old. I'm
using Lustre 1.5.95.
Tim
On OSS:
[EMAIL PROTECTED] ~]# umount /dev/sdc1
[EMAIL PROTECTED] ~]# tunefs.lustre --writeconf /dev/sdc1
checking for existing Lustre data
/dev/sdc1: Filesystem has unsupported feature(s) while opening
filesystem
In all likelihood, the 'unsupported feature' is 'extents', which
older debugfs does not understand.
Use e2fsprogs-1.38-cfs1 or later, available from
ftp://ftp.lustre.org/pub/lustre/other/e2fsprogs/
found Lustre data
tunefs.lustre: Unable to read CONFIGS/mountdata (No such file or
directory).
Contents of CONFIGS:
Trying last_rcvd
tunefs.lustre: Unable to read old data
tunefs.lustre FATAL: Failed to read previous Lustre data from
/dev/sdc1
[EMAIL PROTECTED] ~]# uname -a
Linux Redhat166 2.6.9-42.EL_lustre.1.5.95smp #1 SMP Thu Sep 28
06:36:13 MDT 2006 i686 i686 i386 GNU/Linux
[EMAIL PROTECTED] ~]# tunefs.lustre -h
tunefs.lustre v1.5.95
usage: tunefs.lustre <target types> [options] <device>
On MDS /MGT:
[EMAIL PROTECTED] ~]# umount /dev/sdb1
[EMAIL PROTECTED] ~]# tunefs.lustre --writeconf /dev/sdb1
checking for existing Lustre data
found Lustre data
Reading CONFIGS/mountdata
Read previous values:
Target: test-MDT0000
Index: 0
Lustre FS: test
Mount type: ldiskfs
Flags: 0x5
(MDT MGS )
Persistent mount opts: errors=remount-ro,iopen_nopriv,user_xattr
Parameters:
Permanent disk data:
Target: test-MDT0000
Index: 0
Lustre FS: test
Mount type: ldiskfs
Flags: 0x105
(MDT MGS writeconf )
Persistent mount opts: errors=remount-ro,iopen_nopriv,user_xattr
Parameters:
Writing CONFIGS/mountdata
[EMAIL PROTECTED] ~]# uname -a
Linux Redhat101 2.6.9-42.EL_lustre.1.5.95smp #1 SMP Thu Sep 28
06:36:13 MDT 2006 i686 i686 i386 GNU/Linux
-----Original Message-----
From: Nathaniel Rutman [mailto:[EMAIL PROTECTED]
Sent: Friday, August 03, 2007 7:55 PM
To: Snider, Tim
Cc: [email protected]
Subject: Re: [Lustre-discuss] Problems switching the OSS and getting
Lustre to restart correctly.
Assuming [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> is the old OSS,
the ptlrpc_expire_one_request()) @@@ timeout messages mean that the
client / MDT was trying and failing to talk to the old server.
You need to tell Lustre to regenerate the configuration logs using
'tunefs.lustre --writeconf' -- see
http://wiki.lustre.org/index.php?title=Mount_Conf#Changing_a_server_nid
Snider, Tim wrote:
> I have a simple configuration where I'd like to switch the OSS to a
> different server. The OST is on external storage and will remain the
> same. I'll switch cables to the storage between servers. The MDS, MGT
> and client remain the same. After rebooting all machines, Lustre
> seems to start correctly again on the MDS/MGT and OSS - no console
> messages. I can also mount the client without any console errors,
> however an ls command on the client mounted device hangs.
>
> entries in /var/log/messages on the MDS indicate there was an error
> from the old OSS - which isn't involved in the Lustre configuration at
> this point:
> Lustre: 5012:0:(peer.c:238:lnet_debug_peer())
> [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> 2 up
> 8 8 8 8 7 0
> OLD OSS = 172.22.14.245 (not in use - but still on the network)
> Current OSS IP = 172.22.14.166
> MDS/MGT = 172.22.14.101
> Client = 172.22.14.100
>
> How do you properly switch out the OSS and restart using the same
OSTs?
> Thanks
> Tim
>
> Jul 31 17:54:58 Redhat101 kernel: Lustre:
> 4808:0:(module.c:382:init_libcfs_module()) maximum lustre stack 8192
> Jul 31 17:54:58 Redhat101 kernel: Lustre: OBD class driver,
> [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>
> Jul 31 17:54:58 Redhat101 kernel: Lustre Version: 1.5.95
> Jul 31 17:54:58 Redhat101 kernel: Build Version:
> 1.5.95-19691231170000-PRISTINE-.testsuite.tmp.boulder.lbuild-boulder.B
> UILD.lustre-kernel-2.6.9.lustre.linux-2.6.9-42.EL_lustre.1.5.95smp
> Jul 31 17:54:58 Redhat101 kernel: Lustre: Added LNI [EMAIL PROTECTED]
> <mailto:[EMAIL PROTECTED]> [8/256] Jul 31 17:54:58 Redhat101 kernel:
> Lustre: Accept secure, port 988 Jul 31 17:54:58 Redhat101 kernel:
> Lustre: Lustre Client File System; [EMAIL PROTECTED]
> <mailto:[EMAIL PROTECTED]>
> Jul 31 17:54:58 Redhat101 kernel: Lustre: mount data:
> Jul 31 17:54:58 Redhat101 kernel: Lustre: device: /dev/sdb1
> Jul 31 17:54:58 Redhat101 kernel: Lustre: flags: 0
> Jul 31 17:54:59 Redhat101 kernel: kjournald starting. Commit interval
> 5 seconds
> Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal
> Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with
> ordered data mode.
> Jul 31 17:54:59 Redhat101 kernel: Lustre: disk data:
> Jul 31 17:54:59 Redhat101 kernel: Lustre: server: test-MDT0000 Jul 31
> 17:54:59 Redhat101 kernel: Lustre: uuid:
> Jul 31 17:54:59 Redhat101 kernel: Lustre: fs: test
> Jul 31 17:54:59 Redhat101 kernel: Lustre: index: 0000
> Jul 31 17:54:59 Redhat101 kernel: Lustre: config: 2
> Jul 31 17:54:59 Redhat101 kernel: Lustre: flags: 0x5
> Jul 31 17:54:59 Redhat101 kernel: Lustre: diskfs: ldiskfs Jul 31
> 17:54:59 Redhat101 kernel: Lustre: options:
> errors=remount-ro,iopen_nopriv,user_xattr
> Jul 31 17:54:59 Redhat101 kernel: Lustre: params:
> Jul 31 17:54:59 Redhat101 kernel: Lustre: comment:
> Jul 31 17:54:59 Redhat101 kernel: kjournald starting. Commit interval
> 5 seconds
> Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal
> Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with
> ordered data mode.
> Jul 31 17:54:59 Redhat101 kernel: Lustre: 0 UP mgs MGS MGS 5
> Jul 31 17:54:59 Redhat101 kernel: Lustre: 1 UP mgc
> [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>
> 874c230c-bc4b-f2df-7498-9680ca5495c6 6
> Jul 31 17:54:59 Redhat101 kernel: Lustre: 2 UP mdt MDS MDS_uuid 3
> Jul 31 17:54:59 Redhat101 kernel: Lustre: 3 UP lov test-mdtlov
> test-mdtlov_UUID 4
> Jul 31 17:54:59 Redhat101 kernel: Lustre: 4 UP mds test-MDT0000
> test-MDT0000_UUID 4
> Jul 31 17:54:59 Redhat101 kernel: Lustre: 5 UP osc test-OST0000-osc
> test-mdtlov_UUID 5
> Jul 31 17:55:01 Redhat101 crond(pam_unix)[5124]: session opened for
> user root by (uid=0) Jul 31 17:55:02 Redhat101 crond(pam_unix)[5124]:
> session closed for user root Jul 31 17:55:04 Redhat101 kernel: Lustre:
> 5012:0:(peer.c:238:lnet_debug_peer()) [EMAIL PROTECTED]
> <mailto:[EMAIL PROTECTED]> 2 up 8 8 8
> 8 7 0
> Jul 31 17:55:29 Redhat101 kernel: LustreError:
> 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at
> 1185918924, 5s ago) Jul 31 17:55:29 Redhat101 kernel: LustreError:
> 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 2 previous
> similar messages Jul 31 17:55:29 Redhat101 kernel: Lustre:
> 5012:0:(peer.c:238:lnet_debug_peer()) [EMAIL PROTECTED]
> <mailto:[EMAIL PROTECTED]> 2 up 8 8 8
> 8 7 0
> Jul 31 17:55:54 Redhat101 kernel: LustreError:
> 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at
> 1185918949, 5s ago) Jul 31 17:55:54 Redhat101 kernel: LustreError:
> 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous
> similar message Jul 31 17:55:54 Redhat101 kernel: Lustre:
> 5012:0:(peer.c:238:lnet_debug_peer()) [EMAIL PROTECTED]
> <mailto:[EMAIL PROTECTED]> 2 up 8 8 8
> 8 7 0
> Jul 31 17:56:19 Redhat101 kernel: LustreError:
> 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at
> 1185918974, 5s ago) Jul 31 17:56:19 Redhat101 kernel: LustreError:
> 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous
> similar message Jul 31 17:56:19 Redhat101 kernel: Lustre:
> 5012:0:(peer.c:238:lnet_debug_peer()) [EMAIL PROTECTED]
> <mailto:[EMAIL PROTECTED]> 2 up 8 8 8
> 8 7 0
> ----------------------------------------------------------------------
> --
>
> _______________________________________________
> Lustre-discuss mailing list
> [email protected]
> https://mail.clusterfs.com/mailman/listinfo/lustre-discuss
>
_______________________________________________
Lustre-discuss mailing list
[email protected]
https://mail.clusterfs.com/mailman/listinfo/lustre-discuss