I have a simple configuration where I'd like to switch the OSS to a
different server. The OST is on external storage and will remain the
same. I'll switch cables to the storage between servers. The MDS, MGT
and client remain the same.  After rebooting all machines, Lustre seems
to start correctly again on the MDS/MGT and OSS - no console messages. I
can also mount the client without any console errors, however an ls
command on the client mounted device hangs.
 
entries in /var/log/messages on the MDS indicate there was an error from
the old OSS - which isn't involved in the Lustre configuration at this
point:
        Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) [EMAIL PROTECTED]
<mailto:[EMAIL PROTECTED]>            2    up     8     8     8     8
7 0

OLD OSS = 172.22.14.245    (not in use  - but still on the network)
Current OSS IP = 172.22.14.166
MDS/MGT = 172.22.14.101
Client = 172.22.14.100
 
How do you properly switch out the OSS and restart using the same OSTs?
Thanks
Tim
 
Jul 31 17:54:58 Redhat101 kernel: Lustre:
4808:0:(module.c:382:init_libcfs_module()) maximum lustre stack 8192
Jul 31 17:54:58 Redhat101 kernel: Lustre: OBD class driver,
[EMAIL PROTECTED]
Jul 31 17:54:58 Redhat101 kernel:         Lustre Version: 1.5.95
Jul 31 17:54:58 Redhat101 kernel:         Build Version:
1.5.95-19691231170000-PRISTINE-.testsuite.tmp.boulder.lbuild-boulder.BUI
LD.lustre-kernel-2.6.9.lustre.linux-2.6.9-42.EL_lustre.1.5.95smp
Jul 31 17:54:58 Redhat101 kernel: Lustre: Added LNI [EMAIL PROTECTED]
[8/256]
Jul 31 17:54:58 Redhat101 kernel: Lustre: Accept secure, port 988
Jul 31 17:54:58 Redhat101 kernel: Lustre: Lustre Client File System;
[EMAIL PROTECTED]
Jul 31 17:54:58 Redhat101 kernel: Lustre:   mount data:
Jul 31 17:54:58 Redhat101 kernel: Lustre: device:  /dev/sdb1
Jul 31 17:54:58 Redhat101 kernel: Lustre: flags:   0
Jul 31 17:54:59 Redhat101 kernel: kjournald starting.  Commit interval 5
seconds
Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal
Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with
ordered data mode.
Jul 31 17:54:59 Redhat101 kernel: Lustre:   disk data:
Jul 31 17:54:59 Redhat101 kernel: Lustre: server:  test-MDT0000
Jul 31 17:54:59 Redhat101 kernel: Lustre: uuid:
Jul 31 17:54:59 Redhat101 kernel: Lustre: fs:      test
Jul 31 17:54:59 Redhat101 kernel: Lustre: index:   0000
Jul 31 17:54:59 Redhat101 kernel: Lustre: config:  2
Jul 31 17:54:59 Redhat101 kernel: Lustre: flags:   0x5
Jul 31 17:54:59 Redhat101 kernel: Lustre: diskfs:  ldiskfs
Jul 31 17:54:59 Redhat101 kernel: Lustre: options:
errors=remount-ro,iopen_nopriv,user_xattr
Jul 31 17:54:59 Redhat101 kernel: Lustre: params:
Jul 31 17:54:59 Redhat101 kernel: Lustre: comment:
Jul 31 17:54:59 Redhat101 kernel: kjournald starting.  Commit interval 5
seconds
Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal
Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with
ordered data mode.
Jul 31 17:54:59 Redhat101 kernel: Lustre:   0 UP mgs MGS MGS 5
Jul 31 17:54:59 Redhat101 kernel: Lustre:   1 UP mgc
[EMAIL PROTECTED] 874c230c-bc4b-f2df-7498-9680ca5495c6 6
Jul 31 17:54:59 Redhat101 kernel: Lustre:   2 UP mdt MDS MDS_uuid 3
Jul 31 17:54:59 Redhat101 kernel: Lustre:   3 UP lov test-mdtlov
test-mdtlov_UUID 4
Jul 31 17:54:59 Redhat101 kernel: Lustre:   4 UP mds test-MDT0000
test-MDT0000_UUID 4
Jul 31 17:54:59 Redhat101 kernel: Lustre:   5 UP osc test-OST0000-osc
test-mdtlov_UUID 5
Jul 31 17:55:01 Redhat101 crond(pam_unix)[5124]: session opened for user
root by (uid=0)
Jul 31 17:55:02 Redhat101 crond(pam_unix)[5124]: session closed for user
root
Jul 31 17:55:04 Redhat101 kernel: Lustre:
5012:0:(peer.c:238:lnet_debug_peer()) [EMAIL PROTECTED]           2
up     8     8     8     8     7 0
Jul 31 17:55:29 Redhat101 kernel: LustreError:
5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at
1185918924, 5s ago)
Jul 31 17:55:29 Redhat101 kernel: LustreError:
5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 2 previous
similar messages
Jul 31 17:55:29 Redhat101 kernel: Lustre:
5012:0:(peer.c:238:lnet_debug_peer()) [EMAIL PROTECTED]           2
up     8     8     8     8     7 0
Jul 31 17:55:54 Redhat101 kernel: LustreError:
5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at
1185918949, 5s ago)
Jul 31 17:55:54 Redhat101 kernel: LustreError:
5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous
similar message
Jul 31 17:55:54 Redhat101 kernel: Lustre:
5012:0:(peer.c:238:lnet_debug_peer()) [EMAIL PROTECTED]           2
up     8     8     8     8     7 0
Jul 31 17:56:19 Redhat101 kernel: LustreError:
5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at
1185918974, 5s ago)
Jul 31 17:56:19 Redhat101 kernel: LustreError:
5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous
similar message
Jul 31 17:56:19 Redhat101 kernel: Lustre:
5012:0:(peer.c:238:lnet_debug_peer()) [EMAIL PROTECTED]           2
up     8     8     8     8     7 0

_______________________________________________
Lustre-discuss mailing list
[email protected]
https://mail.clusterfs.com/mailman/listinfo/lustre-discuss

Reply via email to