Assuming [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> is the old OSS, the ptlrpc_expire_one_request()) @@@ timeout messages mean that the client / MDT was trying and failing to talk to the old server.

You need to tell Lustre to regenerate the configuration logs using 'tunefs.lustre --writeconf' -- see
http://wiki.lustre.org/index.php?title=Mount_Conf#Changing_a_server_nid

Snider, Tim wrote:
I have a simple configuration where I'd like to switch the OSS to a different server. The OST is on external storage and will remain the same. I'll switch cables to the storage between servers. The MDS, MGT and client remain the same. After rebooting all machines, Lustre seems to start correctly again on the MDS/MGT and OSS - no console messages. I can also mount the client without any console errors, however an ls command on the client mounted device hangs. entries in /var/log/messages on the MDS indicate there was an error from the old OSS - which isn't involved in the Lustre configuration at this point: Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> 2 up 8 8 8 8 7 0
OLD OSS = 172.22.14.245    (not in use  - but still on the network)
Current OSS IP = 172.22.14.166
MDS/MGT = 172.22.14.101
Client = 172.22.14.100
How do you properly switch out the OSS and restart using the same OSTs?
Thanks
Tim
Jul 31 17:54:58 Redhat101 kernel: Lustre: 4808:0:(module.c:382:init_libcfs_module()) maximum lustre stack 8192 Jul 31 17:54:58 Redhat101 kernel: Lustre: OBD class driver, [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>
Jul 31 17:54:58 Redhat101 kernel:         Lustre Version: 1.5.95
Jul 31 17:54:58 Redhat101 kernel: Build Version: 1.5.95-19691231170000-PRISTINE-.testsuite.tmp.boulder.lbuild-boulder.BUILD.lustre-kernel-2.6.9.lustre.linux-2.6.9-42.EL_lustre.1.5.95smp Jul 31 17:54:58 Redhat101 kernel: Lustre: Added LNI [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> [8/256]
Jul 31 17:54:58 Redhat101 kernel: Lustre: Accept secure, port 988
Jul 31 17:54:58 Redhat101 kernel: Lustre: Lustre Client File System; [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>
Jul 31 17:54:58 Redhat101 kernel: Lustre:   mount data:
Jul 31 17:54:58 Redhat101 kernel: Lustre: device:  /dev/sdb1
Jul 31 17:54:58 Redhat101 kernel: Lustre: flags:   0
Jul 31 17:54:59 Redhat101 kernel: kjournald starting. Commit interval 5 seconds
Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal
Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with ordered data mode.
Jul 31 17:54:59 Redhat101 kernel: Lustre:   disk data:
Jul 31 17:54:59 Redhat101 kernel: Lustre: server:  test-MDT0000
Jul 31 17:54:59 Redhat101 kernel: Lustre: uuid:
Jul 31 17:54:59 Redhat101 kernel: Lustre: fs:      test
Jul 31 17:54:59 Redhat101 kernel: Lustre: index:   0000
Jul 31 17:54:59 Redhat101 kernel: Lustre: config:  2
Jul 31 17:54:59 Redhat101 kernel: Lustre: flags:   0x5
Jul 31 17:54:59 Redhat101 kernel: Lustre: diskfs:  ldiskfs
Jul 31 17:54:59 Redhat101 kernel: Lustre: options: errors=remount-ro,iopen_nopriv,user_xattr
Jul 31 17:54:59 Redhat101 kernel: Lustre: params:
Jul 31 17:54:59 Redhat101 kernel: Lustre: comment:
Jul 31 17:54:59 Redhat101 kernel: kjournald starting. Commit interval 5 seconds
Jul 31 17:54:59 Redhat101 kernel: LDISKFS FS on sdb1, internal journal
Jul 31 17:54:59 Redhat101 kernel: LDISKFS-fs: mounted filesystem with ordered data mode.
Jul 31 17:54:59 Redhat101 kernel: Lustre:   0 UP mgs MGS MGS 5
Jul 31 17:54:59 Redhat101 kernel: Lustre: 1 UP mgc [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> 874c230c-bc4b-f2df-7498-9680ca5495c6 6
Jul 31 17:54:59 Redhat101 kernel: Lustre:   2 UP mdt MDS MDS_uuid 3
Jul 31 17:54:59 Redhat101 kernel: Lustre: 3 UP lov test-mdtlov test-mdtlov_UUID 4 Jul 31 17:54:59 Redhat101 kernel: Lustre: 4 UP mds test-MDT0000 test-MDT0000_UUID 4 Jul 31 17:54:59 Redhat101 kernel: Lustre: 5 UP osc test-OST0000-osc test-mdtlov_UUID 5 Jul 31 17:55:01 Redhat101 crond(pam_unix)[5124]: session opened for user root by (uid=0) Jul 31 17:55:02 Redhat101 crond(pam_unix)[5124]: session closed for user root Jul 31 17:55:04 Redhat101 kernel: Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> 2 up 8 8 8 8 7 0 Jul 31 17:55:29 Redhat101 kernel: LustreError: 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at 1185918924, 5s ago) Jul 31 17:55:29 Redhat101 kernel: LustreError: 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 2 previous similar messages Jul 31 17:55:29 Redhat101 kernel: Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> 2 up 8 8 8 8 7 0 Jul 31 17:55:54 Redhat101 kernel: LustreError: 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at 1185918949, 5s ago) Jul 31 17:55:54 Redhat101 kernel: LustreError: 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous similar message Jul 31 17:55:54 Redhat101 kernel: Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> 2 up 8 8 8 8 7 0 Jul 31 17:56:19 Redhat101 kernel: LustreError: 5012:0:(client.c:950:ptlrpc_expire_one_request()) @@@ timeout (sent at 1185918974, 5s ago) Jul 31 17:56:19 Redhat101 kernel: LustreError: 5012:0:(client.c:950:ptlrpc_expire_one_request()) Skipped 1 previous similar message Jul 31 17:56:19 Redhat101 kernel: Lustre: 5012:0:(peer.c:238:lnet_debug_peer()) [EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]> 2 up 8 8 8 8 7 0
------------------------------------------------------------------------

_______________________________________________
Lustre-discuss mailing list
[email protected]
https://mail.clusterfs.com/mailman/listinfo/lustre-discuss

_______________________________________________
Lustre-discuss mailing list
[email protected]
https://mail.clusterfs.com/mailman/listinfo/lustre-discuss

Reply via email to