Bernd Schubert wrote:
On Monday 25 June 2007 20:15:09 Nathaniel Rutman wrote:
Bernd Schubert wrote:
Hi,
I'm just testing how well the upgrades do work and somehow I have a
problem:
On Ost1:
========
[ 271.985901] LustreError: Trying to start OBD lustre-OST0001_UUID using
the wrong disk ost1_UUID. Were the /dev/ assignments rearranged?
Well, that's the problem. Note the uuid is missing from the tunefs on
the OST, but not the MDS.
This a safety check to make sure you're using the right disk; it should
have been found when you did the initial tunefs upgrade to 1.6.
You can erase the last_rcvd file manually out the of OST disk to get
around this.
I think I figured out how this problem came up at all. When I did run
tunefs.lustre for the first time it told I have to specify the index, since
it couldn't detect the index itself. Following the common ost-numeration I
told tunefs.lustre OST1 has index=1 and OST2 has index=2. Seems I should have
specified index=0 and index=1, respectively.
After the deleting the last_rcvd files I could mount on the servers and on the
clients, but on the clients the files didn't appear properly and dmesg told
me OST0000 is missing for a file.
For the archives if someone should run into this in the future:
To get around this I had to run "tunefs.lustre --writeconf" on all nodes and
to delete the last_rcvd files on the OST nodes. To make sure there's no
corruption I also did run e2fsck on all systems. Specifying --writeconf also
allows to correct an already given ost index.
I still do not understand why tunefs.lustre couldn't detect the indices itslf.
The filesystems have been created with tools from lustre-1.4.9 and with
kernel modules from lustre-1.4.10.
Did you get the message "ost with unknown index" at the first tunefs?
The logic for all this is in mkfs_lustre.c read_local_files(). It
should be able to identify the
ost index from the last_rcvd files created in Lustre 1.4.6 on. A reason
it might not is if the last_rcvd
file was originally created in an earlier version of Lustre and upgraded
to 1.4.6+. If you could send me the original
last_rcvd, I'd be glad to take a look -- too bad I told you to erase it
before :(
_______________________________________________
Lustre-discuss mailing list
[email protected]
https://mail.clusterfs.com/mailman/listinfo/lustre-discuss