Hi ,
Im trying to figure out what is the best way to recover a failed OST , basicly 
we have 10 OST's
each has DRBD + HA on top of raid 6 so its kind of redundent and suppose to be 
solid
just want to notice for the other post that asked of that configuration that 
its working ok and the
performance is fairly ok considering that redundency is more important then 
full speed of the cluster at list in this case .

Regarding the backup strategy , we make a client backups to tapes of all the 
important stuff
and also a seperate backup of the OST files only to a USB HD (daily on each OST)
that backup is made possible by mounting the OST with -t ldiskfs insted of 
lustre the by running rsync to the USB HD
so the main thing i dont understand is if an OST failed as in hardware problem 
then 
to avoid full file system recovery from tapes there is a need to restore the 
OST only data from the USB drive or tapes
to the new OST , then the lustre procedure e.g
e2fsck -n -v --mdsdb /tmp/ostdb /dev/{ostdev}
on all OST's then
lfsck -n -v ............. /mnt/mainfs
the only things i see possibly is to write zero holes on files that were 
changed for example seens the last backup 
of the OST file system itslef with rsync to the USB drive or tapes
so baicly what will happen to a mysql table file that has inconcitency on its 
tripes
how is possible to restore it the best way possible , i realize that it must 
suffer some kind of data lose
but its better then loading the entire lustre file system backup wich will take 
days is some cases .

Thanks for any help .


----------------------------------------------------------
Outgoing messages are virus free checked by NOD32 system 
_______________________________________________
Lustre-discuss mailing list
[email protected]
http://lists.lustre.org/mailman/listinfo/lustre-discuss

Reply via email to