I posted this to the lustre-discuss mailling list, but i have not heard anything all day, just wonder if anyone here might have an ideas?
We had a problem in the datacenter this morning where a bunch of servers went down hard, this included my lustre filesystem and just about every other machine in the building When i try to bring the MDS/MGS back online, it does mount, but an 'lctl dl' shows that everything is there and UP, but i know its not, because i have not mounted the OSS's. Is this some junk left over? Can/Should it be cleared out? If i go through and mount all the OSS's they mount and i can mount the filesystem on the client, but no ls or df of the mountpoint works I've tried various methods of recovery that i know of, but i can't seem to get the MDS/MGS to come up in what appears to be a clean state is there some magic command or file that needs to be deleted to abort everything and restart? Thanks _______________________________________________ Beowulf mailing list, [email protected] sponsored by Penguin Computing To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf
