I have a small Luster setup that's worked pretty well for the last year. A few weeks ago I posted that the OSS locks when it mounts a paticular OST. The Dilger Procedure was suggested and mounting with "-o abort_recovery" worked. This morning it crashed again and remounting the same OST and one other hard lock the system. I'm able to mount the newest problem child with abort_recovery but not the original trouble maker. I've made several tries at truncating the last_rcvd file but no luck. The system has about 30 seconds after being mounted before locking up.
Suggestions? The dump mentions dirty journal meta data as a possible culprit. Dan _______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
