Hi list We had a problem with our OST few days ago ( i also posted my question here ), after rebuilding our RAID partition, we use drbd to re-synchronize our data from active OST to this backup one. We just started drbd from OST ( which has been rebuild RAID partition ) and connect with drbd on an working OST. Everything was fine and the synchronization completed without any error report. But, when we mount this backup OST in to our system, some of web client can't connect to it ( MDS and some others can ) and after a short time, in that OST message log, we see many error report like this :
*Nov 19 19:59:36 OST6 kernel: LDISKFS-fs error (device drbd6): ldiskfs_lookup: unlinked inode 159588368 in dir #261333022 Nov 19 19:59:36 OST6 kernel: LustreError: 3893:0:(filter_lvb.c:90:filter_lvbo_init()) lustre-OST0006: bad object 996598/0: rc -2 Nov 19 19:59:36 OST6 kernel: LustreError: 3893:0:(filter_lvb.c:90:filter_lvbo_init()) Skipped 7 previous similar messages Nov 19 19:59:36 OST6 kernel: LustreError: 3893:0:(ldlm_resource.c:858:ldlm_resource_add()) lvbo_init failed for resource 996598: rc -2 Nov 19 19:59:36 OST6 kernel: LustreError: 3893:0:(ldlm_resource.c:858:ldlm_resource_add()) Skipped 7 previous similar messages * *Nov 19 19:59:40 OST6 kernel: LDISKFS-fs error (device drbd6): ldiskfs_lookup: unlinked inode 261038084 in dir #261333008* *Nov 19 19:59:45 OST6 kernel: LDISKFS-fs error (device drbd6): ldiskfs_lookup: unlinked inode 229924884 in dir #261333024* *Nov 19 19:59:47 OST6 kernel: LDISKFS-fs error (device drbd6): ldiskfs_lookup: unlinked inode 228163899 in dir #261333024* *Nov 19 19:59:54 OST6 kernel: LDISKFS-fs error (device drbd6): ldiskfs_lookup: unlinked inode 165830658 in dir #261333015* I tried umount this drbd disk and use e2fsck to check it, it showed me that this disk is clean, but after re-mount, it went wrong once again >From now, i umounted it, and reformat this disk like this: *mkfs.lustre --reformat --verbose --writeconf --ost --mgsnode=192.168.1...@tcp:192.168.1...@tcp --failover=192.168.1...@tcp--index=6 /dev/sdd * it completed without any error. And i'm trying resynchronize this drbd disk with an working OST node. It will take pretty long time. And, after all, the question i want to rise here is: In order to using drbd as back up solution as i described above, do we need to reformat disk before synchronize data or just sync it directly ? Could you please give me an advice or suggestion in my situation ? Thanks in advance Best regards
_______________________________________________ Lustre-discuss mailing list [email protected] http://lists.lustre.org/mailman/listinfo/lustre-discuss
