Thanks.  Can you tell me how to do the mapping back to the MDS inode?  
For example, is 1162976 in the list below the MDS inode?  May as well look.

I am following the directions in the recent threads to redo the OSTs 
(not the whole Lustre file system).  I will restore the indicated files.

bob

On 11/30/2010 4:17 PM, Andreas Dilger wrote:
> On 2010-11-30, at 11:17, Bob Ball wrote:
>> [r...@umdist03 d0]# ls -l
>> total 182976
>> -rw-rw-rw- 1 daits users 45002956 Jul  5 20:52 1162976
>> -rw-rw-rw- 1 daits users 44569036 Jul  7 02:53 1200608
>> -rw-rw-rw- 1 daits users 49108913 Jun 28 04:43 1218976
>> -rw-rw-rw- 1 daits users 48658429 Jul 16 13:29 1254176
>> -rwSrwSrw- 1 root  root         0 Sep  2 15:11 128
>> -rwSrwSrw- 1 root  root         0 Sep  2 15:11 9152
>> -rwSrwSrw- 1 root  root         0 Sep  2 15:11 9216
>> -rwSrwSrw- 1 root  root         0 Sep  2 15:11 9248
>>
>> Some time back we had an MDT issue, and upon running e2fsck, saw a LOT
>> of corrupted entries that were just deleted.  I suspect that these may
>> have been entries pointing to these files?
> Likely, yes.
>
>> "lfs find" comes up empty handed for this OST, indeed, there are 6 OST
>> here, each with about 10GB worth of files of this kind.  Are those 60GB
>> just lost?  Short of pawing through these, by hand, to see what we can
>> make of the content, is there a snowball's chance in Hades of identifying
>> these files?
> They can be mapped back to an MDS inode number, and the user/group 
> information is intact, but that doesn't help if the MDS inodes were deleted 
> by e2fsck since there will not be any file name available.
>
>> Can I simply copy them out of this "ldiskfs" mount of the file system,
>> back into some recovery directory in the real file system, so that users
>> can pick through them?
> Yes, just rsync the non-zero-length files from the ldiskfs-mounted OST 
> filesystem into a new "lost+found" directory created in the lustre mountpoint 
> on a client.  If you "chmod 1775 /path/to/lustre/lost+found" the owners of 
> the file will be able to read/delete their files, but others will not (like 
> /tmp).
>
>> After they are moved, the file system will be reformatted and returned to 
>> use.
> The whole Lustre filesystem, or the OST?  If you are replacing the OST, then 
> you should still do a backup of last_rcvd, CONFIGS/, and O/0/LAST_ID from the 
> OST, and then restore them to the after the OST is reformatted.  This process 
> was very recently discussed on this list.
>
>> On 11/30/2010 8:53 AM, Bob Ball wrote:
>>> OK, thanks.  Scary, to see errors out of lfs find.
>>>
>>> bob
>>>
>>> On 11/30/2010 1:47 AM, Andreas Dilger wrote:
>>>> On 2010-11-29, at 20:18, Bob Ball wrote:
>>>>> I have an odd problem.  I am trying to empty all files from a set of OST
>>>>> as indicated below, by making a list via lfs find and then sending that
>>>>> list to lfs_migrate.  However, I have just gotten this message back from
>>>>> the lfs find:
>>>>>
>>>>> llapi_semantic_traverse: Failed to open
>>>>> '/lustre/umt3/data13/daits/p15.6.3.10/prod/W1J_munu216465_simul': No
>>>>> such file or directory (2)
>>>>> error: find failed for umt3-OST0021.
>>>> This may mean that the file was deleted while "lfs find" was running.
>>>>
>>>>> On the OSS, I see this but not much else:
>>>>> LustreError: 5226:0:(ldlm_resource.c:861:ldlm_resource_add()) lvbo_init
>>>>> failed for resource 9101: rc -2
>>>>>
>>>>> Can someone give me an idea of what is wrong  here?  And what can be
>>>>> done about it, if anything?
>>>> This might mean that the file was deleted at the same time the MDS 
>>>> crashed, and the objects were removed but the MDS file was not.  It is 
>>>> possible to just delete this file using the "unlink" command - it does not 
>>>> contain any data in any case.
>>>>
>>>> Cheers, Andreas
>>>> --
>>>> Andreas Dilger
>>>> Lustre Technical Lead
>>>> Oracle Corporation Canada Inc.
>>>>
>>>>
>>>>
>>> _______________________________________________
>>> Lustre-discuss mailing list
>>> [email protected]
>>> http://lists.lustre.org/mailman/listinfo/lustre-discuss
>>>
>>>
>> _______________________________________________
>> Lustre-discuss mailing list
>> [email protected]
>> http://lists.lustre.org/mailman/listinfo/lustre-discuss
>
> Cheers, Andreas
> --
> Andreas Dilger
> Lustre Technical Lead
> Oracle Corporation Canada Inc.
>
>
>
_______________________________________________
Lustre-discuss mailing list
[email protected]
http://lists.lustre.org/mailman/listinfo/lustre-discuss

Reply via email to