Hi Bart,
Can you run pvfs2-stat on one of the files, and also send along the
fs.conf file? pvfs2-stat might be helpful because it shows the metadata
handle value. We can compare that value to the handle ranges in the
conf file to narrow down whether it is hitting a metadata object that
has just been corrupted somehow, or whether it really is hitting a
datafile handle.
If pvfs2-stat fails to show any output, then maybe you can modify
pvfs2-stat.c to print the value of ref.handle right before the
sys_getattr() call.
thanks,
-Phil
On 10/07/2010 01:08 PM, Bart Taylor wrote:
Hey guys,
We are having an increasing number of files that cannot be removed on
our 2.6 file systems. When we run the pvfs2-lsplus tool, the output on
these files looks like this:
[E 15:14:05.798568] Invalid type 2 in readdirplus
---------- 1 root root 0 1969-12-31 18:00 File1
[E 15:14:17.712553] Invalid type 2 in readdirplus
---------- 1 root root 0 1969-12-31 18:00 File2
[E 15:14:24.799221] Invalid type 2 in readdirplus
[E 15:14:24.799257] Invalid type 2 in readdirplus
[E 15:14:24.799269] Invalid type 2 in readdirplus
---------- 1 root root 0 1969-12-31 18:00 File3.txt
---------- 1 root root 0 1969-12-31 18:00 File5.txt
---------- 1 root root 0 1969-12-31 18:00 File6.txt
The "Invalid type 2" message indicates that readdirplus is returning
datafile attributes mixed in with the directory entries. That might
explain why all of the attributes look like default values, but I am
not sure why those files are having problems in the first place.
This gets noticed when someone tries to update, create, append, etc.
the file. Most operations seem to return "No such file or directory"
when trying to access those files. A standard /bin/ls will return
normally, but long listings fail. pvfs2-ls, pvfs2-viewdist and
pvfs2-validate return getattr failures. pvfs2-rm returns output like this:
[E 09:42:30.669413] Error: failed removing one or more datafiles
associated with the meta handle 238502937
[E 09:42:30.669599] WARNING: PVFS_sys_remove() encountered an error
which may lead
to inconsistent state: No such file or directory
[E 09:42:30.669614] WARNING: PVFS2 fsck (if available) may be needed.
Error: An error occurred while removing /mnt/pvfs2/file1.txt
PVFS_sys_remove: No such file or directory (error class: 0)
Removing these files is a manual process. These are the steps we follow:
- Track down the file(s) that are causing the problems
- pvfs2-stat on the directory where the file resides
- Grab the FSID and handle from the output
- pvfs2-remove-object using the file name, directory handle, and FSID
As more of these files start appearing, this process is becoming slow
and painful. It would be great if we could sort out why these files
are showing up like they are, but right now I think a utility that
could efficiently remove these files without the legwork would be
really helpful. Any idea what might work based on what we are seeing?
I am not sure if the problem also exists in 2.8, but it may be related
to this issue mailed in by Jim in September:
http://www.beowulf-underground.org/pipermail/pvfs2-users/2010-September/003186.html
We are experiencing this issue as well.
Thanks,
Bart.
_______________________________________________
Pvfs2-developers mailing list
[email protected]
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers
_______________________________________________
Pvfs2-developers mailing list
[email protected]
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers