That didn't fix it unfortunately. In fact, I've done a full rebalance after 
initially discovering the problem and after updating Gluster, but nothing was 
changed..
 
I don't know too much about how Gluster works internally; is it possible to 
compute the hash for each duplicate filename - figure out on which brick it 
belong is and find where it actually resides, then recreate the link file or 
update the linkto attribute? Assuming broken link files are the problem..
 
--------- Original Message --------- Subject: Re: [Gluster-users] Hundreds of 
duplicate files
From: "Joe Julian" <[email protected]>
Date: 12/27/14 1:55 pm
To: [email protected], [email protected]

I'm wondering if this is from a corrupted failed rebalance. In a directory that 
has duplicates, do "setfattr -n trusted.distribute.fix.layout -v 1 ."
 
 If that fixes it, do a rebalance...fix-layout

 On December 27, 2014 12:38:01 PM PST, [email protected] wrote:  Ok, I am 
really tearing my hair out here. I tried doing this manually for several other 
files just to be sure. And in these cases it removed the duplicate file from 
the directory listing, but the file can still not be read.. Reading directly 
from the brick works fine.
 
--------- Original Message --------- Subject: Re: [Gluster-users] Hundreds of 
duplicate files
From: "Joe Julian" <[email protected]>
Date: 12/27/14 12:01 pm
To: [email protected]

Should be safe.
 
 Here's what I've done in the past to clean up rogue dht link files (not that 
yours looked rogue though):
 
 find $brick_root -type f -size 0 -perm 1000 -exec /bin/rm {} \;
 
 On 12/27/2014 11:09 AM, [email protected] wrote:
 Moving the file with linkto attribute worked! Just one copy of the file is 
retained in the listing and can be read without problems.
I will write a script to remove these rogue link files from the bricks - any 
risks associated with this?
 
Thanks everyone for your help, of course if anyone could explain how this 
happened I would love to hear it..
 
Tom
 
--------- Original Message --------- Subject: Re: [Gluster-users] Hundreds of 
duplicate files
 From: "Vijay Bellur" <[email protected]>
 Date: 12/27/14 9:12 am
 To: [email protected], [email protected]
 
 On 12/27/2014 01:11 PM, [email protected] wrote:
 > Thanks for your continued help Joe.
 > A demonstration of the problem, in this case I was able to open the file
 > in vim (a text file) without any issues, however sometimes duplicated
 > text files open in vim as one line consisting of @ characters, and
 > binary data files can also not be opened correctly for reading.
 > However the duplicate listing is still an issue. Note that Dec 13 was
 > the date of a server crash.
 >
 > [root@jongoo ~]# ll /sar/complete/vancouver/refdem/tif2flt.pro*
 > -rw-rw-r-T 1 parwant users 1712 Dec 13 19:02 tif2flt.pro
 > -rw-rw-r-- 1 parwant users 1712 Jun 17 2010 tif2flt.pro
 >
 > A few minutes later doing the same listing.. sticky bit disappeared and
 > modification date changed
 >
 > [root@jongoo ~]# ll /sar/complete/vancouver/refdem/tif2flt.pro*
 > -rw-rw-r-- 1 parwant users 1712 Jun 17 2010
 > /sar/complete/vancouver/refdem/tif2flt.pro
 > -rw-rw-r-- 1 parwant users 1712 Jun 17 2010
 > /sar/complete/vancouver/refdem/tif2flt.pro
 >
 > [root@jongoo ~]# getfattr -m . -d -e hex
 > /data/glusterfs/safari/brick00/brick/complete/vancouver/refdem/tif2flt.pro
 > getfattr: Removing leading '/' from absolute path names
 > # file:
 > data/glusterfs/safari/brick00/brick/complete/vancouver/refdem/tif2flt.pro
 > system.posix_acl_access=0x0200000001000600ffffffff04000600ffffffff10000600ffffffff20000400ffffffff
 > trusted.SGI_ACL_FILE=0x0000000400000001ffffffff0006000000000004ffffffff0006000000000010ffffffff0006000000000020ffffffff00040000
 > trusted.gfid=0xdfe13dc088bf4a779488ef72f0a879cd
 > trusted.glusterfs.dht.linkto=0x7361666172692d636c69656e742d3300
 >
 > [root@ndovu ~]# getfattr -m . -d -e hex
 > /data/glusterfs/safari/brick03/brick/complete/vancouver/refdem/tif2flt.pro
 > getfattr: Removing leading '/' from absolute path names
 > # file:
 > data/glusterfs/safari/brick03/brick/complete/vancouver/refdem/tif2flt.pro
 > system.posix_acl_access=0x0200000001000600ffffffff04000600ffffffff10000600ffffffff20000400ffffffff
 > trusted.SGI_ACL_FILE=0x0000000400000001ffffffff0006000000000004ffffffff0006000000000010ffffffff0006000000000020ffffffff00040000
 > trusted.gfid=0xdfe13dc088bf4a779488ef72f0a879cd
 >
 
 Is rebalance running on this volume right now? If not, can you please 
 move out the file copy with "trusted.glusterfs.dht.linkto" attribute out 
 of the brick directory 
 (/data/glusterfs/safari/brick00/brick/complete/vancouver/refdem/tif2flt.pro) 
 to an alternate location & check the behavior?
 
 Thanks,
 Vijay
 

 _______________________________________________
 Gluster-users mailing list
 [email protected]
 http://www.gluster.org/mailman/listinfo/gluster-users  
_______________________________________________ Gluster-users mailing list 
[email protected] http://www.gluster.org/mailman/listinfo/gluster-users
 
  
Gluster-users mailing list
[email protected]
http://www.gluster.org/mailman/listinfo/gluster-users  

 -- 
 Sent from my Android device with K-9 Mail. Please excuse my brevity.
_______________________________________________
Gluster-users mailing list
[email protected]
http://www.gluster.org/mailman/listinfo/gluster-users

Reply via email to