On February 1, 2020 1:34:30 AM GMT+02:00, Jayme <[email protected]> wrote: >I have run into this exact issue before and resolved it by simply >syncing >over the missing files and running a heal on the volume (can take a >little >time to correct) > > >On Fri, Jan 31, 2020 at 7:05 PM Christian Reiss ><[email protected]> >wrote: > >> Hey folks, >> >> in our production setup with 3 nodes (HCI) we took one host down >> (maintenance, stop gluster, poweroff via ssh/ovirt engine). Once it >was >> up the gluster hat 2k healing entries that went down in a matter on >10 >> minutes to 2. >> >> Those two give me a headache: >> >> [root@node03:~] # gluster vol heal ssd_storage info >> Brick node01:/gluster_bricks/ssd_storage/ssd_storage >> <gfid:a121e4fb-0984-4e41-94d7-8f0c4f87f4b6> >> <gfid:6f8817dc-3d92-46bf-aa65-a5d23f97490e> >> Status: Connected >> Number of entries: 2 >> >> Brick node02:/gluster_bricks/ssd_storage/ssd_storage >> Status: Connected >> Number of entries: 0 >> >> Brick node03:/gluster_bricks/ssd_storage/ssd_storage >> <gfid:a121e4fb-0984-4e41-94d7-8f0c4f87f4b6> >> <gfid:6f8817dc-3d92-46bf-aa65-a5d23f97490e> >> Status: Connected >> Number of entries: 2 >> >> No paths, only gfid. We took down node2, so it does not have the >file: >> >> [root@node01:~] # md5sum >> >> >/gluster_bricks/ssd_storage/ssd_storage/.glusterfs/a1/21/a121e4fb-0984-4e41-94d7-8f0c4f87f4b6 >> 75c4941683b7eabc223fc9d5f022a77c >> >> >/gluster_bricks/ssd_storage/ssd_storage/.glusterfs/a1/21/a121e4fb-0984-4e41-94d7-8f0c4f87f4b6 >> >> [root@node02:~] # md5sum >> >> >/gluster_bricks/ssd_storage/ssd_storage/.glusterfs/a1/21/a121e4fb-0984-4e41-94d7-8f0c4f87f4b6 >> md5sum: >> >/gluster_bricks/ssd_storage/ssd_storage/.glusterfs/a1/21/a121e4fb-0984-4e41-94d7-8f0c4f87f4b6: >> >> No such file or directory >> >> [root@node03:~] # md5sum >> >> >/gluster_bricks/ssd_storage/ssd_storage/.glusterfs/a1/21/a121e4fb-0984-4e41-94d7-8f0c4f87f4b6 >> 75c4941683b7eabc223fc9d5f022a77c >> >> >/gluster_bricks/ssd_storage/ssd_storage/.glusterfs/a1/21/a121e4fb-0984-4e41-94d7-8f0c4f87f4b6 >> >> The other two files are md5-identical. >> >> These flags are identical, too: >> >> [root@node01:~] # getfattr -d -m . -e hex >> >> >/gluster_bricks/ssd_storage/ssd_storage/.glusterfs/a1/21/a121e4fb-0984-4e41-94d7-8f0c4f87f4b6 >> getfattr: Removing leading '/' from absolute path names >> # file: >> >> >gluster_bricks/ssd_storage/ssd_storage/.glusterfs/a1/21/a121e4fb-0984-4e41-94d7-8f0c4f87f4b6 >> >> >security.selinux=0x73797374656d5f753a6f626a6563745f723a676c7573746572645f627269636b5f743a733000 >> trusted.afr.dirty=0x000000000000000000000000 >> trusted.afr.ssd_storage-client-1=0x0000004f0000000100000000 >> trusted.gfid=0xa121e4fb09844e4194d78f0c4f87f4b6 >> >> >trusted.gfid2path.d4cf876a215b173f=0x62653331383633382d653861302d346336642d393737642d3761393337616138343830362f38366461303238392d663734662d343230302d393238342d3637386537626437363139352e31323030 >> >> >trusted.glusterfs.mdata=0x010000000000000000000000005e349b1e000000001139aa2a000000005e349b1e000000001139aa2a000000005e34994900000000304a5eb2 >> >> getfattr: Removing leading '/' from absolute path names >> # file: >> >> >gluster_bricks/ssd_storage/ssd_storage/.glusterfs/a1/21/a121e4fb-0984-4e41-94d7-8f0c4f87f4b6 >> >> >security.selinux=0x73797374656d5f753a6f626a6563745f723a676c7573746572645f627269636b5f743a733000 >> trusted.afr.dirty=0x000000000000000000000000 >> trusted.afr.ssd_storage-client-1=0x0000004f0000000100000000 >> trusted.gfid=0xa121e4fb09844e4194d78f0c4f87f4b6 >> >> >trusted.gfid2path.d4cf876a215b173f=0x62653331383633382d653861302d346336642d393737642d3761393337616138343830362f38366461303238392d663734662d343230302d393238342d3637386537626437363139352e31323030 >> >> >trusted.glusterfs.mdata=0x010000000000000000000000005e349b1e000000001139aa2a000000005e349b1e000000001139aa2a000000005e34994900000000304a5eb2 >> >> Now, I dont dare simply proceeding withouth some advice. >> Anyone got a clue on who to resolve this issue? File #2 is identical >to >> this one, from a problem point of view. >> >> Have a great weekend! >> -Chris. >> >> -- >> with kind regards, >> mit freundlichen Gruessen, >> >> Christian Reiss >> _______________________________________________ >> Users mailing list -- [email protected] >> To unsubscribe send an email to [email protected] >> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ >> oVirt Code of Conduct: >> https://www.ovirt.org/community/about/community-guidelines/ >> List Archives: >> >https://lists.ovirt.org/archives/list/[email protected]/message/FGIQFIRC6QYN4AYB3NRPM42KX4ENIF2A/ >>
There is an active thread in gluster-users , so it will be nice to mention this there. About the sync, you can find the paths via: 1. Mount mount -t glusterfs -o aux-gfid-mount vm1:test /mnt/testvol 2. Find the path of files: getfattr -n trusted.glusterfs.pathinfo -e text /mnt/testvol/.gfid/<GFID> I bet it's the same file that causes me problems. Just verify the contents and you will see that one of them is newer -> just rsync it to the bricks node01 & node03 and run 'gluster volume heal <volume> full.' Best Regards, Strahil Nikolov _______________________________________________ Users mailing list -- [email protected] To unsubscribe send an email to [email protected] Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/[email protected]/message/QQAA6OENXMNCRL5ZONDVLW7ICJZ2SOQT/

