Hello,

Last Friday I upgraded my GlusterFS 3.10.7 3-way replica (with arbitrer) 
cluster to 3.12.7 and this morning I got a warning that 9 files on one of my 
volumes are not synced. Ineeded checking that volume with a "volume heal info" 
shows that the third node (the arbitrer node) has 9 files to be healed but are 
not being healed automatically.

All nodes were always online and there was no network interruption so I am 
wondering if this might not really be a split-brain issue but something else.

I found some interesting log entries on the client log file 
(/var/log/glusterfs/myvol-private.log) which I have included below in this 
mail. It looks like some renaming has gone wrong because a directory is not 
empty.

For your information I have upgraded my GlusterFS in offline mode and the 
upgrade went smoothly.

What can I do to fix that issue?

Best regards,
Mabi


[2018-04-09 06:58:46.906089] I [MSGID: 109066] [dht-rename.c:1741:dht_rename] 
0-myvol-private-dht: renaming 
/dir1/dir2/dir3/dir4/dir5/dir6/dir7/dir8/dir9/dir10/dir11/azipfile.zip 
(hash=myvol-private-replicate-0/cache=myvol-private-replicate-0) => 
/dir1/di2/dir3/dir4/dir5/dir6/dir7/dir8/dir9/dir10/dir11/dir12_Archiv/azipfile.zip
 (hash=myvol-private-replicate-0/cache=<nul>)
[2018-04-09 06:58:53.692440] W [MSGID: 114031] 
[client-rpc-fops.c:670:client3_3_rmdir_cbk] 0-myvol-private-client-2: remote 
operation failed [Directory not empty]
[2018-04-09 06:58:53.714129] W [MSGID: 114031] 
[client-rpc-fops.c:2860:client3_3_lookup_cbk] 0-myvol-private-client-1: remote 
operation failed. Path: <gfid:13880e8c-13da-442f-8180-fa40b6f5327c> 
(13880e8c-13da-442f-8180-fa40b6f5327c) [No such file or directory]
[2018-04-09 06:58:53.714161] W [MSGID: 114031] 
[client-rpc-fops.c:2860:client3_3_lookup_cbk] 0-myvol-private-client-0: remote 
operation failed. Path: <gfid:13880e8c-13da-442f-8180-fa40b6f5327c> 
(13880e8c-13da-442f-8180-fa40b6f5327c) [No such file or directory]
[2018-04-09 06:58:53.715638] W [MSGID: 114031] 
[client-rpc-fops.c:670:client3_3_rmdir_cbk] 0-myvol-private-client-2: remote 
operation failed [Directory not empty]
[2018-04-09 06:58:53.750372] I [MSGID: 108026] 
[afr-self-heal-metadata.c:52:__afr_selfheal_metadata_do] 
0-myvol-private-replicate-0: performing metadata selfheal on 
1cc6facf-eca5-481c-a905-7a39faa25156
[2018-04-09 06:58:53.757677] I [MSGID: 108026] 
[afr-self-heal-common.c:1656:afr_log_selfheal] 0-myvol-private-replicate-0: 
Completed metadata selfheal on 1cc6facf-eca5-481c-a905-7a39faa25156. 
sources=[2]  sinks=0 1 
[2018-04-09 06:58:53.775939] I [MSGID: 108026] 
[afr-self-heal-entry.c:887:afr_selfheal_entry_do] 0-myvol-private-replicate-0: 
performing entry selfheal on 1cc6facf-eca5-481c-a905-7a39faa25156
[2018-04-09 06:58:53.776237] I [MSGID: 108026] 
[afr-self-heal-metadata.c:52:__afr_selfheal_metadata_do] 
0-myvol-private-replicate-0: performing metadata selfheal on 
13880e8c-13da-442f-8180-fa40b6f5327c
[2018-04-09 06:58:53.781762] I [MSGID: 108026] 
[afr-self-heal-common.c:1656:afr_log_selfheal] 0-myvol-private-replicate-0: 
Completed metadata selfheal on 13880e8c-13da-442f-8180-fa40b6f5327c. 
sources=[2]  sinks=0 1 
[2018-04-09 06:58:53.796950] I [MSGID: 108026] 
[afr-self-heal-common.c:1656:afr_log_selfheal] 0-myvol-private-replicate-0: 
Completed entry selfheal on 1cc6facf-eca5-481c-a905-7a39faa25156. sources=[2]  
sinks=0 1 
[2018-04-09 06:58:53.812682] I [MSGID: 108026] 
[afr-self-heal-entry.c:887:afr_selfheal_entry_do] 0-myvol-private-replicate-0: 
performing entry selfheal on 13880e8c-13da-442f-8180-fa40b6f5327c
[2018-04-09 06:58:53.879382] E [MSGID: 108008] 
[afr-read-txn.c:90:afr_read_txn_refresh_done] 0-myvol-private-replicate-0: 
Failing READ on gfid a4c46519-7dda-489d-9f5d-811ededd53f1: split-brain 
observed. [Input/output error]
[2018-04-09 06:58:53.881514] E [MSGID: 108008] 
[afr-read-txn.c:90:afr_read_txn_refresh_done] 0-myvol-private-replicate-0: 
Failing FGETXATTR on gfid a4c46519-7dda-489d-9f5d-811ededd53f1: split-brain 
observed. [Input/output error]
[2018-04-09 06:58:53.890073] W [MSGID: 108027] 
[afr-common.c:2798:afr_discover_done] 0-myvol-private-replicate-0: no read 
subvols for (null)
_______________________________________________
Gluster-users mailing list
Gluster-users@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-users

Reply via email to