[ovirt-users] Re: Gluster volumes not healing (perhaps after host maintenance?)

2021-05-18 Thread Marco Fais
Hi David,

just spotted this post from a couple of weeks ago -- I have the same
problem (Gluster volume not healing) since the upgrade from 7.x to 8.4.
Same exact errors on glustershd.log -- and same errors if I try to heal
manually.

Typically I can get the volume healed by killing the specific brick
processes manually and forcing a volume start (to restart the failed
bricks).

Just wondering if you've got any progress on your side?

I have also tried to upgrade to 9.1 in one of the clusters (I have three
different ones affected) but didn't solve the issue.

Regards.
Marco

On Mon, 26 Apr 2021 at 21:55, David White via Users  wrote:

> I did have my /etc/hosts setup on all 3 of the oVirt Hosts in the format
> you described, with the exception of the trailing "host1" and "host2". I
> only had the FQDN in there.
>
> I had an outage of almost an hour this morning that may or may not be
> related to this. An "ETL Service" started, at which point a lot of things
> broke down, and I saw a lot of storage-related errors. Everything came back
> on its own, though.
>
> See my other thread that I just started on that topic.
> As of now, there are NOT indications that any of the volumes or disks are
> out of sync.
>
>
> Sent with ProtonMail  Secure Email.
>
> ‐‐‐ Original Message ‐‐‐
> On Sunday, April 25, 2021 1:43 AM, Strahil Nikolov via Users <
> users@ovirt.org> wrote:
>
> A/ & PTR records are pretty important.
> As long as you setup your /etc/hosts jn the format like this you will be
> OK:
>
> 10.10.10.10 host1.anysubdomain.domain host1
> 10.10.10.11 host2.anysubdomain.domain host2
>
> Usually the hostname is defined for each peer in the
> /var/lib/glusterd/peers. Can you check the contents on all nodes ?
>
> Best Regards,
> Strahil Nikolov
>
> On Sat, Apr 24, 2021 at 21:57, David White via Users
>  wrote:
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives:
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/CYPYALTFM7ITZZENSI6R5E6ZNT7TRY5Y/
>
>
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives:
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/NU6PXEUVVSCHVUIYTJRFOO72ZCJBWGVG/
>
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/4YLOE6ZX4W4XXEY72Q5ZJIZDKMNPEDO2/


[ovirt-users] Re: Gluster volumes not healing (perhaps after host maintenance?)

2021-04-26 Thread David White via Users
I did have my /etc/hosts setup on all 3 of the oVirt Hosts in the format you 
described, with the exception of the trailing "host1" and "host2". I only had 
the FQDN in there.

I had an outage of almost an hour this morning that may or may not be related 
to this. An "ETL Service" started, at which point a lot of things broke down, 
and I saw a lot of storage-related errors. Everything came back on its own, 
though.

See my other thread that I just started on that topic.
As of now, there are NOT indications that any of the volumes or disks are out 
of sync.

Sent with ProtonMail Secure Email.

‐‐‐ Original Message ‐‐‐
On Sunday, April 25, 2021 1:43 AM, Strahil Nikolov via Users  
wrote:

> A/ & PTR records are pretty important.
> As long as you setup your /etc/hosts jn the format like this you will be OK:
> 

> 10.10.10.10 host1.anysubdomain.domain host1
> 10.10.10.11 host2.anysubdomain.domain host2
> 

> Usually the hostname is defined for each peer in the /var/lib/glusterd/peers. 
> Can you check the contents on all nodes ?
> 

> Best Regards,
> Strahil Nikolov
> 

> > On Sat, Apr 24, 2021 at 21:57, David White via Users
> >  wrote:
> > ___
> > Users mailing list -- users@ovirt.org
> > To unsubscribe send an email to users-le...@ovirt.org
> > Privacy Statement: https://www.ovirt.org/privacy-policy.html
> > oVirt Code of Conduct: 
> > https://www.ovirt.org/community/about/community-guidelines/
> > List Archives: 
> > https://lists.ovirt.org/archives/list/users@ovirt.org/message/CYPYALTFM7ITZZENSI6R5E6ZNT7TRY5Y/

publickey - dmwhite823@protonmail.com - 0x320CD582.asc
Description: application/pgp-keys


signature.asc
Description: OpenPGP digital signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/NU6PXEUVVSCHVUIYTJRFOO72ZCJBWGVG/


[ovirt-users] Re: Gluster volumes not healing (perhaps after host maintenance?)

2021-04-24 Thread Strahil Nikolov via Users
A/ & PTR records are pretty important.As long as you setup your /etc/hosts 
jn the format like this you will be OK:
10.10.10.10 host1.anysubdomain.domain host110.10.10.11 
host2.anysubdomain.domain host2
Usually the hostname is defined for each peer in the /var/lib/glusterd/peers. 
Can you check the contents on all nodes ?
Best Regards,Strahil Nikolov 
 
  On Sat, Apr 24, 2021 at 21:57, David White via Users wrote:  
 ___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/CYPYALTFM7ITZZENSI6R5E6ZNT7TRY5Y/
  
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/WPTIX725OH43KE5FIK2I2G3H2FCMWEDH/


[ovirt-users] Re: Gluster volumes not healing (perhaps after host maintenance?)

2021-04-24 Thread David White via Users
As part of my troubleshooting earlier this morning, I gracefully shut down the 
ovirt-engine so that it would come up on a different host (can't remember if I 
mentioned that or not).

I just verified forward DNS on all 3 of the hosts.
All 3 resolve each other just fine, and are able to ping each other. The 
hostnames look good, too.

I'm fairly certain that this problem didn't exist prior to me shutting the host 
down and replacing the network card.

That said, I don't think I ever setup rdns / ptr records to begin with. I don't 
recall reading that rdns was a requirement, nor do I remember setting it up 
when I built the cluster a couple weeks ago. Is this a requirement?

I did setup forward dns entries into /etc/hosts on each server, though.

Sent with ProtonMail Secure Email.

‐‐‐ Original Message ‐‐‐
On Saturday, April 24, 2021 11:03 AM, Strahil Nikolov  
wrote:

> Hi David,
> 

> let's start with the DNS.
> Check that both nodes resolve each other (both A/ & PTR records).
> 

> If you set entries in /etc/hosts, check them out.
> 

> Also , check the output of 'hostname -s' & 'hostname -f' on both hosts.
> 

> Best Regards,
> Strahil Nikolov

publickey - dmwhite823@protonmail.com - 0x320CD582.asc
Description: application/pgp-keys


signature.asc
Description: OpenPGP digital signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/CYPYALTFM7ITZZENSI6R5E6ZNT7TRY5Y/


[ovirt-users] Re: Gluster volumes not healing (perhaps after host maintenance?)

2021-04-24 Thread Strahil Nikolov via Users
Hi David,

let's start with the DNS.Check that both nodes resolve each other (both A/ 
& PTR records).
If you set entries in /etc/hosts, check them out.
Also , check the output of 'hostname -s' & 'hostname -f' on both hosts.

Best Regards,Strahil Nikolov___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/6S4SBF42ABLYDWJNBZEBBGNB3FLSL53W/