[ovirt-users] Fwd: Ovirt host GetGlusterVolumeHealInfoVDS failed events

2020-05-06 Thread Anton Marchukov
Forwarding to oVirt users list.

-- Forwarded message -
From: 
Date: Wed, May 6, 2020 at 12:01 PM
Subject: Ovirt host GetGlusterVolumeHealInfoVDS failed events
To: 


Hi,

We have a oVirt cluster with 4 hosts and hosted engine running on one of
them (all the nodes provide the storage with GlusterFS)
Currently there are 53 VMs running.
The version of the oVirt-Engine is 4.2.8.2-1.el7 and GlusterFS is 3.12.15.

>From past 1 week, we seem to have multiple events popping up on Ovirt-UI
about the GetGlusterVolumeHealInfoVDS from all the nodes randomly like one
ERROR event for every ~13minutes.

Sample Event dashboard example:
May 4, 2020, 2:32:14 PM - Status of host  was set to Up.
May 4, 2020, 2:32:11 PM - Manually synced the storage devices from host

May 4, 2020, 2:31:55 PM - Host  is not responding. Host cannot be
fenced automatically because power management for the host is disabled.
May 4, 2020, 2:31:55 PM - VDSM  command GetGlusterVolumeHealInfoVDS
failed: Message timeout which can be caused by communication issues

May 4, 2020, 2:19:14 PM - Status of host  was set to Up.
May 4, 2020, 2:19:12 PM - Manually synced the storage devices from host

May 4, 2020, 2:18:49 PM - Host  is not responding. Host cannot be
fenced automatically because power management for the host is disabled.
May 4, 2020, 2:18:49 PM - VDSM  command GetGlusterVolumeHealInfoVDS
failed: Message timeout which can be caused by communication issues

May 4, 2020, 2:05:55 PM - Status of host  was set to Up.
May 4, 2020, 2:05:54 PM - Manually synced the storage devices from host

May 4, 2020, 2:05:35 PM - Host  is not responding. Host cannot be
fenced automatically because power management for the host is disabled.
May 4, 2020, 2:05:35 PM - VDSM  command GetGlusterVolumeHealInfoVDS
failed: Message timeout which can be caused by communication issues

May 4, 2020, 1:52:45 PM - Status of host  was set to Up.
May 4, 2020, 1:52:44 PM - Manually synced the storage devices from host

May 4, 2020, 1:52:22 PM - Host  is not responding. Host cannot be
fenced automatically because power management for the host is disabled.
May 4, 2020, 1:52:22 PM - VDSM  command GetGlusterVolumeHealInfoVDS
failed: Message timeout which can be caused by communication issues

May 4, 2020, 1:39:11 PM - Status of host  was set to Up.
May 4, 2020, 1:39:11 PM - Manually synced the storage devices from host

May 4, 2020, 1:39:11 PM - Host  is not responding. Host cannot be
fenced automatically because power management for the host is disabled.
May 4, 2020, 1:39:11 PM - VDSM  command GetGlusterVolumeHealInfoVDS
failed: Message timeout which can be caused by communication issues

May 4, 2020, 1:26:29 PM - Status of host  was set to Up.
May 4, 2020, 1:26:28 PM - Manually synced the storage devices from host

May 4, 2020, 1:26:11 PM - Host  is not responding. Host cannot be
fenced automatically because power management for the host is disabled.
May 4, 2020, 1:26:11 PM - VDSM  command GetGlusterVolumeHealInfoVDS
failed: Message timeout which can be caused by communication issues

May 4, 2020, 1:13:10 PM - Status of host  was set to Up.
May 4, 2020, 1:13:08 PM - Manually synced the storage devices from host

May 4, 2020, 1:12:51 PM - Host  is not responding. Host cannot be
fenced automatically because power management for the host is disabled.
May 4, 2020, 1:12:51 PM - VDSM  command GetGlusterVolumeHealInfoVDS
failed: Message timeout which can be caused by communication issues
 and so on.

When I look at the Compute > Hosts dashboard, I see the host status to be
DOWN when VDSM event (GetGlusterVolumeHealInfoVDS failed) is popped and
automatically the host status is set to UP within no time.
FYI: when host status is DOWN, the VM's running on that host are not
migrating and everything is running perfectly fine.

This is happening all day. Is there something I can troubleshoot?
Appreciate your comments.
___
Infra mailing list -- in...@ovirt.org
To unsubscribe send an email to infra-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct:
https://www.ovirt.org/community/about/community-guidelines/
List Archives:
https://lists.ovirt.org/archives/list/in...@ovirt.org/message/GNE3QC7GLEER4ZPHGP3H6M27DPSKCQO3/


-- 
Anton Marchukov
Associate Manager - RHV DevOps - Red Hat
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/RGUTF3DUTK4XTG7N4MM3MM3LUAFIIPJE/


[ovirt-users] Fwd: Install of new ovirt baremetal system 4.3.9

2020-05-06 Thread Anton Marchukov
Forwarding to oVirt users list since it looks to be better suited there.

-- Forwarded message -
From: kelley bryan 
Date: Wed, May 6, 2020 at 12:02 PM
Subject: Install of new ovirt baremetal system 4.3.9
To: 


Engine deployment fails near end:
[ ERROR ] fatal: [localhost]: FAILED! => {"changed": true, "cmd": "set -euo
pipefail && firewall-cmd --get-active-zones | grep -v \"^\\s*interfaces\"",
"delta": "0:00:00.352904", "end": "2020-05-05 22:28:01.561606", "msg":
"non-zero return code", "rc": 1, "start": "2020-05-05 22:28:01.208702",
"stderr": "", "stderr_lines": [], "stdout": "", "stdout_lines": []}

The system may not be provisioned according to the playbook results: please
check the logs for the issue, fix accordingly or re-deploy from scratch.\n"}

were does ovirt store logs?
___
Infra mailing list -- in...@ovirt.org
To unsubscribe send an email to infra-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct:
https://www.ovirt.org/community/about/community-guidelines/
List Archives:
https://lists.ovirt.org/archives/list/in...@ovirt.org/message/XALRUKVRYFC2NFN42STINRAP3W6RRIKU/


-- 
Anton Marchukov
Associate Manager - RHV DevOps - Red Hat
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/CVPOWRYJKS4QWDI3M5EPKCEXAEBNZSEW/


[ovirt-users] Re: Mirror oVirt content

2020-04-21 Thread Anton Marchukov
Hello Adrian.

Thanks for your help with mirroring. I have forwarded your messages to the
ticketing system our infra team uses. You should get back two links: one
for the ticket about our mirroring docs being outdated (we need to fix
them) and the second one is about your mirroring request. Our infra team
will pick them up from there.


On Tue, Apr 21, 2020 at 2:42 PM  wrote:

> Hi Barak,
> Thanks for the info, we have opened a request/email using that link since
> a few months ago, however no one has reached out back to us.
>
> Anything else we can do from our end?
>
> thanks,
>
> Adrian
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives:
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/AMBMC2U745ZOIVMQ6SAOU5PVODCVHZPE/
>


-- 
Anton Marchukov
Associate Manager - RHV DevOps - Red Hat
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/DLC2LWEDOQOMTXHWT263BPKIRIIUR5G6/


[ovirt-users] Fwd: Unable to Upgrade

2019-10-01 Thread Anton Marchukov
Forwarding to the users list.

> Begin forwarded message:
> 
> From: "Akshita Jain" 
> Subject: Unable to Upgrade
> Date: 1 October 2019 at 11:12:58 CEST
> To: in...@ovirt.org
> 
> After upgrading oVirt 4.3.4 to 4.3.6, the gluster is also upgrading from 5.6 
> to 6.5. But as soon as it upgrades gluster peer status shows disconnected.
> What is the correct method to upgrade oVirt with gluster HCI environment?
> ___
> Infra mailing list -- in...@ovirt.org
> To unsubscribe send an email to infra-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/in...@ovirt.org/message/24D6NKVLMYQA3LI2GBXJFMOHA3U42KHS/

-- 
Anton Marchukov
Associate Manager - RHV DevOps - Red Hat






___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/YO2RYKFDCSPQ7EWOVXVE6LIO7GMY36SA/


[ovirt-users] Re: Outage Notification: Jenkins and Resources Are Not Accessible

2019-09-26 Thread Anton Marchukov
Hello All.

The issue was related to the upgrade of the networking switch. It is fixed and 
the services are back to normal.

Please let us know if you see any further problems.


> On 26 Sep 2019, at 09:19, Anton Marchukov  wrote:
> 
> Hello All.
> 
> Please note that oVirt services hosted at PHX datacenter are not accessible. 
> Main services running there are:
> 
> jenkins.ovirt.org
> resources.ovirt.org
> 
> This looks to be network related and we are working with the relevant teams 
> to resolve it.
> 
> Anton.
> 
> -- 
> Anton Marchukov
> Associate Manager - RHV DevOps - Red Hat
> 
> 
> 
> 
> 
> 

-- 
Anton Marchukov
Associate Manager - RHV DevOps - Red Hat





___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/ERRULHTW4N3EMCKB42CBVDHRCKB67X6L/


[ovirt-users] Outage Notification: Jenkins and Resources Are Not Accessible

2019-09-26 Thread Anton Marchukov
Hello All.

Please note that oVirt services hosted at PHX datacenter are not accessible. 
Main services running there are:

jenkins.ovirt.org
resources.ovirt.org

This looks to be network related and we are working with the relevant teams to 
resolve it.

Anton.

-- 
Anton Marchukov
Associate Manager - RHV DevOps - Red Hat





___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/CSDEMXX5XYAP4KYLI4SB35ZM2D5KLAUC/


[ovirt-users] Work-Around for Access Problem to resources.ovirt.org and jenkins.ovirt.org

2018-05-09 Thread Anton Marchukov
Hello All.

Please note that ovirt.org got HSTS setting enabled. The way this
change was rolled was not authorised by us. The setting contains
“includeSubDomains” parameter that ask browser to access all resources
inside *.ovirt.org using https://.

The problem is that not all resources on *.ovirt.org are
https-enabled. And this will prevent you from being able to access

http://resources.ovirt.org
http://jenkins.ovirt.org

This setting is cached in your browser and then applied with no
exceptions even if you explicitly specify http:// the browser will
rewrite to https and then connection to above mentioned resources will
fail.

The only immediate work-around for you is to clear the HSTS flag in
your browser, please find the instructions for Firefox and Chrome [1].
Please note that you will get that setting back by visiting ovirt.org
website until we clear it from there.

We are currently working on this setting to be soften or disabled till
all resources are https enabled that is also in progress.

This is only related to access via the web browser. Accessing using
yum or automated scripts should work fine as they usually do not
observe and cache HSTS.

Anton.

[1] https://www.thesslstore.com/blog/clear-hsts-settings-chrome-firefox/
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org


Re: [ovirt-users] Update on oVirt Mirroring Status

2015-11-12 Thread Anton Marchukov
Hello All.

We have got the issue solved. I am readding mirrors back to the mirror list
as soon as I get confirmation it is working for each mirror.

At the moment we already have few mirrors back there.

Anton.

On Wed, Nov 11, 2015 at 10:45 AM, Anton Marchukov 
wrote:

> Hello All.
>
> Following previous announcement, please note that we still face a problem
> with getting updates to our mirrors. This is firewalling problem and we are
> working with IT on resolving it.
>
> I have temporarily updated mirror list to include only resources.ovirt.org
> there. It will be changed back as soon as the mirroring is restored.
>
> Please note that if you are not using the mirrorlist and include URLs
> directly and you are interested in latest snapshoots, than you may need to
> use http://resources.ovirt.org/pub/ovirt-@VERSION@/rpm/@DIST@/ directly.
>
> Will reply on this message once I get an ETA. Please contact
> in...@ovirt.org in case you have any questions.
>
> Anton.
>
> --
> Anton Marchukov
> Senior Software Engineer - RHEV CI - Red Hat
>
>


-- 
Anton Marchukov
Senior Software Engineer - RHEV CI - Red Hat
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Update on oVirt Mirroring Status

2015-11-11 Thread Anton Marchukov
Hello All.

Following previous announcement, please note that we still face a problem
with getting updates to our mirrors. This is firewalling problem and we are
working with IT on resolving it.

I have temporarily updated mirror list to include only resources.ovirt.org
there. It will be changed back as soon as the mirroring is restored.

Please note that if you are not using the mirrorlist and include URLs
directly and you are interested in latest snapshoots, than you may need to
use http://resources.ovirt.org/pub/ovirt-@VERSION@/rpm/@DIST@/ directly.

Will reply on this message once I get an ETA. Please contact in...@ovirt.org
in case you have any questions.

Anton.

-- 
Anton Marchukov
Senior Software Engineer - RHEV CI - Red Hat
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Wishlist - Mix gluster and local storage in same data center

2015-11-11 Thread Anton Marchukov
Hello Liam.

Well. If you have a look on hook's code than you can make it persistent by
removing the code that recreates the storage when VM is started. However
than you have to make sure that you either do not relocate VM to another
host or purge the local storage image if you do. If you come up with
general solution for this or ideas, let me know.

So far we are experimenting with using that local storage for CI slaves. In
this case it is not a problem for it not be permanent.

Anton.

On Wed, Nov 11, 2015 at 12:02 AM, Liam Curtis  wrote:

> Thanks for this info! Would be great if the temporary storage could
> ultimately be made to be persistent
>
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
>


-- 
Anton Marchukov
Senior Software Engineer - RHEV CI - Red Hat
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] resources.ovirt.org Migration Notification

2015-11-06 Thread Anton Marchukov
Hello All.

Please note that today resources.ovirt.org has been migrated to another
data centre to solve long lasting capacity problems. As part of this
migration, all the content is finally back to one single place (as some of
you may noted, older content was moved away to resources01 at some point,
but that is now deprecated).

If you use the correct DNS for the host (that is resources.ovirt.org) than
you do not need to do anything with regard to this migration.

Here is the list of know issues and possible breakages at the moment.

1. Mirrors will have a delay with updates. This issue is being worked on.
2. In case you used linode01.ovirt.org to connect, please, change this to
resources.ovirt.org as it is no longer the same host.
3. resources01.phx.ovirt.org is depricated. It is not down, but all the
content is finally back to resources.ovirt.org

Shall you have any other problems, feel free to drop us a note at
in...@ovirt.org list so we can fix it asap.

Anton.

-- 
Anton Marchukov
Senior Software Engineer - RHEV CI - Red Hat
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Using both shared and local storage

2015-08-31 Thread Anton Marchukov
Hello Damir.


> If I try to create local storage, it works, but then host is in separate
> Data Center.
> If I try to add POSIX FS to node in existing Data Center, then all other
> hosts stop working, since they can't access that local storage (kind of
> expected).
>
> So, long story short - does shared and local storage mix and how?
>

We faced the same need - to use a local storage for our CI and managed to
use vdsm scratchpad hook for that:

http://www.ovirt.org/VDSM-Hooks/scratchpad

it implements something similar to AWS ephemeral storage, so it mounts a
local disk that does not survive over reboot. This might work for you. Of
cause it will not be possible to migrate such a VM and the hook will fails
an attempt to do migration.

Please note that there is a patch https://gerrit.ovirt.org/#/c/42573/
without which it may not work.  This patch was merged, but most like is not
yet at the ovirt version you use.

Let me know if you have other questions with regards to that hook usage. I
may also share my unit files for systemd to format and mount the local
storage provided by the hook.

Anton.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users