[ovirt-users] Fwd: Ovirt host GetGlusterVolumeHealInfoVDS failed events
Forwarding to oVirt users list. -- Forwarded message - From: Date: Wed, May 6, 2020 at 12:01 PM Subject: Ovirt host GetGlusterVolumeHealInfoVDS failed events To: Hi, We have a oVirt cluster with 4 hosts and hosted engine running on one of them (all the nodes provide the storage with GlusterFS) Currently there are 53 VMs running. The version of the oVirt-Engine is 4.2.8.2-1.el7 and GlusterFS is 3.12.15. >From past 1 week, we seem to have multiple events popping up on Ovirt-UI about the GetGlusterVolumeHealInfoVDS from all the nodes randomly like one ERROR event for every ~13minutes. Sample Event dashboard example: May 4, 2020, 2:32:14 PM - Status of host was set to Up. May 4, 2020, 2:32:11 PM - Manually synced the storage devices from host May 4, 2020, 2:31:55 PM - Host is not responding. Host cannot be fenced automatically because power management for the host is disabled. May 4, 2020, 2:31:55 PM - VDSM command GetGlusterVolumeHealInfoVDS failed: Message timeout which can be caused by communication issues May 4, 2020, 2:19:14 PM - Status of host was set to Up. May 4, 2020, 2:19:12 PM - Manually synced the storage devices from host May 4, 2020, 2:18:49 PM - Host is not responding. Host cannot be fenced automatically because power management for the host is disabled. May 4, 2020, 2:18:49 PM - VDSM command GetGlusterVolumeHealInfoVDS failed: Message timeout which can be caused by communication issues May 4, 2020, 2:05:55 PM - Status of host was set to Up. May 4, 2020, 2:05:54 PM - Manually synced the storage devices from host May 4, 2020, 2:05:35 PM - Host is not responding. Host cannot be fenced automatically because power management for the host is disabled. May 4, 2020, 2:05:35 PM - VDSM command GetGlusterVolumeHealInfoVDS failed: Message timeout which can be caused by communication issues May 4, 2020, 1:52:45 PM - Status of host was set to Up. May 4, 2020, 1:52:44 PM - Manually synced the storage devices from host May 4, 2020, 1:52:22 PM - Host is not responding. Host cannot be fenced automatically because power management for the host is disabled. May 4, 2020, 1:52:22 PM - VDSM command GetGlusterVolumeHealInfoVDS failed: Message timeout which can be caused by communication issues May 4, 2020, 1:39:11 PM - Status of host was set to Up. May 4, 2020, 1:39:11 PM - Manually synced the storage devices from host May 4, 2020, 1:39:11 PM - Host is not responding. Host cannot be fenced automatically because power management for the host is disabled. May 4, 2020, 1:39:11 PM - VDSM command GetGlusterVolumeHealInfoVDS failed: Message timeout which can be caused by communication issues May 4, 2020, 1:26:29 PM - Status of host was set to Up. May 4, 2020, 1:26:28 PM - Manually synced the storage devices from host May 4, 2020, 1:26:11 PM - Host is not responding. Host cannot be fenced automatically because power management for the host is disabled. May 4, 2020, 1:26:11 PM - VDSM command GetGlusterVolumeHealInfoVDS failed: Message timeout which can be caused by communication issues May 4, 2020, 1:13:10 PM - Status of host was set to Up. May 4, 2020, 1:13:08 PM - Manually synced the storage devices from host May 4, 2020, 1:12:51 PM - Host is not responding. Host cannot be fenced automatically because power management for the host is disabled. May 4, 2020, 1:12:51 PM - VDSM command GetGlusterVolumeHealInfoVDS failed: Message timeout which can be caused by communication issues and so on. When I look at the Compute > Hosts dashboard, I see the host status to be DOWN when VDSM event (GetGlusterVolumeHealInfoVDS failed) is popped and automatically the host status is set to UP within no time. FYI: when host status is DOWN, the VM's running on that host are not migrating and everything is running perfectly fine. This is happening all day. Is there something I can troubleshoot? Appreciate your comments. ___ Infra mailing list -- in...@ovirt.org To unsubscribe send an email to infra-le...@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/in...@ovirt.org/message/GNE3QC7GLEER4ZPHGP3H6M27DPSKCQO3/ -- Anton Marchukov Associate Manager - RHV DevOps - Red Hat ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/RGUTF3DUTK4XTG7N4MM3MM3LUAFIIPJE/
[ovirt-users] Fwd: Install of new ovirt baremetal system 4.3.9
Forwarding to oVirt users list since it looks to be better suited there. -- Forwarded message - From: kelley bryan Date: Wed, May 6, 2020 at 12:02 PM Subject: Install of new ovirt baremetal system 4.3.9 To: Engine deployment fails near end: [ ERROR ] fatal: [localhost]: FAILED! => {"changed": true, "cmd": "set -euo pipefail && firewall-cmd --get-active-zones | grep -v \"^\\s*interfaces\"", "delta": "0:00:00.352904", "end": "2020-05-05 22:28:01.561606", "msg": "non-zero return code", "rc": 1, "start": "2020-05-05 22:28:01.208702", "stderr": "", "stderr_lines": [], "stdout": "", "stdout_lines": []} The system may not be provisioned according to the playbook results: please check the logs for the issue, fix accordingly or re-deploy from scratch.\n"} were does ovirt store logs? ___ Infra mailing list -- in...@ovirt.org To unsubscribe send an email to infra-le...@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/in...@ovirt.org/message/XALRUKVRYFC2NFN42STINRAP3W6RRIKU/ -- Anton Marchukov Associate Manager - RHV DevOps - Red Hat ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/CVPOWRYJKS4QWDI3M5EPKCEXAEBNZSEW/
[ovirt-users] Re: Mirror oVirt content
Hello Adrian. Thanks for your help with mirroring. I have forwarded your messages to the ticketing system our infra team uses. You should get back two links: one for the ticket about our mirroring docs being outdated (we need to fix them) and the second one is about your mirroring request. Our infra team will pick them up from there. On Tue, Apr 21, 2020 at 2:42 PM wrote: > Hi Barak, > Thanks for the info, we have opened a request/email using that link since > a few months ago, however no one has reached out back to us. > > Anything else we can do from our end? > > thanks, > > Adrian > ___ > Users mailing list -- users@ovirt.org > To unsubscribe send an email to users-le...@ovirt.org > Privacy Statement: https://www.ovirt.org/privacy-policy.html > oVirt Code of Conduct: > https://www.ovirt.org/community/about/community-guidelines/ > List Archives: > https://lists.ovirt.org/archives/list/users@ovirt.org/message/AMBMC2U745ZOIVMQ6SAOU5PVODCVHZPE/ > -- Anton Marchukov Associate Manager - RHV DevOps - Red Hat ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/DLC2LWEDOQOMTXHWT263BPKIRIIUR5G6/
[ovirt-users] Fwd: Unable to Upgrade
Forwarding to the users list. > Begin forwarded message: > > From: "Akshita Jain" > Subject: Unable to Upgrade > Date: 1 October 2019 at 11:12:58 CEST > To: in...@ovirt.org > > After upgrading oVirt 4.3.4 to 4.3.6, the gluster is also upgrading from 5.6 > to 6.5. But as soon as it upgrades gluster peer status shows disconnected. > What is the correct method to upgrade oVirt with gluster HCI environment? > ___ > Infra mailing list -- in...@ovirt.org > To unsubscribe send an email to infra-le...@ovirt.org > Privacy Statement: https://www.ovirt.org/site/privacy-policy/ > oVirt Code of Conduct: > https://www.ovirt.org/community/about/community-guidelines/ > List Archives: > https://lists.ovirt.org/archives/list/in...@ovirt.org/message/24D6NKVLMYQA3LI2GBXJFMOHA3U42KHS/ -- Anton Marchukov Associate Manager - RHV DevOps - Red Hat ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/YO2RYKFDCSPQ7EWOVXVE6LIO7GMY36SA/
[ovirt-users] Re: Outage Notification: Jenkins and Resources Are Not Accessible
Hello All. The issue was related to the upgrade of the networking switch. It is fixed and the services are back to normal. Please let us know if you see any further problems. > On 26 Sep 2019, at 09:19, Anton Marchukov wrote: > > Hello All. > > Please note that oVirt services hosted at PHX datacenter are not accessible. > Main services running there are: > > jenkins.ovirt.org > resources.ovirt.org > > This looks to be network related and we are working with the relevant teams > to resolve it. > > Anton. > > -- > Anton Marchukov > Associate Manager - RHV DevOps - Red Hat > > > > > > -- Anton Marchukov Associate Manager - RHV DevOps - Red Hat ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/ERRULHTW4N3EMCKB42CBVDHRCKB67X6L/
[ovirt-users] Outage Notification: Jenkins and Resources Are Not Accessible
Hello All. Please note that oVirt services hosted at PHX datacenter are not accessible. Main services running there are: jenkins.ovirt.org resources.ovirt.org This looks to be network related and we are working with the relevant teams to resolve it. Anton. -- Anton Marchukov Associate Manager - RHV DevOps - Red Hat ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/CSDEMXX5XYAP4KYLI4SB35ZM2D5KLAUC/
[ovirt-users] Work-Around for Access Problem to resources.ovirt.org and jenkins.ovirt.org
Hello All. Please note that ovirt.org got HSTS setting enabled. The way this change was rolled was not authorised by us. The setting contains “includeSubDomains” parameter that ask browser to access all resources inside *.ovirt.org using https://. The problem is that not all resources on *.ovirt.org are https-enabled. And this will prevent you from being able to access http://resources.ovirt.org http://jenkins.ovirt.org This setting is cached in your browser and then applied with no exceptions even if you explicitly specify http:// the browser will rewrite to https and then connection to above mentioned resources will fail. The only immediate work-around for you is to clear the HSTS flag in your browser, please find the instructions for Firefox and Chrome [1]. Please note that you will get that setting back by visiting ovirt.org website until we clear it from there. We are currently working on this setting to be soften or disabled till all resources are https enabled that is also in progress. This is only related to access via the web browser. Accessing using yum or automated scripts should work fine as they usually do not observe and cache HSTS. Anton. [1] https://www.thesslstore.com/blog/clear-hsts-settings-chrome-firefox/ ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
Re: [ovirt-users] Update on oVirt Mirroring Status
Hello All. We have got the issue solved. I am readding mirrors back to the mirror list as soon as I get confirmation it is working for each mirror. At the moment we already have few mirrors back there. Anton. On Wed, Nov 11, 2015 at 10:45 AM, Anton Marchukov wrote: > Hello All. > > Following previous announcement, please note that we still face a problem > with getting updates to our mirrors. This is firewalling problem and we are > working with IT on resolving it. > > I have temporarily updated mirror list to include only resources.ovirt.org > there. It will be changed back as soon as the mirroring is restored. > > Please note that if you are not using the mirrorlist and include URLs > directly and you are interested in latest snapshoots, than you may need to > use http://resources.ovirt.org/pub/ovirt-@VERSION@/rpm/@DIST@/ directly. > > Will reply on this message once I get an ETA. Please contact > in...@ovirt.org in case you have any questions. > > Anton. > > -- > Anton Marchukov > Senior Software Engineer - RHEV CI - Red Hat > > -- Anton Marchukov Senior Software Engineer - RHEV CI - Red Hat ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
[ovirt-users] Update on oVirt Mirroring Status
Hello All. Following previous announcement, please note that we still face a problem with getting updates to our mirrors. This is firewalling problem and we are working with IT on resolving it. I have temporarily updated mirror list to include only resources.ovirt.org there. It will be changed back as soon as the mirroring is restored. Please note that if you are not using the mirrorlist and include URLs directly and you are interested in latest snapshoots, than you may need to use http://resources.ovirt.org/pub/ovirt-@VERSION@/rpm/@DIST@/ directly. Will reply on this message once I get an ETA. Please contact in...@ovirt.org in case you have any questions. Anton. -- Anton Marchukov Senior Software Engineer - RHEV CI - Red Hat ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] Wishlist - Mix gluster and local storage in same data center
Hello Liam. Well. If you have a look on hook's code than you can make it persistent by removing the code that recreates the storage when VM is started. However than you have to make sure that you either do not relocate VM to another host or purge the local storage image if you do. If you come up with general solution for this or ideas, let me know. So far we are experimenting with using that local storage for CI slaves. In this case it is not a problem for it not be permanent. Anton. On Wed, Nov 11, 2015 at 12:02 AM, Liam Curtis wrote: > Thanks for this info! Would be great if the temporary storage could > ultimately be made to be persistent > > ___ > Users mailing list > Users@ovirt.org > http://lists.ovirt.org/mailman/listinfo/users > > -- Anton Marchukov Senior Software Engineer - RHEV CI - Red Hat ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
[ovirt-users] resources.ovirt.org Migration Notification
Hello All. Please note that today resources.ovirt.org has been migrated to another data centre to solve long lasting capacity problems. As part of this migration, all the content is finally back to one single place (as some of you may noted, older content was moved away to resources01 at some point, but that is now deprecated). If you use the correct DNS for the host (that is resources.ovirt.org) than you do not need to do anything with regard to this migration. Here is the list of know issues and possible breakages at the moment. 1. Mirrors will have a delay with updates. This issue is being worked on. 2. In case you used linode01.ovirt.org to connect, please, change this to resources.ovirt.org as it is no longer the same host. 3. resources01.phx.ovirt.org is depricated. It is not down, but all the content is finally back to resources.ovirt.org Shall you have any other problems, feel free to drop us a note at in...@ovirt.org list so we can fix it asap. Anton. -- Anton Marchukov Senior Software Engineer - RHEV CI - Red Hat ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] Using both shared and local storage
Hello Damir. > If I try to create local storage, it works, but then host is in separate > Data Center. > If I try to add POSIX FS to node in existing Data Center, then all other > hosts stop working, since they can't access that local storage (kind of > expected). > > So, long story short - does shared and local storage mix and how? > We faced the same need - to use a local storage for our CI and managed to use vdsm scratchpad hook for that: http://www.ovirt.org/VDSM-Hooks/scratchpad it implements something similar to AWS ephemeral storage, so it mounts a local disk that does not survive over reboot. This might work for you. Of cause it will not be possible to migrate such a VM and the hook will fails an attempt to do migration. Please note that there is a patch https://gerrit.ovirt.org/#/c/42573/ without which it may not work. This patch was merged, but most like is not yet at the ovirt version you use. Let me know if you have other questions with regards to that hook usage. I may also share my unit files for systemd to format and mount the local storage provided by the hook. Anton. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users