Re: [ovirt-users] live migration
Hi, Every host are different hostnames, it does not change since reinstall. Maybe node1 got diffrerent uuid than before. Ovirt had out-of-box live migration feature yet? Thanks. Tibor - 2015. aug.. 17., 10:07, Matthew Lagoe matthew.la...@subrigo.net írta: Are all the hostnames of the machines different ive had it before where migrations fail because they have the same hostname or uuid for that matter From: users-boun...@ovirt.org [mailto:users-boun...@ovirt.org] On Behalf Of Omer Frenkel Sent: Monday, August 17, 2015 01:02 AM To: Demeter Tibor Cc: users Subject: Re: [ovirt-users] live migration On Sun, Aug 16, 2015 at 10:31 PM, Demeter Tibor tdeme...@itsmart.hu wrote: Hi, I reinstalled one of my nodes (node1) because I have to replace my hdds. I installed centos 6.6 minimal, but on node re-adding procecure it installed newer qemu-kvm-rhev packages. Since reinstall I can run VMs on this node and I can do live migrate from this node to other, but not backwards. I remember, maybe one years ago it was required to install redhat's version of qemu-kvm-rhve package for this feature. Is it necessary yet? my versions: node0 KVM: 0.12.1.2 - 2.415.el6_5.14, LIBVIRT: libvirt-0.10.2-46.el6_6.1 node1 KVM: 0.12.1.2 - 2.448.el6_6.4, LIBVIRT: libvirt-0.10.2-54.el6 node2 KVM: 0.12.1.2 - 2.448.el6_6.4, LIBVIRT: libvirt-0.10.2-46.el6_6.1 Can I upgrade manual these hosts? I haven't restart my hosts because node0 and node2 is gluster replicate. Thanks in advance, Tibor ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users not sure about the versions, but what is the error you see in source host vdsm.log when migration fails? ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] error adding node.
Hi Sandro, Yes we are indeed running an nfs data domain over gluster. But what do you mean with no-replica or replica 3? From: Sandro Bonazzola [sbona...@redhat.com] Sent: Monday, August 17, 2015 11:08 AM To: Kruijf, Erik-Jan de Cc: users@ovirt.org Subject: Re: [ovirt-users] error adding node. On Mon, Aug 17, 2015 at 11:03 AM, Kruijf, Erik-Jan de erik-jan.de.kru...@cgi.commailto:erik-jan.de.kru...@cgi.com wrote: Hi Sandro, Thank you for the explanation. We used the manual: http://community.redhat.com/blog/2014/10/up-and-running-with-ovirt-3-5/http://community.redhat.com/blog/2014/10/up-and-running-with-ovirt-3-5/ Thats why created an local glusterfs storage for the HE. I find it strange that we can start the engine with the local glusterfs or is this just dumb luck? So you're running NFS data domain over gluster, not just gluster right? In this case it may work provided you use no-replica or replica 3. Regards Erik From: Sandro Bonazzola [sbona...@redhat.commailto:sbona...@redhat.com] Sent: Monday, August 17, 2015 10:57 AM To: Kruijf, Erik-Jan de Cc: users@ovirt.orgmailto:users@ovirt.org Subject: Re: [ovirt-users] error adding node. On Mon, Aug 17, 2015 at 10:30 AM, Kruijf, Erik-Jan de erik-jan.de.kru...@cgi.commailto:erik-jan.de.kru...@cgi.com wrote: Hi, I hope i can explain properly ;) (my english isnt that good). This is the situation: 1. We installed a clean install of centos 7 on the host 2. We created an glusterfs share on the host for the engine Please note that this is a hyper converged setup and it's not supposed to work even in 3.6 yet. 3. We ran hosted-engine --deploy and followed the wizard 4. we get the message to open the VNC connection to install the engine (we use centos 6.7 for the engine) 5. We continue the wizard and get the new vnc connection to install the engine 6. When the engine is running the wizard tries to add the host to the engine and then we get the error that he cannot connect to the engine. (we did get the health ok message. The hosts file and dns servers are working correctly. Please don't use gluster storage for HE on the same host running Hosted Engine. This is not a supported configuration. 7. We try to add manually the host to the engine because if the wizard stops at that point he cannot try again because there is already an vm running. 8. When we try to add the host via the engine we get the same error message. The one that he cannot connect to the host. When i try to ssh via the console on the engine to the host it works. What i fing strange is that he almost immidiatly gives the message (within 3 seconds). I hope this explains the steps we took. Kind regards Erik-Jan de Kruijf From: Sandro Bonazzola [sbona...@redhat.commailto:sbona...@redhat.com] Sent: Monday, August 17, 2015 10:18 AM To: Kruijf, Erik-Jan de Cc: users@ovirt.orgmailto:users@ovirt.org Subject: Re: [ovirt-users] error adding node. On Thu, Aug 13, 2015 at 2:40 PM, Kruijf, Erik-Jan de erik-jan.de.kru...@cgi.commailto:erik-jan.de.kru...@cgi.com wrote: Hi, I am trying to add the first node to an self hosted engine. When i click ok i immidiatly get the error that he cannot connect to host. But if i try to ssh to the host from the engine it works. Can someone please point me to an solution? If any logs are needed please let me know. Just to clarify, are you trying to add the host running the HE VM using the Web UI? Or are you trying to add the first non-HE related node? Kind regards, Erik-jan de Kruijf ___ Users mailing list Users@ovirt.orgmailto:Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users -- Sandro Bonazzola Better technology. Faster innovation. Powered by community collaboration. See how it works at redhat.comhttp://redhat.com -- Sandro Bonazzola Better technology. Faster innovation. Powered by community collaboration. See how it works at redhat.comhttp://redhat.com -- Sandro Bonazzola Better technology. Faster innovation. Powered by community collaboration. See how it works at redhat.comhttp://redhat.com ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] vlan-tagging on non-tagged network
Hi again, I just found the reason for the loss of the packages. Either ovirt or centos install some ebtables rules in the nat table. These rules filter ARP packages coming from vnet interfaces. Flushing the table solved my problems. However, I have no idea what the purpose of these rules is, so there might be unwanted side-effects to just flushing the rules away.. But for now I am happy! Thanks to Ido for your help! Regards, Felix Am 17.08.2015 um 13:20 schrieb Felix Pepinghege: Hi Ido, hi everybody, sorry that I kept you waiting for two months, I only just found the time to go back to this problem. You were completely right with your guess. The ethernet frames do appear on the vnet-interface, but not on the bridge. The dropped-counter seems to be independent from these losses, though. However, while this tells me *where* the problem is, I still don't know *what* the problem is. I've done some research, but couldn't find anything particularly helpful. An interesting point may be that this problem is mono-directional. That is, the bridge happily passes vlan-tagged frames from the ethernet device to the vnet, but not the other way around. Untagged ethernet frames make their way through the brigde, no matter where they come from. The vlan module is loaded, as to the versioning questions: # cat /etc/centos-release ; uname -s -v -r CentOS Linux release 7.1.1503 (Core) Linux 3.10.0-229.7.2.el7.x86_64 #1 SMP Tue Jun 23 22:06:11 UTC 2015 The guest OS is an up-to-date Debian Jessie, which should not matter, though, as the frames get lost on the doorstep of the bridge on the host. Again, any suggestions are much appreciated! Regards, Felix Am 16.06.2015 um 08:27 schrieb Ido Barkan: Hey Felix. IIUC your frames are dropped by the bridge. Ovirt uses Linux Bridges To connect virtual machines to 'networks'. The guest connects to the bridge using a tap device which usually is called 'vnetnumber'. So, just to verify, can you please tcpdump both on the bridge device and on the tap device? The bridge can be quite noisy so I suggest filtering traffic using the guest's MAC address. So I am not sure what protocol you use for tunneling but applying a filter similar to this one should do the job: tcpdump -n -i vnet0 -vvv -s 1500 'udp[38:4]=0x001a4aaeec8e' My guess is that you will observe traffic on the tap device, but not on the bridge. You didn't specify which centOS version you use but I do remember seeing people complaining about Linux bridges discarding their tagged frames. You can -maybe- also observe the 'dropped' counter increases on the bridge by running: 'ip -s link show dev trunk' There were a few bugs on rhel6/7 about this, specifically I remember https://bugzilla.redhat.com/show_bug.cgi?id=1174291 and https://bugzilla.redhat.com/show_bug.cgi?id=1200275#c20 Also, is the vlan module loaded on your host? 'lsmod |grep 8021q' Thanks, Ido - Original Message - From: Felix Pepinghege pepingh...@ira.uka.de To: Users@ovirt.org Sent: Monday, June 15, 2015 11:33:39 AM Subject: [ovirt-users] vlan-tagging on non-tagged network Hi everybody! I am experiencing a behaviour of ovirt, of which I don't know whether it is expected or not. My setup is as follows: A virtual machine has a logical network attached to it, which is configured without vlan-tagging and listens to the name 'trunk'. The VM is running an openvpn server. It is a patched openvpn version, including vlan-tagging. That is, openvpn clients get a vlan tag. This should not really be an issue but should satisfy the why do you want to do it in the first place-questions. Anyhow, effectively, the VM simply puts vlan-tagged ethernet-frames on the virtual network. These frames, however, never make it to the host's network bridge, which represents the logical network. My observations are: According to tcpdump, the vlan-tagged packages arrive at the eth1-interface inside the VM (which *is* the correct interface). Again, according to tcpdump, these packages never arrive at the corresponding network-bridge (i.e., the interface 'trunk') on the host. I know that the setup itself is feasible with KVM---I have it working on a proxmox-machine. Therefore, my conclusion is, that ovirt doesn't like vlan-tagged ethernet-frames on non-tagged logical networks, and somehow filters them out, though I don't really see on what level that would happen (Handling the ethernet frames should be a concern of KVM/QEMU/Linux only, once ovirt has started the VM). So this problem could be a CentOS issue, but I really don't see why CentOS should act differently than debian does (proxmox is debian-based). Is this a known/wanted/expected behaviour of ovirt, and can I somehow prevent or elude it? Any help is much appreciated! Of course I am happy to provide more information if that helps helping me :) Regards, Felix ___ Users mailing list Users@ovirt.org
Re: [ovirt-users] vlan-tagging on non-tagged network
Hi Ido, hi everybody, sorry that I kept you waiting for two months, I only just found the time to go back to this problem. You were completely right with your guess. The ethernet frames do appear on the vnet-interface, but not on the bridge. The dropped-counter seems to be independent from these losses, though. However, while this tells me *where* the problem is, I still don't know *what* the problem is. I've done some research, but couldn't find anything particularly helpful. An interesting point may be that this problem is mono-directional. That is, the bridge happily passes vlan-tagged frames from the ethernet device to the vnet, but not the other way around. Untagged ethernet frames make their way through the brigde, no matter where they come from. The vlan module is loaded, as to the versioning questions: # cat /etc/centos-release ; uname -s -v -r CentOS Linux release 7.1.1503 (Core) Linux 3.10.0-229.7.2.el7.x86_64 #1 SMP Tue Jun 23 22:06:11 UTC 2015 The guest OS is an up-to-date Debian Jessie, which should not matter, though, as the frames get lost on the doorstep of the bridge on the host. Again, any suggestions are much appreciated! Regards, Felix Am 16.06.2015 um 08:27 schrieb Ido Barkan: Hey Felix. IIUC your frames are dropped by the bridge. Ovirt uses Linux Bridges To connect virtual machines to 'networks'. The guest connects to the bridge using a tap device which usually is called 'vnetnumber'. So, just to verify, can you please tcpdump both on the bridge device and on the tap device? The bridge can be quite noisy so I suggest filtering traffic using the guest's MAC address. So I am not sure what protocol you use for tunneling but applying a filter similar to this one should do the job: tcpdump -n -i vnet0 -vvv -s 1500 'udp[38:4]=0x001a4aaeec8e' My guess is that you will observe traffic on the tap device, but not on the bridge. You didn't specify which centOS version you use but I do remember seeing people complaining about Linux bridges discarding their tagged frames. You can -maybe- also observe the 'dropped' counter increases on the bridge by running: 'ip -s link show dev trunk' There were a few bugs on rhel6/7 about this, specifically I remember https://bugzilla.redhat.com/show_bug.cgi?id=1174291 and https://bugzilla.redhat.com/show_bug.cgi?id=1200275#c20 Also, is the vlan module loaded on your host? 'lsmod |grep 8021q' Thanks, Ido - Original Message - From: Felix Pepinghege pepingh...@ira.uka.de To: Users@ovirt.org Sent: Monday, June 15, 2015 11:33:39 AM Subject: [ovirt-users] vlan-tagging on non-tagged network Hi everybody! I am experiencing a behaviour of ovirt, of which I don't know whether it is expected or not. My setup is as follows: A virtual machine has a logical network attached to it, which is configured without vlan-tagging and listens to the name 'trunk'. The VM is running an openvpn server. It is a patched openvpn version, including vlan-tagging. That is, openvpn clients get a vlan tag. This should not really be an issue but should satisfy the why do you want to do it in the first place-questions. Anyhow, effectively, the VM simply puts vlan-tagged ethernet-frames on the virtual network. These frames, however, never make it to the host's network bridge, which represents the logical network. My observations are: According to tcpdump, the vlan-tagged packages arrive at the eth1-interface inside the VM (which *is* the correct interface). Again, according to tcpdump, these packages never arrive at the corresponding network-bridge (i.e., the interface 'trunk') on the host. I know that the setup itself is feasible with KVM---I have it working on a proxmox-machine. Therefore, my conclusion is, that ovirt doesn't like vlan-tagged ethernet-frames on non-tagged logical networks, and somehow filters them out, though I don't really see on what level that would happen (Handling the ethernet frames should be a concern of KVM/QEMU/Linux only, once ovirt has started the VM). So this problem could be a CentOS issue, but I really don't see why CentOS should act differently than debian does (proxmox is debian-based). Is this a known/wanted/expected behaviour of ovirt, and can I somehow prevent or elude it? Any help is much appreciated! Of course I am happy to provide more information if that helps helping me :) Regards, Felix ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] error adding node.
On Mon, Aug 17, 2015 at 10:30 AM, Kruijf, Erik-Jan de erik-jan.de.kru...@cgi.com wrote: Hi, I hope i can explain properly ;) (my english isnt that good). This is the situation: 1. We installed a clean install of centos 7 on the host 2. We created an glusterfs share on the host for the engine Please note that this is a hyper converged setup and it's not supposed to work even in 3.6 yet. 3. We ran hosted-engine --deploy and followed the wizard 4. we get the message to open the VNC connection to install the engine (we use centos 6.7 for the engine) 5. We continue the wizard and get the new vnc connection to install the engine 6. When the engine is running the wizard tries to add the host to the engine and then we get the error that he cannot connect to the engine. (we did get the health ok message. The hosts file and dns servers are working correctly. Please don't use gluster storage for HE on the same host running Hosted Engine. This is not a supported configuration. 7. We try to add manually the host to the engine because if the wizard stops at that point he cannot try again because there is already an vm running. 8. When we try to add the host via the engine we get the same error message. The one that he cannot connect to the host. When i try to ssh via the console on the engine to the host it works. What i fing strange is that he almost immidiatly gives the message (within 3 seconds). I hope this explains the steps we took. Kind regards Erik-Jan de Kruijf -- *From:* Sandro Bonazzola [sbona...@redhat.com] *Sent:* Monday, August 17, 2015 10:18 AM *To:* Kruijf, Erik-Jan de *Cc:* users@ovirt.org *Subject:* Re: [ovirt-users] error adding node. On Thu, Aug 13, 2015 at 2:40 PM, Kruijf, Erik-Jan de erik-jan.de.kru...@cgi.com wrote: Hi, I am trying to add the first node to an self hosted engine. When i click ok i immidiatly get the error that he cannot connect to host. But if i try to ssh to the host from the engine it works. Can someone please point me to an solution? If any logs are needed please let me know. Just to clarify, are you trying to add the host running the HE VM using the Web UI? Or are you trying to add the first non-HE related node? Kind regards, Erik-jan de Kruijf ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users -- Sandro Bonazzola Better technology. Faster innovation. Powered by community collaboration. See how it works at redhat.com -- Sandro Bonazzola Better technology. Faster innovation. Powered by community collaboration. See how it works at redhat.com ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] problem with iSCSI/multipath.
- Original Message - From: Giorgio Bersano giorgio.bers...@gmail.com To: users@ovirt.org Users@ovirt.org Sent: Monday, July 27, 2015 1:19:23 PM Subject: [ovirt-users] problem with iSCSI/multipath. Hi all. We have an oVirt cluster in production happily running from the beginning of 2014. It started as 3.3 beta and now is Version 3.4.4-1.el6 . Shared storage provided by an HP P2000 G3 iSCSI MSA. The storage server is fully redundant (2 controllers, dual port disks, 4 iscsi connections per controller) and so is the connectivity (two switches, multiple ethernet cards per server). From now on lets only talk about iSCSI connectivity. The two oldest server have 2 nics each; they have been configured by hand setting routes aimed to reach every scsi target from every nic. On the new server we installed ovirt 3.5 to have a look at the network configuration provided by oVirt. In Data Center - iSCSI Multipathing we defined an iSCSI Bond binding together 3 server's nics and the 8 nics of the MSA. The result is a system that has been functioning for months. Recently we had to do an upgrade of the storage firmware. This activity uploads the firmware to one of the MSA controllers then reboots it. Being successful this is repeated on the other controller. There is an impact on the I/O performance but there should be no problems as every volume on the MSA remains visible on other paths. Well, that's the theory. On the two hand configured hosts we had no significant problems. On the 3.5 host VMs started to migrate due to storage problems then the situation got worse and it took more than an hour to bring again the system to a good operating level. I am inclined to believe that the culprit is the server's routing table. Seems to me that the oVirt generated one is too simplistic and prone to problems in case of connectivity loss (as in our situation or when you have to reboot one of the switches). Anyone on this list with strong experience on similar setup? I have included below some background information. I'm available to provide anything useful to further investigate the case. TIA, Giorgio. Hi Giorgio, There were some issues related to ISCSI multipathing that were already solved on later versions then 3.4 AFAIK. I'm attaching Sergey and Maor (the feature owners) to respond whether related fixes were made. thanks, Liron. --- context information --- oVirt Compatibility Version: 3.4 two FUJITSU PRIMERGY RX300 S5 hosts CPU: Intel(R) Xeon(R) E5504 @ 2.00GHz / Intel Nehalem Family OS Version: RHEL - 6 - 6.el6.centos.12.2 Kernel Version: 2.6.32 - 504.16.2.el6.x86_64 KVM Version: 0.12.1.2 - 2.448.el6_6.2 LIBVIRT Version: libvirt-0.10.2-46.el6_6.6 VDSM Version: vdsm-4.14.17-0.el6 RAM: 40GB mom-0.4.3-1.el6.noarch.rpm ovirt-release34-1.0.3-1.noarch.rpm qemu-img-rhev-0.12.1.2-2.448.el6_6.2.x86_64.rpm qemu-kvm-rhev-0.12.1.2-2.448.el6_6.2.x86_64.rpm qemu-kvm-rhev-tools-0.12.1.2-2.448.el6_6.2.x86_64.rpm vdsm-4.14.17-0.el6.x86_64.rpm vdsm-cli-4.14.17-0.el6.noarch.rpm vdsm-hook-hostusb-4.14.17-0.el6.noarch.rpm vdsm-hook-macspoof-4.14.17-0.el6.noarch.rpm vdsm-python-4.14.17-0.el6.x86_64.rpm vdsm-python-zombiereaper-4.14.17-0.el6.noarch.rpm vdsm-xmlrpc-4.14.17-0.el6.noarch.rpm # ip route list table all |grep 192.168.126. 192.168.126.87 dev eth4 table 4 proto kernel scope link src 192.168.126.65 192.168.126.86 dev eth4 table 4 proto kernel scope link src 192.168.126.65 192.168.126.81 dev eth4 table 4 proto kernel scope link src 192.168.126.65 192.168.126.80 dev eth4 table 4 proto kernel scope link src 192.168.126.65 192.168.126.77 dev eth4 table 4 proto kernel scope link src 192.168.126.65 192.168.126.0/24 dev eth4 table 4 proto kernel scope link src 192.168.126.65 192.168.126.0/24 dev eth3 proto kernel scope link src 192.168.126.64 192.168.126.0/24 dev eth4 proto kernel scope link src 192.168.126.65 192.168.126.85 dev eth3 table 3 proto kernel scope link src 192.168.126.64 192.168.126.84 dev eth3 table 3 proto kernel scope link src 192.168.126.64 192.168.126.83 dev eth3 table 3 proto kernel scope link src 192.168.126.64 192.168.126.82 dev eth3 table 3 proto kernel scope link src 192.168.126.64 192.168.126.76 dev eth3 table 3 proto kernel scope link src 192.168.126.64 192.168.126.0/24 dev eth3 table 3 proto kernel scope link src 192.168.126.64 broadcast 192.168.126.0 dev eth3 table local proto kernel scope link src 192.168.126.64 broadcast 192.168.126.0 dev eth4 table local proto kernel scope link src 192.168.126.65 local 192.168.126.65 dev eth4 table local proto kernel scope host src 192.168.126.65 local 192.168.126.64 dev eth3 table local proto kernel scope host src 192.168.126.64 broadcast 192.168.126.255 dev eth3 table local proto kernel scope link src 192.168.126.64 broadcast 192.168.126.255
Re: [ovirt-users] error adding node.
Hi Sandro, Thank you for the explanation. We used the manual: http://community.redhat.com/blog/2014/10/up-and-running-with-ovirt-3-5/http://community.redhat.com/blog/2014/10/up-and-running-with-ovirt-3-5/ Thats why created an local glusterfs storage for the HE. I find it strange that we can start the engine with the local glusterfs or is this just dumb luck? Regards Erik From: Sandro Bonazzola [sbona...@redhat.com] Sent: Monday, August 17, 2015 10:57 AM To: Kruijf, Erik-Jan de Cc: users@ovirt.org Subject: Re: [ovirt-users] error adding node. On Mon, Aug 17, 2015 at 10:30 AM, Kruijf, Erik-Jan de erik-jan.de.kru...@cgi.commailto:erik-jan.de.kru...@cgi.com wrote: Hi, I hope i can explain properly ;) (my english isnt that good). This is the situation: 1. We installed a clean install of centos 7 on the host 2. We created an glusterfs share on the host for the engine Please note that this is a hyper converged setup and it's not supposed to work even in 3.6 yet. 3. We ran hosted-engine --deploy and followed the wizard 4. we get the message to open the VNC connection to install the engine (we use centos 6.7 for the engine) 5. We continue the wizard and get the new vnc connection to install the engine 6. When the engine is running the wizard tries to add the host to the engine and then we get the error that he cannot connect to the engine. (we did get the health ok message. The hosts file and dns servers are working correctly. Please don't use gluster storage for HE on the same host running Hosted Engine. This is not a supported configuration. 7. We try to add manually the host to the engine because if the wizard stops at that point he cannot try again because there is already an vm running. 8. When we try to add the host via the engine we get the same error message. The one that he cannot connect to the host. When i try to ssh via the console on the engine to the host it works. What i fing strange is that he almost immidiatly gives the message (within 3 seconds). I hope this explains the steps we took. Kind regards Erik-Jan de Kruijf From: Sandro Bonazzola [sbona...@redhat.commailto:sbona...@redhat.com] Sent: Monday, August 17, 2015 10:18 AM To: Kruijf, Erik-Jan de Cc: users@ovirt.orgmailto:users@ovirt.org Subject: Re: [ovirt-users] error adding node. On Thu, Aug 13, 2015 at 2:40 PM, Kruijf, Erik-Jan de erik-jan.de.kru...@cgi.commailto:erik-jan.de.kru...@cgi.com wrote: Hi, I am trying to add the first node to an self hosted engine. When i click ok i immidiatly get the error that he cannot connect to host. But if i try to ssh to the host from the engine it works. Can someone please point me to an solution? If any logs are needed please let me know. Just to clarify, are you trying to add the host running the HE VM using the Web UI? Or are you trying to add the first non-HE related node? Kind regards, Erik-jan de Kruijf ___ Users mailing list Users@ovirt.orgmailto:Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users -- Sandro Bonazzola Better technology. Faster innovation. Powered by community collaboration. See how it works at redhat.comhttp://redhat.com -- Sandro Bonazzola Better technology. Faster innovation. Powered by community collaboration. See how it works at redhat.comhttp://redhat.com ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] error adding node.
On Mon, Aug 17, 2015 at 11:03 AM, Kruijf, Erik-Jan de erik-jan.de.kru...@cgi.com wrote: Hi Sandro, Thank you for the explanation. We used the manual: http://community.redhat.com/blog/2014/10/up-and-running-with-ovirt-3-5/ http://community.redhat.com/blog/2014/10/up-and-running-with-ovirt-3-5/ Thats why created an local glusterfs storage for the HE. I find it strange that we can start the engine with the local glusterfs or is this just dumb luck? So you're running NFS data domain over gluster, not just gluster right? In this case it may work provided you use no-replica or replica 3. Regards Erik -- *From:* Sandro Bonazzola [sbona...@redhat.com] *Sent:* Monday, August 17, 2015 10:57 AM *To:* Kruijf, Erik-Jan de *Cc:* users@ovirt.org *Subject:* Re: [ovirt-users] error adding node. On Mon, Aug 17, 2015 at 10:30 AM, Kruijf, Erik-Jan de erik-jan.de.kru...@cgi.com wrote: Hi, I hope i can explain properly ;) (my english isnt that good). This is the situation: 1. We installed a clean install of centos 7 on the host 2. We created an glusterfs share on the host for the engine Please note that this is a hyper converged setup and it's not supposed to work even in 3.6 yet. 3. We ran hosted-engine --deploy and followed the wizard 4. we get the message to open the VNC connection to install the engine (we use centos 6.7 for the engine) 5. We continue the wizard and get the new vnc connection to install the engine 6. When the engine is running the wizard tries to add the host to the engine and then we get the error that he cannot connect to the engine. (we did get the health ok message. The hosts file and dns servers are working correctly. Please don't use gluster storage for HE on the same host running Hosted Engine. This is not a supported configuration. 7. We try to add manually the host to the engine because if the wizard stops at that point he cannot try again because there is already an vm running. 8. When we try to add the host via the engine we get the same error message. The one that he cannot connect to the host. When i try to ssh via the console on the engine to the host it works. What i fing strange is that he almost immidiatly gives the message (within 3 seconds). I hope this explains the steps we took. Kind regards Erik-Jan de Kruijf -- *From:* Sandro Bonazzola [sbona...@redhat.com] *Sent:* Monday, August 17, 2015 10:18 AM *To:* Kruijf, Erik-Jan de *Cc:* users@ovirt.org *Subject:* Re: [ovirt-users] error adding node. On Thu, Aug 13, 2015 at 2:40 PM, Kruijf, Erik-Jan de erik-jan.de.kru...@cgi.com wrote: Hi, I am trying to add the first node to an self hosted engine. When i click ok i immidiatly get the error that he cannot connect to host. But if i try to ssh to the host from the engine it works. Can someone please point me to an solution? If any logs are needed please let me know. Just to clarify, are you trying to add the host running the HE VM using the Web UI? Or are you trying to add the first non-HE related node? Kind regards, Erik-jan de Kruijf ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users -- Sandro Bonazzola Better technology. Faster innovation. Powered by community collaboration. See how it works at redhat.com -- Sandro Bonazzola Better technology. Faster innovation. Powered by community collaboration. See how it works at redhat.com -- Sandro Bonazzola Better technology. Faster innovation. Powered by community collaboration. See how it works at redhat.com ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] error adding node.
We have now tried without gluster but even then he cannot add the host. Could it be that ovirt doesnt work with centos 7.1 From: users-boun...@ovirt.org [users-boun...@ovirt.org] on behalf of Kruijf, Erik-Jan de [erik-jan.de.kru...@cgi.com] Sent: Monday, August 17, 2015 11:12 AM To: Sandro Bonazzola Cc: users@ovirt.org Subject: Re: [ovirt-users] error adding node. Hi Sandro, Yes we are indeed running an nfs data domain over gluster. But what do you mean with no-replica or replica 3? From: Sandro Bonazzola [sbona...@redhat.com] Sent: Monday, August 17, 2015 11:08 AM To: Kruijf, Erik-Jan de Cc: users@ovirt.org Subject: Re: [ovirt-users] error adding node. On Mon, Aug 17, 2015 at 11:03 AM, Kruijf, Erik-Jan de erik-jan.de.kru...@cgi.commailto:erik-jan.de.kru...@cgi.com wrote: Hi Sandro, Thank you for the explanation. We used the manual: http://community.redhat.com/blog/2014/10/up-and-running-with-ovirt-3-5/http://community.redhat.com/blog/2014/10/up-and-running-with-ovirt-3-5/ Thats why created an local glusterfs storage for the HE. I find it strange that we can start the engine with the local glusterfs or is this just dumb luck? So you're running NFS data domain over gluster, not just gluster right? In this case it may work provided you use no-replica or replica 3. Regards Erik From: Sandro Bonazzola [sbona...@redhat.commailto:sbona...@redhat.com] Sent: Monday, August 17, 2015 10:57 AM To: Kruijf, Erik-Jan de Cc: users@ovirt.orgmailto:users@ovirt.org Subject: Re: [ovirt-users] error adding node. On Mon, Aug 17, 2015 at 10:30 AM, Kruijf, Erik-Jan de erik-jan.de.kru...@cgi.commailto:erik-jan.de.kru...@cgi.com wrote: Hi, I hope i can explain properly ;) (my english isnt that good). This is the situation: 1. We installed a clean install of centos 7 on the host 2. We created an glusterfs share on the host for the engine Please note that this is a hyper converged setup and it's not supposed to work even in 3.6 yet. 3. We ran hosted-engine --deploy and followed the wizard 4. we get the message to open the VNC connection to install the engine (we use centos 6.7 for the engine) 5. We continue the wizard and get the new vnc connection to install the engine 6. When the engine is running the wizard tries to add the host to the engine and then we get the error that he cannot connect to the engine. (we did get the health ok message. The hosts file and dns servers are working correctly. Please don't use gluster storage for HE on the same host running Hosted Engine. This is not a supported configuration. 7. We try to add manually the host to the engine because if the wizard stops at that point he cannot try again because there is already an vm running. 8. When we try to add the host via the engine we get the same error message. The one that he cannot connect to the host. When i try to ssh via the console on the engine to the host it works. What i fing strange is that he almost immidiatly gives the message (within 3 seconds). I hope this explains the steps we took. Kind regards Erik-Jan de Kruijf From: Sandro Bonazzola [sbona...@redhat.commailto:sbona...@redhat.com] Sent: Monday, August 17, 2015 10:18 AM To: Kruijf, Erik-Jan de Cc: users@ovirt.orgmailto:users@ovirt.org Subject: Re: [ovirt-users] error adding node. On Thu, Aug 13, 2015 at 2:40 PM, Kruijf, Erik-Jan de erik-jan.de.kru...@cgi.commailto:erik-jan.de.kru...@cgi.com wrote: Hi, I am trying to add the first node to an self hosted engine. When i click ok i immidiatly get the error that he cannot connect to host. But if i try to ssh to the host from the engine it works. Can someone please point me to an solution? If any logs are needed please let me know. Just to clarify, are you trying to add the host running the HE VM using the Web UI? Or are you trying to add the first non-HE related node? Kind regards, Erik-jan de Kruijf ___ Users mailing list Users@ovirt.orgmailto:Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users -- Sandro Bonazzola Better technology. Faster innovation. Powered by community collaboration. See how it works at redhat.comhttp://redhat.com -- Sandro Bonazzola Better technology. Faster innovation. Powered by community collaboration. See how it works at redhat.comhttp://redhat.com -- Sandro Bonazzola Better technology. Faster innovation. Powered by community collaboration. See how it works at redhat.comhttp://redhat.com ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] upgrade issues
From the engine.log - the gluster volumes are queried on host node1 which returns no volumes. 1. Your cluster r710cluster1 - which nodes are added to it? node1 alone or node0 and node2 as well? 2. Was the attached supervdsm.log from node1? 3. Which node was the below gluster volume info output from? What is the output of gluster peer status and gluster volume info on node1? On 08/17/2015 12:49 PM, Demeter Tibor wrote: Dear Sahina, Thank you for your reply. Volume Name: g2sata Type: Replicate Volume ID: 49d76fc8-853e-4c7d-82a5-b12ec98dadd8 Status: Started Number of Bricks: 1 x 2 = 2 Transport-type: tcp Bricks: Brick1: 172.16.0.10:/data/sata/brick2 Brick2: 172.16.0.12:/data/sata/brick2 Options Reconfigured: nfs.disable: on user.cifs: disable auth.allow: 172.16.* storage.owner-uid: 36 storage.owner-gid: 36 Volume Name: g4sata Type: Replicate Volume ID: f26ed231-c951-431f-8a2f-e8818b58cfb4 Status: Started Number of Bricks: 1 x 2 = 2 Transport-type: tcp Bricks: Brick1: 172.16.0.10:/data/sata/iso Brick2: 172.16.0.12:/data/sata/iso Options Reconfigured: nfs.disable: off user.cifs: disable auth.allow: 172.16.0.* storage.owner-uid: 36 storage.owner-gid: 36 Also, I have attached the logs . Thanks in advance, Tibor - 2015. aug.. 17., 8:40, Sahina Bose sab...@redhat.com írta: Please provide output of gluster volume info command, vdsm.log engine.log There could be a mismatch between node information in engine database and gluster - one of the reasons is because the gluster server uuid changed on the node and we will need to see why. On 08/17/2015 12:35 AM, Demeter Tibor wrote: Hi All, I have to upgrade ovirt 3.5.0 to 3.5.3. We have a 3 node system and we have a gluster replica beetwen 2 node of these 3 servers. I had gluster volume beetwen node0 and node2 But I wanted to do a new volume beetwen node1 and node2. It didn't work, it complety kill my node1, because glusterd does not stated.I always have to got always error: gluster peer rejected (related to node1). I followed this article http://www.gluster.org/community/documentation/index.php/Resolving_Peer_Rejected and it was good, my gluster service already worked, but ovirt got these errors Detected deletion of volume g2sata on cluster r710cluster1, and deleted it from engine DB. Detected deletion of volume g4sata on cluster r710cluster1, and deleted it from engine DB. And ovirt does not see my gluster volumes anymore. I've checked with gluster volume status and gluster volume heal g2sata info, it seems to be working, my VMs are ok. How can I reimport my lost volumes to ovirt? Thanks in advance, Tibor ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] stuck hosts - how can I delete them?
Yes - thanks! On Sunday, August 16, 2015, Sahina Bose sab...@redhat.com wrote: On 08/13/2015 11:48 PM, Chris Liebman wrote: I've just force deleted a DC. I did this because gluster was completely hosed. Multiple nodes with broken disks - don't ask... Anyway - now I see that the Cluster still exists with the hosts. And I cant remove, re-install etc the hosts, nor can I delete the cluster. Help! Are you facing the same issue as https://bugzilla.redhat.com/show_bug.cgi?id=1244935 -- Chris ___ Users mailing listus...@ovirt.org javascript:_e(%7B%7D,'cvml','Users@ovirt.org');http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] Regarding meetup events
Hello Doron, Its was awesome experience by host this oVirt meetup. Will host one more in next month. Thanks Ashutosh On Sun, 2015-08-09 at 17:19 +0300, Doron Fediuck wrote: Hi Ashutosh, and updates from the meetup? How did it go? On Sun, Jul 12, 2015 at 2:29 PM, Ashutosh Bhakare unnatisales...@gmail.com wrote: Hello All, Happy to Share that we hosted 1st meetup in Aurangabad (MS) India. This meetup was schedule on 11th July 2015 @ 6 PM focusing on Introduction about the oVirt project. here are some links https://www.facebook.com/RedHatATP/posts/967562879955137?pnref=stor y http://www.meetup.com/Aurangabad-Ovirt-Meetup/events/223693898/ Will schedule 2nd meetup asap focusing on deployment of ovirt. Thanks. Ashutosh S.Bhakare RHCA-V, RHCI, JBCI ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] Testing Ovirt 3.6 Beta 2
Hi, I updated this package ovirt-engine-extension-aaa-jdbc to the latest one available, and the engine's installation terminated successfully. this time I disabled selinux and firewalling on hypervisor and on the VM engine. 1 - Still no trace of the VM engine on webui even after adding storage domain 2 - the ovirt-ha-agent crashes with this message : ovirt-ha-agent ovirt_hosted_engine_ha.agent.agent.Agent ERROR Error: ' 'StorageServer' object has no attribute '_self'' - trying to restart agent 2015-08-17 0:19 GMT+01:00 wodel youchi wodel.you...@gmail.com: Hi folks, Two days ago I redid a test with the second beta, and I had the same problem with the VM engine not present on the webui. I cleaned up everything and I redid the test today, this I couldn't terminate the engine's installation, I had this error after the creation of the database /usr/share/ovirt-engine-extension-aaa-jdbc/dbscripts/schema.sh option non permise -- e the script does not accept -e option Regards. 2015-08-06 18:42 GMT+01:00 Alexander Wels aw...@redhat.com: On Thursday, August 06, 2015 06:04:44 PM wodel youchi wrote: Hi, A new test with Centos7 as host and Centos7 as VM-engine. The same problem, no VM-engine on Webgui. - I can create and start VMs - I could import my export domain and old data domain There is another problem with host edition, I can't get pass these two parameters when I edit a host: - host groups - compute resources That is a known bug [1], which should be fixed once [2] is merged. [1] https://bugzilla.redhat.com/show_bug.cgi?id=1250547 [2] https://gerrit.ovirt.org/#/c/44498/ I don't know what they mean, they are blank, and if I change the host's configuration, the two parameters become red, and I don't know what to fill in. Regards 2015-08-04 10:54 GMT+01:00 Simone Tiraboschi stira...@redhat.com: On Tue, Aug 4, 2015 at 11:25 AM, wodel youchi wodel.you...@gmail.com wrote: Hi again, Yes I mean the hosted-egnine VM I added the first storage domain (NFS4), I even added the ISO domain, but still no vm-engine shown on webui. Could you please attach your engine logs on bug # 1222010 thanks, Simone Regards. 2015-08-04 8:19 GMT+01:00 Simone Tiraboschi stira...@redhat.com: On Tue, Aug 4, 2015 at 1:28 AM, wodel youchi wodel.you...@gmail.com wrote: Hi, I redid the installation with Fc22 for the host and the VM engine, I still Did you mean hosted-engine? have the same problems - No VM engine on the webui If so there is an open bug on that: https://bugzilla.redhat.com/show_bug.cgi?id=1222010 Adding the first normal (non HE) storage domain is enough to solve it: when you add the first data domain the datacenter comes up and the engine-VM got shown. - Cannot start a created VM, DB error Then I tested with Fc22 for th host and Centos7 for the VM engine - Still no VM engine on webui - But this time no DB error and the created VM did start. Regards. 2015-08-03 14:40 GMT+01:00 Sandro Bonazzola sbona...@redhat.com : No, no specific known issue. On Sat, Aug 1, 2015 at 8:57 PM, Maor Lipchuk mlipc...@redhat.com wrote: Sandro, Eyal, Is there any known issue of this specific build? Regards, Maor - Original Message - From: wodel youchi wodel.you...@gmail.com To: Maor Lipchuk mlipc...@redhat.com Cc: users users@ovirt.org Sent: Saturday, August 1, 2015 3:24:21 PM Subject: Re: [ovirt-users] Testing Ovirt 3.6 Hi, Here are the logs engine.log hosted-engine setup log vdsm.log agent and broker logs About the postgresql function, it's exists gethostnetworksbycluster(uuid) but the webgui is calling it with parameters not defined. 2015-07-31 22:05:20,449 ERROR [org.ovirt.engine.core.bll.RunVmCommand] (default task-23) [7acb8bf] Data access error during C anDoActionFailure.: org.springframework.jdbc.BadSqlGrammarException: PreparedStatementCallback; bad SQL grammar [select * fro m gethostnetworksbycluster(?, ?, ?)]; nested exception is org.postgresql.util.PSQLException: ERROR: function gethostnetworks bycluster(uuid, unknown, character varying) does not exist Hint: No function matches the given name and argument types. You might need to add explicit type casts. Regards 2015-08-01 10:01 GMT+01:00 Maor Lipchuk mlipc...@redhat.com: Hi wodel, can u please attach the engine.log, also the hosted engine log. Regards, Maor - Original Message - From: wodel youchi wodel.you...@gmail.com To: users users@ovirt.org Sent:
Re: [ovirt-users] live migration
On Sun, Aug 16, 2015 at 10:31 PM, Demeter Tibor tdeme...@itsmart.hu wrote: Hi, I reinstalled one of my nodes (node1) because I have to replace my hdds. I installed centos 6.6 minimal, but on node re-adding procecure it installed newer qemu-kvm-rhev packages. Since reinstall I can run VMs on this node and I can do live migrate from this node to other, but not backwards. I remember, maybe one years ago it was required to install redhat's version of qemu-kvm-rhve package for this feature. Is it necessary yet? my versions: node0 KVM: 0.12.1.2 - 2.415.el6_5.14, LIBVIRT: libvirt-0.10.2-46.el6_6.1 node1 KVM: 0.12.1.2 - 2.448.el6_6.4, LIBVIRT: libvirt-0.10.2-54.el6 node2 KVM: 0.12.1.2 - 2.448.el6_6.4, LIBVIRT: libvirt-0.10.2-46.el6_6.1 Can I upgrade manual these hosts? I haven't restart my hosts because node0 and node2 is gluster replicate. Thanks in advance, Tibor ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users not sure about the versions, but what is the error you see in source host vdsm.log when migration fails? ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] live migration
Are all the hostnames of the machines different ive had it before where migrations fail because they have the same hostname or uuid for that matter From: users-boun...@ovirt.org [mailto:users-boun...@ovirt.org] On Behalf Of Omer Frenkel Sent: Monday, August 17, 2015 01:02 AM To: Demeter Tibor Cc: users Subject: Re: [ovirt-users] live migration On Sun, Aug 16, 2015 at 10:31 PM, Demeter Tibor tdeme...@itsmart.hu wrote: Hi, I reinstalled one of my nodes (node1) because I have to replace my hdds. I installed centos 6.6 minimal, but on node re-adding procecure it installed newer qemu-kvm-rhev packages. Since reinstall I can run VMs on this node and I can do live migrate from this node to other, but not backwards. I remember, maybe one years ago it was required to install redhat's version of qemu-kvm-rhve package for this feature. Is it necessary yet? my versions: node0 KVM: 0.12.1.2 - 2.415.el6_5.14, LIBVIRT: libvirt-0.10.2-46.el6_6.1 node1 KVM: 0.12.1.2 - 2.448.el6_6.4, LIBVIRT: libvirt-0.10.2-54.el6 node2 KVM: 0.12.1.2 - 2.448.el6_6.4, LIBVIRT: libvirt-0.10.2-46.el6_6.1 Can I upgrade manual these hosts? I haven't restart my hosts because node0 and node2 is gluster replicate. Thanks in advance, Tibor ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users not sure about the versions, but what is the error you see in source host vdsm.log when migration fails? ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
[ovirt-users] R: vlan-tagging on non-tagged network
Sorry for jumping into the discussion, but I'm currently experiencing exactly the opposite behaviour. DHCP offer is reached until the host bridge interface, but it is not propagated to the vnet device. I'm using VLAN tagging. Packets incoming are correctly managed until vnet device and seems to have impact only into DHCP packets. Static IP assigned to the vm ĺeave VM active and reachable. Just my cent to a topic that I suppose could be related to mine issue. Roberto Messaggio originale Da: Felix Pepinghege pepingh...@ira.uka.de Data: 17/08/2015 13:36 (GMT+01:00) A: Users@ovirt.org Oggetto: Re: [ovirt-users] vlan-tagging on non-tagged network Hi Ido, hi everybody, sorry that I kept you waiting for two months, I only just found the time to go back to this problem. You were completely right with your guess. The ethernet frames do appear on the vnet-interface, but not on the bridge. The dropped-counter seems to be independent from these losses, though. However, while this tells me *where* the problem is, I still don't know *what* the problem is. I've done some research, but couldn't find anything particularly helpful. An interesting point may be that this problem is mono-directional. That is, the bridge happily passes vlan-tagged frames from the ethernet device to the vnet, but not the other way around. Untagged ethernet frames make their way through the brigde, no matter where they come from. The vlan module is loaded, as to the versioning questions: # cat /etc/centos-release ; uname -s -v -r CentOS Linux release 7.1.1503 (Core) Linux 3.10.0-229.7.2.el7.x86_64 #1 SMP Tue Jun 23 22:06:11 UTC 2015 The guest OS is an up-to-date Debian Jessie, which should not matter, though, as the frames get lost on the doorstep of the bridge on the host. Again, any suggestions are much appreciated! Regards, Felix Am 16.06.2015 um 08:27 schrieb Ido Barkan: Hey Felix. IIUC your frames are dropped by the bridge. Ovirt uses Linux Bridges To connect virtual machines to 'networks'. The guest connects to the bridge using a tap device which usually is called 'vnetnumber'. So, just to verify, can you please tcpdump both on the bridge device and on the tap device? The bridge can be quite noisy so I suggest filtering traffic using the guest's MAC address. So I am not sure what protocol you use for tunneling but applying a filter similar to this one should do the job: tcpdump -n -i vnet0 -vvv -s 1500 'udp[38:4]=0x001a4aaeec8e' My guess is that you will observe traffic on the tap device, but not on the bridge. You didn't specify which centOS version you use but I do remember seeing people complaining about Linux bridges discarding their tagged frames. You can -maybe- also observe the 'dropped' counter increases on the bridge by running: 'ip -s link show dev trunk' There were a few bugs on rhel6/7 about this, specifically I remember https://bugzilla.redhat.com/show_bug.cgi?id=1174291 and https://bugzilla.redhat.com/show_bug.cgi?id=1200275#c20 Also, is the vlan module loaded on your host? 'lsmod |grep 8021q' Thanks, Ido - Original Message - From: Felix Pepinghege pepingh...@ira.uka.de To: Users@ovirt.org Sent: Monday, June 15, 2015 11:33:39 AM Subject: [ovirt-users] vlan-tagging on non-tagged network Hi everybody! I am experiencing a behaviour of ovirt, of which I don't know whether it is expected or not. My setup is as follows: A virtual machine has a logical network attached to it, which is configured without vlan-tagging and listens to the name 'trunk'. The VM is running an openvpn server. It is a patched openvpn version, including vlan-tagging. That is, openvpn clients get a vlan tag. This should not really be an issue but should satisfy the why do you want to do it in the first place-questions. Anyhow, effectively, the VM simply puts vlan-tagged ethernet-frames on the virtual network. These frames, however, never make it to the host's network bridge, which represents the logical network. My observations are: According to tcpdump, the vlan-tagged packages arrive at the eth1-interface inside the VM (which *is* the correct interface). Again, according to tcpdump, these packages never arrive at the corresponding network-bridge (i.e., the interface 'trunk') on the host. I know that the setup itself is feasible with KVM---I have it working on a proxmox-machine. Therefore, my conclusion is, that ovirt doesn't like vlan-tagged ethernet-frames on non-tagged logical networks, and somehow filters them out, though I don't really see on what level that would happen (Handling the ethernet frames should be a concern of KVM/QEMU/Linux only, once ovirt has started the VM). So this problem could be a CentOS issue, but I really don't see why CentOS should act differently than debian does (proxmox is debian-based). Is this a known/wanted/expected behaviour of ovirt, and can I somehow prevent or elude it? Any help is
Re: [ovirt-users] stuck hosts - how can I delete them?
On 08/13/2015 11:48 PM, Chris Liebman wrote: I've just force deleted a DC. I did this because gluster was completely hosed. Multiple nodes with broken disks - don't ask... Anyway - now I see that the Cluster still exists with the hosts. And I cant remove, re-install etc the hosts, nor can I delete the cluster. Help! Are you facing the same issue as https://bugzilla.redhat.com/show_bug.cgi?id=1244935 -- Chris ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] upgrade issues
Please provide output of gluster volume info command, vdsm.log engine.log There could be a mismatch between node information in engine database and gluster - one of the reasons is because the gluster server uuid changed on the node and we will need to see why. On 08/17/2015 12:35 AM, Demeter Tibor wrote: Hi All, I have to upgrade ovirt 3.5.0 to 3.5.3. We have a 3 node system and we have a gluster replica beetwen 2 node of these 3 servers. I had gluster volume beetwen node0 and node2 But I wanted to do a new volume beetwen node1 and node2. It didn't work, it complety kill my node1, because glusterd does not stated.I always have to got always error: gluster peer rejected (related to node1). I followed this article http://www.gluster.org/community/documentation/index.php/Resolving_Peer_Rejected and it was good, my gluster service already worked, but ovirt got these errors Detected deletion of volume g2sata on cluster r710cluster1, and deleted it from engine DB. Detected deletion of volume g4sata on cluster r710cluster1, and deleted it from engine DB. And ovirt does not see my gluster volumes anymore. I've checked with gluster volume status and gluster volume heal g2sata info, it seems to be working, my VMs are ok. How can I reimport my lost volumes to ovirt? Thanks in advance, Tibor ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] Gluster Replica 3 using arbiter node?
On 07/30/2015 08:11 PM, Adrian Lewis wrote: Hi, Just wondering if it will be possible to create gluster replica 3 volumes in oVirt 3.6 that use the arbiter function instead of actually storing three copies of the data? If so, could this be used for the hosted engine on gluster feature which from what I can tell requires replica 3? We currently do not have support for creating a volume with an arbiter node from oVirt, changes are required in the Add Volume flow to enable it. I've opened a BZ to track but will not be available in oVirt 3.6 (https://bugzilla.redhat.com/show_bug.cgi?id=1254073) If you want to use such a volume to be used as storage domain, you will need to create the volume outside of oVirt using gluster cli commands - this has not yet been integrated and tested with the hosted engine flow. Your feedback is welcome. Many thanks, Adrian ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
[ovirt-users] Hosted Engine and Sanlock
Hi there, recently there was a network failure in our ovirt infrastructure, causing ovirt engine to become unstable. It will restarted after 10-20minutes. Load average was high. Command issued will hanged. Looking at host logs, there was endless locking errors (/var/log/sanlock.log) below. I tried to re-initialize by stopp HE HA agent/broker in all hosts, by issuing following command in one of the host: # su - vdsm -s /bin/bash $ sanlock direct init -s hosted-engine:0:/rhev/data-center/mnt/192.168.10.10 \\:_engine/a184f8ac-b779-4bf8-81c3-751115e15436/ha_agent/hosted-engine.lockspace:0 and than restart both agent and broker in the same host. However i m still getting the same problem. Any advice on this matter? Installation Infos: -- ovirt 3.5.3 vdsm-xmlrpc-4.16.24-0.el6.noarch vdsm-python-zombiereaper-4.16.24-0.el6.noarch vdsm-python-4.16.24-0.el6.noarch vdsm-jsonrpc-4.16.24-0.el6.noarch vdsm-4.16.24-0.el6.x86_64 vdsm-cli-4.16.24-0.el6.noarch vdsm-yajsonrpc-4.16.24-0.el6.noarch ovirt-hosted-engine-ha-1.2.6-2.el6.noarch ovirt-hosted-engine-setup-1.2.6-0.0.master.20150812080635.git5295df1.el6.noarch - end installation infos -- /var/log/sanlock.log 2015-08-18 04:20:02+0800 1704 [9385]: s2 delta_renew read rv -202 offset 0 /rhev/data-center/mnt/192.168.10.10: _engine/a184f8ac-b779-4bf8-81c3-751115e15436/dom_md/ids 2015-08-18 04:20:02+0800 1704 [9385]: s2 renewal error -202 delta_length 11 last_success 1662 2015-08-18 04:20:11+0800 1713 [9385]: a184f8ac aio collect 0 0x7fc5040008c0:0x7fc5040008d0:0x7fc50b9f7000 result 1048576:0 other free 2015-08-18 04:20:11+0800 1713 [9833]: hosted-e aio collect 0 0x7fc4f80008c0:0x7fc4f80008d0:0x7fc50baf9000 result 1048576:0 other free 2015-08-18 04:20:11+0800 1713 [9385]: a184f8ac aio collect 0 0x7fc504000910:0x7fc504000920:0x7fc50bbfb000 result 1048576:0 other free 2015-08-18 04:20:11+0800 1713 [9833]: hosted-e aio collect 0 0x7fc4f8000910:0x7fc4f8000920:0x7fc50beff000 result 1048576:0 other free 2015-08-18 04:21:43+0800 1805 [9385]: a184f8ac aio timeout 0 0x7fc5040008c0:0x7fc5040008d0:0x7fc50adf2000 ioto 10 to_count 18 2015-08-18 04:21:43+0800 1805 [9385]: s2 delta_renew read rv -202 offset 0 /rhev/data-center/mnt/192.168.10.10: _engine/a184f8ac-b779-4bf8-81c3-751115e15436/dom_md/ids 2015-08-18 04:21:43+0800 1805 [9385]: s2 renewal error -202 delta_length 10 last_success 1774 2015-08-18 04:21:43+0800 1805 [9833]: hosted-e aio timeout 0 0x7fc4f80008c0:0x7fc4f80008d0:0x7fc50aef4000 ioto 10 to_count 14 2015-08-18 04:21:43+0800 1805 [9833]: s3 delta_renew read rv -202 offset 0 /rhev/data-center/mnt/192.168.10.10: _engine/a184f8ac-b779-4bf8-81c3-751115e15436/images/190f4d2a-77f4-4403-af0d-62853560c653/2be7db4d-f30e-4873-b4ef-cff9e757341c 2015-08-18 04:21:43+0800 1805 [9833]: s3 renewal error -202 delta_length 10 last_success 1774 2015-08-18 04:21:52+0800 1814 [9385]: a184f8ac aio collect 0 0x7fc5040008c0:0x7fc5040008d0:0x7fc50adf2000 result 1048576:0 other free 2015-08-18 04:21:52+0800 1814 [9833]: hosted-e aio collect 0 0x7fc4f80008c0:0x7fc4f80008d0:0x7fc50aef4000 result 1048576:0 other free 2015-08-18 04:23:04+0800 1885 [9833]: hosted-e aio timeout 0 0x7fc4f80008c0:0x7fc4f80008d0:0x7fc50bbfb000 ioto 10 to_count 15 2015-08-18 04:23:04+0800 1885 [9833]: s3 delta_renew read rv -202 offset 0 /rhev/data-center/mnt/192.168.10.10: _engine/a184f8ac-b779-4bf8-81c3-751115e15436/images/190f4d2a-77f4-4403-af0d-62853560c653/2be7db4d-f30e-4873-b4ef-cff9e757341c 2015-08-18 04:23:04+0800 1885 [9833]: s3 renewal error -202 delta_length 10 last_success 1855 2015-08-18 04:23:04+0800 1886 [9385]: a184f8ac aio timeout 0 0x7fc5040008c0:0x7fc5040008d0:0x7fc50beff000 ioto 10 to_count 19 2015-08-18 04:23:04+0800 1886 [9385]: s2 delta_renew read rv -202 offset 0 /rhev/data-center/mnt/192.168.10.10: _engine/a184f8ac-b779-4bf8-81c3-751115e15436/dom_md/ids 2015-08-18 04:23:04+0800 1886 [9385]: s2 renewal error -202 delta_length 10 last_success 1855 2015-08-18 04:23:15+0800 1896 [9833]: hosted-e aio timeout 0 0x7fc4f8000910:0x7fc4f8000920:0x7fc50baf9000 ioto 10 to_count 16 2015-08-18 04:23:15+0800 1896 [9833]: s3 delta_renew read rv -202 offset 0 /rhev/data-center/mnt/192.168.10.10: _engine/a184f8ac-b779-4bf8-81c3-751115e15436/images/190f4d2a-77f4-4403-af0d-62853560c653/2be7db4d-f30e-4873-b4ef-cff9e757341c 2015-08-18 04:23:15+0800 1896 [9833]: s3 renewal error -202 delta_length 10 last_success 1855 2015-08-18 04:23:15+0800 1897 [9385]: a184f8ac aio timeout 0 0x7fc504000910:0x7fc504000920:0x7fc50b9f7000 ioto 10 to_count 20 2015-08-18 04:23:15+0800 1897 [9385]: s2 delta_renew read rv -202 offset 0 /rhev/data-center/mnt/192.168.10.10: _engine/a184f8ac-b779-4bf8-81c3-751115e15436/dom_md/ids 2015-08-18 04:23:15+0800 1897 [9385]: s2 renewal error -202 delta_length 11 last_success 1855 2015-08-18 04:23:26+0800 1907 [9833]: hosted-e aio timeout 0 0x7fc4f8000960:0x7fc4f8000970:0x7fc50aef4000 ioto 10 to_count 17 2015-08-18 04:23:26+0800 1907 [9833]:
Re: [ovirt-users] error adding node.
On Thu, Aug 13, 2015 at 2:40 PM, Kruijf, Erik-Jan de erik-jan.de.kru...@cgi.com wrote: Hi, I am trying to add the first node to an self hosted engine. When i click ok i immidiatly get the error that he cannot connect to host. But if i try to ssh to the host from the engine it works. Can someone please point me to an solution? If any logs are needed please let me know. Just to clarify, are you trying to add the host running the HE VM using the Web UI? Or are you trying to add the first non-HE related node? Kind regards, Erik-jan de Kruijf ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users -- Sandro Bonazzola Better technology. Faster innovation. Powered by community collaboration. See how it works at redhat.com ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] ovirt 3.5.2 issues with nodes becoming Non Operational
On Wed, Aug 12, 2015 at 8:20 PM, Chris Liebman chri...@taboola.com wrote: I may have figured this out. The systems that failed are running the Oracle unbreakable kernel: 3.8.13-98.el6uek.x86_64 The working systems are running the default CentOS 6 2.6 kernel. There you go, this kernel is unfakeable ;) - fabian and the error from the vdsm.log only show up on the UEK kernel. -- Chris On Wed, Aug 12, 2015 at 9:34 AM, Chris Liebman chri...@taboola.com wrote: Hi, I'm new to oVirt and recently built a 10 node ovirt 3.5 DC with shared storage using gluster configured as distributed-replicated (replication = 2). Shortly after 7 of the 10 nodes dropped, one at a time over a few hours, into Non Operational state. Attempting to activate one of these nodes gives the error: Failed to connect Host ovirt-node260 to Storage Pool LADC-TBX. Attempting to put the node into Maintenance eaves the node stuck in Preparing For maintenance. When I rebooted one of the nodes I see this in the nodes event list: Host ovirt-node269 reports about one of the Active Storage Domains as Problematic. I see many of these errors in the vdsm log from the failed nodes: Thread-1::ERROR::2015-08-12 10:01:17,748::__init__::506::jsonrpc.JsonRpcServer::(_serveRequest) Internal server error Traceback (most recent call last): File /usr/lib/python2.6/site-packages/yajsonrpc/__init__.py, line 501, in _serveRequest res = method(**params) File /usr/share/vdsm/rpc/Bridge.py, line 267, in _dynamicMethod result = fn(*methodArgs) File /usr/share/vdsm/API.py, line 1330, in getStats stats.update(self._cif.mom.getKsmStats()) File /usr/share/vdsm/momIF.py, line 60, in getKsmStats stats = self._mom.getStatistics()['host'] File /usr/lib/python2.6/site-packages/mom/MOMFuncs.py, line 75, in getStatistics host_stats = self.threads['host_monitor'].interrogate().statistics[-1] AttributeError: 'NoneType' object has no attribute 'statistics' Any help here is appreciated. -- Chris ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users -- Fabian Deutsch fdeut...@redhat.com RHEV Hypervisor Red Hat ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [ovirt-users] error adding node.
Hi, I hope i can explain properly ;) (my english isnt that good). This is the situation: 1. We installed a clean install of centos 7 on the host 2. We created an glusterfs share on the host for the engine 3. We ran hosted-engine --deploy and followed the wizard 4. we get the message to open the VNC connection to install the engine (we use centos 6.7 for the engine) 5. We continue the wizard and get the new vnc connection to install the engine 6. When the engine is running the wizard tries to add the host to the engine and then we get the error that he cannot connect to the engine. (we did get the health ok message. The hosts file and dns servers are working correctly. 7. We try to add manually the host to the engine because if the wizard stops at that point he cannot try again because there is already an vm running. 8. When we try to add the host via the engine we get the same error message. The one that he cannot connect to the host. When i try to ssh via the console on the engine to the host it works. What i fing strange is that he almost immidiatly gives the message (within 3 seconds). I hope this explains the steps we took. Kind regards Erik-Jan de Kruijf From: Sandro Bonazzola [sbona...@redhat.com] Sent: Monday, August 17, 2015 10:18 AM To: Kruijf, Erik-Jan de Cc: users@ovirt.org Subject: Re: [ovirt-users] error adding node. On Thu, Aug 13, 2015 at 2:40 PM, Kruijf, Erik-Jan de erik-jan.de.kru...@cgi.commailto:erik-jan.de.kru...@cgi.com wrote: Hi, I am trying to add the first node to an self hosted engine. When i click ok i immidiatly get the error that he cannot connect to host. But if i try to ssh to the host from the engine it works. Can someone please point me to an solution? If any logs are needed please let me know. Just to clarify, are you trying to add the host running the HE VM using the Web UI? Or are you trying to add the first non-HE related node? Kind regards, Erik-jan de Kruijf ___ Users mailing list Users@ovirt.orgmailto:Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users -- Sandro Bonazzola Better technology. Faster innovation. Powered by community collaboration. See how it works at redhat.comhttp://redhat.com ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users