[ovirt-users] Re: Import of VMs failing - 0% progress on qemu-img
elp. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 20 Apr 2019, at 10:54, Benny Zlotnik mailto:bzlot...@redhat.com>> wrote: Sorry, I kind of lost track of what the problem is The "KeyError: 'appsList'" issue is a known bug[1] If a manual (not via vdsm) run of qemu-img is actually stuck, then let's involve the qemu-discuss list, with the version of the relevant packages (qemu, qemu-img, kernel, your distro) and the output of gdb commands [1] - https://bugzilla.redhat.com/show_bug.cgi?id=1690301 On Sat, Apr 20, 2019 at 1:36 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Benny and others, So it seems I wasn’t being patient with GDB and it does show me some output. This error of qemu-img convert even is failing and preventing updating ovirt-node version from 4.3.2 to 4.3.3.1. I get a feeling this is an unrelated error, but I thought I’d be complete: Excuse any typos, im having to type this manually from a remote session, but the error: [733272.427922] hid-generic 0003:0624:0249.0001: usb_submit_urb(ctrl) failed: -19 If this bug is preventing even a local yum updatei can’t see how it’s any issue other than somehow involved with the hardware of the hypervisor, our network and storage configuration must be irrelevant to this fact at this stage? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Apr 2019, at 12:00, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Without the sudo and running in a dir where the root has access to, gdb has zero output: Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Apr 2019, at 11:54, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Some more information: running qemu-img convert manually having captured the failed attempt from the previous: sudo -u vdsm /usr/bin/qemu-img convert -p -t none -T none -f raw /rhev/data-center/mnt/10.141.15.248:_export_instruct_vm__storage/0e01f014-530b-4067-aa1d-4e9378626a9d/images/6597eede-9fa0-4451-84fc-9f9c070cb5f3/765fa48b-2e77-4637-b4ca-e1affcd71e48 -O raw /rhev/data-center/mnt/10.141.15.248:_export_instruct_vm__storage/0e01f014-530b-4067-aa1d-4e9378626a9d/images/9cc99110-70a2-477f-b3ef-1031a912d12b/c2776107-4579-43a6-9d60-93a5ea9c64c5 -W Added the -W flag just to see what would happen: gdb -p 79913 -batch -ex "t a a bt" [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib64/libthread_db.so.1". 0x7f528d7661f0 in __poll_nocancel () from /lib64/libc.so.6 Thread 1 (Thread 0x7f528e6bb840 (LWP 79913)): #0 0x7f528d7661f0 in __poll_nocancel () from /lib64/libc.so.6 #1 0x7f528dc510fb in sudo_ev_scan_impl () from /usr/libexec/sudo/libsudo_util.so.0 #2 0x7f528dc49b44 in sudo_ev_loop_v1 () from /usr/libexec/sudo/libsudo_util.so.0 #3 0x55e94aa0e271 in exec_nopty () #4 0x55e94aa0afda in sudo_execute () #5 0x55e94aa18a12 in run_command () #6 0x55e94aa0969e in main () And without -W: gdb -p 85235 -batch -ex "t a a bt" [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib64/libthread_db.so.1". 0x7fc0cc69b1f0 in __poll_nocancel () from /lib64/libc.so.6 Thread 1 (Thread 0x7fc0cd5f0840 (LWP 85235)): #0 0x7fc0cc69b1f0 in __poll_nocancel () from /lib64/libc.so.6 #1 0x7fc0ccb860fb in sudo_ev_scan_impl () from /usr/libexec/sudo/libsudo_util.so.0 #2 0x7fc0ccb7eb44 in sudo_ev_loop_v1 () from /usr/libexec/sudo/libsudo_util.so.0 #3 0x5610f4397271 in exec_nopty () #4 0x5610f4393fda in sudo_execute () #5 0x00005610f43a1a12 in run_command () #6 0x5610f439269e in main () Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Apr 2019, at 09:57, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Benny, It would seem that even cloning a VM is failing, creating a VM works on the same storage. This is the only error i could find: ERROR Internal server error Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/yajsonrpc/__init__.py", line 345, in _handle_request res = method(**params) File "/usr/lib/python2.7/site-packages/vdsm/rpc/Bridge.p
[ovirt-users] Re: Import of VMs failing - 0% progress on qemu-img
Dear Benny and others, So it seems I wasn’t being patient with GDB and it does show me some output. This error of qemu-img convert even is failing and preventing updating ovirt-node version from 4.3.2 to 4.3.3.1. I get a feeling this is an unrelated error, but I thought I’d be complete: Excuse any typos, im having to type this manually from a remote session, but the error: [733272.427922] hid-generic 0003:0624:0249.0001: usb_submit_urb(ctrl) failed: -19 If this bug is preventing even a local yum updatei can’t see how it’s any issue other than somehow involved with the hardware of the hypervisor, our network and storage configuration must be irrelevant to this fact at this stage? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Apr 2019, at 12:00, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Without the sudo and running in a dir where the root has access to, gdb has zero output: Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Apr 2019, at 11:54, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Some more information: running qemu-img convert manually having captured the failed attempt from the previous: sudo -u vdsm /usr/bin/qemu-img convert -p -t none -T none -f raw /rhev/data-center/mnt/10.141.15.248:_export_instruct_vm__storage/0e01f014-530b-4067-aa1d-4e9378626a9d/images/6597eede-9fa0-4451-84fc-9f9c070cb5f3/765fa48b-2e77-4637-b4ca-e1affcd71e48 -O raw /rhev/data-center/mnt/10.141.15.248:_export_instruct_vm__storage/0e01f014-530b-4067-aa1d-4e9378626a9d/images/9cc99110-70a2-477f-b3ef-1031a912d12b/c2776107-4579-43a6-9d60-93a5ea9c64c5 -W Added the -W flag just to see what would happen: gdb -p 79913 -batch -ex "t a a bt" [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib64/libthread_db.so.1". 0x7f528d7661f0 in __poll_nocancel () from /lib64/libc.so.6 Thread 1 (Thread 0x7f528e6bb840 (LWP 79913)): #0 0x7f528d7661f0 in __poll_nocancel () from /lib64/libc.so.6 #1 0x7f528dc510fb in sudo_ev_scan_impl () from /usr/libexec/sudo/libsudo_util.so.0 #2 0x7f528dc49b44 in sudo_ev_loop_v1 () from /usr/libexec/sudo/libsudo_util.so.0 #3 0x55e94aa0e271 in exec_nopty () #4 0x55e94aa0afda in sudo_execute () #5 0x55e94aa18a12 in run_command () #6 0x55e94aa0969e in main () And without -W: gdb -p 85235 -batch -ex "t a a bt" [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib64/libthread_db.so.1". 0x7fc0cc69b1f0 in __poll_nocancel () from /lib64/libc.so.6 Thread 1 (Thread 0x7fc0cd5f0840 (LWP 85235)): #0 0x7fc0cc69b1f0 in __poll_nocancel () from /lib64/libc.so.6 #1 0x7fc0ccb860fb in sudo_ev_scan_impl () from /usr/libexec/sudo/libsudo_util.so.0 #2 0x7fc0ccb7eb44 in sudo_ev_loop_v1 () from /usr/libexec/sudo/libsudo_util.so.0 #3 0x5610f4397271 in exec_nopty () #4 0x5610f4393fda in sudo_execute () #5 0x5610f43a1a12 in run_command () #6 0x5610f439269e in main () Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Apr 2019, at 09:57, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Benny, It would seem that even cloning a VM is failing, creating a VM works on the same storage. This is the only error i could find: ERROR Internal server error Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/yajsonrpc/__init__.py", line 345, in _handle_request res = method(**params) File "/usr/lib/python2.7/site-packages/vdsm/rpc/Bridge.py", line 194, in _dynamicMethod result = fn(*methodArgs) File "", line 2, in getAllVmStats File "/usr/lib/python2.7/site-packages/vdsm/common/api.py", line 50, in method ret = func(*args, **kwargs) File "/usr/lib/python2.7/site-packages/vdsm/API.py", line 1388, in getAllVmStats statsList = self._cif.getAllVmStats()
[ovirt-users] Re: Import of VMs failing - 0% progress on qemu-img
Without the sudo and running in a dir where the root has access to, gdb has zero output: [cid:CCCDE95C-DD82-48AE-90E0-68295827C3F9@in.bmrc.ox.ac.uk] Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Apr 2019, at 11:54, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Some more information: running qemu-img convert manually having captured the failed attempt from the previous: sudo -u vdsm /usr/bin/qemu-img convert -p -t none -T none -f raw /rhev/data-center/mnt/10.141.15.248:_export_instruct_vm__storage/0e01f014-530b-4067-aa1d-4e9378626a9d/images/6597eede-9fa0-4451-84fc-9f9c070cb5f3/765fa48b-2e77-4637-b4ca-e1affcd71e48 -O raw /rhev/data-center/mnt/10.141.15.248:_export_instruct_vm__storage/0e01f014-530b-4067-aa1d-4e9378626a9d/images/9cc99110-70a2-477f-b3ef-1031a912d12b/c2776107-4579-43a6-9d60-93a5ea9c64c5 -W Added the -W flag just to see what would happen: gdb -p 79913 -batch -ex "t a a bt" [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib64/libthread_db.so.1". 0x7f528d7661f0 in __poll_nocancel () from /lib64/libc.so.6 Thread 1 (Thread 0x7f528e6bb840 (LWP 79913)): #0 0x7f528d7661f0 in __poll_nocancel () from /lib64/libc.so.6 #1 0x7f528dc510fb in sudo_ev_scan_impl () from /usr/libexec/sudo/libsudo_util.so.0 #2 0x7f528dc49b44 in sudo_ev_loop_v1 () from /usr/libexec/sudo/libsudo_util.so.0 #3 0x55e94aa0e271 in exec_nopty () #4 0x55e94aa0afda in sudo_execute () #5 0x55e94aa18a12 in run_command () #6 0x55e94aa0969e in main () And without -W: gdb -p 85235 -batch -ex "t a a bt" [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib64/libthread_db.so.1". 0x7fc0cc69b1f0 in __poll_nocancel () from /lib64/libc.so.6 Thread 1 (Thread 0x7fc0cd5f0840 (LWP 85235)): #0 0x7fc0cc69b1f0 in __poll_nocancel () from /lib64/libc.so.6 #1 0x7fc0ccb860fb in sudo_ev_scan_impl () from /usr/libexec/sudo/libsudo_util.so.0 #2 0x7fc0ccb7eb44 in sudo_ev_loop_v1 () from /usr/libexec/sudo/libsudo_util.so.0 #3 0x5610f4397271 in exec_nopty () #4 0x5610f4393fda in sudo_execute () #5 0x5610f43a1a12 in run_command () #6 0x5610f439269e in main () Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Apr 2019, at 09:57, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Benny, It would seem that even cloning a VM is failing, creating a VM works on the same storage. This is the only error i could find: ERROR Internal server error Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/yajsonrpc/__init__.py", line 345, in _handle_request res = method(**params) File "/usr/lib/python2.7/site-packages/vdsm/rpc/Bridge.py", line 194, in _dynamicMethod result = fn(*methodArgs) File "", line 2, in getAllVmStats File "/usr/lib/python2.7/site-packages/vdsm/common/api.py", line 50, in method ret = func(*args, **kwargs) File "/usr/lib/python2.7/site-packages/vdsm/API.py", line 1388, in getAllVmStats statsList = self._cif.getAllVmStats() File "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 567, in getAllVmStats return [v.getStats() for v in self.vmContainer.values()] File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 1766, in getStats oga_stats = self._getGuestStats() File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 1967, in _getGuestStats stats = self.guestAgent.getGuestInfo() File "/usr/lib/python2.7/si
[ovirt-users] Re: Import of VMs failing - 0% progress on qemu-img
Some more information: running qemu-img convert manually having captured the failed attempt from the previous: sudo -u vdsm /usr/bin/qemu-img convert -p -t none -T none -f raw /rhev/data-center/mnt/10.141.15.248:_export_instruct_vm__storage/0e01f014-530b-4067-aa1d-4e9378626a9d/images/6597eede-9fa0-4451-84fc-9f9c070cb5f3/765fa48b-2e77-4637-b4ca-e1affcd71e48 -O raw /rhev/data-center/mnt/10.141.15.248:_export_instruct_vm__storage/0e01f014-530b-4067-aa1d-4e9378626a9d/images/9cc99110-70a2-477f-b3ef-1031a912d12b/c2776107-4579-43a6-9d60-93a5ea9c64c5 -W Added the -W flag just to see what would happen: gdb -p 79913 -batch -ex "t a a bt" [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib64/libthread_db.so.1". 0x7f528d7661f0 in __poll_nocancel () from /lib64/libc.so.6 Thread 1 (Thread 0x7f528e6bb840 (LWP 79913)): #0 0x7f528d7661f0 in __poll_nocancel () from /lib64/libc.so.6 #1 0x7f528dc510fb in sudo_ev_scan_impl () from /usr/libexec/sudo/libsudo_util.so.0 #2 0x7f528dc49b44 in sudo_ev_loop_v1 () from /usr/libexec/sudo/libsudo_util.so.0 #3 0x55e94aa0e271 in exec_nopty () #4 0x55e94aa0afda in sudo_execute () #5 0x55e94aa18a12 in run_command () #6 0x55e94aa0969e in main () And without -W: gdb -p 85235 -batch -ex "t a a bt" [Thread debugging using libthread_db enabled] Using host libthread_db library "/lib64/libthread_db.so.1". 0x7fc0cc69b1f0 in __poll_nocancel () from /lib64/libc.so.6 Thread 1 (Thread 0x7fc0cd5f0840 (LWP 85235)): #0 0x7fc0cc69b1f0 in __poll_nocancel () from /lib64/libc.so.6 #1 0x7fc0ccb860fb in sudo_ev_scan_impl () from /usr/libexec/sudo/libsudo_util.so.0 #2 0x7fc0ccb7eb44 in sudo_ev_loop_v1 () from /usr/libexec/sudo/libsudo_util.so.0 #3 0x5610f4397271 in exec_nopty () #4 0x5610f4393fda in sudo_execute () #5 0x5610f43a1a12 in run_command () #6 0x5610f439269e in main () Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Apr 2019, at 09:57, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Benny, It would seem that even cloning a VM is failing, creating a VM works on the same storage. This is the only error i could find: ERROR Internal server error Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/yajsonrpc/__init__.py", line 345, in _handle_request res = method(**params) File "/usr/lib/python2.7/site-packages/vdsm/rpc/Bridge.py", line 194, in _dynamicMethod result = fn(*methodArgs) File "", line 2, in getAllVmStats File "/usr/lib/python2.7/site-packages/vdsm/common/api.py", line 50, in method ret = func(*args, **kwargs) File "/usr/lib/python2.7/site-packages/vdsm/API.py", line 1388, in getAllVmStats statsList = self._cif.getAllVmStats() File "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 567, in getAllVmStats return [v.getStats() for v in self.vmContainer.values()] File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 1766, in getStats oga_stats = self._getGuestStats() File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 1967, in _getGuestStats stats = self.guestAgent.getGuestInfo() File "/usr/lib/python2.7/site-packages/vdsm/virt/guestagent.py", line 505, in getGuestInfo del qga['appsList'] KeyError: 'appsList' It's the qemu-img convert for sure that's just failing to do anything, this is the command from the clone: /usr/bin/qemu-img convert -p -t none -T none -f raw /rhev/data-center/mnt/10.141.15.248:_export_inst
[ovirt-users] Re: Import of VMs failing - 0% progress on qemu-img
Dear Benny, It would seem that even cloning a VM is failing, creating a VM works on the same storage. This is the only error i could find: ERROR Internal server error Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/yajsonrpc/__init__.py", line 345, in _handle_request res = method(**params) File "/usr/lib/python2.7/site-packages/vdsm/rpc/Bridge.py", line 194, in _dynamicMethod result = fn(*methodArgs) File "", line 2, in getAllVmStats File "/usr/lib/python2.7/site-packages/vdsm/common/api.py", line 50, in method ret = func(*args, **kwargs) File "/usr/lib/python2.7/site-packages/vdsm/API.py", line 1388, in getAllVmStats statsList = self._cif.getAllVmStats() File "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 567, in getAllVmStats return [v.getStats() for v in self.vmContainer.values()] File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 1766, in getStats oga_stats = self._getGuestStats() File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 1967, in _getGuestStats stats = self.guestAgent.getGuestInfo() File "/usr/lib/python2.7/site-packages/vdsm/virt/guestagent.py", line 505, in getGuestInfo del qga['appsList'] KeyError: 'appsList' It's the qemu-img convert for sure that's just failing to do anything, this is the command from the clone: /usr/bin/qemu-img convert -p -t none -T none -f raw /rhev/data-center/mnt/10.141.15.248:_export_instruct_vm__storage/0e01f014-530b-4067-aa1d-4e9378626a9d/images/6597eede-9fa0-4451-84fc-9f9c070cb5f3/765fa48b-2e77-4637-b4ca-e1affcd71e48 -O raw /rhev/data-center/mnt/10.141.15.248:_export_instruct_vm__storage/0e01f014-530b-4067-aa1d-4e9378626a9d/images/f0700631-e60b-4c2a-a6f5-a6c818ae7651/d4fb05ec-7c78-4d89-9a66-614c093c6e16 gdb has a blank output for this though. This means 4.3.2 is fairly unusable for us, so two questions, can I downgrade to 4.2, and is there a fix coming in 4.3.3 for this? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 10 Apr 2019, at 11:22, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Creating a disk on the target share works fine. - This seems to specifically be an issue to do with moving a disk to/from a share. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 10 Apr 2019, at 09:53, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: gdb -p $(pidof qemu-img convert) -batch -ex "t a a bt" 289444: No such file or directory. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 10 Apr 2019, at 09:36, Benny Zlotnik mailto:bzlot...@redhat.com>> wrote: Can you run: $ gdb -p $(pidof qemu-img convert) -batch -ex "t a a bt" On Wed, Apr 10, 2019 at 11:26 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Further to this, I can't migrate a disk to different storage using the GUI. Both disks are configured identically and on the same physical NFS provider. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 9 Apr 2019, at 12:12, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, It would seem this is a bug in 4.3.? - As upgrading the old oVirt HE to 4.3 (from 4.2.la
[ovirt-users] Re: Import of VMs failing - 0% progress on qemu-img
Creating a disk on the target share works fine. - This seems to specifically be an issue to do with moving a disk to/from a share. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 10 Apr 2019, at 09:53, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: gdb -p $(pidof qemu-img convert) -batch -ex "t a a bt" 289444: No such file or directory. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 10 Apr 2019, at 09:36, Benny Zlotnik mailto:bzlot...@redhat.com>> wrote: Can you run: $ gdb -p $(pidof qemu-img convert) -batch -ex "t a a bt" On Wed, Apr 10, 2019 at 11:26 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Further to this, I can't migrate a disk to different storage using the GUI. Both disks are configured identically and on the same physical NFS provider. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 9 Apr 2019, at 12:12, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, It would seem this is a bug in 4.3.? - As upgrading the old oVirt HE to 4.3 (from 4.2.latest) now means that the export of VMs to export domain no longer works. Again qemu-img convert is using some cpu, but no network. Progress is 0. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 8 Apr 2019, at 15:42, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, We've exported some VMs from our old oVirt infrastructure and want to import them into the new one, but qemu-img appears to be failing. We have mounted an export domain populated from the old oVirt in the new hosted engine and are using the GUI to import the VM. Manually running the command sits at 16% CPU, 0% network usage and no progress. It appears to lock the NFS mount and ls and lsof both hang. sudo -u vdsm /usr/bin/qemu-img convert -p -t none -T none -f raw -O raw Conversely a simple cp will work (ruling out file permissions errors): sudo -u vddm cp What might we be doing wrong? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/ZUBKWQIGGSJETPVRWN42R4J7COPFV6GS/ ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/6EE6X43GTAJ6L4QBH2XQJ4LVPIXCZC3T/ ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/GD2XCKEIPNCXXQWMR4OP5IKGCORJQEGB/ ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/SV3V34DQJZ7A3MH4CQMMIZ4GPPHWFB7E/ ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/MICXCNEDJL5NOFUNS7R7W5CYDNEFINVK/
[ovirt-users] Re: Import of VMs failing - 0% progress on qemu-img
gdb -p $(pidof qemu-img convert) -batch -ex "t a a bt" 289444: No such file or directory. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 10 Apr 2019, at 09:36, Benny Zlotnik mailto:bzlot...@redhat.com>> wrote: Can you run: $ gdb -p $(pidof qemu-img convert) -batch -ex "t a a bt" On Wed, Apr 10, 2019 at 11:26 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Further to this, I can't migrate a disk to different storage using the GUI. Both disks are configured identically and on the same physical NFS provider. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 9 Apr 2019, at 12:12, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, It would seem this is a bug in 4.3.? - As upgrading the old oVirt HE to 4.3 (from 4.2.latest) now means that the export of VMs to export domain no longer works. Again qemu-img convert is using some cpu, but no network. Progress is 0. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 8 Apr 2019, at 15:42, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, We've exported some VMs from our old oVirt infrastructure and want to import them into the new one, but qemu-img appears to be failing. We have mounted an export domain populated from the old oVirt in the new hosted engine and are using the GUI to import the VM. Manually running the command sits at 16% CPU, 0% network usage and no progress. It appears to lock the NFS mount and ls and lsof both hang. sudo -u vdsm /usr/bin/qemu-img convert -p -t none -T none -f raw -O raw Conversely a simple cp will work (ruling out file permissions errors): sudo -u vddm cp What might we be doing wrong? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/ZUBKWQIGGSJETPVRWN42R4J7COPFV6GS/ ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/6EE6X43GTAJ6L4QBH2XQJ4LVPIXCZC3T/ ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/GD2XCKEIPNCXXQWMR4OP5IKGCORJQEGB/ ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/SV3V34DQJZ7A3MH4CQMMIZ4GPPHWFB7E/
[ovirt-users] Re: Import of VMs failing - 0% progress on qemu-img
Dear All, Further to this, I can't migrate a disk to different storage using the GUI. Both disks are configured identically and on the same physical NFS provider. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 9 Apr 2019, at 12:12, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, It would seem this is a bug in 4.3.? - As upgrading the old oVirt HE to 4.3 (from 4.2.latest) now means that the export of VMs to export domain no longer works. Again qemu-img convert is using some cpu, but no network. Progress is 0. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 8 Apr 2019, at 15:42, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, We've exported some VMs from our old oVirt infrastructure and want to import them into the new one, but qemu-img appears to be failing. We have mounted an export domain populated from the old oVirt in the new hosted engine and are using the GUI to import the VM. Manually running the command sits at 16% CPU, 0% network usage and no progress. It appears to lock the NFS mount and ls and lsof both hang. sudo -u vdsm /usr/bin/qemu-img convert -p -t none -T none -f raw -O raw Conversely a simple cp will work (ruling out file permissions errors): sudo -u vddm cp What might we be doing wrong? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/ZUBKWQIGGSJETPVRWN42R4J7COPFV6GS/ ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/6EE6X43GTAJ6L4QBH2XQJ4LVPIXCZC3T/ ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/GD2XCKEIPNCXXQWMR4OP5IKGCORJQEGB/
[ovirt-users] Re: Import of VMs failing - 0% progress on qemu-img
Dear All, It would seem this is a bug in 4.3.? - As upgrading the old oVirt HE to 4.3 (from 4.2.latest) now means that the export of VMs to export domain no longer works. Again qemu-img convert is using some cpu, but no network. Progress is 0. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 8 Apr 2019, at 15:42, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, We've exported some VMs from our old oVirt infrastructure and want to import them into the new one, but qemu-img appears to be failing. We have mounted an export domain populated from the old oVirt in the new hosted engine and are using the GUI to import the VM. Manually running the command sits at 16% CPU, 0% network usage and no progress. It appears to lock the NFS mount and ls and lsof both hang. sudo -u vdsm /usr/bin/qemu-img convert -p -t none -T none -f raw -O raw Conversely a simple cp will work (ruling out file permissions errors): sudo -u vddm cp What might we be doing wrong? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/ZUBKWQIGGSJETPVRWN42R4J7COPFV6GS/ ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/6EE6X43GTAJ6L4QBH2XQJ4LVPIXCZC3T/
[ovirt-users] Import of VMs failing - 0% progress on qemu-img
Dear All, We've exported some VMs from our old oVirt infrastructure and want to import them into the new one, but qemu-img appears to be failing. We have mounted an export domain populated from the old oVirt in the new hosted engine and are using the GUI to import the VM. Manually running the command sits at 16% CPU, 0% network usage and no progress. It appears to lock the NFS mount and ls and lsof both hang. sudo -u vdsm /usr/bin/qemu-img convert -p -t none -T none -f raw -O raw Conversely a simple cp will work (ruling out file permissions errors): sudo -u vddm cp What might we be doing wrong? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/ZUBKWQIGGSJETPVRWN42R4J7COPFV6GS/
[ovirt-users] UI bug viewing/editing host
et.jar:] at com.google.gwt.core.client.impl.Impl.entry0(Impl.java:275) [gwt-servlet.jar:] at Unknown.Su/<(https://he.virt.in.bmrc.ox.ac.uk/ovirt-engine/webadmin/?locale=en_US#hosts-network_interfaces;name=virthyp04.virt.in.bmrc.ox.ac.uk) at Unknown.anonymous(Unknown) Caused by: com.google.gwt.core.client.JavaScriptException: (TypeError) : oab(...) is null at org.ovirt.engine.ui.uicommonweb.models.hosts.HostListModel.$onSave(HostListModel.java:816) at org.ovirt.engine.ui.uicommonweb.models.hosts.HostListModel.executeCommand(HostListModel.java:1969) at org.ovirt.engine.ui.uicommonweb.UICommand.$execute(UICommand.java:163) at org.ovirt.engine.ui.common.presenter.AbstractModelBoundPopupPresenterWidget.$lambda$4(AbstractModelBoundPopupPresenterWidget.java:306) at org.ovirt.engine.ui.common.presenter.AbstractModelBoundPopupPresenterWidget$lambda$4$Type.onClick(AbstractModelBoundPopupPresenterWidget.java:306) at com.google.gwt.event.dom.client.ClickEvent.dispatch(ClickEvent.java:55) [gwt-servlet.jar:] at com.google.gwt.event.shared.GwtEvent.dispatch(GwtEvent.java:76) [gwt-servlet.jar:] at com.google.web.bindery.event.shared.SimpleEventBus.$doFire(SimpleEventBus.java:173) [gwt-servlet.jar:] ... 12 more Clean install ovirt-node-ng-4.3.0-0.20190204.0+1 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/JHQVLYNWOSB3FC23ZI7PII7HAPK76WZO/
[ovirt-users] Re: Ansible hosted-engine deploy still doesnt support manually defined ovirtmgmt?
So close now: fatal: [virthyp04.virt.in.bmrc.ox.ac.uk]: FAILED! => {"changed": true, "cmd": ["dd", "bs=20480", "count=1", "oflag=direct", "if=/var/tmp/localvmDBMVgn/e208e0f9-0f4d-4d0d-9104-10d8a26bfab6", "of=/rhev/data-center/mnt/10.141.15.248:_export_virtman_hosted__storage/b4f93b28-1497-44b0-9eaf-5e5e2b71bce8/images/645c4286-71e4-4cce-9049-345903929e1b/e208e0f9-0f4d-4d0d-9104-10d8a26bfab6"], "delta": "0:00:00.005134", "end": "2019-03-29 17:04:27.952367", "msg": "non-zero return code", "rc": 1, "start": "2019-03-29 17:04:27.947233", "stderr": "dd: failed to open ‘/rhev/data-center/mnt/10.141.15.248:_export_virtman_hosted__storage/b4f93b28-1497-44b0-9eaf-5e5e2b71bce8/images/645c4286-71e4-4cce-9049-345903929e1b/e208e0f9-0f4d-4d0d-9104-10d8a26bfab6’: Permission denied", "stderr_lines": ["dd: failed to open ‘/rhev/data-center/mnt/10.141.15.248:_export_virtman_hosted__storage/b4f93b28-1497-44b0-9eaf-5e5e2b71bce8/images/645c4286-71e4-4cce-9049-345903929e1b/e208e0f9-0f4d-4d0d-9104-10d8a26bfab6’: Permission denied"], "stdout": "", "stdout_lines": []} to retry, use: --limit @/etc/ansible/playbook/ovirt.retry ls -laZ /rhev/data-center/mnt/10.141.15.248:_export_virtman_hosted__storage/b4f93b28-1497-44b0-9eaf-5e5e2b71bce8/images/645c4286-71e4-4cce-9049-345903929e1b/e208e0f9-0f4d-4d0d-9104-10d8a26bfab6 -rw-rw. vdsm kvm system_u:object_r:nfs_t:s0 /rhev/data-center/mnt/10.141.15.248:_export_virtman_hosted__storage/b4f93b28-1497-44b0-9eaf-5e5e2b71bce8/images/645c4286-71e4-4cce-9049-345903929e1b/e208e0f9-0f4d-4d0d-9104-10d8a26bfab6 Any ideas on this one? I can almost touch this deployment now... Looking at the command it should run as `vdsm` so should work fine, could this be SELinux? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 29 Mar 2019, at 15:50, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Guilty, will roll back and try again! Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 29 Mar 2019, at 15:35, Simone Tiraboschi mailto:stira...@redhat.com>> wrote: The error comes from here: TASK [ovirt.hosted_engine_setup : Parse OVF] *** fatal: [virthyp04.virt.in.bmrc.ox.ac.uk<http://virthyp04.virt.in.bmrc.ox.ac.uk/>]: FAILED! => {"changed": false, "msg": "missing parameter(s) required by 'attribute': value"} but are you really using it with ansible 2.8 alpha 1? I'd strongly suggest to switch back to a stable release of ansible which is currently 2.7.9. That one was due to: https://github.com/ansible/ansible/issues/53459 In the next ansible build it will be just a warning as for: https://github.com/ansible/ansible/pull/54336 https://github.com/oVirt/ovirt-ansible-hosted-engine-setup/pull/150/files already address this on ovirt-ansible-hosted-engine-setup to be compatible with future ansible releases. On Fri, Mar 29, 2019 at 3:53 PM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: The OVF in question is here: http://schemas.dmtf.org/ovf/envelope/1/; xmlns:rasd="http://schemas.dmtf.org/wbem/wscim/1/cim-schema/2/CIM_ResourceAllocationSettingData; xmlns:vssd="http://schemas.dmtf.org/wbem/wscim/1/cim-schema/2/CIM_VirtualSystemSettingData; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance;>List of Networkshttp://www.vmware.com/specifications/vmdk.html#sparse; ovf:parentRef="" ovf:size="51" ovf:vm_snapshot_id="5f2be758-82d7-4c07-a220-9060e782dc7a" ovf:volume-format="COW" ovf:volume-type="Sparse" ovf:wipe-after-delete="false" />074a62d4-44f9-4ffe-a172-2702a9fe96df074a62d4-44f9-4ffe-a172-2702a9fe96dfCreated by OVABuilder2019/03/19 08:35:09false101Guest OSOtherLinux4 CPU, 16384 MemoryRHEVM 4.6.0.1634 virtual CPUNumber of virtual CPU131416384 MB of memoryMemory Size24MegaBytes16384Drive 172bc3948-5d8d-4877-bac8-7db4995045b5176f76686b-199c-4cb3-bbbe-86fc34365745/72bc3948-5d8d-4877-bac8-7db4995045b5----0000-0000-0000----------2019/03/19 08:35:092019/03/19 08:35:09Ethernet 0 rhevm3103rhevmeth01000Graphics5201 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.o
[ovirt-users] Re: Ansible hosted-engine deploy still doesnt support manually defined ovirtmgmt?
Guilty, will roll back and try again! Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 29 Mar 2019, at 15:35, Simone Tiraboschi mailto:stira...@redhat.com>> wrote: The error comes from here: TASK [ovirt.hosted_engine_setup : Parse OVF] *** fatal: [virthyp04.virt.in.bmrc.ox.ac.uk<http://virthyp04.virt.in.bmrc.ox.ac.uk/>]: FAILED! => {"changed": false, "msg": "missing parameter(s) required by 'attribute': value"} but are you really using it with ansible 2.8 alpha 1? I'd strongly suggest to switch back to a stable release of ansible which is currently 2.7.9. That one was due to: https://github.com/ansible/ansible/issues/53459 In the next ansible build it will be just a warning as for: https://github.com/ansible/ansible/pull/54336 https://github.com/oVirt/ovirt-ansible-hosted-engine-setup/pull/150/files already address this on ovirt-ansible-hosted-engine-setup to be compatible with future ansible releases. On Fri, Mar 29, 2019 at 3:53 PM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: The OVF in question is here: http://schemas.dmtf.org/ovf/envelope/1/; xmlns:rasd="http://schemas.dmtf.org/wbem/wscim/1/cim-schema/2/CIM_ResourceAllocationSettingData; xmlns:vssd="http://schemas.dmtf.org/wbem/wscim/1/cim-schema/2/CIM_VirtualSystemSettingData; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance;>List of Networkshttp://www.vmware.com/specifications/vmdk.html#sparse; ovf:parentRef="" ovf:size="51" ovf:vm_snapshot_id="5f2be758-82d7-4c07-a220-9060e782dc7a" ovf:volume-format="COW" ovf:volume-type="Sparse" ovf:wipe-after-delete="false" />074a62d4-44f9-4ffe-a172-2702a9fe96df074a62d4-44f9-4ffe-a172-2702a9fe96dfCreated by OVABuilder2019/03/19 08:35:09false101Guest OSOtherLinux4 CPU, 16384 MemoryRHEVM 4.6.0.1634 virtual CPUNumber of virtual CPU131416384 MB of memoryMemory Size24MegaBytes16384Drive 172bc3948-5d8d-4877-bac8-7db4995045b5176f76686b-199c-4cb3-bbbe-86fc34365745/72bc3948-5d8d-4877-bac8-7db4995045b5------------0000----2019/03/19 08:35:092019/03/19 08:35:09Ethernet 0 rhevm3103rhevmeth01000Graphics5201 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 29 Mar 2019, at 14:42, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Ok so we're getting very close now, weird OVF error: Full ansible log attached Only error in the engine.log looks normal/expected to me: 2019-03-29 14:32:44,370Z ERROR [org.ovirt.engine.core.bll.pm.FenceProxyLocator] (EE-ManagedThreadFactory-engineScheduled-Thread-71) [4405d6db] Can not run fence action on host 'vir thyp04.virt.in.bmrc.ox.ac.uk<http://thyp04.virt.in.bmrc.ox.ac.uk/>', no suitable proxy host was found. Feeling damn close to success here, but have managed to replicate this issue twice re-running the installer. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 29 Mar 2019, at 11:50, Simone Tiraboschi mailto:stira...@redhat.com>> wrote: On Fri, Mar 29, 2019 at 12:36 PM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: ip link del ovirtmgmt has done the job Another issue, but this is likely due to randomised MAC addresses: fatal: [virthyp04.virt.in.bmrc.ox.ac.uk<http://virthyp04.virt.in.bmrc.ox.ac.uk/>]: FAILED! => {"changed": true, "cmd": ["virt-install", "-n", "HostedEngineLocal", "--os-variant", "rhel7", "--virt-type", "kvm", "--memory", "4096", "--vcpus", "64", "--network", "network=default,mac=fe:58:6c:da:1e:cc,model=virtio", "--disk", "/var/tmp/localvmOCYiyF/images/6f76686b-199c-4cb3-bbbe-86fc34365745/72bc3948-5d8d-4877-bac8-7db4995045b5", "--import", "--disk", "path=/var/tmp/localvmOCYiyF/seed.iso,device=cdrom", "--noautoconsole", "--rng", "/dev/random", "--graphics", "vnc", "--video", "vga", "--sound", "none", "--controller", "usb,model=none", "--memballoon", "none", "--boot", "hd,menu=off", "--clock", "kvmclock_present=yes"], &q
[ovirt-users] Re: Ansible hosted-engine deploy still doesnt support manually defined ovirtmgmt?
The OVF in question is here: http://schemas.dmtf.org/ovf/envelope/1/; xmlns:rasd="http://schemas.dmtf.org/wbem/wscim/1/cim-schema/2/CIM_ResourceAllocationSettingData; xmlns:vssd="http://schemas.dmtf.org/wbem/wscim/1/cim-schema/2/CIM_VirtualSystemSettingData; xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance;>List of Networkshttp://www.vmware.com/specifications/vmdk.html#sparse; ovf:parentRef="" ovf:size="51" ovf:vm_snapshot_id="5f2be758-82d7-4c07-a220-9060e782dc7a" ovf:volume-format="COW" ovf:volume-type="Sparse" ovf:wipe-after-delete="false" />074a62d4-44f9-4ffe-a172-2702a9fe96df074a62d4-44f9-4ffe-a172-2702a9fe96dfCreated by OVABuilder2019/03/19 08:35:09false101Guest OSOtherLinux4 CPU, 16384 MemoryRHEVM 4.6.0.1634 virtual CPUNumber of virtual CPU131416384 MB of memoryMemory Size24MegaBytes16384Drive 172bc3948-5d8d-4877-bac8-7db4995045b5176f76686b-199c-4cb3-bbbe-86fc34365745/72bc3948-5d8d-4877-bac8-7db4995045b5----------------2019/03/19 08:35:092019/03/19 08:35:09Ethernet 0 rhevm3103rhevmeth01000Graphics5201 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 29 Mar 2019, at 14:42, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Ok so we're getting very close now, weird OVF error: Full ansible log attached Only error in the engine.log looks normal/expected to me: 2019-03-29 14:32:44,370Z ERROR [org.ovirt.engine.core.bll.pm.FenceProxyLocator] (EE-ManagedThreadFactory-engineScheduled-Thread-71) [4405d6db] Can not run fence action on host 'vir thyp04.virt.in.bmrc.ox.ac.uk', no suitable proxy host was found. Feeling damn close to success here, but have managed to replicate this issue twice re-running the installer. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 29 Mar 2019, at 11:50, Simone Tiraboschi mailto:stira...@redhat.com>> wrote: On Fri, Mar 29, 2019 at 12:36 PM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: ip link del ovirtmgmt has done the job Another issue, but this is likely due to randomised MAC addresses: fatal: [virthyp04.virt.in.bmrc.ox.ac.uk<http://virthyp04.virt.in.bmrc.ox.ac.uk/>]: FAILED! => {"changed": true, "cmd": ["virt-install", "-n", "HostedEngineLocal", "--os-variant", "rhel7", "--virt-type", "kvm", "--memory", "4096", "--vcpus", "64", "--network", "network=default,mac=fe:58:6c:da:1e:cc,model=virtio", "--disk", "/var/tmp/localvmOCYiyF/images/6f76686b-199c-4cb3-bbbe-86fc34365745/72bc3948-5d8d-4877-bac8-7db4995045b5", "--import", "--disk", "path=/var/tmp/localvmOCYiyF/seed.iso,device=cdrom", "--noautoconsole", "--rng", "/dev/random", "--graphics", "vnc", "--video", "vga", "--sound", "none", "--controller", "usb,model=none", "--memballoon", "none", "--boot", "hd,menu=off", "--clock", "kvmclock_present=yes"], "delta": "0:00:01.355834", "end": "2019-03-29 11:31:02.100143", "msg": "non-zero return code", "rc": 1, "start": "2019-03-29 11:31:00.744309", "stderr": "ERROR unsupported configuration: Unable to use MAC address starting with reserved value 0xFE - 'fe:58:6c:da:1e:cc' - \nDomain installation does not appear to have been successful.\nIf it was, you can restart your domain by running:\n virsh --connect qemu:///system start HostedEngineLocal\notherwise, please restart your installation.", "stderr_lines": ["ERRORunsupported configuration: Unable to use MAC address starting with reserved value 0xFE - 'fe:58:6c:da:1e:cc' - ", "Domain installation does not appear to have been successful.", "If it was, you can restart your domain by running:", " virsh --connect qemu:///system start HostedEngineLocal", "otherwise, please restart your installation."], "stdout": "\nStarting install...", "stdout_lines": ["", "Starting install..."]} Seems it doesn't take into account reserved values when generating. If not specified by the user, a random unicast MAC address is randomly generated here: https://github.com/oVirt/ovirt-ansible-hosted-e
[ovirt-users] Re: Ansible hosted-engine deploy still doesnt support manually defined ovirtmgmt?
ip link del ovirtmgmt has done the job Another issue, but this is likely due to randomised MAC addresses: fatal: [virthyp04.virt.in.bmrc.ox.ac.uk]: FAILED! => {"changed": true, "cmd": ["virt-install", "-n", "HostedEngineLocal", "--os-variant", "rhel7", "--virt-type", "kvm", "--memory", "4096", "--vcpus", "64", "--network", "network=default,mac=fe:58:6c:da:1e:cc,model=virtio", "--disk", "/var/tmp/localvmOCYiyF/images/6f76686b-199c-4cb3-bbbe-86fc34365745/72bc3948-5d8d-4877-bac8-7db4995045b5", "--import", "--disk", "path=/var/tmp/localvmOCYiyF/seed.iso,device=cdrom", "--noautoconsole", "--rng", "/dev/random", "--graphics", "vnc", "--video", "vga", "--sound", "none", "--controller", "usb,model=none", "--memballoon", "none", "--boot", "hd,menu=off", "--clock", "kvmclock_present=yes"], "delta": "0:00:01.355834", "end": "2019-03-29 11:31:02.100143", "msg": "non-zero return code", "rc": 1, "start": "2019-03-29 11:31:00.744309", "stderr": "ERROR unsupported configuration: Unable to use MAC address starting with reserved value 0xFE - 'fe:58:6c:da:1e:cc' - \nDomain installation does not appear to have been successful.\nIf it was, you can restart your domain by running:\n virsh --connect qemu:///system start HostedEngineLocal\notherwise, please restart your installation.", "stderr_lines": ["ERRORunsupported configuration: Unable to use MAC address starting with reserved value 0xFE - 'fe:58:6c:da:1e:cc' - ", "Domain installation does not appear to have been successful.", "If it was, you can restart your domain by running:", " virsh --connect qemu:///system start HostedEngineLocal", "otherwise, please restart your installation."], "stdout": "\nStarting install...", "stdout_lines": ["", "Starting install..."]} Seems it doesn't take into account reserved values when generating. I hope this feedback is valuable - I have a good feeling about the current deploy otherwise though. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 29 Mar 2019, at 11:01, Simone Tiraboschi mailto:stira...@redhat.com>> wrote: On Fri, Mar 29, 2019 at 11:56 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Simone, It doesn't seem to want to work: # Settings he_fqdn: "he.virt.in.bmrc.ox.ac.uk<http://he.virt.in.bmrc.ox.ac.uk/>" he_ansible_host_name: "virthyp04.virt.in.bmrc.ox.ac.uk<http://virthyp04.virt.in.bmrc.ox.ac.uk/>" he_admin_password: he_appliance_password: # Resources he_mem_size_MB: "4096" # Storage he_domain_type: "nfs" he_storage_domain_addr: he_storage_domain_path: # Network he_vm_ip_addr: "10.141.31.240" he_vm_ip_prefix: "20" he_dns_addr: ["10.141.31.251","10.141.31.252","10.141.31.253"] he_default_gateway_4: "10.141.31.254" he_gateway: he_default_gateway_4 he_force_ip4: true he_bridge_if: bond0.910 #he_just_collect_network_interfaces: true # Email he_smtp_port: 25 he_smtp_server: smtp.ox.ac.uk<http://smtp.ox.ac.uk/> he_dest_email: rescomp-...@well.ox.ac.uk<mailto:rescomp-...@well.ox.ac.uk> he_source_email: ov...@bmrc.ox.ac.uk<mailto:ov...@bmrc.ox.ac.uk> # Ansible Stuff ansible_ssh_user: root ansible_become: false host_key_checking: false I've attached the output of the ansible command as a log file, this is what happens when the IF bond0.910 is assigned the IP and `ovirtmgmt` is not defined on the host. TASK [ovirt.hosted_engine_setup : debug] ******* ok: [virthyp04.virt.in.bmrc.ox.ac.uk<http://virthyp04.virt.in.bmrc.ox.ac.uk/>] => { "target_address_v4": { "changed": true, "cmd": "ip addr show ovirtmgmt | grep 'inet ' | cut -d' ' -f6 | cut -d'/' -f1", "delta": "0:00:00.008744", "end": "2019-03-29 10:26:07.510481", "failed": false, "rc": 0, "start": "2019-03-29 10:26:07.501737", "stderr": "", "stderr_lines": [], "stdout"
[ovirt-users] Re: Ansible hosted-engine deploy still doesnt support manually defined ovirtmgmt?
Dear Simone, It doesn't seem to want to work: # Settings he_fqdn: "he.virt.in.bmrc.ox.ac.uk" he_ansible_host_name: "virthyp04.virt.in.bmrc.ox.ac.uk" he_admin_password: he_appliance_password: # Resources he_mem_size_MB: "4096" # Storage he_domain_type: "nfs" he_storage_domain_addr: he_storage_domain_path: # Network he_vm_ip_addr: "10.141.31.240" he_vm_ip_prefix: "20" he_dns_addr: ["10.141.31.251","10.141.31.252","10.141.31.253"] he_default_gateway_4: "10.141.31.254" he_gateway: he_default_gateway_4 he_force_ip4: true he_bridge_if: bond0.910 #he_just_collect_network_interfaces: true # Email he_smtp_port: 25 he_smtp_server: smtp.ox.ac.uk<http://smtp.ox.ac.uk> he_dest_email: rescomp-...@well.ox.ac.uk<mailto:rescomp-...@well.ox.ac.uk> he_source_email: ov...@bmrc.ox.ac.uk<mailto:ov...@bmrc.ox.ac.uk> # Ansible Stuff ansible_ssh_user: root ansible_become: false host_key_checking: false I've attached the output of the ansible command as a log file, this is what happens when the IF bond0.910 is assigned the IP and `ovirtmgmt` is not defined on the host. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 28 Mar 2019, at 16:23, Simone Tiraboschi mailto:stira...@redhat.com>> wrote: On Thu, Mar 28, 2019 at 1:44 PM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Simone, This is my experience too, but I'm now hitting this error on the hosted-engine install at the part where it registers the hypervisor as the first host in the engine: 2019-03-28 12:40:50,025Z INFO [org.ovirt.engine.core.bll.host.HostConnectivityChecker] (EE-ManagedThreadFactory-engine-Thread-1) [49f371c1] Engine managed to communicate with VDSM agent on host 'virthyp04.virt.in.bmrc.ox.ac.uk<http://virthyp04.virt.in.bmrc.ox.ac.uk/>' with address 'virthyp04.virt.in.bmrc.ox.ac.uk<http://virthyp04.virt.in.bmrc.ox.ac.uk/>' ('db571f8a-fc85-40d3-b86f-c0038e3cd7e7') 2019-03-28 12:40:53,111Z WARN [org.ovirt.engine.core.bll.network.NetworkConfigurator] (EE-ManagedThreadFactory-engine-Thread-1) [49f371c1] Failed to find a valid interface for the management network of host virthyp04.virt.in.bmrc.ox.ac.uk<http://virthyp04.virt.in.bmrc.ox.ac.uk/>. If the interface ovirtmgmt is a bridge, it should be torn-down manually. 2019-03-28 12:40:53,111Z ERROR [org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand] (EE-ManagedThreadFactory-engine-Thread-1) [49f371c1] Exception: org.ovirt.engine.cor e.bll.network.NetworkConfigurator$NetworkConfiguratorException: Interface ovirtmgmt is invalid for management network The host's ovirtmgmt network connection is a statically assigned IP on a VLAN on a bond, how should I be configuring this if not manually? If you need to deploy over vlan 123 over bond0 simply configure a device exactly called bond0.123 and statically configure your IP address there. Choose it for hosted-engine deployment, nothing more: ovirtmgmt will be automatically created over that device and the vlan ID will be set at engine level for the whole management network. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Mar 2019, at 17:09, Simone Tiraboschi mailto:stira...@redhat.com>> wrote: On Wed, Mar 27, 2019 at 4:27 PM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: It's ok, migrating to 4.3.2 on the oVirt node (from 4.3.0) did the job of fixing it. It is a bug if you intend on using the ovirtmgmt network to deploy your ansible from This is a bit tricky: when the engine brings up the host it also creates the management bridge and this could lead to a temporary network down on the selected interface for the bridge creation time (a couple of seconds?) I tried it on a LAN and ansible ssh connection always survived but I 'm not sure it's always true. , and you need it to have an IP address already on that range! But - it works as expected with the ovirtmgmt bridge setup so nothing to worry about. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Mar 2019, at 14:57, Simone Tiraboschi mailto:stira...@redhat.com>> wrote: On Wed, Mar 27, 2019 at 3:24 PM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, We're trying to deploy our hosted engine remotely using the ansible hosted engine playbook, which has been a rocky road but we're now at the point where it's installing, and failing. We've got a pre-defined bond/VLAN setup for our interface which has the correct bond0 bond0.123 and ovirtmgmt bri
[ovirt-users] Re: HE Install cannot login to VM
In this email thread im talking about running `hosted-engine --deploy` as root directly on the hypervisor. Checked the known_hosts and it was empty - I think we're going to go back to trying to fix the remote-deploy option! Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 28 Mar 2019, at 16:27, Simone Tiraboschi mailto:stira...@redhat.com>> wrote: On Thu, Mar 28, 2019 at 4:53 PM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear all, Giving up on running ansible playbook remotely we're hitting earlier issues running hosted-engine --deploy The issue is just running the role as unprivileged user on the ansible controller node; I opened a bug here: https://bugzilla.redhat.com/show_bug.cgi?id=1693607 For now simply run your palybook as root instead of as a regular user with become: true. I think it will be fixed for the next build. 2019-03-28 15:42:33,975+ INFO otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:109 TASK [ovirt.hosted_engine_setup : Wait for the local VM] 2019-03-28 15:45:40,739+ DEBUG otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:103 {u'msg': u"timed out waiting for ping module test success: Using a SSH password instead of a key is not possible because Host Key checking is enabled and sshpass does not support this. Please add this host's fingerprint to your known_hosts file to manage this host.", u'elapsed': 186, u'changed': False, u'_ansible_no_log': False, u'_ansible_delegated_vars': {u'ansible_delegated_host': u'he.virt.in.bmrc.ox.ac.uk<http://he.virt.in.bmrc.ox.ac.uk/>', u'ansibl e_host': u'he.virt.in.bmrc.ox.ac.uk<http://he.virt.in.bmrc.ox.ac.uk/>'}} 2019-03-28 15:45:40,840+ ERROR otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:107 fatal: [localhost -> he.virt.in.bmrc.ox.ac.uk<http://he.virt.in.bmrc.ox.ac.uk/>]: FAILED! => {"changed": false, "elapsed": 186, "msg": "timed out waiting for ping module test success: Using a SSH password instead of a key is not possible because Host Key checking is enabled and sshpass does not support this. Please add this host's fingerprint to your known_hosts file to manage this host."} At a bit of a loss, especially since the old no-ansible install is not available anymore? Please check ~/.ssh/know_hosts for leftovers from previous attempt. On each deploy attempt the engine VM will get a fresh ssh keys. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/OLOH2VEI64CBDAMVJPZJL5WOGPTU5D2N/ ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/GO5CTEMEOGAAQYHLSXVODMJJBBEOXMBM/
[ovirt-users] HE Install cannot login to VM
Dear all, Giving up on running ansible playbook remotely we're hitting earlier issues running hosted-engine --deploy 2019-03-28 15:42:33,975+ INFO otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:109 TASK [ovirt.hosted_engine_setup : Wait for the local VM] 2019-03-28 15:45:40,739+ DEBUG otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:103 {u'msg': u"timed out waiting for ping module test success: Using a SSH password instead of a key is not possible because Host Key checking is enabled and sshpass does not support this. Please add this host's fingerprint to your known_hosts file to manage this host.", u'elapsed': 186, u'changed': False, u'_ansible_no_log': False, u'_ansible_delegated_vars': {u'ansible_delegated_host': u'he.virt.in.bmrc.ox.ac.uk', u'ansibl e_host': u'he.virt.in.bmrc.ox.ac.uk'}} 2019-03-28 15:45:40,840+ ERROR otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils._process_output:107 fatal: [localhost -> he.virt.in.bmrc.ox.ac.uk]: FAILED! => {"changed": false, "elapsed": 186, "msg": "timed out waiting for ping module test success: Using a SSH password instead of a key is not possible because Host Key checking is enabled and sshpass does not support this. Please add this host's fingerprint to your known_hosts file to manage this host."} At a bit of a loss, especially since the old no-ansible install is not available anymore? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/OLOH2VEI64CBDAMVJPZJL5WOGPTU5D2N/
[ovirt-users] Re: Ansible hosted-engine deploy still doesnt support manually defined ovirtmgmt?
Dear Simone, This is my experience too, but I'm now hitting this error on the hosted-engine install at the part where it registers the hypervisor as the first host in the engine: 2019-03-28 12:40:50,025Z INFO [org.ovirt.engine.core.bll.host.HostConnectivityChecker] (EE-ManagedThreadFactory-engine-Thread-1) [49f371c1] Engine managed to communicate with VDSM agent on host 'virthyp04.virt.in.bmrc.ox.ac.uk' with address 'virthyp04.virt.in.bmrc.ox.ac.uk' ('db571f8a-fc85-40d3-b86f-c0038e3cd7e7') 2019-03-28 12:40:53,111Z WARN [org.ovirt.engine.core.bll.network.NetworkConfigurator] (EE-ManagedThreadFactory-engine-Thread-1) [49f371c1] Failed to find a valid interface for the management network of host virthyp04.virt.in.bmrc.ox.ac.uk. If the interface ovirtmgmt is a bridge, it should be torn-down manually. 2019-03-28 12:40:53,111Z ERROR [org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand] (EE-ManagedThreadFactory-engine-Thread-1) [49f371c1] Exception: org.ovirt.engine.cor e.bll.network.NetworkConfigurator$NetworkConfiguratorException: Interface ovirtmgmt is invalid for management network The host's ovirtmgmt network connection is a statically assigned IP on a VLAN on a bond, how should I be configuring this if not manually? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Mar 2019, at 17:09, Simone Tiraboschi mailto:stira...@redhat.com>> wrote: On Wed, Mar 27, 2019 at 4:27 PM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: It's ok, migrating to 4.3.2 on the oVirt node (from 4.3.0) did the job of fixing it. It is a bug if you intend on using the ovirtmgmt network to deploy your ansible from This is a bit tricky: when the engine brings up the host it also creates the management bridge and this could lead to a temporary network down on the selected interface for the bridge creation time (a couple of seconds?) I tried it on a LAN and ansible ssh connection always survived but I 'm not sure it's always true. , and you need it to have an IP address already on that range! But - it works as expected with the ovirtmgmt bridge setup so nothing to worry about. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Mar 2019, at 14:57, Simone Tiraboschi mailto:stira...@redhat.com>> wrote: On Wed, Mar 27, 2019 at 3:24 PM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, We're trying to deploy our hosted engine remotely using the ansible hosted engine playbook, which has been a rocky road but we're now at the point where it's installing, and failing. We've got a pre-defined bond/VLAN setup for our interface which has the correct bond0 bond0.123 and ovirtmgmt bridge on top but we're hitting the classic error: Failed to find a valid in terface for the management network of host virthyp04.virt.in.bmrc.ox.ac.uk<http://virthyp04.virt.in.bmrc.ox.ac.uk/>. If the interface ovirtmgmt is a bridge, it should be torn-down manually. Does this bug still exist in the latest (4.3) version, and is installing using ansible with this network configuration impossible? I don't think it's a bug; please avoid manually creating ovirtmgmt and simply set: he_bridge_if: "bond0.123", in ansible variable file and the management bridge will be created for you at host-deploy time. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/SBOZ6FRBRQKE7FMTEHT6FBKHOW3PIPHK/ ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/DYYGXZBVPLT6D2O7QWUK5R5IJ7KKZT7C/
[ovirt-users] Re: Ansible hosted-engine deploy still doesnt support manually defined ovirtmgmt?
It's ok, migrating to 4.3.2 on the oVirt node (from 4.3.0) did the job of fixing it. It is a bug if you intend on using the ovirtmgmt network to deploy your ansible from, and you need it to have an IP address already on that range! But - it works as expected with the ovirtmgmt bridge setup so nothing to worry about. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Mar 2019, at 14:57, Simone Tiraboschi mailto:stira...@redhat.com>> wrote: On Wed, Mar 27, 2019 at 3:24 PM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, We're trying to deploy our hosted engine remotely using the ansible hosted engine playbook, which has been a rocky road but we're now at the point where it's installing, and failing. We've got a pre-defined bond/VLAN setup for our interface which has the correct bond0 bond0.123 and ovirtmgmt bridge on top but we're hitting the classic error: Failed to find a valid in terface for the management network of host virthyp04.virt.in.bmrc.ox.ac.uk<http://virthyp04.virt.in.bmrc.ox.ac.uk/>. If the interface ovirtmgmt is a bridge, it should be torn-down manually. Does this bug still exist in the latest (4.3) version, and is installing using ansible with this network configuration impossible? I don't think it's a bug; please avoid manually creating ovirtmgmt and simply set: he_bridge_if: "bond0.123", in ansible variable file and the management bridge will be created for you at host-deploy time. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/SBOZ6FRBRQKE7FMTEHT6FBKHOW3PIPHK/ ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/RM5U5IT2X6FXGMTMU6JASPET5FQS5BVY/
[ovirt-users] Ansible hosted-engine deploy still doesnt support manually defined ovirtmgmt?
Dear All, We're trying to deploy our hosted engine remotely using the ansible hosted engine playbook, which has been a rocky road but we're now at the point where it's installing, and failing. We've got a pre-defined bond/VLAN setup for our interface which has the correct bond0 bond0.123 and ovirtmgmt bridge on top but we're hitting the classic error: Failed to find a valid in terface for the management network of host virthyp04.virt.in.bmrc.ox.ac.uk. If the interface ovirtmgmt is a bridge, it should be torn-down manually. Does this bug still exist in the latest (4.3) version, and is installing using ansible with this network configuration impossible? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/SBOZ6FRBRQKE7FMTEHT6FBKHOW3PIPHK/
[ovirt-users] Adding host with local-storage VMs configured
Dear All, We’re in a world where we had to recover from outage by installing a clean hosted engine and importing VMs from storage. We do, however, have a host that is configured for local-storage and has a VM running on it. What is the best way to re-introduce this host to a new hosted-engine with minimal outage of the VM on that host. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/AVZX7U7BSMIMYUOR6ZBVULMFGMXJNH7H/
[ovirt-users] exporting VMs using vdsm-client
Is it possible to export VMs using vdsm-client to import into a new oVirt engine in a case of catastrophic failure such that they can be imported into a new setup? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/5RNYW2W2LLZTISTPTNARMFVQ6OQ7R477/
[ovirt-users] JSON internal error setting bonding using vdsm-client
Dear All, I’m getting the error: “(code=32603, message=Internal JSON-RPC error: {‘reason’: “‘unicode’ object has no attribute ’sort’”}) when trying to apply networking configuration with vdsm-client -f bond.json Host setupNetworks bond.json contains: { “networks”: {}, “bondings”: { “bond0”: { “nics”: “eno1+eno2”, “options”: “mode=4” } }, “options”: {} } Of course python is handling all the arguments as unicode entities rather than strings. Any idea what might be wrong? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/UON6H2RCEUY37KIYYUEK7TPNZ6SPHB7L/
[ovirt-users] Re: Disk corruption on hosted engine
Dear Simone, Thanks for the response. Unfortunately I didn't get the install up to 4.7 in time for this event, so we might go down the spare host route just to be triple safe pending the arrival of some networking kit. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 18 Dec 2018, at 10:08, Simone Tiraboschi mailto:stira...@redhat.com>> wrote: On Mon, Dec 17, 2018 at 2:24 PM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, So we've had some major disk corruption on our hosted engine (hardware to blame), and we have taken backups. However, the hosted-engine VM will no longer boot at all, database is thoroughly corrupted, and we need to rebuild the thing. Just a sanity check on the best route: Preamble: VMs are still running fine - only hosted engine affected VMs are distributed across our entire 3 node cluster All 3 nodes are registered as hosted engine candidates 1. Do another hosted-engine --deploy on one of the existing hosts, and then restore the backup into that 2. Build a new host, deploy the hosted-engine, then restore a backup on a fresh node Hi, since 4.2.7 you can use hosted-engine --deploy --restore-from-file=backup.tar.gz And the deployment will restore your backup on the fly. Technically you an also use one of the existing hosts with running VMs but if you want to be on the safe and you have a spare host I'd suggest to use that one. You will be asked to create a new SD for the new engine VM, the previous HE SD will be still visible in the engine if you have to migrate other disks stored there. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/VYQM7WWIBP7Z7FQQ6TG7HH3IFDZ64LXQ/ ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/PTCSNZT7V3NA7WAHDANKHZE4Z2AHMQE3/
[ovirt-users] Re: Disk corruption on hosted engine
Memory error on the storage array that cause postgres to become corrupted. Attempting to restore from backup before the DIMM was replaced was ill advised and now the whole HE is trash. We're having a long talk with our vendor of the affected piece of kit. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 Dec 2018, at 15:45, femi adegoke mailto:ov...@fateknollogee.com>> wrote: Curious question: How did the hardware corrupt the HE? On Dec 17 2018, at 5:02 am, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, So we've had some major disk corruption on our hosted engine (hardware to blame), and we have taken backups. However, the hosted-engine VM will no longer boot at all, database is thoroughly corrupted, and we need to rebuild the thing. Just a sanity check on the best route: Preamble: VMs are still running fine - only hosted engine affected VMs are distributed across our entire 3 node cluster All 3 nodes are registered as hosted engine candidates 1. Do another hosted-engine --deploy on one of the existing hosts, and then restore the backup into that 2. Build a new host, deploy the hosted-engine, then restore a backup on a fresh node Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/VYQM7WWIBP7Z7FQQ6TG7HH3IFDZ64LXQ/ ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/3MWI53U5OHUQJ2IJFMF4L6GNNNH2QV44/
[ovirt-users] Disk corruption on hosted engine
Dear All, So we've had some major disk corruption on our hosted engine (hardware to blame), and we have taken backups. However, the hosted-engine VM will no longer boot at all, database is thoroughly corrupted, and we need to rebuild the thing. Just a sanity check on the best route: Preamble: VMs are still running fine - only hosted engine affected VMs are distributed across our entire 3 node cluster All 3 nodes are registered as hosted engine candidates 1. Do another hosted-engine --deploy on one of the existing hosts, and then restore the backup into that 2. Build a new host, deploy the hosted-engine, then restore a backup on a fresh node Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/VYQM7WWIBP7Z7FQQ6TG7HH3IFDZ64LXQ/
[ovirt-users] Re: Using the web-ui VM portal through a proxy failing
Dear Greg, Here's my config, this is based on the original guide and some other stuff that i found to help make it work. Squid Cache: Version 3.5.20 https_port 443 accel key=/etc/squid/rescomp-vmgw.well.ox.ac.uk.proxy.key cert=/etc/squid/rescomp-vmgw.well.ox.ac.uk.proxy.crt defaultsite= cache_peer parent 443 0 no-query originserver ssl sslcafile=/etc/squid/ca.pem sslflags=DONT_VERIFY_PEER name=engine cache_peer_access engine allow all ssl_bump allow all http_port 3128 acl ovirt_nodes dst acl ovirt_engine dstdomain . acl all_ips src 1.1.1.1/1 http_access allow ovirt_nodes ovirt_engine http_access allow all_ips http_access allow all # Following are from: # https://access.redhat.com/solutions/425693 # Leave coredumps in the first cache dir coredump_dir /var/spool/squid # RHEV and Spice may leave connections idle for long periods pconn_timeout 12 hours request_timeout 12 hours read_timeout12 hours # We need approx 20 open filehandles per spice client max_filedesc 16384 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 3 Oct 2018, at 00:39, Greg Sheremeta mailto:gsher...@redhat.com>> wrote: Hi Callum, I took a look at this, but got in the weeds pretty quickly with squid configuration. I can help more offline, but it might be a while. It'll probably be easier if you can provide me exact steps for how I could reproduce. Looks like I need to generate some keys. Can you create and share a simple reproducer? Greg On Thu, Sep 20, 2018 at 11:37 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Greg, Did you manage to get any further with this, reverse proxy is rather critical to this project. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 6 Aug 2018, at 12:13, Greg Sheremeta mailto:gsher...@redhat.com>> wrote: I'll look into it and get back to you. On Mon, Aug 6, 2018 at 7:02 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Greg, So what's the go-to here, it seems so close but something in the API ajax is failing. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Jul 2018, at 12:21, Greg Sheremeta mailto:gsher...@redhat.com>> wrote: On Fri, Jul 27, 2018 at 4:39 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Greg, Indeed, always the latest and greatest for us while trying to get this running. https://www.ovirt.org/documentation/security/squid-reverse-proxy/ Arrggghh, that is referring to the old GWT UserPortal and not the new react-based VM Portal. (I'll delete it / mark it obsolete. I apologize for the out-of-date state of our documentation. I am working on improving it.) Unfortunately we have never tested VM Portal with squid. @Lukas Svaty<mailto:lsv...@redhat.com> any chance you or someone on the team can assist? And the squid.conf file looks like this: https_port 443 accel key=/etc/squid/rescomp-vmgw.well.ox.ac.uk.proxy.key cert=/etc/squid/rescomp-vmgw.well.ox.ac.uk.proxy.crt defaultsite=ovirtengine.cluster cache_peer ovirtengine.cluster parent 443 0 no-query originserver ssl sslcafile=/etc/squid/ca.pem sslflags=DONT_VERIFY_PEER name=engine cache_peer_access engine allow all ssl_bump allow all http_port 3128 acl ovirt_nodes dst 192.168.64.0/24<http://192.168.64.0/24> acl ovirt_engine dstdomain .ovirtengine.cluster acl all_ips src 1.1.1.1/1<http://1.1.1.1/1> http_access allow ovirt_nodes ovirt_engine http_access allow all_ips http_access allow all # Following are from: # https://access.redhat.com/solutions/425693 # Leave coredumps in the first cache dir coredump_dir /var/spool/squid # RHEV and Spice may leave connections idle for long periods pconn_timeout 12 hours request_timeout 12 hours read_timeout12 hours # We need approx 20 open filehandles per spice client max_filedesc 16384 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Jul 2018, at 01:15, Greg Sheremeta mailto:gsher...@redhat.com>> wrote: >From your other thread, I'm guessing 4.2.4. Can you send the link to the squid guide you used? On Wed, Jul 25, 2018 at 7:55 PM Greg Sheremeta mailto:gsher...@redhat.com>> wrote: Hi Callum, What version of ovirt-web-ui is this? Greg On Wed, Jul 18, 2018 at 7:12 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Those error logs are relevant only to another issue, please ignore. There appears to be a problem to do with authentication through the squid proxy though, which presents differently in
[ovirt-users] Re: Cannot configure local storage, network issue?
Dear Dominik, Thanks for taking the time to reply, and sorry for my delay in replying. I don't actually want to remove the host from the cluster, I want to configure local storage. The whole point of the configure local storage tool is to create a new cluster with local storage of the machine configured right? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 21 Sep 2018, at 15:14, Dominik Holler mailto:dhol...@redhat.com>> wrote: On Fri, 21 Sep 2018 15:05:59 +0200 Sandro Bonazzola mailto:sbona...@redhat.com>> wrote: Il giorno ven 21 set 2018 alle ore 14:58 Callum Smith mailto:cal...@well.ox.ac.uk>> ha scritto: Dear All, I'lm getting this problem when trying to add local storage from the node: Cannot edit Host. Moving a host to a cluster with different management network is not allowed. That might cause connectivity loss. The cluster the host is in uses a different default network to the default cluster - is this the problem? Yes, but you can remove the host from oVirt and add again to the new cluster. Adding Dominik Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/UW7UZBPD5SWQ54PWETWPYY2AYZDV5IVK/ ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/ZYZXAAFTG7PMK5SZ6ITBHH5ZBUVGPVHJ/
[ovirt-users] Cannot configure local storage, network issue?
Dear All, I'lm getting this problem when trying to add local storage from the node: Cannot edit Host. Moving a host to a cluster with different management network is not allowed. That might cause connectivity loss. The cluster the host is in uses a different default network to the default cluster - is this the problem? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/UW7UZBPD5SWQ54PWETWPYY2AYZDV5IVK/
[ovirt-users] Re: Using the web-ui VM portal through a proxy failing
Dear Greg, Did you manage to get any further with this, reverse proxy is rather critical to this project. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 6 Aug 2018, at 12:13, Greg Sheremeta mailto:gsher...@redhat.com>> wrote: I'll look into it and get back to you. On Mon, Aug 6, 2018 at 7:02 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Greg, So what's the go-to here, it seems so close but something in the API ajax is failing. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Jul 2018, at 12:21, Greg Sheremeta mailto:gsher...@redhat.com>> wrote: On Fri, Jul 27, 2018 at 4:39 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Greg, Indeed, always the latest and greatest for us while trying to get this running. https://www.ovirt.org/documentation/security/squid-reverse-proxy/ Arrggghh, that is referring to the old GWT UserPortal and not the new react-based VM Portal. (I'll delete it / mark it obsolete. I apologize for the out-of-date state of our documentation. I am working on improving it.) Unfortunately we have never tested VM Portal with squid. @Lukas Svaty<mailto:lsv...@redhat.com> any chance you or someone on the team can assist? And the squid.conf file looks like this: https_port 443 accel key=/etc/squid/rescomp-vmgw.well.ox.ac.uk.proxy.key cert=/etc/squid/rescomp-vmgw.well.ox.ac.uk.proxy.crt defaultsite=ovirtengine.cluster cache_peer ovirtengine.cluster parent 443 0 no-query originserver ssl sslcafile=/etc/squid/ca.pem sslflags=DONT_VERIFY_PEER name=engine cache_peer_access engine allow all ssl_bump allow all http_port 3128 acl ovirt_nodes dst 192.168.64.0/24<http://192.168.64.0/24> acl ovirt_engine dstdomain .ovirtengine.cluster acl all_ips src 1.1.1.1/1<http://1.1.1.1/1> http_access allow ovirt_nodes ovirt_engine http_access allow all_ips http_access allow all # Following are from: # https://access.redhat.com/solutions/425693 # Leave coredumps in the first cache dir coredump_dir /var/spool/squid # RHEV and Spice may leave connections idle for long periods pconn_timeout 12 hours request_timeout 12 hours read_timeout12 hours # We need approx 20 open filehandles per spice client max_filedesc 16384 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Jul 2018, at 01:15, Greg Sheremeta mailto:gsher...@redhat.com>> wrote: >From your other thread, I'm guessing 4.2.4. Can you send the link to the squid guide you used? On Wed, Jul 25, 2018 at 7:55 PM Greg Sheremeta mailto:gsher...@redhat.com>> wrote: Hi Callum, What version of ovirt-web-ui is this? Greg On Wed, Jul 18, 2018 at 7:12 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Those error logs are relevant only to another issue, please ignore. There appears to be a problem to do with authentication through the squid proxy though, which presents differently in Safari and Firefox: [X][X] Sorry for the screenshots but its the only way i can extract this data due to the page-refresh. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 18 Jul 2018, at 10:54, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Some relevant error logs: 2018-07-18 10:51:33,554+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-9) [557ca876] Running command : CreateUserSessionCommand internal: false. 2018-07-18 10:51:33,575+01 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-9) [557ca876] E VENT_ID: USER_VDC_LOGIN(30), User callum@Biomedical Research Computing connecting from '192.168.1.241' using session 'wiWA25wdaRP1zay iyTSGBJKpvi89LdzgKqeX12BcZhNVhpV2BIA+zkAnT50xOSDglxnhfAi3S2ZiODls8JYFUA==' logged in. 2018-07-18 10:51:34,135+01 ERROR [org.ovirt.engine.core.bll.GetSystemStatisticsQuery] (default task-5) [8d830cdb-fc11-4e68-94e6-73309 65c4488] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,205+01 ERROR [org.ovirt.engine.core.bll.GetPermissionsForObjectQuery] (default task-26) [ba1825f1-60fb-44cd-8b57- ea701cf698c0] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,242+01 ERROR [org.ovirt.engine.api.restapi.resource.AbstractBackendResource] (default task-26) [] Operation Faile d: query execution failed due to insufficient permissions. 2018-07-18 10:51:34,389+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (defaul
[ovirt-users] Re: Using the web-ui VM portal through a proxy failing
Dear Greg, So what's the go-to here, it seems so close but something in the API ajax is failing. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Jul 2018, at 12:21, Greg Sheremeta mailto:gsher...@redhat.com>> wrote: On Fri, Jul 27, 2018 at 4:39 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Greg, Indeed, always the latest and greatest for us while trying to get this running. https://www.ovirt.org/documentation/security/squid-reverse-proxy/ Arrggghh, that is referring to the old GWT UserPortal and not the new react-based VM Portal. (I'll delete it / mark it obsolete. I apologize for the out-of-date state of our documentation. I am working on improving it.) Unfortunately we have never tested VM Portal with squid. @Lukas Svaty<mailto:lsv...@redhat.com> any chance you or someone on the team can assist? And the squid.conf file looks like this: https_port 443 accel key=/etc/squid/rescomp-vmgw.well.ox.ac.uk.proxy.key cert=/etc/squid/rescomp-vmgw.well.ox.ac.uk.proxy.crt defaultsite=ovirtengine.cluster cache_peer ovirtengine.cluster parent 443 0 no-query originserver ssl sslcafile=/etc/squid/ca.pem sslflags=DONT_VERIFY_PEER name=engine cache_peer_access engine allow all ssl_bump allow all http_port 3128 acl ovirt_nodes dst 192.168.64.0/24<http://192.168.64.0/24> acl ovirt_engine dstdomain .ovirtengine.cluster acl all_ips src 1.1.1.1/1<http://1.1.1.1/1> http_access allow ovirt_nodes ovirt_engine http_access allow all_ips http_access allow all # Following are from: # https://access.redhat.com/solutions/425693 # Leave coredumps in the first cache dir coredump_dir /var/spool/squid # RHEV and Spice may leave connections idle for long periods pconn_timeout 12 hours request_timeout 12 hours read_timeout12 hours # We need approx 20 open filehandles per spice client max_filedesc 16384 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Jul 2018, at 01:15, Greg Sheremeta mailto:gsher...@redhat.com>> wrote: >From your other thread, I'm guessing 4.2.4. Can you send the link to the squid guide you used? On Wed, Jul 25, 2018 at 7:55 PM Greg Sheremeta mailto:gsher...@redhat.com>> wrote: Hi Callum, What version of ovirt-web-ui is this? Greg On Wed, Jul 18, 2018 at 7:12 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Those error logs are relevant only to another issue, please ignore. There appears to be a problem to do with authentication through the squid proxy though, which presents differently in Safari and Firefox: [X][X] Sorry for the screenshots but its the only way i can extract this data due to the page-refresh. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 18 Jul 2018, at 10:54, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Some relevant error logs: 2018-07-18 10:51:33,554+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-9) [557ca876] Running command : CreateUserSessionCommand internal: false. 2018-07-18 10:51:33,575+01 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-9) [557ca876] E VENT_ID: USER_VDC_LOGIN(30), User callum@Biomedical Research Computing connecting from '192.168.1.241' using session 'wiWA25wdaRP1zay iyTSGBJKpvi89LdzgKqeX12BcZhNVhpV2BIA+zkAnT50xOSDglxnhfAi3S2ZiODls8JYFUA==' logged in. 2018-07-18 10:51:34,135+01 ERROR [org.ovirt.engine.core.bll.GetSystemStatisticsQuery] (default task-5) [8d830cdb-fc11-4e68-94e6-73309 65c4488] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,205+01 ERROR [org.ovirt.engine.core.bll.GetPermissionsForObjectQuery] (default task-26) [ba1825f1-60fb-44cd-8b57- ea701cf698c0] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,242+01 ERROR [org.ovirt.engine.api.restapi.resource.AbstractBackendResource] (default task-26) [] Operation Faile d: query execution failed due to insufficient permissions. 2018-07-18 10:51:34,389+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,393+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,394+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query
[ovirt-users] Re: VM User with UserRole missing permissions to activate console and other actions
Dear Michal, We’re on the latest release version, 4.2.4. VM Permissions are UserRole and no others. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Jul 2018, at 18:14, Michal Skrivanek mailto:mskri...@redhat.com>> wrote: On 27 Jul 2018, at 16:37, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Scott, You're not wrong, console reports the error: Reducing action: {"type":"VM_ACTIONS_SET_CONSOLE_IN_USE","payload":{"vmId":"a437d298-59b3-4e10-9ced-516edb38a7ea","consoleInUse":false}} helpers.js:19:6 Reducing action: {"type":"SET_VM_CONSOLES","payload":{"vmId":"a437d298-59b3-4e10-9ced-516edb38a7ea","consoles":[{"id":"7370696365","protocol":"spice"},{"id":"766e63","protocol":"vnc"}]}} helpers.js:19:6 Reducing action: {"type":"VM_ACTION_IN_PROGRESS","payload":{"vmId":"a437d298-59b3-4e10-9ced-516edb38a7ea","name":"getConsole","started":false}} helpers.js:19:6 External action console() starts on {"type":"INTERNAL_CONSOLE","payload":{"vmId":"a437d298-59b3-4e10-9ced-516edb38a7ea","consoleId":"7370696365"}} helpers.js:19:6 _httpGet start: url="/ovirt-engine/api/vms/a437d298-59b3-4e10-9ced-516edb38a7ea/graphicsconsoles/7370696365" helpers.js:19:6 _httpGet: url="/ovirt-engine/api/vms/a437d298-59b3-4e10-9ced-516edb38a7ea/graphicsconsoles/7370696365", headers="{"Authorization":"Bearer 5orPWAJBkJx5g1bZDxi7Q3h3oQHFpWZW6XP5rZo_Moy3IShguyuA6VPpa4EB-o7mc8g_gHuua5BdMy8wwet3Yw","Accept-Language":"en_US","Filter":true,"Accept":"application/x-virt-viewer"}" XML Parsing Error: no root element found Location: https://ovirtengine.cluster/ovirt-engine/api/vms/a437d298-59b3-4e10-9ced-516edb38a7ea/graphicsconsoles/7370696365 Line Number 1, Column 1: Ajax failed: {"readyState":4,"responseText":"","status":403,"statusText":"Forbidden"} helpers.js:19:6 External action exception: {"readyState":4,"responseText":"","status":403,"statusText":"Forbidden"} helpers.js:19:6 Reducing action: {"type":"FAILED_EXTERNAL_ACTION","payload":{"message":"Failed to retrieve VM console details\nForbidden","shortMessage":"Failed to retrieve VM console details","type":403,"action":{"type":"INTERNAL_CONSOLE","payload":{"vmId":"a437d298-59b3-4e10-9ced-516edb38a7ea","consoleId":"7370696365" All of the forbidden issues could be misleading, but along with the server-side permissions issues that leads me to believe it could be a combination of both the issue you suggest and another one underlying? We have to go with full release versions ideally as the ovirtengine is in production, I'd rather avoid the master branch version. sure, so which ones do you have? both ovirt-engine and ovirt-web-ui Also please add what all permissions you have set on that VM, and which user tries to access it Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Jul 2018, at 15:25, Scott Dickerson mailto:sdick...@redhat.com>> wrote: Callum, If you are seeing the error when opening the console from VM Portal (web-ui), this sounds a lot like issue [1]. A patch against the ovirt REST api was done to help mitigate some errors that should not happen. I'll take a look to see how it is working in the current master snapshot of ovirt-engine and VM Portal. What version of oVirt are you using? [1] - https://github.com/oVirt/ovirt-web-ui/issues/509 Regards, Scott Dickerson On Fri, Jul 27, 2018 at 4:42 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Thanks Greg, it really would be great to resolve this and would really like to not have to clean install the engine on a new hostname (especially without a guarantee that it will work). Migrating is a significant portion of work. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Jul 2018, at 01:20, Greg Sheremeta mailto:gsher...@redhat.com>> wrote: Adding some people who may be able to help. On Wed, Jul 18
[ovirt-users] Re: VM User with UserRole missing permissions to activate console and other actions
Dear Scott, You're not wrong, console reports the error: Reducing action: {"type":"VM_ACTIONS_SET_CONSOLE_IN_USE","payload":{"vmId":"a437d298-59b3-4e10-9ced-516edb38a7ea","consoleInUse":false}} helpers.js:19:6 Reducing action: {"type":"SET_VM_CONSOLES","payload":{"vmId":"a437d298-59b3-4e10-9ced-516edb38a7ea","consoles":[{"id":"7370696365","protocol":"spice"},{"id":"766e63","protocol":"vnc"}]}} helpers.js:19:6 Reducing action: {"type":"VM_ACTION_IN_PROGRESS","payload":{"vmId":"a437d298-59b3-4e10-9ced-516edb38a7ea","name":"getConsole","started":false}} helpers.js:19:6 External action console() starts on {"type":"INTERNAL_CONSOLE","payload":{"vmId":"a437d298-59b3-4e10-9ced-516edb38a7ea","consoleId":"7370696365"}} helpers.js:19:6 _httpGet start: url="/ovirt-engine/api/vms/a437d298-59b3-4e10-9ced-516edb38a7ea/graphicsconsoles/7370696365" helpers.js:19:6 _httpGet: url="/ovirt-engine/api/vms/a437d298-59b3-4e10-9ced-516edb38a7ea/graphicsconsoles/7370696365", headers="{"Authorization":"Bearer 5orPWAJBkJx5g1bZDxi7Q3h3oQHFpWZW6XP5rZo_Moy3IShguyuA6VPpa4EB-o7mc8g_gHuua5BdMy8wwet3Yw","Accept-Language":"en_US","Filter":true,"Accept":"application/x-virt-viewer"}" XML Parsing Error: no root element found Location: https://ovirtengine.cluster/ovirt-engine/api/vms/a437d298-59b3-4e10-9ced-516edb38a7ea/graphicsconsoles/7370696365 Line Number 1, Column 1: Ajax failed: {"readyState":4,"responseText":"","status":403,"statusText":"Forbidden"} helpers.js:19:6 External action exception: {"readyState":4,"responseText":"","status":403,"statusText":"Forbidden"} helpers.js:19:6 Reducing action: {"type":"FAILED_EXTERNAL_ACTION","payload":{"message":"Failed to retrieve VM console details\nForbidden","shortMessage":"Failed to retrieve VM console details","type":403,"action":{"type":"INTERNAL_CONSOLE","payload":{"vmId":"a437d298-59b3-4e10-9ced-516edb38a7ea","consoleId":"7370696365" All of the forbidden issues could be misleading, but along with the server-side permissions issues that leads me to believe it could be a combination of both the issue you suggest and another one underlying? We have to go with full release versions ideally as the ovirtengine is in production, I'd rather avoid the master branch version. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Jul 2018, at 15:25, Scott Dickerson mailto:sdick...@redhat.com>> wrote: Callum, If you are seeing the error when opening the console from VM Portal (web-ui), this sounds a lot like issue [1]. A patch against the ovirt REST api was done to help mitigate some errors that should not happen. I'll take a look to see how it is working in the current master snapshot of ovirt-engine and VM Portal. What version of oVirt are you using? [1] - https://github.com/oVirt/ovirt-web-ui/issues/509 Regards, Scott Dickerson On Fri, Jul 27, 2018 at 4:42 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Thanks Greg, it really would be great to resolve this and would really like to not have to clean install the engine on a new hostname (especially without a guarantee that it will work). Migrating is a significant portion of work. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Jul 2018, at 01:20, Greg Sheremeta mailto:gsher...@redhat.com>> wrote: Adding some people who may be able to help. On Wed, Jul 18, 2018 at 7:15 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Please see the errors below. I'm seeing this in the engine.log when as a user I'm trying to activate either a VM console or reboot a VM which I have access to as a user ("UserRole permission assigned to VM). 2018-07-18 10:51:33,554+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-9) [557ca876] Running command : CreateUserSessionCommand internal: false. 2018-07-18 10:51:33,575+01 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-9) [557ca876] E VENT_ID: USER_VDC_LOGIN
[ovirt-users] Re: VM User with UserRole missing permissions to activate console and other actions
Thanks Greg, it really would be great to resolve this and would really like to not have to clean install the engine on a new hostname (especially without a guarantee that it will work). Migrating is a significant portion of work. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Jul 2018, at 01:20, Greg Sheremeta mailto:gsher...@redhat.com>> wrote: Adding some people who may be able to help. On Wed, Jul 18, 2018 at 7:15 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Please see the errors below. I'm seeing this in the engine.log when as a user I'm trying to activate either a VM console or reboot a VM which I have access to as a user ("UserRole permission assigned to VM). 2018-07-18 10:51:33,554+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-9) [557ca876] Running command : CreateUserSessionCommand internal: false. 2018-07-18 10:51:33,575+01 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-9) [557ca876] E VENT_ID: USER_VDC_LOGIN(30), User callum@Biomedical Research Computing connecting from '192.168.1.241' using session 'wiWA25wdaRP1zay iyTSGBJKpvi89LdzgKqeX12BcZhNVhpV2BIA+zkAnT50xOSDglxnhfAi3S2ZiODls8JYFUA==' logged in. 2018-07-18 10:51:34,135+01 ERROR [org.ovirt.engine.core.bll.GetSystemStatisticsQuery] (default task-5) [8d830cdb-fc11-4e68-94e6-73309 65c4488] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,205+01 ERROR [org.ovirt.engine.core.bll.GetPermissionsForObjectQuery] (default task-26) [ba1825f1-60fb-44cd-8b57- ea701cf698c0] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,242+01 ERROR [org.ovirt.engine.api.restapi.resource.AbstractBackendResource] (default task-26) [] Operation Faile d: query execution failed due to insufficient permissions. 2018-07-18 10:51:34,389+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,393+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,394+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,396+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:59,195+01 WARN [org.ovirt.engine.core.bll.SetVmTicketCommand] (default task-18) [7881a832] User '9386d6f5-f172-4cdb -abca-62492a357888' is trying to take the console of virtual machine 'ddb23e0a-01d5-403c-89ab-37c400d2c938', but the console is alrea dy taken by user 'd021fc10-4f7c-11e8-88cb-00163e6a7aff'. 2018-07-18 10:51:59,197+01 INFO [org.ovirt.engine.core.bll.SetVmTicketCommand] (default task-18) [7881a832] No permission found for user '9386d6f5-f172-4cdb-abca-62492a357888' or one of the groups he is member of, when running action 'SetVmTicket', Required permiss ions are: Action type: 'USER' Action group: 'RECONNECT_TO_VM' Object type: 'VM' Object ID: 'ddb23e0a-01d5-403c-89ab-37c400d2c938'. 2018-07-18 10:51:59,197+01 WARN [org.ovirt.engine.core.bll.SetVmTicketCommand] (default task-18) [7881a832] Validation of action 'Se tVmTicket' failed for user callum@Biomedical Research Computing. Reasons: VAR__ACTION__SET,VAR__TYPE__VM_TICKET,USER_CANNOT_FORCE_REC ONNECT_TO_VM 2018-07-18 10:51:59,198+01 ERROR [org.ovirt.engine.api.restapi.resource.BackendVmGraphicsConsoleResource] (default task-18) [] Operat ion Failed: USER_CANNOT_FORCE_RECONNECT_TO_VM Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/D7GSJDZ32DBIMYRMX3MF2TKVNTP5Q6DW/ -- GREG SHEREMETA SENIOR SOFTWARE ENGINEER - TEAM LEAD - RHV UX Red Hat NA <https://www.redhat.com/> gsher...@redhat.com<mailto:gsher...@redhat.com>IRC: gshereme [https://www.redhat.com/files/brand/email/sig-redhat.png]<https://red.ht/sig
[ovirt-users] Re: Using the web-ui VM portal through a proxy failing
Dear Greg, Indeed, always the latest and greatest for us while trying to get this running. https://www.ovirt.org/documentation/security/squid-reverse-proxy/ And the squid.conf file looks like this: https_port 443 accel key=/etc/squid/rescomp-vmgw.well.ox.ac.uk.proxy.key cert=/etc/squid/rescomp-vmgw.well.ox.ac.uk.proxy.crt defaultsite=ovirtengine.cluster cache_peer ovirtengine.cluster parent 443 0 no-query originserver ssl sslcafile=/etc/squid/ca.pem sslflags=DONT_VERIFY_PEER name=engine cache_peer_access engine allow all ssl_bump allow all http_port 3128 acl ovirt_nodes dst 192.168.64.0/24 acl ovirt_engine dstdomain .ovirtengine.cluster acl all_ips src 1.1.1.1/1 http_access allow ovirt_nodes ovirt_engine http_access allow all_ips http_access allow all # Following are from: # https://access.redhat.com/solutions/425693 # Leave coredumps in the first cache dir coredump_dir /var/spool/squid # RHEV and Spice may leave connections idle for long periods pconn_timeout 12 hours request_timeout 12 hours read_timeout12 hours # We need approx 20 open filehandles per spice client max_filedesc 16384 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 27 Jul 2018, at 01:15, Greg Sheremeta mailto:gsher...@redhat.com>> wrote: >From your other thread, I'm guessing 4.2.4. Can you send the link to the squid guide you used? On Wed, Jul 25, 2018 at 7:55 PM Greg Sheremeta mailto:gsher...@redhat.com>> wrote: Hi Callum, What version of ovirt-web-ui is this? Greg On Wed, Jul 18, 2018 at 7:12 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Those error logs are relevant only to another issue, please ignore. There appears to be a problem to do with authentication through the squid proxy though, which presents differently in Safari and Firefox: [X][X] Sorry for the screenshots but its the only way i can extract this data due to the page-refresh. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 18 Jul 2018, at 10:54, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Some relevant error logs: 2018-07-18 10:51:33,554+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-9) [557ca876] Running command : CreateUserSessionCommand internal: false. 2018-07-18 10:51:33,575+01 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-9) [557ca876] E VENT_ID: USER_VDC_LOGIN(30), User callum@Biomedical Research Computing connecting from '192.168.1.241' using session 'wiWA25wdaRP1zay iyTSGBJKpvi89LdzgKqeX12BcZhNVhpV2BIA+zkAnT50xOSDglxnhfAi3S2ZiODls8JYFUA==' logged in. 2018-07-18 10:51:34,135+01 ERROR [org.ovirt.engine.core.bll.GetSystemStatisticsQuery] (default task-5) [8d830cdb-fc11-4e68-94e6-73309 65c4488] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,205+01 ERROR [org.ovirt.engine.core.bll.GetPermissionsForObjectQuery] (default task-26) [ba1825f1-60fb-44cd-8b57- ea701cf698c0] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,242+01 ERROR [org.ovirt.engine.api.restapi.resource.AbstractBackendResource] (default task-26) [] Operation Faile d: query execution failed due to insufficient permissions. 2018-07-18 10:51:34,389+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,393+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,394+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,396+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:59,195+01 WARN [org.ovirt.engine.core.bll.SetVmTicketCommand] (default task-18) [7881a832] User '9386d6f5-f172-4cdb -abca-62492a357888' is trying to take the console of virtual machine 'ddb23e0a-01d5-403c-89ab-37c400d2c938', but the console is alrea dy taken by user 'd021fc10-4f7c-11e8-88cb-00163e6a7aff'. 2018-07-18 10:51:59,197+01 INFO [org.ovirt.engine.core.bll.SetVmTicketCommand] (default task-18) [7881a832] No permission found for user '9386d6f5-f172-4cdb-abca-62492a357888' or one of the groups he is member of, when running action 'SetVmTicket', Required permiss ion
[ovirt-users] VM User with UserRole missing permissions to activate console and other actions
Dear All, Please see the errors below. I'm seeing this in the engine.log when as a user I'm trying to activate either a VM console or reboot a VM which I have access to as a user ("UserRole permission assigned to VM). 2018-07-18 10:51:33,554+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-9) [557ca876] Running command : CreateUserSessionCommand internal: false. 2018-07-18 10:51:33,575+01 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-9) [557ca876] E VENT_ID: USER_VDC_LOGIN(30), User callum@Biomedical Research Computing connecting from '192.168.1.241' using session 'wiWA25wdaRP1zay iyTSGBJKpvi89LdzgKqeX12BcZhNVhpV2BIA+zkAnT50xOSDglxnhfAi3S2ZiODls8JYFUA==' logged in. 2018-07-18 10:51:34,135+01 ERROR [org.ovirt.engine.core.bll.GetSystemStatisticsQuery] (default task-5) [8d830cdb-fc11-4e68-94e6-73309 65c4488] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,205+01 ERROR [org.ovirt.engine.core.bll.GetPermissionsForObjectQuery] (default task-26) [ba1825f1-60fb-44cd-8b57- ea701cf698c0] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,242+01 ERROR [org.ovirt.engine.api.restapi.resource.AbstractBackendResource] (default task-26) [] Operation Faile d: query execution failed due to insufficient permissions. 2018-07-18 10:51:34,389+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,393+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,394+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,396+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:59,195+01 WARN [org.ovirt.engine.core.bll.SetVmTicketCommand] (default task-18) [7881a832] User '9386d6f5-f172-4cdb -abca-62492a357888' is trying to take the console of virtual machine 'ddb23e0a-01d5-403c-89ab-37c400d2c938', but the console is alrea dy taken by user 'd021fc10-4f7c-11e8-88cb-00163e6a7aff'. 2018-07-18 10:51:59,197+01 INFO [org.ovirt.engine.core.bll.SetVmTicketCommand] (default task-18) [7881a832] No permission found for user '9386d6f5-f172-4cdb-abca-62492a357888' or one of the groups he is member of, when running action 'SetVmTicket', Required permiss ions are: Action type: 'USER' Action group: 'RECONNECT_TO_VM' Object type: 'VM' Object ID: 'ddb23e0a-01d5-403c-89ab-37c400d2c938'. 2018-07-18 10:51:59,197+01 WARN [org.ovirt.engine.core.bll.SetVmTicketCommand] (default task-18) [7881a832] Validation of action 'Se tVmTicket' failed for user callum@Biomedical Research Computing. Reasons: VAR__ACTION__SET,VAR__TYPE__VM_TICKET,USER_CANNOT_FORCE_REC ONNECT_TO_VM 2018-07-18 10:51:59,198+01 ERROR [org.ovirt.engine.api.restapi.resource.BackendVmGraphicsConsoleResource] (default task-18) [] Operat ion Failed: USER_CANNOT_FORCE_RECONNECT_TO_VM Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/D7GSJDZ32DBIMYRMX3MF2TKVNTP5Q6DW/
[ovirt-users] Re: Using the web-ui VM portal through a proxy failing
Dear All, Some relevant error logs: 2018-07-18 10:51:33,554+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-9) [557ca876] Running command : CreateUserSessionCommand internal: false. 2018-07-18 10:51:33,575+01 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-9) [557ca876] E VENT_ID: USER_VDC_LOGIN(30), User callum@Biomedical Research Computing connecting from '192.168.1.241' using session 'wiWA25wdaRP1zay iyTSGBJKpvi89LdzgKqeX12BcZhNVhpV2BIA+zkAnT50xOSDglxnhfAi3S2ZiODls8JYFUA==' logged in. 2018-07-18 10:51:34,135+01 ERROR [org.ovirt.engine.core.bll.GetSystemStatisticsQuery] (default task-5) [8d830cdb-fc11-4e68-94e6-73309 65c4488] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,205+01 ERROR [org.ovirt.engine.core.bll.GetPermissionsForObjectQuery] (default task-26) [ba1825f1-60fb-44cd-8b57- ea701cf698c0] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,242+01 ERROR [org.ovirt.engine.api.restapi.resource.AbstractBackendResource] (default task-26) [] Operation Faile d: query execution failed due to insufficient permissions. 2018-07-18 10:51:34,389+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,393+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,394+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:34,396+01 ERROR [org.ovirt.engine.core.bll.storage.domain.GetStorageDomainListByIdQuery] (default task-17) [02965366 -44b0-4370-ab83-4781065e46c2] Query execution failed due to insufficient permissions. 2018-07-18 10:51:59,195+01 WARN [org.ovirt.engine.core.bll.SetVmTicketCommand] (default task-18) [7881a832] User '9386d6f5-f172-4cdb -abca-62492a357888' is trying to take the console of virtual machine 'ddb23e0a-01d5-403c-89ab-37c400d2c938', but the console is alrea dy taken by user 'd021fc10-4f7c-11e8-88cb-00163e6a7aff'. 2018-07-18 10:51:59,197+01 INFO [org.ovirt.engine.core.bll.SetVmTicketCommand] (default task-18) [7881a832] No permission found for user '9386d6f5-f172-4cdb-abca-62492a357888' or one of the groups he is member of, when running action 'SetVmTicket', Required permiss ions are: Action type: 'USER' Action group: 'RECONNECT_TO_VM' Object type: 'VM' Object ID: 'ddb23e0a-01d5-403c-89ab-37c400d2c938'. 2018-07-18 10:51:59,197+01 WARN [org.ovirt.engine.core.bll.SetVmTicketCommand] (default task-18) [7881a832] Validation of action 'Se tVmTicket' failed for user callum@Biomedical Research Computing. Reasons: VAR__ACTION__SET,VAR__TYPE__VM_TICKET,USER_CANNOT_FORCE_REC ONNECT_TO_VM 2018-07-18 10:51:59,198+01 ERROR [org.ovirt.engine.api.restapi.resource.BackendVmGraphicsConsoleResource] (default task-18) [] Operat ion Failed: USER_CANNOT_FORCE_RECONNECT_TO_VM Seems like there's a permission missing in there - this is a newly attached LDAP group. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 Jul 2018, at 10:02, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Does anyone know how to set such options in the web-ui? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 12 Jul 2018, at 11:09, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear oVirt Gurus, Using the oVirt user VM portal seems to not work through the squid proxy setup (configured as per the guide). The page loads and login works fine through the proxy, but the asynchronous requests just hang. I've attached a screenshot, but you can see the "api" endpoint just hanging in a web inspector: "https://proxyfqdn/ovirt-engine/api/; This works fine when not going through the proxy. Is there a way to force noVNC HTML as the console mode through the web-ui, or at least have it as an option if not default? The console seems not to work when logged in with a base 'user role'. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https:/
[ovirt-users] Re: Issue when updating node to 4.2.4
Dear All, A combination of options fixed this: yum downgrade ovirt-node-ng-image-update lvremove /dev//.0 lvremove /dev//.0+1 lvremove /dev//var_crash fstrim -av reboot Next time the yum update ovirt-node-ng-image-update works. Thanks all for your help! Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 18 Jul 2018, at 07:07, Oliver Riesener mailto:oliver.riese...@hs-bremen.de>> wrote: Hi Callum, your LVM thin pool seams to be full. Try to trim it with: fstrin -av. Am 12.07.2018 um 13:13 schrieb Callum Smith mailto:cal...@well.ox.ac.uk>>: 'lvcreate', '--thin', '--virtualsize', u'349121282048B', '--name', 'ovirt-node-ng-4.2.4-0.20180626.0', u'onn_virta003/pool00']' ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/SGJTCFJJZDZ6CZL3JXICH3X2DFY7HUX5/
[ovirt-users] Re: Issue when updating node to 4.2.4
Dear all, Has anyone else experienced these issues or are the exclusive to these nodes? I'd rather avoid re-installing them as a work-around if possible. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 12 Jul 2018, at 12:13, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Error in the scripts as part of doing a yum update on the host: /var/log/messages Jul 12 12:07:00 virtA003 imgbased: 2018-07-12 12:07:00,170 [INFO] (MainThread) Extracting image '/usr/share/ovirt-node-ng/image//ovirt-node-ng-4.2.0-0.20180626.0.el7.squashfs.img' Jul 12 12:07:00 virtA003 kernel: EXT4-fs (loop3): mounted filesystem with ordered data mode. Opts: (null) Jul 12 12:07:00 virtA003 imgbased: 2018-07-12 12:07:00,295 [INFO] (MainThread) Starting base creation Jul 12 12:07:00 virtA003 imgbased: 2018-07-12 12:07:00,295 [INFO] (MainThread) New base will be: ovirt-node-ng-4.2.4-0.20180626.0 Jul 12 12:07:00 virtA003 python: detected unhandled Python exception in '/tmp/tmp.awWEfeJStR/usr/lib/python2.7/site-packages/imgbased/__main__.py' Jul 12 12:07:00 virtA003 abrt-server: Executable '/tmp/tmp.awWEfeJStR/usr/lib/python2.7/site-packages/imgbased/__main__.py' doesn't belong to any package and ProcessUnpackaged is set to 'no' Jul 12 12:07:00 virtA003 abrt-server: 'post-create' on '/var/tmp/abrt/Python-2018-07-12-12:07:00-4688' exited with 1 Jul 12 12:07:00 virtA003 abrt-server: Deleting problem directory '/var/tmp/abrt/Python-2018-07-12-12:07:00-4688' Jul 12 12:07:00 virtA003 yum[4558]: Updated: ovirt-node-ng-image-update-4.2.4-1.el7.noarch /tmp/imgbased.log 2018-07-12 12:07:00,166 [DEBUG] (MainThread) Version: imgbased-1.0.20 2018-07-12 12:07:00,170 [DEBUG] (MainThread) Arguments: Namespace(FILENAME='/usr/share/ovirt-node-ng/image//ovirt-node-ng-4.2.0-0.20180626.0.el7.squashfs.img', command='update', debug=True, experimental=False, format='liveimg', stream='Image') 2018-07-12 12:07:00,170 [INFO] (MainThread) Extracting image '/usr/share/ovirt-node-ng/image//ovirt-node-ng-4.2.0-0.20180626.0.el7.squashfs.img' 2018-07-12 12:07:00,170 [DEBUG] (MainThread) Calling binary: (['mktemp', '-d', '--tmpdir', 'mnt.X'],) {} 2018-07-12 12:07:00,170 [DEBUG] (MainThread) Calling: (['mktemp', '-d', '--tmpdir', 'mnt.X'],) {'close_fds': True, 'stderr': -2} 2018-07-12 12:07:00,173 [DEBUG] (MainThread) Returned: /tmp/mnt.qTRik 2018-07-12 12:07:00,173 [DEBUG] (MainThread) Calling binary: (['mount', '/usr/share/ovirt-node-ng/image//ovirt-node-ng-4.2.0-0.20180626.0.el7.squashfs.img', u'/tmp/mnt.qTRik'],) {} 2018-07-12 12:07:00,174 [DEBUG] (MainThread) Calling: (['mount', '/usr/share/ovirt-node-ng/image//ovirt-node-ng-4.2.0-0.20180626.0.el7.squashfs.img', u'/tmp/mnt.qTRik'],) {'close_fds': True, 'stderr': -2} 2018-07-12 12:07:00,180 [DEBUG] (MainThread) Returned: 2018-07-12 12:07:00,180 [DEBUG] (MainThread) Mounted squashfs 2018-07-12 12:07:00,180 [DEBUG] (MainThread) Found fsimage at '/tmp/mnt.qTRik/LiveOS/rootfs.img' 2018-07-12 12:07:00,180 [DEBUG] (MainThread) Calling binary: (['mktemp', '-d', '--tmpdir', 'mnt.X'],) {} 2018-07-12 12:07:00,181 [DEBUG] (MainThread) Calling: (['mktemp', '-d', '--tmpdir', 'mnt.X'],) {'close_fds': True, 'stderr': -2} 2018-07-12 12:07:00,184 [DEBUG] (MainThread) Returned: /tmp/mnt.SIk9q 2018-07-12 12:07:00,184 [DEBUG] (MainThread) Calling binary: (['mount', u'/tmp/mnt.qTRik/LiveOS/rootfs.img', u'/tmp/mnt.SIk9q'],) {} 2018-07-12 12:07:00,184 [DEBUG] (MainThread) Calling: (['mount', u'/tmp/mnt.qTRik/LiveOS/rootfs.img', u'/tmp/mnt.SIk9q'],) {'close_fds': True, 'stderr': -2} 2018-07-12 12:07:00,199 [DEBUG] (MainThread) Returned: 2018-07-12 12:07:00,212 [DEBUG] (MainThread) Using nvr: ovirt-node-ng-4.2.4-0.20180626.0 2018-07-12 12:07:00,212 [DEBUG] (MainThread) Fetching image for '/' 2018-07-12 12:07:00,212 [DEBUG] (MainThread) Calling binary: (['findmnt', '--noheadings', '-o', 'SOURCE', '/'],) {} 2018-07-12 12:07:00,212 [DEBUG] (MainThread) Calling: (['findmnt', '--noheadings', '-o', 'SOURCE', '/'],) {'close_fds': True, 'stderr': -2} 2018-07-12 12:07:00,217 [DEBUG] (MainThread) Returned: /dev/mapper/onn_virta003-ovirt--node--ng--4.2.3.1--0.20180530.0+1 2018-07-12 12:07:00,218 [DEBUG] (MainThread) Found '/dev/mapper/onn_virta003-ovirt--node--ng--4.2.3.1--0.20180530.0+1' 2018-07-12 12:07:00,218 [DEBUG] (MainThread) Calling binary: (['lvs', '--noheadings', '--ignoreskippedcluster', '-ovg_name,lv_name', u'/dev/mapper/onn_virta003-ovirt--node--ng--4.2.3.1--0.20180530.0+1'],) {'stderr': } 2018-07-12 12:07:00,218 [DEBUG] (MainThread) Calling: (['lvs', '--noheadings', '--ignoreskippedcluster', '-ovg_name,lv_name', u'/dev/mapper/onn_virta003-ovirt--node--ng--4.2.3.1--0.20180530.0+1'],) {'close_fds': True, 'stderr': } 2018-07-12 12:07:00,248 [DEBUG] (MainThread) Returned: onn_virta003 ovirt-node-ng-4.2.3.1-0.20180530.0+1 2018-07-12 12:07:0
[ovirt-users] Re: Using the web-ui VM portal through a proxy failing
Dear All, Does anyone know how to set such options in the web-ui? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 12 Jul 2018, at 11:09, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear oVirt Gurus, Using the oVirt user VM portal seems to not work through the squid proxy setup (configured as per the guide). The page loads and login works fine through the proxy, but the asynchronous requests just hang. I've attached a screenshot, but you can see the "api" endpoint just hanging in a web inspector: "https://proxyfqdn/ovirt-engine/api/; This works fine when not going through the proxy. Is there a way to force noVNC HTML as the console mode through the web-ui, or at least have it as an option if not default? The console seems not to work when logged in with a base 'user role'. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/VZIGGZZ2IIHBZ65QCX5PLB65DEMRQD4X/ ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/7NBOGYVL4EAH4QQI6ETPMFNXC5VSTZCP/
[ovirt-users] Issue when updating node to 4.2.4
uot;/tmp/tmp.awWEfeJStR/usr/lib/python2.7/site-packages/imgbased/utils.py", line 378, in call stdout = call(*args, **kwargs) File "/tmp/tmp.awWEfeJStR/usr/lib/python2.7/site-packages/imgbased/utils.py", line 153, in call return subprocess.check_output(*args, **kwargs).strip() File "/usr/lib64/python2.7/subprocess.py", line 575, in check_output raise CalledProcessError(retcode, cmd, output=output) subprocess.CalledProcessError: Command '['lvcreate', '--thin', '--virtualsize', u'349121282048B', '--name', 'ovirt-node-ng-4.2.4-0.20180626.0', u'onn_virta003/pool00']' returned non-zero exit status 5 How would i move forward on this? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/EMEF6P2RMCDUVQST42BMPEN3O3NQGCET/
[ovirt-users] Using the web-ui VM portal through a proxy failing
Dear oVirt Gurus, Using the oVirt user VM portal seems to not work through the squid proxy setup (configured as per the guide). The page loads and login works fine through the proxy, but the asynchronous requests just hang. I've attached a screenshot, but you can see the "api" endpoint just hanging in a web inspector: "https://proxyfqdn/ovirt-engine/api/; [cid:CA42E493-3AD9-45F8-B4C3-C914F059390C@well.ox.ac.uk] This works fine when not going through the proxy. Is there a way to force noVNC HTML as the console mode through the web-ui, or at least have it as an option if not default? The console seems not to work when logged in with a base 'user role'. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/VZIGGZZ2IIHBZ65QCX5PLB65DEMRQD4X/
[ovirt-users] Re: vGPU setup guide
Works perfectly on our P40, going to try on a 32GB V100 card too, just got console and general user-portal woes to get through before we can do proper scale testing though. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 24 Jun 2018, at 13:20, femi adegoke mailto:ov...@fateknollogee.com>> wrote: What's the latest, any updates? What GPU are you using? ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/ZQUVLRVZZ5IILWTVPAQP7CLKSGW4DTLX/ ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/I252NSG6P7FSZOBNBIWCI2I7I3UPIBAC/
[ovirt-users] Re: Hosted engine cannot migrate
So the one host with the stale-data, putting that into maintenance and then rebooting seems to have brought it back and stopped the errors. It seems the ha-broker is a bit more temperamental than usual after the 4.2.3.1 release. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 14 Jun 2018, at 10:34, Martin Sivak mailto:msi...@redhat.com>> wrote: Dear Callum, unknown stale-data means the hosts did not submit status update during the last minute. That might be just a glitch or something happened to the storage connection there. Best regards Martin Sivak On Thu, Jun 14, 2018 at 11:28 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Martin, The engine is running happily and migration appears to work although it appears one node has dropped into "unknown stale-data" on vm-status (the node that is neither the originator or the target for the migration. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 14 Jun 2018, at 10:22, Martin Sivak mailto:msi...@redhat.com>> wrote: Hi, is the engine running even though you see the errors in the log? Hosted engine agents fight for the lock when starting the engine VM. One wins and the others report an issue. We have some checks in place to silence those, but maybe it leaked again. This might be just annoying as long as the engine is up. Manually clicking the migrate button should also work. Best regards Martin Sivak On Thu, Jun 14, 2018 at 10:41 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Getting an issue where the HE can't b migrated, the log is full of: "VM HostedEngine is down with error. Exit message: resource busy: Failed to acquire lock: Lease is held by another host." engine.log attached Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/SR5RVQ2WFDMKNO4RXSCU45GJ7RTXDZX4/ ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/JXTP2AUZ7P7YJY7XQFE7DM2YWOOZLFRE/
[ovirt-users] Re: Hosted engine cannot migrate
Dear Martin, The engine is running happily and migration appears to work although it appears one node has dropped into "unknown stale-data" on vm-status (the node that is neither the originator or the target for the migration. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 14 Jun 2018, at 10:22, Martin Sivak mailto:msi...@redhat.com>> wrote: Hi, is the engine running even though you see the errors in the log? Hosted engine agents fight for the lock when starting the engine VM. One wins and the others report an issue. We have some checks in place to silence those, but maybe it leaked again. This might be just annoying as long as the engine is up. Manually clicking the migrate button should also work. Best regards Martin Sivak On Thu, Jun 14, 2018 at 10:41 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Getting an issue where the HE can't b migrated, the log is full of: "VM HostedEngine is down with error. Exit message: resource busy: Failed to acquire lock: Lease is held by another host." engine.log attached Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/SR5RVQ2WFDMKNO4RXSCU45GJ7RTXDZX4/ ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/KRGOH56HKO6SHS7CEMSMS7JMZKXSCPDG/
[ovirt-users] Hosted engine cannot migrate
Dear All, Getting an issue where the HE can't b migrated, the log is full of: "VM HostedEngine is down with error. Exit message: resource busy: Failed to acquire lock: Lease is held by another host." engine.log attached Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> engine.log.tar.gz Description: engine.log.tar.gz ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/SR5RVQ2WFDMKNO4RXSCU45GJ7RTXDZX4/
[ovirt-users] Re: LDAP Authentication issues
Dear All, Seems the required element for the CPU Profile to work is in roles_groups table: insert into roles_groups (role_id, action_group_id) VALUES ('def00017----def00017', '1668'); Whether the action_group_id is install-specific or not is unclear, but the role UUID for "CPUOperator" should be standard. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 12 Jun 2018, at 09:47, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Process of database "fixing" is required because adding system permissions to the "Everyone" group is a one-way process that causes many problems and there is no way to rescue from the GUI, only options are to restore from backup or rebuild the permissions database. The next issue, is that CPU Profiles are locked out to even the SuperUser - so creating a VM with the SuperUser account with reset permissions is denied: User doesn't have permissions to assign the cpu profile Default with id 58ca604e-01a7-003f-01de-0250 to VMs. I consider that to be a bug, personally, as a SuperUser should have access to everything by definition. To solve this in the mean time, i need to know the object_type_id of a cpu profile to manually reassign permissions to it (you can't control CPU profile permissions in the GUI either, only view). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 12 Jun 2018, at 06:44, Roy Golan mailto:rgo...@redhat.com>> wrote: On Tue, 12 Jun 2018 at 02:24 Donny Davis mailto:do...@fortnebula.com>> wrote: I am happy to help where I can. I would also not recommend tinkering around in the database, but I am happy to hear you have it all running. :) Everything you should every be doing in the engine is available via the API/UI. Just some general advice. On Mon, Jun 11, 2018 at 9:31 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All & Donny, Thank you for the clarifications, very useful indeed. A note for future users who go down this path and dont want to restore or reinstall: Cleaning out the `permissions` table in the database and restoring the defaults will solve the issue, but you need to restore the SuperUser permission on the admin@internal account: Learning from here: https://www.ovirt.org/develop/developer-guide/action-permissions-overview/ Clean out your `roles_groups` and `permissions` DELETE FROM `permissions`; DELETE FROM `roles_groups`; Restore the defaults: https://github.com/oVirt/ovirt-engine/blob/master/packaging/dbscripts/data/00600_insert_permissions.sql https://github.com/oVirt/ovirt-engine/blob/master/packaging/dbscripts/data/00700_insert_roles_groups.sql Re-assign the SuperUser role to the admin@internal user: Either: https://github.com/oVirt/ovirt-engine/blob/master/packaging/bin/ovirt-engine-role.sh Or just go straight into your localhost psql on your engine, replacing information as appropriate: Get your external_id from the users table and use it in the function: SELECT external_id FROM `users` WHERE `name` = 'admin' AND `domain` = 'internal-authz'; select attach_user_to_role('admin','internal-authz','*','#external_id#','SuperUser'); Regards, Callum I think the root cause here is that you are trying to login to the webadmin and not the vm portal. User are authorized to login to the web admin only if they have a role of type 'admin'. And UserRole is a 'user' type. So the solution is not the give SuperUser for all those users, this is just a workaround. If you want to know for sure, go to Administration - Configure - Roles. So ask yourself why users need access to the webadmin at all. If they need admin permission assign them an appropriate role on the DC or the cluster. If not, they use the VM portal. Having said all that, if nothing helps and the db needs 'fixing' (I doubt it though) then this is a bug. -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:57, Donny Davis mailto:do...@fortnebula.com>> wrote: https://lists.ovirt.org/pipermail/users/2015-January/030981.html This is the thread where I discussed a bit of the permissions thing. I am sure things have changed since 3.5.1, but should get you down the right path. On Mon, Jun 11, 2018 at 6:54 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Yes, in process of trying to fix/identify things - need to undo this. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:48, Donny Dav
[ovirt-users] Re: LDAP Authentication issues
Yes indeed Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 12 Jun 2018, at 10:18, Roy Golan mailto:rgo...@redhat.com>> wrote: On Tue, 12 Jun 2018 at 11:48 Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Process of database "fixing" is required because adding system permissions to the "Everyone" group is a one-way process that causes many problems and there is no way to rescue from the GUI, only options are to restore from backup or rebuild the permissions database. The next issue, is that CPU Profiles are locked out to even the SuperUser - so creating a VM with the SuperUser account with reset permissions is denied: User doesn't have permissions to assign the cpu profile Default with id 58ca604e-01a7-003f-01de-0250 to VMs. I consider that to be a bug, personally, as a SuperUser should have access to everything by definition. Is that user is admin@internal? To solve this in the mean time, i need to know the object_type_id of a cpu profile to manually reassign permissions to it (you can't control CPU profile permissions in the GUI either, only view). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 12 Jun 2018, at 06:44, Roy Golan mailto:rgo...@redhat.com>> wrote: On Tue, 12 Jun 2018 at 02:24 Donny Davis mailto:do...@fortnebula.com>> wrote: I am happy to help where I can. I would also not recommend tinkering around in the database, but I am happy to hear you have it all running. :) Everything you should every be doing in the engine is available via the API/UI. Just some general advice. On Mon, Jun 11, 2018 at 9:31 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All & Donny, Thank you for the clarifications, very useful indeed. A note for future users who go down this path and dont want to restore or reinstall: Cleaning out the `permissions` table in the database and restoring the defaults will solve the issue, but you need to restore the SuperUser permission on the admin@internal account: Learning from here: https://www.ovirt.org/develop/developer-guide/action-permissions-overview/ Clean out your `roles_groups` and `permissions` DELETE FROM `permissions`; DELETE FROM `roles_groups`; Restore the defaults: https://github.com/oVirt/ovirt-engine/blob/master/packaging/dbscripts/data/00600_insert_permissions.sql https://github.com/oVirt/ovirt-engine/blob/master/packaging/dbscripts/data/00700_insert_roles_groups.sql Re-assign the SuperUser role to the admin@internal user: Either: https://github.com/oVirt/ovirt-engine/blob/master/packaging/bin/ovirt-engine-role.sh Or just go straight into your localhost psql on your engine, replacing information as appropriate: Get your external_id from the users table and use it in the function: SELECT external_id FROM `users` WHERE `name` = 'admin' AND `domain` = 'internal-authz'; select attach_user_to_role('admin','internal-authz','*','#external_id#','SuperUser'); Regards, Callum I think the root cause here is that you are trying to login to the webadmin and not the vm portal. User are authorized to login to the web admin only if they have a role of type 'admin'. And UserRole is a 'user' type. So the solution is not the give SuperUser for all those users, this is just a workaround. If you want to know for sure, go to Administration - Configure - Roles. So ask yourself why users need access to the webadmin at all. If they need admin permission assign them an appropriate role on the DC or the cluster. If not, they use the VM portal. Having said all that, if nothing helps and the db needs 'fixing' (I doubt it though) then this is a bug. -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:57, Donny Davis mailto:do...@fortnebula.com>> wrote: https://lists.ovirt.org/pipermail/users/2015-January/030981.html This is the thread where I discussed a bit of the permissions thing. I am sure things have changed since 3.5.1, but should get you down the right path. On Mon, Jun 11, 2018 at 6:54 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Yes, in process of trying to fix/identify things - need to undo this. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:48, Donny Davis mailto:do...@fortnebula.com>> wrote: did you add system permissions to the everyone group? On Mon, Jun 11, 2018 at 6:42 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Happy for you to
[ovirt-users] Re: LDAP Authentication issues
Dear All, Process of database "fixing" is required because adding system permissions to the "Everyone" group is a one-way process that causes many problems and there is no way to rescue from the GUI, only options are to restore from backup or rebuild the permissions database. The next issue, is that CPU Profiles are locked out to even the SuperUser - so creating a VM with the SuperUser account with reset permissions is denied: User doesn't have permissions to assign the cpu profile Default with id 58ca604e-01a7-003f-01de-0250 to VMs. I consider that to be a bug, personally, as a SuperUser should have access to everything by definition. To solve this in the mean time, i need to know the object_type_id of a cpu profile to manually reassign permissions to it (you can't control CPU profile permissions in the GUI either, only view). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 12 Jun 2018, at 06:44, Roy Golan mailto:rgo...@redhat.com>> wrote: On Tue, 12 Jun 2018 at 02:24 Donny Davis mailto:do...@fortnebula.com>> wrote: I am happy to help where I can. I would also not recommend tinkering around in the database, but I am happy to hear you have it all running. :) Everything you should every be doing in the engine is available via the API/UI. Just some general advice. On Mon, Jun 11, 2018 at 9:31 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All & Donny, Thank you for the clarifications, very useful indeed. A note for future users who go down this path and dont want to restore or reinstall: Cleaning out the `permissions` table in the database and restoring the defaults will solve the issue, but you need to restore the SuperUser permission on the admin@internal account: Learning from here: https://www.ovirt.org/develop/developer-guide/action-permissions-overview/ Clean out your `roles_groups` and `permissions` DELETE FROM `permissions`; DELETE FROM `roles_groups`; Restore the defaults: https://github.com/oVirt/ovirt-engine/blob/master/packaging/dbscripts/data/00600_insert_permissions.sql https://github.com/oVirt/ovirt-engine/blob/master/packaging/dbscripts/data/00700_insert_roles_groups.sql Re-assign the SuperUser role to the admin@internal user: Either: https://github.com/oVirt/ovirt-engine/blob/master/packaging/bin/ovirt-engine-role.sh Or just go straight into your localhost psql on your engine, replacing information as appropriate: Get your external_id from the users table and use it in the function: SELECT external_id FROM `users` WHERE `name` = 'admin' AND `domain` = 'internal-authz'; select attach_user_to_role('admin','internal-authz','*','#external_id#','SuperUser'); Regards, Callum I think the root cause here is that you are trying to login to the webadmin and not the vm portal. User are authorized to login to the web admin only if they have a role of type 'admin'. And UserRole is a 'user' type. So the solution is not the give SuperUser for all those users, this is just a workaround. If you want to know for sure, go to Administration - Configure - Roles. So ask yourself why users need access to the webadmin at all. If they need admin permission assign them an appropriate role on the DC or the cluster. If not, they use the VM portal. Having said all that, if nothing helps and the db needs 'fixing' (I doubt it though) then this is a bug. -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:57, Donny Davis mailto:do...@fortnebula.com>> wrote: https://lists.ovirt.org/pipermail/users/2015-January/030981.html This is the thread where I discussed a bit of the permissions thing. I am sure things have changed since 3.5.1, but should get you down the right path. On Mon, Jun 11, 2018 at 6:54 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Yes, in process of trying to fix/identify things - need to undo this. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:48, Donny Davis mailto:do...@fortnebula.com>> wrote: did you add system permissions to the everyone group? On Mon, Jun 11, 2018 at 6:42 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Happy for you to link me a guide, googlefu is failing me. How do i get around this "It's not allowed to remove system permissions assigned to built-in Everyone group" - to remove permissions erroneously added. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun
[ovirt-users] Re: LDAP Authentication issues
Dear All & Donny, Thank you for the clarifications, very useful indeed. A note for future users who go down this path and dont want to restore or reinstall: Cleaning out the `permissions` table in the database and restoring the defaults will solve the issue, but you need to restore the SuperUser permission on the admin@internal account: Learning from here: https://www.ovirt.org/develop/developer-guide/action-permissions-overview/ Clean out your `roles_groups` and `permissions` DELETE FROM `permissions`; DELETE FROM `roles_groups`; Restore the defaults: https://github.com/oVirt/ovirt-engine/blob/master/packaging/dbscripts/data/00600_insert_permissions.sql https://github.com/oVirt/ovirt-engine/blob/master/packaging/dbscripts/data/00700_insert_roles_groups.sql Re-assign the SuperUser role to the admin@internal user: Either: https://github.com/oVirt/ovirt-engine/blob/master/packaging/bin/ovirt-engine-role.sh Or just go straight into your localhost psql on your engine, replacing information as appropriate: Get your external_id from the users table and use it in the function: SELECT external_id FROM `users` WHERE `name` = 'admin' AND `domain` = 'internal-authz'; select attach_user_to_role('admin','internal-authz','*','#external_id#','SuperUser'); Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:57, Donny Davis mailto:do...@fortnebula.com>> wrote: https://lists.ovirt.org/pipermail/users/2015-January/030981.html This is the thread where I discussed a bit of the permissions thing. I am sure things have changed since 3.5.1, but should get you down the right path. On Mon, Jun 11, 2018 at 6:54 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Yes, in process of trying to fix/identify things - need to undo this. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:48, Donny Davis mailto:do...@fortnebula.com>> wrote: did you add system permissions to the everyone group? On Mon, Jun 11, 2018 at 6:42 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Happy for you to link me a guide, googlefu is failing me. How do i get around this "It's not allowed to remove system permissions assigned to built-in Everyone group" - to remove permissions erroneously added. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:38, Donny Davis mailto:do...@fortnebula.com>> wrote: You can create a profile that has the proper permissions to allow what you are looking for, and then assign that profile to the groups you wish. I wrote a post on this quite a while back on how to setup oVirt to appear to be multi-tenant. Happy to see you don't have an ldap issue :) >This will be a problem for us to now create group permissions for all 100+ >groups since Everyone === No-one. -sigh- On Mon, Jun 11, 2018 at 6:34 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Ah, this appears to be an issue with the proxy - setting up the spice proxy as indicated in the guides is causing this issue, and likely will need support. https://www.ovirt.org/documentation/admin-guide/chap-Proxies/ Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:29, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Ok, the user now logs in! This will be a problem for us to now create group permissions for all 100+ groups since Everyone === No-one. -sigh- A new issue, when in the VM portal as the LDAP user, i get HTTP basic auth login prompts, and a "Authorization expired" error, then a page reload. Nothing in the logs seem to indicate an issue. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:26, Donny Davis mailto:do...@fortnebula.com>> wrote: Try giving your user system permissions as a superuser and see if it goes away. I wouldn't leave it like that, but it will help isolate your issue. I don't think you have an ldap issue... the log entry is telling you that user has no permissions >The user callum@Biomedical Research Computing is not authorized to perform >login On Mon, Jun 11, 2018 at 6:23 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Donny, No, though the user shows the permissions inherited from the Everyone group: Regards, Callum -- Callum Smith Research Computin
[ovirt-users] Re: LDAP Authentication issues
Yes, in process of trying to fix/identify things - need to undo this. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:48, Donny Davis mailto:do...@fortnebula.com>> wrote: did you add system permissions to the everyone group? On Mon, Jun 11, 2018 at 6:42 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Happy for you to link me a guide, googlefu is failing me. How do i get around this "It's not allowed to remove system permissions assigned to built-in Everyone group" - to remove permissions erroneously added. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:38, Donny Davis mailto:do...@fortnebula.com>> wrote: You can create a profile that has the proper permissions to allow what you are looking for, and then assign that profile to the groups you wish. I wrote a post on this quite a while back on how to setup oVirt to appear to be multi-tenant. Happy to see you don't have an ldap issue :) >This will be a problem for us to now create group permissions for all 100+ >groups since Everyone === No-one. -sigh- On Mon, Jun 11, 2018 at 6:34 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Ah, this appears to be an issue with the proxy - setting up the spice proxy as indicated in the guides is causing this issue, and likely will need support. https://www.ovirt.org/documentation/admin-guide/chap-Proxies/ Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:29, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Ok, the user now logs in! This will be a problem for us to now create group permissions for all 100+ groups since Everyone === No-one. -sigh- A new issue, when in the VM portal as the LDAP user, i get HTTP basic auth login prompts, and a "Authorization expired" error, then a page reload. Nothing in the logs seem to indicate an issue. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:26, Donny Davis mailto:do...@fortnebula.com>> wrote: Try giving your user system permissions as a superuser and see if it goes away. I wouldn't leave it like that, but it will help isolate your issue. I don't think you have an ldap issue... the log entry is telling you that user has no permissions >The user callum@Biomedical Research Computing is not authorized to perform >login On Mon, Jun 11, 2018 at 6:23 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Donny, No, though the user shows the permissions inherited from the Everyone group: Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:21, Donny Davis mailto:do...@fortnebula.com>> wrote: Just a shot in the dark, but after you setup ldap did you go in as the default admin and give an ldap account permissions? On Mon, Jun 11, 2018 at 6:04 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Could this be as our LDAP is fairly short on attributes? 2018-06-11 11:00:52,856+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-5) [5dff9eb0] Running command: CreateUserSessionCommand internal: false. 2018-06-11 11:00:52,884+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-5) [5dff9eb0] EVENT_ID: USER_VDC_LOGIN_FAILED(114), User callum@Biomedical Research Computing connecting from '--ipaddr--' failed to log in. 2018-06-11 11:00:52,884+01 ERROR [org.ovirt.engine.core.aaa.servlet.SsoPostLoginServlet] (default task-5) [] The user callum@Biomedical Research Computing is not authorized to perform login I note that a number of variables are included in this action, but which are required and which are optional is the question: https://github.com/oVirt/ovirt-engine/blob/master/backend/manager/modules/aaa/src/main/java/org/ovirt/engine/core/aaa/servlet/SsoPostLoginServlet.java#L88 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 09:35, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: What would be the next step to help solve this issue? All users authenticating through LDAP get "This user is not authorised to perform authentication". Regar
[ovirt-users] Re: LDAP Authentication issues
Happy for you to link me a guide, googlefu is failing me. How do i get around this "It's not allowed to remove system permissions assigned to built-in Everyone group" - to remove permissions erroneously added. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:38, Donny Davis mailto:do...@fortnebula.com>> wrote: You can create a profile that has the proper permissions to allow what you are looking for, and then assign that profile to the groups you wish. I wrote a post on this quite a while back on how to setup oVirt to appear to be multi-tenant. Happy to see you don't have an ldap issue :) >This will be a problem for us to now create group permissions for all 100+ >groups since Everyone === No-one. -sigh- On Mon, Jun 11, 2018 at 6:34 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Ah, this appears to be an issue with the proxy - setting up the spice proxy as indicated in the guides is causing this issue, and likely will need support. https://www.ovirt.org/documentation/admin-guide/chap-Proxies/ Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:29, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Ok, the user now logs in! This will be a problem for us to now create group permissions for all 100+ groups since Everyone === No-one. -sigh- A new issue, when in the VM portal as the LDAP user, i get HTTP basic auth login prompts, and a "Authorization expired" error, then a page reload. Nothing in the logs seem to indicate an issue. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:26, Donny Davis mailto:do...@fortnebula.com>> wrote: Try giving your user system permissions as a superuser and see if it goes away. I wouldn't leave it like that, but it will help isolate your issue. I don't think you have an ldap issue... the log entry is telling you that user has no permissions >The user callum@Biomedical Research Computing is not authorized to perform >login On Mon, Jun 11, 2018 at 6:23 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Donny, No, though the user shows the permissions inherited from the Everyone group: Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:21, Donny Davis mailto:do...@fortnebula.com>> wrote: Just a shot in the dark, but after you setup ldap did you go in as the default admin and give an ldap account permissions? On Mon, Jun 11, 2018 at 6:04 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Could this be as our LDAP is fairly short on attributes? 2018-06-11 11:00:52,856+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-5) [5dff9eb0] Running command: CreateUserSessionCommand internal: false. 2018-06-11 11:00:52,884+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-5) [5dff9eb0] EVENT_ID: USER_VDC_LOGIN_FAILED(114), User callum@Biomedical Research Computing connecting from '--ipaddr--' failed to log in. 2018-06-11 11:00:52,884+01 ERROR [org.ovirt.engine.core.aaa.servlet.SsoPostLoginServlet] (default task-5) [] The user callum@Biomedical Research Computing is not authorized to perform login I note that a number of variables are included in this action, but which are required and which are optional is the question: https://github.com/oVirt/ovirt-engine/blob/master/backend/manager/modules/aaa/src/main/java/org/ovirt/engine/core/aaa/servlet/SsoPostLoginServlet.java#L88 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 09:35, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: What would be the next step to help solve this issue? All users authenticating through LDAP get "This user is not authorised to perform authentication". Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 5 Jun 2018, at 11:42, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Ok I spoke too soon, I have resolved the groups, but authentication still isn't working for LDAP users, same error as before (114). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human G
[ovirt-users] Re: LDAP Authentication issues
Ah, this appears to be an issue with the proxy - setting up the spice proxy as indicated in the guides is causing this issue, and likely will need support. https://www.ovirt.org/documentation/admin-guide/chap-Proxies/ Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk On 11 Jun 2018, at 11:29, Callum Smith wrote: Ok, the user now logs in! This will be a problem for us to now create group permissions for all 100+ groups since Everyone === No-one. -sigh- A new issue, when in the VM portal as the LDAP user, i get HTTP basic auth login prompts, and a "Authorization expired" error, then a page reload. Nothing in the logs seem to indicate an issue. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:26, Donny Davis mailto:do...@fortnebula.com>> wrote: Try giving your user system permissions as a superuser and see if it goes away. I wouldn't leave it like that, but it will help isolate your issue. I don't think you have an ldap issue... the log entry is telling you that user has no permissions >The user callum@Biomedical Research Computing is not authorized to perform >login On Mon, Jun 11, 2018 at 6:23 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Donny, No, though the user shows the permissions inherited from the Everyone group: Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:21, Donny Davis mailto:do...@fortnebula.com>> wrote: Just a shot in the dark, but after you setup ldap did you go in as the default admin and give an ldap account permissions? On Mon, Jun 11, 2018 at 6:04 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Could this be as our LDAP is fairly short on attributes? 2018-06-11 11:00:52,856+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-5) [5dff9eb0] Running command: CreateUserSessionCommand internal: false. 2018-06-11 11:00:52,884+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-5) [5dff9eb0] EVENT_ID: USER_VDC_LOGIN_FAILED(114), User callum@Biomedical Research Computing connecting from '--ipaddr--' failed to log in. 2018-06-11 11:00:52,884+01 ERROR [org.ovirt.engine.core.aaa.servlet.SsoPostLoginServlet] (default task-5) [] The user callum@Biomedical Research Computing is not authorized to perform login I note that a number of variables are included in this action, but which are required and which are optional is the question: https://github.com/oVirt/ovirt-engine/blob/master/backend/manager/modules/aaa/src/main/java/org/ovirt/engine/core/aaa/servlet/SsoPostLoginServlet.java#L88 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 09:35, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: What would be the next step to help solve this issue? All users authenticating through LDAP get "This user is not authorised to perform authentication". Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 5 Jun 2018, at 11:42, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Ok I spoke too soon, I have resolved the groups, but authentication still isn't working for LDAP users, same error as before (114). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 5 Jun 2018, at 10:14, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Ondra, all, Managed to solve this once i got my head around the properties file. Conceptually the problem is that users are typically not a member of their primary group in a POSIX scenario, and their primary group is set by the gidNumber of the user's record, with additional group memberships specified by memberUid entries against a posixGroup entry. search.rfc2307-resolve-groups-memberUid.search-request.filter = &(objectClass=posixGroup)(|(memberUid=${seq:_rfc2307_uid_encoded})(gidNumber=${seq:_rfc2307_gid_encoded})) search.rfc2307-resolve-principal-uid.search-request.attributes = uid, gidNumber sequence.bmrc-resolve-groups.010.description = set dn sequence.bmrc-resolve-groups.010.type = var-set sequence.bmrc-resolve-groups.010.var-set.variable = _rfc2307_dn sequence.bmrc-resolve-groups.010.var-set.value = ${seq:dn} sequence.bmrc-resolve-groups.010.descripti
[ovirt-users] Re: LDAP Authentication issues
Ok, the user now logs in! This will be a problem for us to now create group permissions for all 100+ groups since Everyone === No-one. -sigh- A new issue, when in the VM portal as the LDAP user, i get HTTP basic auth login prompts, and a "Authorization expired" error, then a page reload. Nothing in the logs seem to indicate an issue. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:26, Donny Davis mailto:do...@fortnebula.com>> wrote: Try giving your user system permissions as a superuser and see if it goes away. I wouldn't leave it like that, but it will help isolate your issue. I don't think you have an ldap issue... the log entry is telling you that user has no permissions >The user callum@Biomedical Research Computing is not authorized to perform >login On Mon, Jun 11, 2018 at 6:23 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Donny, No, though the user shows the permissions inherited from the Everyone group: Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:21, Donny Davis mailto:do...@fortnebula.com>> wrote: Just a shot in the dark, but after you setup ldap did you go in as the default admin and give an ldap account permissions? On Mon, Jun 11, 2018 at 6:04 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Could this be as our LDAP is fairly short on attributes? 2018-06-11 11:00:52,856+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-5) [5dff9eb0] Running command: CreateUserSessionCommand internal: false. 2018-06-11 11:00:52,884+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-5) [5dff9eb0] EVENT_ID: USER_VDC_LOGIN_FAILED(114), User callum@Biomedical Research Computing connecting from '--ipaddr--' failed to log in. 2018-06-11 11:00:52,884+01 ERROR [org.ovirt.engine.core.aaa.servlet.SsoPostLoginServlet] (default task-5) [] The user callum@Biomedical Research Computing is not authorized to perform login I note that a number of variables are included in this action, but which are required and which are optional is the question: https://github.com/oVirt/ovirt-engine/blob/master/backend/manager/modules/aaa/src/main/java/org/ovirt/engine/core/aaa/servlet/SsoPostLoginServlet.java#L88 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 09:35, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: What would be the next step to help solve this issue? All users authenticating through LDAP get "This user is not authorised to perform authentication". Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 5 Jun 2018, at 11:42, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Ok I spoke too soon, I have resolved the groups, but authentication still isn't working for LDAP users, same error as before (114). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 5 Jun 2018, at 10:14, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Ondra, all, Managed to solve this once i got my head around the properties file. Conceptually the problem is that users are typically not a member of their primary group in a POSIX scenario, and their primary group is set by the gidNumber of the user's record, with additional group memberships specified by memberUid entries against a posixGroup entry. search.rfc2307-resolve-groups-memberUid.search-request.filter = &(objectClass=posixGroup)(|(memberUid=${seq:_rfc2307_uid_encoded})(gidNumber=${seq:_rfc2307_gid_encoded})) search.rfc2307-resolve-principal-uid.search-request.attributes = uid, gidNumber sequence.bmrc-resolve-groups.010.description = set dn sequence.bmrc-resolve-groups.010.type = var-set sequence.bmrc-resolve-groups.010.var-set.variable = _rfc2307_dn sequence.bmrc-resolve-groups.010.var-set.value = ${seq:dn} sequence.bmrc-resolve-groups.010.description = resolve uid sequence.bmrc-resolve-groups.020.type = fetch-record sequence.bmrc-resolve-groups.020.fetch-record.search = rfc2307-resolve-principal-uid sequence.bmrc-resolve-groups.020.fetch-record.map.uid.name<http://sequence.bmrc-resolve-groups.020.fetch-record.map.uid.name/> = _rfc2307_uid sequence.bmrc-resolve-groups.030.description = resolve gid sequence.bmrc-resolve-groups.030.type = fetch-record
[ovirt-users] Re: LDAP Authentication issues
Dear Donny, No, though the user shows the permissions inherited from the Everyone group: [cid:3C4DA68E-6FBF-4D50-AA88-9E063CFBED6C@well.ox.ac.uk] Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 11:21, Donny Davis mailto:do...@fortnebula.com>> wrote: Just a shot in the dark, but after you setup ldap did you go in as the default admin and give an ldap account permissions? On Mon, Jun 11, 2018 at 6:04 AM, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Could this be as our LDAP is fairly short on attributes? 2018-06-11 11:00:52,856+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-5) [5dff9eb0] Running command: CreateUserSessionCommand internal: false. 2018-06-11 11:00:52,884+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-5) [5dff9eb0] EVENT_ID: USER_VDC_LOGIN_FAILED(114), User callum@Biomedical Research Computing connecting from '--ipaddr--' failed to log in. 2018-06-11 11:00:52,884+01 ERROR [org.ovirt.engine.core.aaa.servlet.SsoPostLoginServlet] (default task-5) [] The user callum@Biomedical Research Computing is not authorized to perform login I note that a number of variables are included in this action, but which are required and which are optional is the question: https://github.com/oVirt/ovirt-engine/blob/master/backend/manager/modules/aaa/src/main/java/org/ovirt/engine/core/aaa/servlet/SsoPostLoginServlet.java#L88 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 09:35, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: What would be the next step to help solve this issue? All users authenticating through LDAP get "This user is not authorised to perform authentication". Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 5 Jun 2018, at 11:42, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Ok I spoke too soon, I have resolved the groups, but authentication still isn't working for LDAP users, same error as before (114). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 5 Jun 2018, at 10:14, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Ondra, all, Managed to solve this once i got my head around the properties file. Conceptually the problem is that users are typically not a member of their primary group in a POSIX scenario, and their primary group is set by the gidNumber of the user's record, with additional group memberships specified by memberUid entries against a posixGroup entry. search.rfc2307-resolve-groups-memberUid.search-request.filter = &(objectClass=posixGroup)(|(memberUid=${seq:_rfc2307_uid_encoded})(gidNumber=${seq:_rfc2307_gid_encoded})) search.rfc2307-resolve-principal-uid.search-request.attributes = uid, gidNumber sequence.bmrc-resolve-groups.010.description = set dn sequence.bmrc-resolve-groups.010.type = var-set sequence.bmrc-resolve-groups.010.var-set.variable = _rfc2307_dn sequence.bmrc-resolve-groups.010.var-set.value = ${seq:dn} sequence.bmrc-resolve-groups.010.description = resolve uid sequence.bmrc-resolve-groups.020.type = fetch-record sequence.bmrc-resolve-groups.020.fetch-record.search = rfc2307-resolve-principal-uid sequence.bmrc-resolve-groups.020.fetch-record.map.uid.name<http://sequence.bmrc-resolve-groups.020.fetch-record.map.uid.name/> = _rfc2307_uid sequence.bmrc-resolve-groups.030.description = resolve gid sequence.bmrc-resolve-groups.030.type = fetch-record sequence.bmrc-resolve-groups.030.fetch-record.search = rfc2307-resolve-principal-uid sequence.bmrc-resolve-groups.030.fetch-record.map.gidNumber.name<http://sequence.bmrc-resolve-groups.030.fetch-record.map.gidnumber.name/> = _rfc2307_gid sequence.bmrc-resolve-groups.040.description = query groups sequence.bmrc-resolve-groups.040.type = search-open sequence.bmrc-resolve-groups.040.search-open.search = rfc2307-resolve-groups-memberUid sequence.bmrc-resolve-groups.040.search-open.variable = queryRFC2307ByMemberUid sequence.rfc2307-resolve-groups.020.call.name<http://sequence.rfc2307-resolve-groups.020.call.name/> = bmrc-resolve-groups Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 4 Jun 2018, at 15:07, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Ondra, I went for openldap-rfc2307 as that best desc
[ovirt-users] Re: LDAP Authentication issues
Dear All, Could this be as our LDAP is fairly short on attributes? 2018-06-11 11:00:52,856+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-5) [5dff9eb0] Running command: CreateUserSessionCommand internal: false. 2018-06-11 11:00:52,884+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-5) [5dff9eb0] EVENT_ID: USER_VDC_LOGIN_FAILED(114), User callum@Biomedical Research Computing connecting from '--ipaddr--' failed to log in. 2018-06-11 11:00:52,884+01 ERROR [org.ovirt.engine.core.aaa.servlet.SsoPostLoginServlet] (default task-5) [] The user callum@Biomedical Research Computing is not authorized to perform login I note that a number of variables are included in this action, but which are required and which are optional is the question: https://github.com/oVirt/ovirt-engine/blob/master/backend/manager/modules/aaa/src/main/java/org/ovirt/engine/core/aaa/servlet/SsoPostLoginServlet.java#L88 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 11 Jun 2018, at 09:35, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: What would be the next step to help solve this issue? All users authenticating through LDAP get "This user is not authorised to perform authentication". Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 5 Jun 2018, at 11:42, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Ok I spoke too soon, I have resolved the groups, but authentication still isn't working for LDAP users, same error as before (114). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 5 Jun 2018, at 10:14, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Ondra, all, Managed to solve this once i got my head around the properties file. Conceptually the problem is that users are typically not a member of their primary group in a POSIX scenario, and their primary group is set by the gidNumber of the user's record, with additional group memberships specified by memberUid entries against a posixGroup entry. search.rfc2307-resolve-groups-memberUid.search-request.filter = &(objectClass=posixGroup)(|(memberUid=${seq:_rfc2307_uid_encoded})(gidNumber=${seq:_rfc2307_gid_encoded})) search.rfc2307-resolve-principal-uid.search-request.attributes = uid, gidNumber sequence.bmrc-resolve-groups.010.description = set dn sequence.bmrc-resolve-groups.010.type = var-set sequence.bmrc-resolve-groups.010.var-set.variable = _rfc2307_dn sequence.bmrc-resolve-groups.010.var-set.value = ${seq:dn} sequence.bmrc-resolve-groups.010.description = resolve uid sequence.bmrc-resolve-groups.020.type = fetch-record sequence.bmrc-resolve-groups.020.fetch-record.search = rfc2307-resolve-principal-uid sequence.bmrc-resolve-groups.020.fetch-record.map.uid.name = _rfc2307_uid sequence.bmrc-resolve-groups.030.description = resolve gid sequence.bmrc-resolve-groups.030.type = fetch-record sequence.bmrc-resolve-groups.030.fetch-record.search = rfc2307-resolve-principal-uid sequence.bmrc-resolve-groups.030.fetch-record.map.gidNumber.name = _rfc2307_gid sequence.bmrc-resolve-groups.040.description = query groups sequence.bmrc-resolve-groups.040.type = search-open sequence.bmrc-resolve-groups.040.search-open.search = rfc2307-resolve-groups-memberUid sequence.bmrc-resolve-groups.040.search-open.variable = queryRFC2307ByMemberUid sequence.rfc2307-resolve-groups.020.call.name = bmrc-resolve-groups Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 4 Jun 2018, at 15:07, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Ondra, I went for openldap-rfc2307 as that best describes our ldap setup. The issue seems to be that the gidNumber is set, but users are not a member of their primary group within the LDAP. So, user's gidNumber represents primary group and posixGroup membership (memberUid) represents their secondary groups. What's the best way to approach this (fix the filters on oVirt end or change the LDAP? This is a question of what is most compliant with standards really). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 29 May 2018, at 11:29, Ondra Machacek mailto:omach...@redhat.com>> wrote: What's you LDAP and what profile did you choose? This looks like you have chosen incorect profile during setup. Are you sure you arent using posix group and using n
[ovirt-users] Re: LDAP Authentication issues
What would be the next step to help solve this issue? All users authenticating through LDAP get "This user is not authorised to perform authentication". Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 5 Jun 2018, at 11:42, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Ok I spoke too soon, I have resolved the groups, but authentication still isn't working for LDAP users, same error as before (114). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 5 Jun 2018, at 10:14, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Ondra, all, Managed to solve this once i got my head around the properties file. Conceptually the problem is that users are typically not a member of their primary group in a POSIX scenario, and their primary group is set by the gidNumber of the user's record, with additional group memberships specified by memberUid entries against a posixGroup entry. search.rfc2307-resolve-groups-memberUid.search-request.filter = &(objectClass=posixGroup)(|(memberUid=${seq:_rfc2307_uid_encoded})(gidNumber=${seq:_rfc2307_gid_encoded})) search.rfc2307-resolve-principal-uid.search-request.attributes = uid, gidNumber sequence.bmrc-resolve-groups.010.description = set dn sequence.bmrc-resolve-groups.010.type = var-set sequence.bmrc-resolve-groups.010.var-set.variable = _rfc2307_dn sequence.bmrc-resolve-groups.010.var-set.value = ${seq:dn} sequence.bmrc-resolve-groups.010.description = resolve uid sequence.bmrc-resolve-groups.020.type = fetch-record sequence.bmrc-resolve-groups.020.fetch-record.search = rfc2307-resolve-principal-uid sequence.bmrc-resolve-groups.020.fetch-record.map.uid.name = _rfc2307_uid sequence.bmrc-resolve-groups.030.description = resolve gid sequence.bmrc-resolve-groups.030.type = fetch-record sequence.bmrc-resolve-groups.030.fetch-record.search = rfc2307-resolve-principal-uid sequence.bmrc-resolve-groups.030.fetch-record.map.gidNumber.name = _rfc2307_gid sequence.bmrc-resolve-groups.040.description = query groups sequence.bmrc-resolve-groups.040.type = search-open sequence.bmrc-resolve-groups.040.search-open.search = rfc2307-resolve-groups-memberUid sequence.bmrc-resolve-groups.040.search-open.variable = queryRFC2307ByMemberUid sequence.rfc2307-resolve-groups.020.call.name = bmrc-resolve-groups Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 4 Jun 2018, at 15:07, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Ondra, I went for openldap-rfc2307 as that best describes our ldap setup. The issue seems to be that the gidNumber is set, but users are not a member of their primary group within the LDAP. So, user's gidNumber represents primary group and posixGroup membership (memberUid) represents their secondary groups. What's the best way to approach this (fix the filters on oVirt end or change the LDAP? This is a question of what is most compliant with standards really). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 29 May 2018, at 11:29, Ondra Machacek mailto:omach...@redhat.com>> wrote: What's you LDAP and what profile did you choose? This looks like you have chosen incorect profile during setup. Are you sure you arent using posix group and using non-posix aaa profile? Sharing a debug log of ovirt-engine-extensions-tool would be helpfull. On Fri, May 25, 2018, 10:04 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, I'm having problems getting LDAP running, login works, but I'm getting "user is not authorised to perform login" - this is even if i specify the UserRole specifically to the LDAP group the user is in. 2018-05-25 08:56:16,212+01 INFO [org.ovirt.engine.core.sso.utils.AuthenticationUtils] (default task-23) [] User callum@Biomedical Research Computing successfully logged in with scopes: ovirt-app-admin ovirt-app-api ovirt-app-portal ovirt-ext=auth:sequence-priority=~ ovirt-ext=revoke:revoke-all ovirt-ext=token-info:authz-search ovirt-ext=token-info:public-authz-search ovirt-ext=token-info:validate ovirt-ext=token:password-access 2018-05-25 08:56:16,391+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-25) [63e60fe9] Running command: CreateUserSessionCommand internal: false. 2018-05-25 08:56:16,430+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-25) [63e60fe9] EVENT_ID: USER_VDC_LOGIN_FAILED(114), User callum@Biomedical Research Computing connec
[ovirt-users] Re: LDAP Authentication issues
Ok I spoke too soon, I have resolved the groups, but authentication still isn't working for LDAP users, same error as before (114). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 5 Jun 2018, at 10:14, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Ondra, all, Managed to solve this once i got my head around the properties file. Conceptually the problem is that users are typically not a member of their primary group in a POSIX scenario, and their primary group is set by the gidNumber of the user's record, with additional group memberships specified by memberUid entries against a posixGroup entry. search.rfc2307-resolve-groups-memberUid.search-request.filter = &(objectClass=posixGroup)(|(memberUid=${seq:_rfc2307_uid_encoded})(gidNumber=${seq:_rfc2307_gid_encoded})) search.rfc2307-resolve-principal-uid.search-request.attributes = uid, gidNumber sequence.bmrc-resolve-groups.010.description = set dn sequence.bmrc-resolve-groups.010.type = var-set sequence.bmrc-resolve-groups.010.var-set.variable = _rfc2307_dn sequence.bmrc-resolve-groups.010.var-set.value = ${seq:dn} sequence.bmrc-resolve-groups.010.description = resolve uid sequence.bmrc-resolve-groups.020.type = fetch-record sequence.bmrc-resolve-groups.020.fetch-record.search = rfc2307-resolve-principal-uid sequence.bmrc-resolve-groups.020.fetch-record.map.uid.name = _rfc2307_uid sequence.bmrc-resolve-groups.030.description = resolve gid sequence.bmrc-resolve-groups.030.type = fetch-record sequence.bmrc-resolve-groups.030.fetch-record.search = rfc2307-resolve-principal-uid sequence.bmrc-resolve-groups.030.fetch-record.map.gidNumber.name = _rfc2307_gid sequence.bmrc-resolve-groups.040.description = query groups sequence.bmrc-resolve-groups.040.type = search-open sequence.bmrc-resolve-groups.040.search-open.search = rfc2307-resolve-groups-memberUid sequence.bmrc-resolve-groups.040.search-open.variable = queryRFC2307ByMemberUid sequence.rfc2307-resolve-groups.020.call.name = bmrc-resolve-groups Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 4 Jun 2018, at 15:07, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Ondra, I went for openldap-rfc2307 as that best describes our ldap setup. The issue seems to be that the gidNumber is set, but users are not a member of their primary group within the LDAP. So, user's gidNumber represents primary group and posixGroup membership (memberUid) represents their secondary groups. What's the best way to approach this (fix the filters on oVirt end or change the LDAP? This is a question of what is most compliant with standards really). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 29 May 2018, at 11:29, Ondra Machacek mailto:omach...@redhat.com>> wrote: What's you LDAP and what profile did you choose? This looks like you have chosen incorect profile during setup. Are you sure you arent using posix group and using non-posix aaa profile? Sharing a debug log of ovirt-engine-extensions-tool would be helpfull. On Fri, May 25, 2018, 10:04 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, I'm having problems getting LDAP running, login works, but I'm getting "user is not authorised to perform login" - this is even if i specify the UserRole specifically to the LDAP group the user is in. 2018-05-25 08:56:16,212+01 INFO [org.ovirt.engine.core.sso.utils.AuthenticationUtils] (default task-23) [] User callum@Biomedical Research Computing successfully logged in with scopes: ovirt-app-admin ovirt-app-api ovirt-app-portal ovirt-ext=auth:sequence-priority=~ ovirt-ext=revoke:revoke-all ovirt-ext=token-info:authz-search ovirt-ext=token-info:public-authz-search ovirt-ext=token-info:validate ovirt-ext=token:password-access 2018-05-25 08:56:16,391+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-25) [63e60fe9] Running command: CreateUserSessionCommand internal: false. 2018-05-25 08:56:16,430+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-25) [63e60fe9] EVENT_ID: USER_VDC_LOGIN_FAILED(114), User callum@Biomedical Research Computing connecting from '192.168.65.254' failed to log in. 2018-05-25 08:56:16,430+01 ERROR [org.ovirt.engine.core.aaa.servlet.SsoPostLoginServlet] (default task-25) [] The user callum@Biomedical Research Computing is not authorized to perform login on a side note: is it possible to assign permissions to all members of an LDAP tree where they dont have a common group membership? Regards, Callum -- Callum Smith Research Computing
[ovirt-users] Re: LDAP Authentication issues
Dear Ondra, all, Managed to solve this once i got my head around the properties file. Conceptually the problem is that users are typically not a member of their primary group in a POSIX scenario, and their primary group is set by the gidNumber of the user's record, with additional group memberships specified by memberUid entries against a posixGroup entry. search.rfc2307-resolve-groups-memberUid.search-request.filter = &(objectClass=posixGroup)(|(memberUid=${seq:_rfc2307_uid_encoded})(gidNumber=${seq:_rfc2307_gid_encoded})) search.rfc2307-resolve-principal-uid.search-request.attributes = uid, gidNumber sequence.bmrc-resolve-groups.010.description = set dn sequence.bmrc-resolve-groups.010.type = var-set sequence.bmrc-resolve-groups.010.var-set.variable = _rfc2307_dn sequence.bmrc-resolve-groups.010.var-set.value = ${seq:dn} sequence.bmrc-resolve-groups.010.description = resolve uid sequence.bmrc-resolve-groups.020.type = fetch-record sequence.bmrc-resolve-groups.020.fetch-record.search = rfc2307-resolve-principal-uid sequence.bmrc-resolve-groups.020.fetch-record.map.uid.name = _rfc2307_uid sequence.bmrc-resolve-groups.030.description = resolve gid sequence.bmrc-resolve-groups.030.type = fetch-record sequence.bmrc-resolve-groups.030.fetch-record.search = rfc2307-resolve-principal-uid sequence.bmrc-resolve-groups.030.fetch-record.map.gidNumber.name = _rfc2307_gid sequence.bmrc-resolve-groups.040.description = query groups sequence.bmrc-resolve-groups.040.type = search-open sequence.bmrc-resolve-groups.040.search-open.search = rfc2307-resolve-groups-memberUid sequence.bmrc-resolve-groups.040.search-open.variable = queryRFC2307ByMemberUid sequence.rfc2307-resolve-groups.020.call.name = bmrc-resolve-groups Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 4 Jun 2018, at 15:07, Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear Ondra, I went for openldap-rfc2307 as that best describes our ldap setup. The issue seems to be that the gidNumber is set, but users are not a member of their primary group within the LDAP. So, user's gidNumber represents primary group and posixGroup membership (memberUid) represents their secondary groups. What's the best way to approach this (fix the filters on oVirt end or change the LDAP? This is a question of what is most compliant with standards really). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 29 May 2018, at 11:29, Ondra Machacek mailto:omach...@redhat.com>> wrote: What's you LDAP and what profile did you choose? This looks like you have chosen incorect profile during setup. Are you sure you arent using posix group and using non-posix aaa profile? Sharing a debug log of ovirt-engine-extensions-tool would be helpfull. On Fri, May 25, 2018, 10:04 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, I'm having problems getting LDAP running, login works, but I'm getting "user is not authorised to perform login" - this is even if i specify the UserRole specifically to the LDAP group the user is in. 2018-05-25 08:56:16,212+01 INFO [org.ovirt.engine.core.sso.utils.AuthenticationUtils] (default task-23) [] User callum@Biomedical Research Computing successfully logged in with scopes: ovirt-app-admin ovirt-app-api ovirt-app-portal ovirt-ext=auth:sequence-priority=~ ovirt-ext=revoke:revoke-all ovirt-ext=token-info:authz-search ovirt-ext=token-info:public-authz-search ovirt-ext=token-info:validate ovirt-ext=token:password-access 2018-05-25 08:56:16,391+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-25) [63e60fe9] Running command: CreateUserSessionCommand internal: false. 2018-05-25 08:56:16,430+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-25) [63e60fe9] EVENT_ID: USER_VDC_LOGIN_FAILED(114), User callum@Biomedical Research Computing connecting from '192.168.65.254' failed to log in. 2018-05-25 08:56:16,430+01 ERROR [org.ovirt.engine.core.aaa.servlet.SsoPostLoginServlet] (default task-25) [] The user callum@Biomedical Research Computing is not authorized to perform login on a side note: is it possible to assign permissions to all members of an LDAP tree where they dont have a common group membership? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> ___ Users mailing
[ovirt-users] Re: LDAP Authentication issues
Dear Ondra, I went for openldap-rfc2307 as that best describes our ldap setup. The issue seems to be that the gidNumber is set, but users are not a member of their primary group within the LDAP. So, user's gidNumber represents primary group and posixGroup membership (memberUid) represents their secondary groups. What's the best way to approach this (fix the filters on oVirt end or change the LDAP? This is a question of what is most compliant with standards really). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 29 May 2018, at 11:29, Ondra Machacek mailto:omach...@redhat.com>> wrote: What's you LDAP and what profile did you choose? This looks like you have chosen incorect profile during setup. Are you sure you arent using posix group and using non-posix aaa profile? Sharing a debug log of ovirt-engine-extensions-tool would be helpfull. On Fri, May 25, 2018, 10:04 AM Callum Smith mailto:cal...@well.ox.ac.uk>> wrote: Dear All, I'm having problems getting LDAP running, login works, but I'm getting "user is not authorised to perform login" - this is even if i specify the UserRole specifically to the LDAP group the user is in. 2018-05-25 08:56:16,212+01 INFO [org.ovirt.engine.core.sso.utils.AuthenticationUtils] (default task-23) [] User callum@Biomedical Research Computing successfully logged in with scopes: ovirt-app-admin ovirt-app-api ovirt-app-portal ovirt-ext=auth:sequence-priority=~ ovirt-ext=revoke:revoke-all ovirt-ext=token-info:authz-search ovirt-ext=token-info:public-authz-search ovirt-ext=token-info:validate ovirt-ext=token:password-access 2018-05-25 08:56:16,391+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-25) [63e60fe9] Running command: CreateUserSessionCommand internal: false. 2018-05-25 08:56:16,430+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-25) [63e60fe9] EVENT_ID: USER_VDC_LOGIN_FAILED(114), User callum@Biomedical Research Computing connecting from '192.168.65.254' failed to log in. 2018-05-25 08:56:16,430+01 ERROR [org.ovirt.engine.core.aaa.servlet.SsoPostLoginServlet] (default task-25) [] The user callum@Biomedical Research Computing is not authorized to perform login on a side note: is it possible to assign permissions to all members of an LDAP tree where they dont have a common group membership? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct: https://www.ovirt.org/community/about/community-guidelines/ List Archives: https://lists.ovirt.org/archives/list/users@ovirt.org/message/NAEUHLW3YMYAP6L44RRS5MCLRU2OTXPZ/
[ovirt-users] LDAP Authentication issues
Dear All, I'm having problems getting LDAP running, login works, but I'm getting "user is not authorised to perform login" - this is even if i specify the UserRole specifically to the LDAP group the user is in. 2018-05-25 08:56:16,212+01 INFO [org.ovirt.engine.core.sso.utils.AuthenticationUtils] (default task-23) [] User callum@Biomedical Research Computing successfully logged in with scopes: ovirt-app-admin ovirt-app-api ovirt-app-portal ovirt-ext=auth:sequence-priority=~ ovirt-ext=revoke:revoke-all ovirt-ext=token-info:authz-search ovirt-ext=token-info:public-authz-search ovirt-ext=token-info:validate ovirt-ext=token:password-access 2018-05-25 08:56:16,391+01 INFO [org.ovirt.engine.core.bll.aaa.CreateUserSessionCommand] (default task-25) [63e60fe9] Running command: CreateUserSessionCommand internal: false. 2018-05-25 08:56:16,430+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (default task-25) [63e60fe9] EVENT_ID: USER_VDC_LOGIN_FAILED(114), User callum@Biomedical Research Computing connecting from '192.168.65.254' failed to log in. 2018-05-25 08:56:16,430+01 ERROR [org.ovirt.engine.core.aaa.servlet.SsoPostLoginServlet] (default task-25) [] The user callum@Biomedical Research Computing is not authorized to perform login on a side note: is it possible to assign permissions to all members of an LDAP tree where they dont have a common group membership? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
[ovirt-users] Re: vGPU VM not starting
Dear Ales, 4.2.3,5-1 Through extensive testing done with the help of Martin Polednik the issues with the vGPU startup appear to be within the nvidia drivers, so continuation of that issue is now going through nvidia. The issue with the nics MTU appears to have gone away with the upgrade of host to the version above. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 21 May 2018, at 13:23, Ales Musil <amu...@redhat.com<mailto:amu...@redhat.com>> wrote: On Mon, May 21, 2018 at 1:15 PM, Francesco Romani <from...@redhat.com<mailto:from...@redhat.com>> wrote: On 05/17/2018 12:01 AM, Callum Smith wrote: > Dear All, > > Our vGPU installation is progressing, though the VM is failing to start. > > 2018-05-16 22:57:34,328+0100 ERROR (vm/1bc9dae8) [virt.vm] > (vmId='1bc9dae8-a0ea-44b3-9103-5805100648d0') The vm start process > failed (vm:943) > Traceback (most recent call last): > File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 872, > in _startUnderlyingVm > self._run() > File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2872, > in _run > dom.createWithFlags(flags) > File > "/usr/lib/python2.7/site-packages/vdsm/common/libvirtconnection.py", > line 130, in wrapper > ret = f(*args, **kwargs) > File "/usr/lib/python2.7/site-packages/vdsm/common/function.py", > line 92, in wrapper > return func(inst, *args, **kwargs) > File "/usr/lib64/python2.7/site-packages/libvirt.py", line 1099, in > createWithFlags > if ret == -1: raise libvirtError ('virDomainCreateWithFlags() > failed', dom=self) > libvirtError: Cannot get interface MTU on '': No such device This is another bug, related to https://bugzilla.redhat.com/show_bug.cgi?id=1561010. The proper fix is on Engine side, even though we can fix it on Vdsm side too Bests, -- Francesco Romani Senior SW Eng., Virtualization R Red Hat IRC: fromani github: @fromanirh ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Dear Callum, can you send us also the engine.log? And which version of the engine are you using? Regards, Ales. -- ALES MUSIL INTERN - rhv network Red Hat EMEA<https://www.redhat.com/> amu...@redhat.com<mailto:amu...@redhat.com> IM: amusil [https://www.redhat.com/files/brand/email/sig-redhat.png]<https://red.ht/sig> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
[ovirt-users] Re: vGPU VM not starting
Dear All, I'm still having the same problems, is this a bug or something that's configured incorrectly? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 18 May 2018, at 13:22, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Yep, creating the mdev manually works, and in fact like I said previously, the VM does actually create an mdev successfully as you can see the UUID of the device (and is correctly identifiable though the /sys/class/mdev_bus/${DEVICE_ADDR}/${UUID}/mdev_type/name In this specific case to help with the logs, the uuid generated is consistently the similar (even after manual deletion) of "f5dc8396-dad5-3893-9eb4-94eedf60a881" The VM then fails to start because of the MTU issue. Restarting the VM on the node then produces the issue of the device not being available (because the device with the previous uuid exists and it's of max_instance=1). So it's the first VM start with the MTU issue that needs resolving, with the added complication that the issue of MTU (network) is caused by the mdev being set. The same error does not happen when mdev is not set. PS. In fact this was the guide i followed, so thank you Martin for writing it, without it getting this far would have been very difficult: https://mpolednik.github.io/2017/09/13/vgpu-in-ovirt/ Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 18 May 2018, at 13:05, Martin Polednik <mpoled...@redhat.com<mailto:mpoled...@redhat.com>> wrote: On 18/05/18 13:42 +0200, Francesco Romani wrote: Hi, On 05/17/2018 10:56 AM, Callum Smith wrote: In an attempt not to mislead you guys as well, there appears to be a separate, vGPU specific, issue. https://www.dropbox.com/s/hlymmf9d6rn12tq/vdsm.vgpu.log?dl=0 I've uploaded the full vdsm.log to dropbox. Most recently I tried unmounting alll network devices from the VM and booting it and i get a different issue around the vGPU: 2018-05-17 09:48:24,806+0100 INFO (vm/1bc9dae8) [root] /usr/libexec/vdsm/hooks/before_vm_start/50_hos tedengine: rc=0 err= (hooks:110) 2018-05-17 09:48:24,953+0100 INFO (vm/1bc9dae8) [root] /usr/libexec/vdsm/hooks/before_vm_start/50_vfi o_mdev: rc=1 err=vgpu: No device with type nvidia-61 is available. (hooks:110) 2018-05-17 09:48:25,069+0100 INFO (vm/1bc9dae8) [root] /usr/libexec/vdsm/hooks/before_vm_start/50_vho stmd: rc=0 err= (hooks:110) 2018-05-17 09:48:25,070+0100 ERROR (vm/1bc9dae8) [virt.vm] (vmId='1bc9dae8-a0ea-44b3-9103-5805100648d0 ') The vm start process failed (vm:943) Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 872, in _startUnderlyingVm self._run() File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2862, in _run self._custom) File "/usr/lib/python2.7/site-packages/vdsm/common/hooks.py", line 153, in before_vm_start return _runHooksDir(domxml, 'before_vm_start', vmconf=vmconf) File "/usr/lib/python2.7/site-packages/vdsm/common/hooks.py", line 120, in _runHooksDir raise exception.HookError(err) HookError: Hook Error: ('',) Despite the nvidia-61 being an option on the GPU: https://pastebin.com/bucw21DG Let's tackle one issue at time :) >From the shared logs, the VM start failed because of 2018-05-17 10:11:12,681+0100 INFO (vm/1bc9dae8) [root] /usr/libexec/vdsm/hooks/before_vm_start/50_hostedengine: rc=0 err= (hooks:110) 2018-05-17 10:11:12,837+0100 INFO (vm/1bc9dae8) [root] /usr/libexec/vdsm/hooks/before_vm_start/50_vfio_mdev: rc=1 err=vgpu: No device with type nvidia-53 is available. maybe Martin can shed some light here? Given that the actual slice is available in sysfs (as indicated by one of the other branches of this thread), I fear we may be facing some weird issue with the driver itself. Can you create the mdev manually? $ uuidgen > /sys/class/mdev_bus/${DEVICE_ADDR}/mdev_supported_types/nvidia-61 should be enough for a test. Callum, please share Vdsm logs showing the network failure Bests, -- Francesco Romani Senior SW Eng., Virtualization R Red Hat IRC: fromani github: @fromanirh ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
[ovirt-users] Re: vGPU VM not starting
Yep, creating the mdev manually works, and in fact like I said previously, the VM does actually create an mdev successfully as you can see the UUID of the device (and is correctly identifiable though the /sys/class/mdev_bus/${DEVICE_ADDR}/${UUID}/mdev_type/name In this specific case to help with the logs, the uuid generated is consistently the similar (even after manual deletion) of "f5dc8396-dad5-3893-9eb4-94eedf60a881" The VM then fails to start because of the MTU issue. Restarting the VM on the node then produces the issue of the device not being available (because the device with the previous uuid exists and it's of max_instance=1). So it's the first VM start with the MTU issue that needs resolving, with the added complication that the issue of MTU (network) is caused by the mdev being set. The same error does not happen when mdev is not set. PS. In fact this was the guide i followed, so thank you Martin for writing it, without it getting this far would have been very difficult: https://mpolednik.github.io/2017/09/13/vgpu-in-ovirt/ Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 18 May 2018, at 13:05, Martin Polednik <mpoled...@redhat.com<mailto:mpoled...@redhat.com>> wrote: On 18/05/18 13:42 +0200, Francesco Romani wrote: Hi, On 05/17/2018 10:56 AM, Callum Smith wrote: In an attempt not to mislead you guys as well, there appears to be a separate, vGPU specific, issue. https://www.dropbox.com/s/hlymmf9d6rn12tq/vdsm.vgpu.log?dl=0 I've uploaded the full vdsm.log to dropbox. Most recently I tried unmounting alll network devices from the VM and booting it and i get a different issue around the vGPU: 2018-05-17 09:48:24,806+0100 INFO (vm/1bc9dae8) [root] /usr/libexec/vdsm/hooks/before_vm_start/50_hos tedengine: rc=0 err= (hooks:110) 2018-05-17 09:48:24,953+0100 INFO (vm/1bc9dae8) [root] /usr/libexec/vdsm/hooks/before_vm_start/50_vfi o_mdev: rc=1 err=vgpu: No device with type nvidia-61 is available. (hooks:110) 2018-05-17 09:48:25,069+0100 INFO (vm/1bc9dae8) [root] /usr/libexec/vdsm/hooks/before_vm_start/50_vho stmd: rc=0 err= (hooks:110) 2018-05-17 09:48:25,070+0100 ERROR (vm/1bc9dae8) [virt.vm] (vmId='1bc9dae8-a0ea-44b3-9103-5805100648d0 ') The vm start process failed (vm:943) Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 872, in _startUnderlyingVm self._run() File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2862, in _run self._custom) File "/usr/lib/python2.7/site-packages/vdsm/common/hooks.py", line 153, in before_vm_start return _runHooksDir(domxml, 'before_vm_start', vmconf=vmconf) File "/usr/lib/python2.7/site-packages/vdsm/common/hooks.py", line 120, in _runHooksDir raise exception.HookError(err) HookError: Hook Error: ('',) Despite the nvidia-61 being an option on the GPU: https://pastebin.com/bucw21DG Let's tackle one issue at time :) >From the shared logs, the VM start failed because of 2018-05-17 10:11:12,681+0100 INFO (vm/1bc9dae8) [root] /usr/libexec/vdsm/hooks/before_vm_start/50_hostedengine: rc=0 err= (hooks:110) 2018-05-17 10:11:12,837+0100 INFO (vm/1bc9dae8) [root] /usr/libexec/vdsm/hooks/before_vm_start/50_vfio_mdev: rc=1 err=vgpu: No device with type nvidia-53 is available. maybe Martin can shed some light here? Given that the actual slice is available in sysfs (as indicated by one of the other branches of this thread), I fear we may be facing some weird issue with the driver itself. Can you create the mdev manually? $ uuidgen > /sys/class/mdev_bus/${DEVICE_ADDR}/mdev_supported_types/nvidia-61 should be enough for a test. Callum, please share Vdsm logs showing the network failure Bests, -- Francesco Romani Senior SW Eng., Virtualization R Red Hat IRC: fromani github: @fromanirh ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
[ovirt-users] Re: vGPU VM not starting
Dear All, Similar issues with a clean install https://www.dropbox.com/s/jf9pwapohn5dq5p/vdsm.gpu2.log?dl=0 Above is the dropbox of the log of the clean install. This VM has a custom "mdev_type" of "nvidia-53" which relates to a specific GRID P40-24Q instance. Even looking in /sys/class/mdev_bus/*/ you see that there has been correctly a vGPU slice created as part of the boot of the machine, but still you get this error: 2018-05-17 14:19:42,757+0100 INFO (vm/1bc9dae8) [root] /usr/libexec/vdsm/hooks/before_vm_start/50_vfio_mdev: rc=1 err=vgpu: No device with type nvidia-53 is available. (hooks:110) 2018-05-17 14:19:42,873+0100 INFO (vm/1bc9dae8) [root] /usr/libexec/vdsm/hooks/before_vm_start/50_vhostmd: rc=0 err= (hooks:110) 2018-05-17 14:19:42,874+0100 ERROR (vm/1bc9dae8) [virt.vm] (vmId='1bc9dae8-a0ea-44b3-9103-5805100648d0') The vm start process failed (vm:943) Thanks all for your input. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 14:05, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Dear Yaniv, Please see my most recent response: https://www.dropbox.com/s/hlymmf9d6rn12tq/vdsm.vgpu.log?dl=0 I'm doing a clean install of the host right now to see if doing the exact same procedure a second time produces different results (this way lies madness, but we have excited bosses about vGPUs on oVirt). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 14:02, Yaniv Kaul <yk...@redhat.com<mailto:yk...@redhat.com>> wrote: It'd be easier if you could share the complete vdsm log. Perhaps file a bug and we can investigate it? Y. On Thu, May 17, 2018 at 11:25 AM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Some information that appears to be from around the time of installation to the cluster: WARNING: COMMAND_FAILED: '/usr/sbin/ebtables --concurrent -t nat -X libvirt-O-vnet0' failed: Chain 'libvirt-O-vnet0' doesn't exist. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ebtables --concurrent -t nat -F libvirt-O-vnet0' failed: Chain 'libvirt-O-vnet0' doesn't exist. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ebtables --concurrent -t nat -L libvirt-O-vnet0' failed: Chain 'libvirt-O-vnet0' doesn't exist. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ebtables --concurrent -t nat -D POSTROUTING -o vnet0 -j libvirt-O-vnet0' failed: Illegal target name 'libvirt-O-vnet0'. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ip6tables -w2 -w -X HI-vnet0' failed: ip6tables: No chain/target/match by that name. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ip6tables -w2 -w -F HI-vnet0' failed: ip6tables: No chain/target/match by that name. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ip6tables -w2 -w -X FI-vnet0' failed: ip6tables: No chain/target/match by that name. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ip6tables -w2 -w -F FI-vnet0' failed: ip6tables: No chain/target/match by that name. firewalld Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 09:20, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: PS. some other WARN's that come up on the host: WARN File: /var/lib/libvirt/qemu/channels/1bc9dae8-a0ea-44b3-9103-5805100648d0.org.qemu.guest_agent.0 already removed vdsm WARN Attempting to remove a non existing net user: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm WARN Attempting to remove a non existing network: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm WARN File: /var/lib/libvirt/qemu/channels/1bc9dae8-a0ea-44b3-9103-5805100648d0.ovirt-guest-agent.0 already removed vdsm WARN Attempting to add an existing net user: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 09:16, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: OVN Network provider is used, and the node is running 4.2.3 (specifically 2018051606 clean install last night). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 07:47, Ales Musil <amu...@redhat.com<mailto:amu...@redhat.com>> wrote: On Thu, May 17, 2018 at 12:01 AM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Our v
[ovirt-users] Re: vGPU VM not starting
Dear Yaniv, Please see my most recent response: https://www.dropbox.com/s/hlymmf9d6rn12tq/vdsm.vgpu.log?dl=0 I'm doing a clean install of the host right now to see if doing the exact same procedure a second time produces different results (this way lies madness, but we have excited bosses about vGPUs on oVirt). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 14:02, Yaniv Kaul <yk...@redhat.com<mailto:yk...@redhat.com>> wrote: It'd be easier if you could share the complete vdsm log. Perhaps file a bug and we can investigate it? Y. On Thu, May 17, 2018 at 11:25 AM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Some information that appears to be from around the time of installation to the cluster: WARNING: COMMAND_FAILED: '/usr/sbin/ebtables --concurrent -t nat -X libvirt-O-vnet0' failed: Chain 'libvirt-O-vnet0' doesn't exist. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ebtables --concurrent -t nat -F libvirt-O-vnet0' failed: Chain 'libvirt-O-vnet0' doesn't exist. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ebtables --concurrent -t nat -L libvirt-O-vnet0' failed: Chain 'libvirt-O-vnet0' doesn't exist. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ebtables --concurrent -t nat -D POSTROUTING -o vnet0 -j libvirt-O-vnet0' failed: Illegal target name 'libvirt-O-vnet0'. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ip6tables -w2 -w -X HI-vnet0' failed: ip6tables: No chain/target/match by that name. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ip6tables -w2 -w -F HI-vnet0' failed: ip6tables: No chain/target/match by that name. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ip6tables -w2 -w -X FI-vnet0' failed: ip6tables: No chain/target/match by that name. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ip6tables -w2 -w -F FI-vnet0' failed: ip6tables: No chain/target/match by that name. firewalld Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 09:20, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: PS. some other WARN's that come up on the host: WARN File: /var/lib/libvirt/qemu/channels/1bc9dae8-a0ea-44b3-9103-5805100648d0.org.qemu.guest_agent.0 already removed vdsm WARN Attempting to remove a non existing net user: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm WARN Attempting to remove a non existing network: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm WARN File: /var/lib/libvirt/qemu/channels/1bc9dae8-a0ea-44b3-9103-5805100648d0.ovirt-guest-agent.0 already removed vdsm WARN Attempting to add an existing net user: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 09:16, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: OVN Network provider is used, and the node is running 4.2.3 (specifically 2018051606 clean install last night). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 07:47, Ales Musil <amu...@redhat.com<mailto:amu...@redhat.com>> wrote: On Thu, May 17, 2018 at 12:01 AM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Our vGPU installation is progressing, though the VM is failing to start. 2018-05-16 22:57:34,328+0100 ERROR (vm/1bc9dae8) [virt.vm] (vmId='1bc9dae8-a0ea-44b3-9103-5805100648d0') The vm start process failed (vm:943) Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 872, in _startUnderlyingVm self._run() File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2872, in _run dom.createWithFlags(flags) File "/usr/lib/python2.7/site-packages/vdsm/common/libvirtconnection.py", line 130, in wrapper ret = f(*args, **kwargs) File "/usr/lib/python2.7/site-packages/vdsm/common/function.py", line 92, in wrapper return func(inst, *args, **kwargs) File "/usr/lib64/python2.7/site-packages/libvirt.py", line 1099, in createWithFlags if ret == -1: raise libvirtError ('virDomainCreateWithFlags() failed', dom=self) libvirtError: Cannot get interface MTU on '': No such device That's the specific error, some other information. It seems the GPU 'allocation' of uuid against the nvidia-xx mdev type is proceeding correctly, and the device is being created by the VM instantiation but the VM does not su
[ovirt-users] Re: vGPU VM not starting
In an attempt not to mislead you guys as well, there appears to be a separate, vGPU specific, issue. https://www.dropbox.com/s/hlymmf9d6rn12tq/vdsm.vgpu.log?dl=0 I've uploaded the full vdsm.log to dropbox. Most recently I tried unmounting alll network devices from the VM and booting it and i get a different issue around the vGPU: 2018-05-17 09:48:24,806+0100 INFO (vm/1bc9dae8) [root] /usr/libexec/vdsm/hooks/before_vm_start/50_hos tedengine: rc=0 err= (hooks:110) 2018-05-17 09:48:24,953+0100 INFO (vm/1bc9dae8) [root] /usr/libexec/vdsm/hooks/before_vm_start/50_vfi o_mdev: rc=1 err=vgpu: No device with type nvidia-61 is available. (hooks:110) 2018-05-17 09:48:25,069+0100 INFO (vm/1bc9dae8) [root] /usr/libexec/vdsm/hooks/before_vm_start/50_vho stmd: rc=0 err= (hooks:110) 2018-05-17 09:48:25,070+0100 ERROR (vm/1bc9dae8) [virt.vm] (vmId='1bc9dae8-a0ea-44b3-9103-5805100648d0 ') The vm start process failed (vm:943) Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 872, in _startUnderlyingVm self._run() File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2862, in _run self._custom) File "/usr/lib/python2.7/site-packages/vdsm/common/hooks.py", line 153, in before_vm_start return _runHooksDir(domxml, 'before_vm_start', vmconf=vmconf) File "/usr/lib/python2.7/site-packages/vdsm/common/hooks.py", line 120, in _runHooksDir raise exception.HookError(err) HookError: Hook Error: ('',) Despite the nvidia-61 being an option on the GPU: https://pastebin.com/bucw21DG So I think we have two issues here, one relating to the network and one to GPU. Thanks all for your rapid and very useful help! Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 09:28, Ales Musil <amu...@redhat.com<mailto:amu...@redhat.com>> wrote: Seems like some vdsm problem with xml generation. +Francesco On Thu, May 17, 2018 at 10:20 AM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: PS. some other WARN's that come up on the host: WARN File: /var/lib/libvirt/qemu/channels/1bc9dae8-a0ea-44b3-9103-5805100648d0.org.qemu.guest_agent.0 already removed vdsm WARN Attempting to remove a non existing net user: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm WARN Attempting to remove a non existing network: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm WARN File: /var/lib/libvirt/qemu/channels/1bc9dae8-a0ea-44b3-9103-5805100648d0.ovirt-guest-agent.0 already removed vdsm WARN Attempting to add an existing net user: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 09:16, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: OVN Network provider is used, and the node is running 4.2.3 (specifically 2018051606 clean install last night). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 07:47, Ales Musil <amu...@redhat.com<mailto:amu...@redhat.com>> wrote: On Thu, May 17, 2018 at 12:01 AM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Our vGPU installation is progressing, though the VM is failing to start. 2018-05-16 22:57:34,328+0100 ERROR (vm/1bc9dae8) [virt.vm] (vmId='1bc9dae8-a0ea-44b3-9103-5805100648d0') The vm start process failed (vm:943) Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 872, in _startUnderlyingVm self._run() File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2872, in _run dom.createWithFlags(flags) File "/usr/lib/python2.7/site-packages/vdsm/common/libvirtconnection.py", line 130, in wrapper ret = f(*args, **kwargs) File "/usr/lib/python2.7/site-packages/vdsm/common/function.py", line 92, in wrapper return func(inst, *args, **kwargs) File "/usr/lib64/python2.7/site-packages/libvirt.py", line 1099, in createWithFlags if ret == -1: raise libvirtError ('virDomainCreateWithFlags() failed', dom=self) libvirtError: Cannot get interface MTU on '': No such device That's the specific error, some other information. It seems the GPU 'allocation' of uuid against the nvidia-xx mdev type is proceeding correctly, and the device is being created by the VM instantiation but the VM does not succeed in going up with this error. Any other logs or information relevant to help diagnose? Reg
[ovirt-users] Re: vGPU VM not starting
Some information that appears to be from around the time of installation to the cluster: WARNING: COMMAND_FAILED: '/usr/sbin/ebtables --concurrent -t nat -X libvirt-O-vnet0' failed: Chain 'libvirt-O-vnet0' doesn't exist. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ebtables --concurrent -t nat -F libvirt-O-vnet0' failed: Chain 'libvirt-O-vnet0' doesn't exist. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ebtables --concurrent -t nat -L libvirt-O-vnet0' failed: Chain 'libvirt-O-vnet0' doesn't exist. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ebtables --concurrent -t nat -D POSTROUTING -o vnet0 -j libvirt-O-vnet0' failed: Illegal target name 'libvirt-O-vnet0'. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ip6tables -w2 -w -X HI-vnet0' failed: ip6tables: No chain/target/match by that name. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ip6tables -w2 -w -F HI-vnet0' failed: ip6tables: No chain/target/match by that name. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ip6tables -w2 -w -X FI-vnet0' failed: ip6tables: No chain/target/match by that name. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ip6tables -w2 -w -F FI-vnet0' failed: ip6tables: No chain/target/match by that name. firewalld Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 09:20, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: PS. some other WARN's that come up on the host: WARN File: /var/lib/libvirt/qemu/channels/1bc9dae8-a0ea-44b3-9103-5805100648d0.org.qemu.guest_agent.0 already removed vdsm WARN Attempting to remove a non existing net user: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm WARN Attempting to remove a non existing network: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm WARN File: /var/lib/libvirt/qemu/channels/1bc9dae8-a0ea-44b3-9103-5805100648d0.ovirt-guest-agent.0 already removed vdsm WARN Attempting to add an existing net user: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 09:16, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: OVN Network provider is used, and the node is running 4.2.3 (specifically 2018051606 clean install last night). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 07:47, Ales Musil <amu...@redhat.com<mailto:amu...@redhat.com>> wrote: On Thu, May 17, 2018 at 12:01 AM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Our vGPU installation is progressing, though the VM is failing to start. 2018-05-16 22:57:34,328+0100 ERROR (vm/1bc9dae8) [virt.vm] (vmId='1bc9dae8-a0ea-44b3-9103-5805100648d0') The vm start process failed (vm:943) Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 872, in _startUnderlyingVm self._run() File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2872, in _run dom.createWithFlags(flags) File "/usr/lib/python2.7/site-packages/vdsm/common/libvirtconnection.py", line 130, in wrapper ret = f(*args, **kwargs) File "/usr/lib/python2.7/site-packages/vdsm/common/function.py", line 92, in wrapper return func(inst, *args, **kwargs) File "/usr/lib64/python2.7/site-packages/libvirt.py", line 1099, in createWithFlags if ret == -1: raise libvirtError ('virDomainCreateWithFlags() failed', dom=self) libvirtError: Cannot get interface MTU on '': No such device That's the specific error, some other information. It seems the GPU 'allocation' of uuid against the nvidia-xx mdev type is proceeding correctly, and the device is being created by the VM instantiation but the VM does not succeed in going up with this error. Any other logs or information relevant to help diagnose? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Hi Callum, can you share your version of the setup? Also do you use OVS switch type in the cluster? Regards, Ales. -- ALES MUSIL INTERN - rhv network Red Hat EMEA<https://www.redhat.com/> amu...@redhat.com<mailto:amu...@redhat.com> IM: amusil [https://www.redhat.com/files/brand/email/sig-redhat.png]<https://red.ht/sig&g
[ovirt-users] Re: vGPU VM not starting
PS. some other WARN's that come up on the host: WARN File: /var/lib/libvirt/qemu/channels/1bc9dae8-a0ea-44b3-9103-5805100648d0.org.qemu.guest_agent.0 already removed vdsm WARN Attempting to remove a non existing net user: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm WARN Attempting to remove a non existing network: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm WARN File: /var/lib/libvirt/qemu/channels/1bc9dae8-a0ea-44b3-9103-5805100648d0.ovirt-guest-agent.0 already removed vdsm WARN Attempting to add an existing net user: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 09:16, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: OVN Network provider is used, and the node is running 4.2.3 (specifically 2018051606 clean install last night). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 07:47, Ales Musil <amu...@redhat.com<mailto:amu...@redhat.com>> wrote: On Thu, May 17, 2018 at 12:01 AM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Our vGPU installation is progressing, though the VM is failing to start. 2018-05-16 22:57:34,328+0100 ERROR (vm/1bc9dae8) [virt.vm] (vmId='1bc9dae8-a0ea-44b3-9103-5805100648d0') The vm start process failed (vm:943) Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 872, in _startUnderlyingVm self._run() File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2872, in _run dom.createWithFlags(flags) File "/usr/lib/python2.7/site-packages/vdsm/common/libvirtconnection.py", line 130, in wrapper ret = f(*args, **kwargs) File "/usr/lib/python2.7/site-packages/vdsm/common/function.py", line 92, in wrapper return func(inst, *args, **kwargs) File "/usr/lib64/python2.7/site-packages/libvirt.py", line 1099, in createWithFlags if ret == -1: raise libvirtError ('virDomainCreateWithFlags() failed', dom=self) libvirtError: Cannot get interface MTU on '': No such device That's the specific error, some other information. It seems the GPU 'allocation' of uuid against the nvidia-xx mdev type is proceeding correctly, and the device is being created by the VM instantiation but the VM does not succeed in going up with this error. Any other logs or information relevant to help diagnose? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Hi Callum, can you share your version of the setup? Also do you use OVS switch type in the cluster? Regards, Ales. -- ALES MUSIL INTERN - rhv network Red Hat EMEA<https://www.redhat.com/> amu...@redhat.com<mailto:amu...@redhat.com> IM: amusil [https://www.redhat.com/files/brand/email/sig-redhat.png]<https://red.ht/sig> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
[ovirt-users] Re: Fwd: vGPU VM not starting
OVN Network provider is used, and the node is running 4.2.3 (specifically 2018051606 clean install last night). Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 17 May 2018, at 07:47, Ales Musil <amu...@redhat.com<mailto:amu...@redhat.com>> wrote: On Thu, May 17, 2018 at 12:01 AM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Dear All, Our vGPU installation is progressing, though the VM is failing to start. 2018-05-16 22:57:34,328+0100 ERROR (vm/1bc9dae8) [virt.vm] (vmId='1bc9dae8-a0ea-44b3-9103-5805100648d0') The vm start process failed (vm:943) Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 872, in _startUnderlyingVm self._run() File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2872, in _run dom.createWithFlags(flags) File "/usr/lib/python2.7/site-packages/vdsm/common/libvirtconnection.py", line 130, in wrapper ret = f(*args, **kwargs) File "/usr/lib/python2.7/site-packages/vdsm/common/function.py", line 92, in wrapper return func(inst, *args, **kwargs) File "/usr/lib64/python2.7/site-packages/libvirt.py", line 1099, in createWithFlags if ret == -1: raise libvirtError ('virDomainCreateWithFlags() failed', dom=self) libvirtError: Cannot get interface MTU on '': No such device That's the specific error, some other information. It seems the GPU 'allocation' of uuid against the nvidia-xx mdev type is proceeding correctly, and the device is being created by the VM instantiation but the VM does not succeed in going up with this error. Any other logs or information relevant to help diagnose? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> Hi Callum, can you share your version of the setup? Also do you use OVS switch type in the cluster? Regards, Ales. -- ALES MUSIL INTERN - rhv network Red Hat EMEA<https://www.redhat.com/> amu...@redhat.com<mailto:amu...@redhat.com> IM: amusil [https://www.redhat.com/files/brand/email/sig-redhat.png]<https://red.ht/sig> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
[ovirt-users] vGPU VM not starting
Dear All, Our vGPU installation is progressing, though the VM is failing to start. 2018-05-16 22:57:34,328+0100 ERROR (vm/1bc9dae8) [virt.vm] (vmId='1bc9dae8-a0ea-44b3-9103-5805100648d0') The vm start process failed (vm:943) Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 872, in _startUnderlyingVm self._run() File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2872, in _run dom.createWithFlags(flags) File "/usr/lib/python2.7/site-packages/vdsm/common/libvirtconnection.py", line 130, in wrapper ret = f(*args, **kwargs) File "/usr/lib/python2.7/site-packages/vdsm/common/function.py", line 92, in wrapper return func(inst, *args, **kwargs) File "/usr/lib64/python2.7/site-packages/libvirt.py", line 1099, in createWithFlags if ret == -1: raise libvirtError ('virDomainCreateWithFlags() failed', dom=self) libvirtError: Cannot get interface MTU on '': No such device That's the specific error, some other information. It seems the GPU 'allocation' of uuid against the nvidia-xx mdev type is proceeding correctly, and the device is being created by the VM instantiation but the VM does not succeed in going up with this error. Any other logs or information relevant to help diagnose? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
[ovirt-users] Re: vGPU setup guide
OK I guess it was literally just a breath away: https://blogs.nvidia.com/blog/2018/05/15/red-hat-virtualization-vgpu-support/ So based on it now being actually supported, is this guide still relevant? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 14 May 2018, at 21:48, Don Dupuis <donds...@gmail.com<mailto:donds...@gmail.com>> wrote: No, if you look at support matrix, there is no rhev/ovirt. RHEL KVM only supports pass through, not vGPU!! That driver only support 1 to 1 pass through, no vGPU profiles. I hope it get released soon as when RHEV 4.2 gets released. Don On Mon, May 14, 2018 at 3:33 PM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: That should be fine then, because they have done, right? https://docs.nvidia.com/grid/6.0/product-support-matrix/ And inside my product manager for NVIDIA i can download "NVIDIA vGPU for RHEL KVM" which comes with the hypervisor driver. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> > On 14 May 2018, at 21:19, Don Dupuis > <donds...@gmail.com<mailto:donds...@gmail.com>> wrote: > > Nvidia vGPU support won't work until Nvidia releases hypervisor drivers for > RHEV/oVirt. > > Don > > On Mon, May 14, 2018 at 3:08 PM, Callum Smith > <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: > Dear All, > > IS this the most current and useful example of implenting vGPUs in oVirt? I > had understood that 4.2 had NVIDIA GRID support as a flagship feature, but > this appears to be 4.1.4? It seems a very reasonable and decent guide, just > don't want to go down this route if there's alternatives now available in > 4.2.x. > > https://mpolednik.github.io/2017/09/13/vgpu-in-ovirt/ > > Regards, > Callum > > -- > > Callum Smith > Research Computing Core > Wellcome Trust Centre for Human Genetics > University of Oxford > e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> > > > ___ > Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> > To unsubscribe send an email to > users-le...@ovirt.org<mailto:users-le...@ovirt.org> > > ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
[ovirt-users] Re: vGPU setup guide
That should be fine then, because they have done, right? https://docs.nvidia.com/grid/6.0/product-support-matrix/ And inside my product manager for NVIDIA i can download "NVIDIA vGPU for RHEL KVM" which comes with the hypervisor driver. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk > On 14 May 2018, at 21:19, Don Dupuis <donds...@gmail.com> wrote: > > Nvidia vGPU support won't work until Nvidia releases hypervisor drivers for > RHEV/oVirt. > > Don > > On Mon, May 14, 2018 at 3:08 PM, Callum Smith <cal...@well.ox.ac.uk> wrote: > Dear All, > > IS this the most current and useful example of implenting vGPUs in oVirt? I > had understood that 4.2 had NVIDIA GRID support as a flagship feature, but > this appears to be 4.1.4? It seems a very reasonable and decent guide, just > don't want to go down this route if there's alternatives now available in > 4.2.x. > > https://mpolednik.github.io/2017/09/13/vgpu-in-ovirt/ > > Regards, > Callum > > -- > > Callum Smith > Research Computing Core > Wellcome Trust Centre for Human Genetics > University of Oxford > e. cal...@well.ox.ac.uk > > > ___ > Users mailing list -- users@ovirt.org > To unsubscribe send an email to users-le...@ovirt.org > > ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
[ovirt-users] vGPU setup guide
Dear All, IS this the most current and useful example of implenting vGPUs in oVirt? I had understood that 4.2 had NVIDIA GRID support as a flagship feature, but this appears to be 4.1.4? It seems a very reasonable and decent guide, just don't want to go down this route if there's alternatives now available in 4.2.x. https://mpolednik.github.io/2017/09/13/vgpu-in-ovirt/ Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
[ovirt-users] Re: Host install failed - cannot set maintenance or remove
Sorry this is the existing discussion, duplicated because of the mail list migration. Cheers, Callum On 13 May 2018, at 13:06, Michael Burman <mbur...@redhat.com<mailto:mbur...@redhat.com>> wrote: Adding Alona, On Tue, May 8, 2018 at 12:25 PM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Dear All, There appears to be an issue with the host install on these, there's quite a lot of errors being kicked out into the logs as they cycle through attempting to activate, failing, and then failing to go into maintenance. I can't remove the hosts from the engine to attempt to reinstall them. Issue presented from having a statically configured network on the ovritmgmt network on the hosts before running the host install. There are SQL errors (FK missing) and assorted goodness in there. Any help greatly appreciated, not sure what the next step forward is from here Engine.log on dropbox: https://www.dropbox.com/s/82iem0ov869yh32/engine.log-20180508.zip?dl=0 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> -- Michael Burman Senior Quality engineer - rhv network - redhat israel Red Hat <https://www.redhat.com> mbur...@redhat.com<mailto:mbur...@redhat.com>M: 0545355725 IM: mburman [https://www.redhat.com/files/brand/email/sig-redhat.png]<https://red.ht/sig> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
[ovirt-users] Host install failed - cannot set maintenance or remove
Dear All, There appears to be an issue with the host install on these, there's quite a lot of errors being kicked out into the logs as they cycle through attempting to activate, failing, and then failing to go into maintenance. I can't remove the hosts from the engine to attempt to reinstall them. Issue presented from having a statically configured network on the ovritmgmt network on the hosts before running the host install. There are SQL errors (FK missing) and assorted goodness in there. Any help greatly appreciated, not sure what the next step forward is from here Engine.log on dropbox: https://www.dropbox.com/s/82iem0ov869yh32/engine.log-20180508.zip?dl=0 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
[ovirt-users] Re: Host install failed - cannot set maintenance or remove
Dear Michael, No luck, the cycling is still happening. This is with a hard reset of the host-engine vm, the host, updating the node versions on everything. Would it be really bad to just delete the nodes from the DB and clean install them? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 8 May 2018, at 13:44, Michael Burman <mbur...@redhat.com<mailto:mbur...@redhat.com>> wrote: You should only remove the required property from the network, this will release the host from the activating cycle in which it's stuck now. On Tue, May 8, 2018 at 3:39 PM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Dear Michael, Good to know - didn't know about that path to configure networks. What's the procedure to remove the failing hosts and re-install? I can't get them into maintenance mode to then install them. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 8 May 2018, at 12:37, Michael Burman <mbur...@redhat.com<mailto:mbur...@redhat.com>> wrote: Hi Callum Looks like you hitting https://bugzilla.redhat.com/show_bug.cgi?id=1570388 Do you have a required network in your env(except ovirtmgmt)? try to uncheck the 'required' from this network, this will solve the issue(via Clusters>network or Networks>Cluster). The bug was fixed in ovirt-engine-4.2.3.4 Cheers) On Tue, May 8, 2018 at 12:55 PM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Dear All, There appears to be an issue with the host install on these, there's quite a lot of errors being kicked out into the logs as they cycle through attempting to activate, failing, and then failing to go into maintenance. I can't remove the hosts from the engine to attempt to reinstall them. Issue presented from having a statically configured network on the ovritmgmt network on the hosts before running the host install. There are SQL errors (FK missing) and assorted goodness in there. Any help greatly appreciated, not sure what the next step forward is from here Engine.log on dropbox: https://www.dropbox.com/s/82iem0ov869yh32/engine.log-20180508.zip?dl=0 PS. Sorry if this comes through a few times, mailing list membership seems to be having a funny turn this morning. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> -- Michael Burman Senior Quality engineer - rhv network - redhat israel Red Hat <https://www.redhat.com/> mbur...@redhat.com<mailto:mbur...@redhat.com>M: 0545355725 IM: mburman [https://www.redhat.com/files/brand/email/sig-redhat.png]<https://red.ht/sig> -- Michael Burman Senior Quality engineer - rhv network - redhat israel Red Hat <https://www.redhat.com/> mbur...@redhat.com<mailto:mbur...@redhat.com>M: 0545355725 IM: mburman [https://www.redhat.com/files/brand/email/sig-redhat.png]<https://red.ht/sig> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
[ovirt-users] Re: Host install failed - cannot set maintenance or remove
Dear Michael, Good to know - didn't know about that path to configure networks. What's the procedure to remove the failing hosts and re-install? I can't get them into maintenance mode to then install them. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 8 May 2018, at 12:37, Michael Burman <mbur...@redhat.com<mailto:mbur...@redhat.com>> wrote: Hi Callum Looks like you hitting https://bugzilla.redhat.com/show_bug.cgi?id=1570388 Do you have a required network in your env(except ovirtmgmt)? try to uncheck the 'required' from this network, this will solve the issue(via Clusters>network or Networks>Cluster). The bug was fixed in ovirt-engine-4.2.3.4 Cheers) On Tue, May 8, 2018 at 12:55 PM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Dear All, There appears to be an issue with the host install on these, there's quite a lot of errors being kicked out into the logs as they cycle through attempting to activate, failing, and then failing to go into maintenance. I can't remove the hosts from the engine to attempt to reinstall them. Issue presented from having a statically configured network on the ovritmgmt network on the hosts before running the host install. There are SQL errors (FK missing) and assorted goodness in there. Any help greatly appreciated, not sure what the next step forward is from here Engine.log on dropbox: https://www.dropbox.com/s/82iem0ov869yh32/engine.log-20180508.zip?dl=0 PS. Sorry if this comes through a few times, mailing list membership seems to be having a funny turn this morning. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org<mailto:users@ovirt.org> To unsubscribe send an email to users-le...@ovirt.org<mailto:users-le...@ovirt.org> -- Michael Burman Senior Quality engineer - rhv network - redhat israel Red Hat <https://www.redhat.com/> mbur...@redhat.com<mailto:mbur...@redhat.com>M: 0545355725 IM: mburman [https://www.redhat.com/files/brand/email/sig-redhat.png]<https://red.ht/sig> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
[ovirt-users] Re: Re-attaching ISOs and moving ISOs storage
Dear Fred, Hosts can indeed access the share, in fact i can manually mount it on the hosts and engine. I've attached the day's logs from when the ISO domain was detached, again expect lots of noise. The time pressure on this issue has gone as we now have a functional fresh install on another node, and the ability to export the VMs across using a shared export domain. https://www.dropbox.com/s/rdcyzoef3j3cjgx/engine.log-20180501.gz?dl=0 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 6 May 2018, at 15:38, Fred Rolland <froll...@redhat.com<mailto:froll...@redhat.com>> wrote: I am trying to reproduce this on my setup, with no success for now. Can you share the engine log corresponding to the time you tried to detach? What are the options available currently? I understand that "Activate" does not work, what about "Detach"? Can your hosts access the ISO domain? On Fri, May 4, 2018 at 2:09 PM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Is there any sensible way to either clean up the existing ISOs storage or re-attach it? I'm struggling to even export VMs and migrate them elsewhere with this and need to recover them asap. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 2 May 2018, at 15:09, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Attached, thank you for looking into this https://HOSTNAME/ovirt-engine/api/v4/storagedomains/f5914df0-f46c-4cc0-b666-c929aa0225ae VMISOs 11770357874688 false 0 5 false ok false backoffice01.cluster /vm-iso nfs v1 false false iso 38654705664 10 false https://HOSTNAME/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains tegile-virtman-backup 17519171600384 false 0 5 false ok false maintenance 192.168.64.248 auto /export/virtman/backup nfs v1 false false export 8589934592 10 false VMStorage 11770357874688 false 118111600640 5 false ok true active backoffice01.cluster /vm-storage2 nfs v4 false false data 38654705664 10 false tegile-virtman 2190433320960 false 226559524864 5 false ok false active 192.168.64.248 auto /export/virtman/VirtualServerShare_1 nfs v4 false false data 8589934592 10 false VMISOs 11770357874688 false 0 5 false ok false maintenance backoffice01.cluster /vm-iso nfs v1 false false iso 38654705664 10 false Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 2 May 2018, at 14:46, Fred Rolland <froll...@redhat.com<mailto:froll...@redhat.com>> wrote: Can you share the REST API data of the Storage domain and Data Center? Here an example of the URLs, you will need to replace with correct ids. http://MY-SERVER/ovirt-engine/api/v4/storagedomains/13461356-f6f7-4a58-9897-2fac61ff40af<http://my-server/ovirt-engine/api/v4/storagedomains/13461356-f6f7-4a58-9897-2fac61ff40af> http://MY-SERVER/ovirt-engine/api/v4/datacenters/5a5df553-022d-036d-01e8-0071/storagedomains<http://my-server/ovirt-engine/api/v4/datacenters/5a5df553-022d-036d-01e8-0071/storagedomains> On Wed, May 2, 2018 at 12:53 PM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: This is on 4.2.0.2-1, I've linked the main logs to dropbox simply
[ovirt-users] Host install failed - cannot set maintenance or remove
Dear All, There appears to be an issue with the host install on these, there's quite a lot of errors being kicked out into the logs as they cycle through attempting to activate, failing, and then failing to go into maintenance. I can't remove the hosts from the engine to attempt to reinstall them. Issue presented from having a statically configured network on the ovritmgmt network on the hosts before running the host install. There are SQL errors (FK missing) and assorted goodness in there. Any help greatly appreciated, not sure what the next step forward is from here Engine.log on dropbox: https://www.dropbox.com/s/82iem0ov869yh32/engine.log-20180508.zip?dl=0 PS. Sorry if this comes through a few times, mailing list membership seems to be having a funny turn this morning. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org
Re: [ovirt-users] Re-attaching ISOs and moving ISOs storage
Is there any sensible way to either clean up the existing ISOs storage or re-attach it? I'm struggling to even export VMs and migrate them elsewhere with this and need to recover them asap. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 2 May 2018, at 15:09, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Attached, thank you for looking into this https://HOSTNAME/ovirt-engine/api/v4/storagedomains/f5914df0-f46c-4cc0-b666-c929aa0225ae https://virtman.cluster/ovirt-engine/api/v4/storagedomains/f5914df0-f46c-4cc0-b666-c929aa0225ae>" id="f5914df0-f46c-4cc0-b666-c929aa0225ae"> https://virtman.cluster/ovirt-engine/api/v4/storagedomains/f5914df0-f46c-4cc0-b666-c929aa0225ae/isattached>" rel="isattached"/> https://virtman.cluster/ovirt-engine/api/v4/storagedomains/f5914df0-f46c-4cc0-b666-c929aa0225ae/reduceluns>" rel="reduceluns"/> https://virtman.cluster/ovirt-engine/api/v4/storagedomains/f5914df0-f46c-4cc0-b666-c929aa0225ae/refreshluns>" rel="refreshluns"/> https://virtman.cluster/ovirt-engine/api/v4/storagedomains/f5914df0-f46c-4cc0-b666-c929aa0225ae/updateovfstore>" rel="updateovfstore"/> VMISOs https://virtman.cluster/ovirt-engine/api/v4/storagedomains/f5914df0-f46c-4cc0-b666-c929aa0225ae/disksnapshots>" rel="disksnapshots"/> https://virtman.cluster/ovirt-engine/api/v4/storagedomains/f5914df0-f46c-4cc0-b666-c929aa0225ae/files>" rel="files"/> https://virtman.cluster/ovirt-engine/api/v4/storagedomains/f5914df0-f46c-4cc0-b666-c929aa0225ae/diskprofiles>" rel="diskprofiles"/> https://virtman.cluster/ovirt-engine/api/v4/storagedomains/f5914df0-f46c-4cc0-b666-c929aa0225ae/storageconnections>" rel="storageconnections"/> https://virtman.cluster/ovirt-engine/api/v4/storagedomains/f5914df0-f46c-4cc0-b666-c929aa0225ae/permissions>" rel="permissions"/> 11770357874688 false 0 5 false ok false backoffice01.cluster /vm-iso nfs v1 false false iso 38654705664 10 false https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304>" id="5a54bf81-0228-02bc-0358-0304"/> https://HOSTNAME/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains/d9f8093b-71cd-423d-9b7d-91a613ef98ba>" id="d9f8093b-71cd-423d-9b7d-91a613ef98ba"> https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains/d9f8093b-71cd-423d-9b7d-91a613ef98ba/activate>" rel="activate"/> https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains/d9f8093b-71cd-423d-9b7d-91a613ef98ba/deactivate>" rel="deactivate"/> tegile-virtman-backup https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains/d9f8093b-71cd-423d-9b7d-91a613ef98ba/disks>" rel="disks"/> 17519171600384 false 0 5 false ok false maintenance 192.168.64.248 auto /export/virtman/backup nfs v1 false false export 8589934592 10 false https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304>" id="5a54bf81-0228-02bc-0358-0304"/> https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304>" id="5a54bf81-0228-02bc-0358-0304"/> https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains/017dd620-65ec-4fa4-8d46-67737dedd7a1>" id="017dd620-65ec-4fa4-8d46-67737dedd7a1"> https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains/017dd620-65ec-4fa4-8d46-67737dedd7a1/activate>" rel="activate"/> https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains/017dd620-65ec-4fa4-8d46-67737dedd7a1/deactivate>" rel="deactivate"/> VMStorage https://virtman.cluster/o
Re: [ovirt-users] Re-attaching ISOs and moving ISOs storage
s://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304>" id="5a54bf81-0228-02bc-0358-0304"/> https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304>" id="5a54bf81-0228-02bc-0358-0304"/> https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains/e572c874-17f8-4ad8-9bac-b6f563a9fc17>" id="e572c874-17f8-4ad8-9bac-b6f563a9fc17"> https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains/e572c874-17f8-4ad8-9bac-b6f563a9fc17/activate>" rel="activate"/> https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains/e572c874-17f8-4ad8-9bac-b6f563a9fc17/deactivate>" rel="deactivate"/> tegile-virtman https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains/e572c874-17f8-4ad8-9bac-b6f563a9fc17/disks>" rel="disks"/> 2190433320960 false 226559524864 5 false ok false active 192.168.64.248 auto /export/virtman/VirtualServerShare_1 nfs v4 false false data 8589934592 10 false https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304>" id="5a54bf81-0228-02bc-0358-0304"/> https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304>" id="5a54bf81-0228-02bc-0358-0304"/> https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains/f5914df0-f46c-4cc0-b666-c929aa0225ae>" id="f5914df0-f46c-4cc0-b666-c929aa0225ae"> https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains/f5914df0-f46c-4cc0-b666-c929aa0225ae/activate>" rel="activate"/> https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304/storagedomains/f5914df0-f46c-4cc0-b666-c929aa0225ae/deactivate>" rel="deactivate"/> VMISOs 11770357874688 false 0 5 false ok false maintenance backoffice01.cluster /vm-iso nfs v1 false false iso 38654705664 10 false https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304>" id="5a54bf81-0228-02bc-0358-0304"/> https://virtman.cluster/ovirt-engine/api/v4/datacenters/5a54bf81-0228-02bc-0358-0304>" id="5a54bf81-0228-02bc-0358-0304"/> [cid:921E5257-26F4-4E0D-83B0-68F996842D08@well.ox.ac.uk] Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 2 May 2018, at 14:46, Fred Rolland <froll...@redhat.com<mailto:froll...@redhat.com>> wrote: Can you share the REST API data of the Storage domain and Data Center? Here an example of the URLs, you will need to replace with correct ids. http://MY-SERVER/ovirt-engine/api/v4/storagedomains/13461356-f6f7-4a58-9897-2fac61ff40af<http://my-server/ovirt-engine/api/v4/storagedomains/13461356-f6f7-4a58-9897-2fac61ff40af> http://MY-SERVER/ovirt-engine/api/v4/datacenters/5a5df553-022d-036d-01e8-0071/storagedomains<http://my-server/ovirt-engine/api/v4/datacenters/5a5df553-022d-036d-01e8-0071/storagedomains> On Wed, May 2, 2018 at 12:53 PM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: This is on 4.2.0.2-1, I've linked the main logs to dropbox simply because they're big, full of noise right now. https://www.dropbox.com/s/f8q3m5amro2a1b2/engine.log?dl=0 https://www.dropbox.com/s/uods85jk65halo3/vdsm.log?dl=0 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 2 May 2018, at 10:43, Fred Rolland <froll...@redhat.com<mailto:froll...@redhat.com>> wrote: Which version are you using? Can you provide the whole log? For some reason, it looks like the Vdsm thinks that the Storage Domain is not part of the pool. On Wed, May 2, 2018 at 11:20 AM, Callum Smith
Re: [ovirt-users] Re-attaching ISOs and moving ISOs storage
This is on 4.2.0.2-1, I've linked the main logs to dropbox simply because they're big, full of noise right now. https://www.dropbox.com/s/f8q3m5amro2a1b2/engine.log?dl=0 https://www.dropbox.com/s/uods85jk65halo3/vdsm.log?dl=0 Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 2 May 2018, at 10:43, Fred Rolland <froll...@redhat.com<mailto:froll...@redhat.com>> wrote: Which version are you using? Can you provide the whole log? For some reason, it looks like the Vdsm thinks that the Storage Domain is not part of the pool. On Wed, May 2, 2018 at 11:20 AM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: State is maintenance for the ISOs storage. I've extracted what is hopefully the relevant bits of the log. VDSM.log (SPM) 2018-05-02 09:16:03,455+0100 INFO (ioprocess communication (179084)) [IOProcess] Starting ioprocess (__init__:447) 2018-05-02 09:16:03,456+0100 INFO (ioprocess communication (179091)) [IOProcess] Starting ioprocess (__init__:447) 2018-05-02 09:16:03,461+0100 INFO (jsonrpc/0) [vdsm.api] FINISH activateStorageDomain error=Storage domain not in pool: u'domain=f5914df0-f46c-4cc0-b666-c929aa0225ae, pool=5a54bf81-0228-02bc-0358-0304' from=:::192.168.64.254,58968, flow_id=93433989-8e26-48a9-bd3a-2ab95f296c08, task_id=7f21f911-348f-45a3-b79c-e3cb11642035 (api:50) 2018-05-02 09:16:03,461+0100 ERROR (jsonrpc/0) [storage.TaskManager.Task] (Task='7f21f911-348f-45a3-b79c-e3cb11642035') Unexpected error (task:875) Traceback (most recent call last): File "/usr/lib/python2.7/site-packages/vdsm/storage/task.py", line 882, in _run return fn(*args, **kargs) File "", line 2, in activateStorageDomain File "/usr/lib/python2.7/site-packages/vdsm/common/api.py", line 48, in method ret = func(*args, **kwargs) File "/usr/lib/python2.7/site-packages/vdsm/storage/hsm.py", line 1256, in activateStorageDomain pool.activateSD(sdUUID) File "/usr/lib/python2.7/site-packages/vdsm/storage/securable.py", line 79, in wrapper return method(self, *args, **kwargs) File "/usr/lib/python2.7/site-packages/vdsm/storage/sp.py", line 1130, in activateSD self.validateAttachedDomain(dom) File "/usr/lib/python2.7/site-packages/vdsm/storage/securable.py", line 79, in wrapper return method(self, *args, **kwargs) File "/usr/lib/python2.7/site-packages/vdsm/storage/sp.py", line 557, in validateAttachedDomain raise se.StorageDomainNotInPool(self.spUUID, dom.sdUUID) StorageDomainNotInPool: Storage domain not in pool: u'domain=f5914df0-f46c-4cc0-b666-c929aa0225ae, pool=5a54bf81-0228-02bc-0358-0304' 2018-05-02 09:16:03,461+0100 INFO (jsonrpc/0) [storage.TaskManager.Task] (Task='7f21f911-348f-45a3-b79c-e3cb11642035') aborting: Task is aborted: "Storage domain not in pool: u'domain=f5914df0-f46c-4cc0-b666-c929aa0225ae, pool=5a54bf81-0228-02bc-0358-0304'" - code 353 (task:1181) 2018-05-02 09:16:03,462+0100 ERROR (jsonrpc/0) [storage.Dispatcher] FINISH activateStorageDomain error=Storage domain not in pool: u'domain=f5914df0-f46c-4cc0-b666-c929aa0225ae, pool=5a54bf81-0228-02bc-0358-0304' (dispatcher:82) engine.log 2018-05-02 09:16:02,326+01 INFO [org.ovirt.engine.core.bll.storage.domain.ActivateStorageDomainCommand] (default task-20) [93433989-8e26-48a9-bd3a-2ab95f296c08] Lock Acquired to object 'EngineLock:{exclusiveLocks='[f5914df0-f46c-4cc0-b666-c929aa0225ae=STORAGE]', sharedLocks=''}' 2018-05-02 09:16:02,376+01 INFO [org.ovirt.engine.core.bll.storage.domain.ActivateStorageDomainCommand] (EE-ManagedThreadFactory-engine-Thread-33455) [93433989-8e26-48a9-bd3a-2ab95f296c08] Running command: ActivateStorageDomainCommand internal: false. Entities affected : ID: f5914df0-f46c-4cc0-b666-c929aa0225ae Type: StorageAction group MANIPULATE_STORAGE_DOMA IN with role type ADMIN 2018-05-02 09:16:02,385+01 INFO [org.ovirt.engine.core.bll.storage.domain.ActivateStorageDomainCommand] (EE-ManagedThreadFactory-engine-Thread-33455) [93433989-8e26-48a9-bd3a-2ab95f296c08] Lock freed to object 'EngineLock:{exclusiveLocks='[f5914df0-f46c-4cc0-b666-c929aa0225ae=STORAGE]', sharedLocks=''}' 2018-05-02 09:16:02,385+01 INFO [org.ovirt.engine.core.bll.storage.domain.ActivateStorageDomainCommand] (EE-ManagedThreadFactory-engine-Thread-33455) [93433989-8e26-48a9-bd3a-2ab95f296c08] ActivateStorage Domain. Before Connect all hosts to pool. Time: Wed May 02 09:16:02 BST 2018 2018-05-02 09:16:02,407+01 INFO [org.ovirt.engine.core.bll.storage.connection.ConnectStorageToVdsCommand] (EE-ManagedThreadFactory-engine-Thread-33456) [40a82b47] Running command: ConnectStorageToVdsCommand internal: true. Entities affected : ID: aaa0--000
Re: [ovirt-users] Re-attaching ISOs and moving ISOs storage
.core.vdsbroker.vdsbroker.ConnectStorageServerVDSCommand] (EE-ManagedThreadFactory-engine-Thread-33456) [40a82b47] FINISH, ConnectStorageServerVDSCommand, return: {da392861-aedc-4f1e-97f4-6919fb01f1e9=0}, log id: 23ce648f 2018-05-02 09:16:02,450+01 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.ActivateStorageDomainVDSCommand] (EE-ManagedThreadFactory-engine-Thread-33455) [93433989-8e26-48a9-bd3a-2ab95f296c08] START, ActivateStorageDomainVDSCommand( ActivateStorageDomainVDSCommandParameters:{storagePoolId='5a54bf81-0228-02bc-0358-0304', ignoreFailoverLimit='false', stor ageDomainId='f5914df0-f46c-4cc0-b666-c929aa0225ae'}), log id: 5c864594 2018-05-02 09:16:02,635+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (EE-ManagedThreadFactory-engine-Thread-33455) [93433989-8e26-48a9-bd3a-2ab95f296c08] EVENT_ID: IRS_BROKER_COMMAND_FAILURE(10,803), VDSM command ActivateStorageDomainVDS failed: Storage domain not in pool: u'domain=f5914df0-f46c-4cc0-b666-c929aa0225ae, pool=5a5 4bf81-0228-02bc-0358-0304' 2018-05-02 09:16:02,635+01 ERROR [org.ovirt.engine.core.vdsbroker.irsbroker.ActivateStorageDomainVDSCommand] (EE-ManagedThreadFactory-engine-Thread-33455) [93433989-8e26-48a9-bd3a-2ab95f296c08] Command 'ActivateStorageDomainVDSCommand( ActivateStorageDomainVDSCommandParameters:{storagePoolId='5a54bf81-0228-02bc-0358-0304', ignoreFailoverLimit='false', st orageDomainId='f5914df0-f46c-4cc0-b666-c929aa0225ae'})' execution failed: IRSGenericException: IRSErrorException: Storage domain not in pool: u'domain=f5914df0-f46c-4cc0-b666-c929aa0225ae, pool=5a54bf81-0228-02bc-0358-0304' 2018-05-02 09:16:02,635+01 INFO [org.ovirt.engine.core.vdsbroker.irsbroker.ActivateStorageDomainVDSCommand] (EE-ManagedThreadFactory-engine-Thread-33455) [93433989-8e26-48a9-bd3a-2ab95f296c08] FINISH, ActivateStorageDomainVDSCommand, log id: 5c864594 2018-05-02 09:16:02,635+01 ERROR [org.ovirt.engine.core.bll.storage.domain.ActivateStorageDomainCommand] (EE-ManagedThreadFactory-engine-Thread-33455) [93433989-8e26-48a9-bd3a-2ab95f296c08] Command 'org.ovirt.engine.core.bll.storage.domain.ActivateStorageDomainCommand' failed: EngineException: org.ovirt.engine.core.vdsbroker.irsbroker.IrsOperationFailedNoFailove rException: IRSGenericException: IRSErrorException: Storage domain not in pool: u'domain=f5914df0-f46c-4cc0-b666-c929aa0225ae, pool=5a54bf81-0228-02bc-0358-0304' (Failed with error StorageDomainNotInPool and code 353) 2018-05-02 09:16:02,636+01 INFO [org.ovirt.engine.core.bll.storage.domain.ActivateStorageDomainCommand] (EE-ManagedThreadFactory-engine-Thread-33455) [93433989-8e26-48a9-bd3a-2ab95f296c08] Command [id=22b0f3c1-9a09-4e26-8096-d83465c8f4ee]: Compensating CHANGED_STATUS_ONLY of org.ovirt.engine.core.common.businessentities.StoragePoolIsoMap; snapshot: EntityStatus Snapshot:{id='StoragePoolIsoMapId:{storagePoolId='5a54bf81-0228-02bc-0358-0304', storageId='f5914df0-f46c-4cc0-b666-c929aa0225ae'}', status='Maintenance'}. 2018-05-02 09:16:02,660+01 ERROR [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (EE-ManagedThreadFactory-engine-Thread-33455) [93433989-8e26-48a9-bd3a-2ab95f296c08] EVENT_ID: USER_ACTIVATE_STORAGE_DOMAIN_FAILED(967), Failed to activate Storage Domain VMISOs (Data Center Default) by admin@internal-authz Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> On 2 May 2018, at 08:44, Fred Rolland <froll...@redhat.com<mailto:froll...@redhat.com>> wrote: Hi, Can you provide logs from engine and Vdsm(SPM)? What is the state now? Thanks, Fred On Tue, May 1, 2018 at 4:11 PM, Callum Smith <cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk>> wrote: Dear All, It appears that clicking "detach" on the ISO storage domain is a really bad idea. This has gotten half way through the procedure and now can't be recovered from. Is there any advice for re-attaching the ISO storage domain manually? An NFS mount didn't add it back to the "pool" unfortunately. On a separate note, is it possible to migrate this storage to a new location? And if so how. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list Users@ovirt.org<mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
[ovirt-users] Re-attaching ISOs and moving ISOs storage
Dear All, It appears that clicking "detach" on the ISO storage domain is a really bad idea. This has gotten half way through the procedure and now can't be recovered from. Is there any advice for re-attaching the ISO storage domain manually? An NFS mount didn't add it back to the "pool" unfortunately. On a separate note, is it possible to migrate this storage to a new location? And if so how. Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk<mailto:cal...@well.ox.ac.uk> ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users