[ovirt-users] Re: vGPU VM not starting

2018-06-24 Thread femi adegoke
Any updates, did this get resolved? ___ Users mailing list -- users@ovirt.org To unsubscribe send an email to users-le...@ovirt.org Privacy Statement: https://www.ovirt.org/site/privacy-policy/ oVirt Code of Conduct:

[ovirt-users] Re: vGPU VM not starting

2018-05-21 Thread Callum Smith
Dear Ales, 4.2.3,5-1 Through extensive testing done with the help of Martin Polednik the issues with the vGPU startup appear to be within the nvidia drivers, so continuation of that issue is now going through nvidia. The issue with the nics MTU appears to have gone away with the upgrade of

[ovirt-users] Re: vGPU VM not starting

2018-05-21 Thread Francesco Romani
On 05/21/2018 02:42 PM, Callum Smith wrote: > Dear Ales, > > 4.2.3,5-1 > > Through extensive testing done with the help of Martin Polednik the > issues with the vGPU startup appear to be within the nvidia drivers, > so continuation of that issue is now going through nvidia. > > The issue with the

[ovirt-users] Re: vGPU VM not starting

2018-05-21 Thread Ales Musil
On Mon, May 21, 2018 at 1:15 PM, Francesco Romani wrote: > > On 05/17/2018 12:01 AM, Callum Smith wrote: > > Dear All, > > > > Our vGPU installation is progressing, though the VM is failing to start. > > > > 2018-05-16 22:57:34,328+0100 ERROR (vm/1bc9dae8) [virt.vm] > >

[ovirt-users] Re: vGPU VM not starting

2018-05-21 Thread Francesco Romani
On 05/17/2018 12:01 AM, Callum Smith wrote: > Dear All, > > Our vGPU installation is progressing, though the VM is failing to start. > > 2018-05-16 22:57:34,328+0100 ERROR (vm/1bc9dae8) [virt.vm] > (vmId='1bc9dae8-a0ea-44b3-9103-5805100648d0') The vm start process > failed (vm:943) > Traceback

[ovirt-users] Re: vGPU VM not starting

2018-05-21 Thread Callum Smith
Dear All, I'm still having the same problems, is this a bug or something that's configured incorrectly? Regards, Callum -- Callum Smith Research Computing Core Wellcome Trust Centre for Human Genetics University of Oxford e. cal...@well.ox.ac.uk On 18 May 2018,

[ovirt-users] Re: vGPU VM not starting

2018-05-18 Thread Callum Smith
Yep, creating the mdev manually works, and in fact like I said previously, the VM does actually create an mdev successfully as you can see the UUID of the device (and is correctly identifiable though the /sys/class/mdev_bus/${DEVICE_ADDR}/${UUID}/mdev_type/name In this specific case to help

[ovirt-users] Re: vGPU VM not starting

2018-05-18 Thread Martin Polednik
On 18/05/18 13:42 +0200, Francesco Romani wrote: Hi, On 05/17/2018 10:56 AM, Callum Smith wrote: In an attempt not to mislead you guys as well, there appears to be a separate, vGPU specific, issue. https://www.dropbox.com/s/hlymmf9d6rn12tq/vdsm.vgpu.log?dl=0 I've uploaded the full vdsm.log

[ovirt-users] Re: vGPU VM not starting

2018-05-17 Thread Callum Smith
Dear All, Similar issues with a clean install https://www.dropbox.com/s/jf9pwapohn5dq5p/vdsm.gpu2.log?dl=0 Above is the dropbox of the log of the clean install. This VM has a custom "mdev_type" of "nvidia-53" which relates to a specific GRID P40-24Q instance. Even looking in

[ovirt-users] Re: vGPU VM not starting

2018-05-17 Thread Callum Smith
Dear Yaniv, Please see my most recent response: https://www.dropbox.com/s/hlymmf9d6rn12tq/vdsm.vgpu.log?dl=0 I'm doing a clean install of the host right now to see if doing the exact same procedure a second time produces different results (this way lies madness, but we have excited bosses

[ovirt-users] Re: vGPU VM not starting

2018-05-17 Thread Yaniv Kaul
It'd be easier if you could share the complete vdsm log. Perhaps file a bug and we can investigate it? Y. On Thu, May 17, 2018 at 11:25 AM, Callum Smith wrote: > Some information that appears to be from around the time of installation > to the cluster: > > WARNING:

[ovirt-users] Re: vGPU VM not starting

2018-05-17 Thread Callum Smith
In an attempt not to mislead you guys as well, there appears to be a separate, vGPU specific, issue. https://www.dropbox.com/s/hlymmf9d6rn12tq/vdsm.vgpu.log?dl=0 I've uploaded the full vdsm.log to dropbox. Most recently I tried unmounting alll network devices from the VM and booting it and i

[ovirt-users] Re: vGPU VM not starting

2018-05-17 Thread Ales Musil
Seems like some vdsm problem with xml generation. +Francesco On Thu, May 17, 2018 at 10:20 AM, Callum Smith wrote: > PS. some other WARN's that come up on the host: > > WARN File: /var/lib/libvirt/qemu/channels/1bc9dae8-a0ea-44b3- >

[ovirt-users] Re: vGPU VM not starting

2018-05-17 Thread Callum Smith
Some information that appears to be from around the time of installation to the cluster: WARNING: COMMAND_FAILED: '/usr/sbin/ebtables --concurrent -t nat -X libvirt-O-vnet0' failed: Chain 'libvirt-O-vnet0' doesn't exist. firewalld WARNING: COMMAND_FAILED: '/usr/sbin/ebtables --concurrent -t nat

[ovirt-users] Re: vGPU VM not starting

2018-05-17 Thread Callum Smith
PS. some other WARN's that come up on the host: WARN File: /var/lib/libvirt/qemu/channels/1bc9dae8-a0ea-44b3-9103-5805100648d0.org.qemu.guest_agent.0 already removed vdsm WARN Attempting to remove a non existing net user: ovirtmgmt/1bc9dae8-a0ea-44b3-9103-5805100648d0 vdsm WARN Attempting to