[ovirt-users] Re: Dead agent

2022-06-15 Thread Sketch

On Wed, 15 Jun 2022, Valerio Luccio wrote:


I have an ovirt 4.4 installation whit self-hosted engine where the agent
seems to have died. The VMs are still running, so I assume that the engine
itself is still running (is this a wrong assumption ?). Can I restart the
agent without affecting the running VMs, that is how will restarting the
agent affect the running VMs ? If I can restart the agent, what's the
correct way of doing it ?


If the engine is down, the VMs will continue to run.  You just won't be 
able to start/migrate/configure/etc them.


If the engine VM is still running, you may want to SSH into it and look at 
the state of the system to see if you can see what went wrong.


systemctl status 'ovirt*' may tell you if just a single service is down 
(such as ovirt-engine).  You might also check the logs in 
/var/log/ovirt-engine


If the VM is down or inaccessible, SSH into one of the hosts capable of 
running the engine and run the following to check VM status:


hosted-engine --vm-status

This should tell you if and whre it's running.  If it isn't dead, You can 
stop it with:


hosted-engine --vm-shutdown

Check the status and wait until it's actually down, then you can start it 
up again:


hosted-engine --vm-start
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/KWGZQ4CETYECKR5MLCVNDHNHIERSY2CQ/


[ovirt-users] Re: Ovirt manager engine certificate expired

2022-06-15 Thread Don Dupuis
Hello
After another try with engine-setup, I seem to have new certificates that
was created today, but now I get this error when I try to log into Admin
Portal and also in engine.log "PKIX path building failed:
sun.security.provider.certpath.SunCertPathBuilderException: unable to find
valid certification path to requested target". I also  have General
SSLEngine problem messages, but I know that is related to the fact that all
of my 32 hypervisor certificates are expired also. I wanted to get the
engine back going again, and then tackle the hypervisors. What am I missing?

Thanks
Don

On Tue, Jun 14, 2022 at 11:29 AM Don Dupuis  wrote:

> This is on a standalone engine and it didn't prompt me to upgrade
> certificate.
>
> Don
>
> On Tue, Jun 14, 2022 at 11:09 AM Don Dupuis  wrote:
>
>> Hello
>> I have an older installation that was running just great and the
>> certificate expired, it is version 4.2.8.2. I ran the engine-setup as per
>> this "access.redhat.com/solutions/4780411". It didn't update the
>> certificate or the dates and stil have the same issue. Any ideas to fix
>> this issue?
>>
>> Thanks
>> Don
>>
>
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/RYAHHSV7WLVYTGRGL7M3WNTTRTLNUYFK/


[ovirt-users] Dead agent

2022-06-15 Thread Valerio Luccio

Hello all,

I have an ovirt 4.4 installation whit self-hosted engine where the agent 
seems to have died. The VMs are still running, so I assume that the 
engine itself is still running (is this a wrong assumption ?). Can I 
restart the agent without affecting the running VMs, that is how will 
restarting the agent affect the running VMs ? If I can restart the 
agent, what's the correct way of doing it ?


Thanks,

--
Valerio Luccio  
High Performance Computing  10 Astor Place, Room 416D
New York University New York, NY 10003

   "In an open world, who needs windows or gates ?"
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/JA3RHEJCYV5DV4CVIPDLKHJDDLX56LXI/


[ovirt-users] Re: oVirt 4.5 linux guest vm with host device added to it fails to start

2022-06-15 Thread Milan Zamazal
Don Dupuis  writes:

> Hello
> Anyone have any ideas?
>
> Don
>
> On Fri, Jun 10, 2022 at 11:45 AM Don Dupuis  wrote:
>
>> THis is for version oVirt 4.5.0.8-1. Sorry left out the exact release.
>>
>> Don
>>
>> On Fri, Jun 10, 2022 at 11:41 AM Don Dupuis  wrote:
>>
>>> Hello
>>> I have a RHEL 8.6 based hypervisor with a Mellanox ConnectX-5 IB card
>>> installed with SRIOV enabled. The host device I am assigning is
>>> pci__af_00_2. The card is working as I can talk to other infiniband
>>> interfaces on other servers. Below is the output of lspci.
>>> 3b:00.0 Ethernet controller: Mellanox Technologies MT27800 Family
>>> [ConnectX-5]
>>> 3b:00.1 Ethernet controller: Mellanox Technologies MT27800 Family
>>> [ConnectX-5]
>>> af:00.0 Infiniband controller: Mellanox Technologies MT27800 Family
>>> [ConnectX-5]
>>> af:00.1 Infiniband controller: Mellanox Technologies MT27800 Family
>>> [ConnectX-5 Virtual Function]
>>> af:00.2 Infiniband controller: Mellanox Technologies MT27800 Family
>>> [ConnectX-5 Virtual Function]
>>> af:00.3 Infiniband controller: Mellanox Technologies MT27800 Family
>>> [ConnectX-5 Virtual Function]
>>> af:00.4 Infiniband controller: Mellanox Technologies MT27800 Family
>>> [ConnectX-5 Virtual Function]
>>>
>>> The linux vm is configured as Q35 Chipset with UEFI, 16 cpus, numa
>>> enabled, and cpu pinning enabled. OS is RHEL 7.9. As soon as I start the
>>> vm, I get an immediate error message stating "Cannot run VM. There is no
>>> host that satisfies current scheduling constraints. See below for details:,
>>> The host rvsh002 did not satisfy internal filter HostDevice because some of
>>> the required host devices are unavailable." If I remove the host device
>>> from the vm config, then it starts and runs fine. This setup was working
>>> just fine on RHEL8.4 and oVirt 4.4.7 using the proper driver for RHEL 8.4.

Engine apparently cannot find a host with enough CPUs and free memory,
matching the NUMA and CPU pinning configurations, and having the given
host device available.  According to the log, rvsh002 doesn't have the
host device, other hosts apparently don't satisfy some of the other
conditions.  Also, isn't the VM pinned to some hosts?

Maybe someone could provide a better advice, but if you think there is a
host satisfying all the conditions, you can try to start the VM there
with "Run Once" and see if Engine provides a reason why it cannot be
started there.

>>> Here is the engine.log after I press the run button.
>>> 2022-06-10 11:22:10,506-05 INFO  [org.ovirt.engine.core.bll.RunVmCommand]
>>> (default task-1) [81144b66-e5f9-474e-a922-e2ce49cdc8ca] Lock Acquired to
>>> object
>>> 'EngineLock:{exclusiveLocks='[de54b903-7204-4966-95a3-05f64ed17f68=VM]',
>>> sharedLocks=''}'
>>> 2022-06-10 11:22:10,520-05 INFO
>>>  [org.ovirt.engine.core.vdsbroker.IsVmDuringInitiatingVDSCommand] (default
>>> task-1) [81144b66-e5f9-474e-a922-e2ce49cdc8ca] START,
>>> IsVmDuringInitiatingVDSCommand(
>>> IsVmDuringInitiatingVDSCommandParameters:{vmId='de54b903-7204-4966-95a3-05f64ed17f68'}),
>>> log id: 6faf22a5
>>> 2022-06-10 11:22:10,520-05 INFO
>>>  [org.ovirt.engine.core.vdsbroker.IsVmDuringInitiatingVDSCommand] (default
>>> task-1) [81144b66-e5f9-474e-a922-e2ce49cdc8ca] FINISH,
>>> IsVmDuringInitiatingVDSCommand, return: false, log id: 6faf22a5
>>> 2022-06-10 11:22:10,560-05 INFO
>>>  [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-1)
>>> [] Candidate host 'rvsh002' ('f68352c2-6ddc-44ae-a19b-9262e92327f8') was
>>> filtered out by 'VAR__FILTERTYPE__INTERNAL' filter 'HostDevice'
>>> (correlation id: null)
>>> 2022-06-10 11:22:10,569-05 ERROR
>>> [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
>>> (default task-1) [] EVENT_ID: USER_FAILED_RUN_VM(54), Failed to run VM
>>> ws006 due to a failed validation: [Cannot run VM. There is no host that
>>> satisfies current scheduling constraints. See below for details:, The host
>>> rvsh002 did not satisfy internal filter HostDevice because some of the
>>> required host devices are unavailable.] (User: admin@internal-authz).
>>> 2022-06-10 11:22:10,569-05 WARN  [org.ovirt.engine.core.bll.RunVmCommand]
>>> (default task-1) [] Validation of action 'RunVm' failed for user
>>> admin@internal-authz. Reasons:
>>> VAR__ACTION__RUN,VAR__TYPE__VM,SCHEDULING_ALL_HOSTS_FILTERED_OUT,VAR__FILTERTYPE__INTERNAL,$hostName
>>> rvsh002,$filterName
>>> HostDevice,VAR__DETAIL__HOST_DEVICE_UNAVAILABLE,SCHEDULING_HOST_FILTERED_REASON_WITH_DETAIL
>>> 2022-06-10 11:22:10,570-0
>>>
>>> There was nothing in the vdsm.log on the hypervisor related to this issue
>>> that I could see after hitting the run button.

Engine couldn't find a matching host so the VM is not attempted to start 
anywhere.

>>> Thanks
>>> Don
>>>
>>
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code 

[ovirt-users] Re: oVirt 4.5 linux guest vm with host device added to it fails to start

2022-06-15 Thread Milan Zamazal
Don Dupuis  writes:

> Hello
> Anyone have any ideas?
>
> Don
>
> On Fri, Jun 10, 2022 at 11:45 AM Don Dupuis  wrote:
>
>> THis is for version oVirt 4.5.0.8-1. Sorry left out the exact release.
>>
>> Don
>>
>> On Fri, Jun 10, 2022 at 11:41 AM Don Dupuis  wrote:
>>
>>> Hello
>>> I have a RHEL 8.6 based hypervisor with a Mellanox ConnectX-5 IB card
>>> installed with SRIOV enabled. The host device I am assigning is
>>> pci__af_00_2. The card is working as I can talk to other infiniband
>>> interfaces on other servers. Below is the output of lspci.
>>> 3b:00.0 Ethernet controller: Mellanox Technologies MT27800 Family
>>> [ConnectX-5]
>>> 3b:00.1 Ethernet controller: Mellanox Technologies MT27800 Family
>>> [ConnectX-5]
>>> af:00.0 Infiniband controller: Mellanox Technologies MT27800 Family
>>> [ConnectX-5]
>>> af:00.1 Infiniband controller: Mellanox Technologies MT27800 Family
>>> [ConnectX-5 Virtual Function]
>>> af:00.2 Infiniband controller: Mellanox Technologies MT27800 Family
>>> [ConnectX-5 Virtual Function]
>>> af:00.3 Infiniband controller: Mellanox Technologies MT27800 Family
>>> [ConnectX-5 Virtual Function]
>>> af:00.4 Infiniband controller: Mellanox Technologies MT27800 Family
>>> [ConnectX-5 Virtual Function]
>>>
>>> The linux vm is configured as Q35 Chipset with UEFI, 16 cpus, numa
>>> enabled, and cpu pinning enabled. OS is RHEL 7.9. As soon as I start the
>>> vm, I get an immediate error message stating "Cannot run VM. There is no
>>> host that satisfies current scheduling constraints. See below for details:,
>>> The host rvsh002 did not satisfy internal filter HostDevice because some of
>>> the required host devices are unavailable." If I remove the host device
>>> from the vm config, then it starts and runs fine. This setup was working
>>> just fine on RHEL8.4 and oVirt 4.4.7 using the proper driver for RHEL 8.4.

Engine apparently cannot find a host with enough CPUs and free memory,
matching the NUMA and CPU pinning configurations, and having the given
host device available.  According to the log, rvsh002 doesn't have the
host device, other hosts apparently don't satisfy some of the other
conditions.  Also, isn't the VM pinned to some hosts?

Maybe someone could provide a better advice, but if you think there is a
host satisfying all the conditions, you can try to start the VM there
with "Run Once" and see if Engine provides a reason why it cannot be
started there.

>>> Here is the engine.log after I press the run button.
>>> 2022-06-10 11:22:10,506-05 INFO  [org.ovirt.engine.core.bll.RunVmCommand]
>>> (default task-1) [81144b66-e5f9-474e-a922-e2ce49cdc8ca] Lock Acquired to
>>> object
>>> 'EngineLock:{exclusiveLocks='[de54b903-7204-4966-95a3-05f64ed17f68=VM]',
>>> sharedLocks=''}'
>>> 2022-06-10 11:22:10,520-05 INFO
>>>  [org.ovirt.engine.core.vdsbroker.IsVmDuringInitiatingVDSCommand] (default
>>> task-1) [81144b66-e5f9-474e-a922-e2ce49cdc8ca] START,
>>> IsVmDuringInitiatingVDSCommand(
>>> IsVmDuringInitiatingVDSCommandParameters:{vmId='de54b903-7204-4966-95a3-05f64ed17f68'}),
>>> log id: 6faf22a5
>>> 2022-06-10 11:22:10,520-05 INFO
>>>  [org.ovirt.engine.core.vdsbroker.IsVmDuringInitiatingVDSCommand] (default
>>> task-1) [81144b66-e5f9-474e-a922-e2ce49cdc8ca] FINISH,
>>> IsVmDuringInitiatingVDSCommand, return: false, log id: 6faf22a5
>>> 2022-06-10 11:22:10,560-05 INFO
>>>  [org.ovirt.engine.core.bll.scheduling.SchedulingManager] (default task-1)
>>> [] Candidate host 'rvsh002' ('f68352c2-6ddc-44ae-a19b-9262e92327f8') was
>>> filtered out by 'VAR__FILTERTYPE__INTERNAL' filter 'HostDevice'
>>> (correlation id: null)
>>> 2022-06-10 11:22:10,569-05 ERROR
>>> [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
>>> (default task-1) [] EVENT_ID: USER_FAILED_RUN_VM(54), Failed to run VM
>>> ws006 due to a failed validation: [Cannot run VM. There is no host that
>>> satisfies current scheduling constraints. See below for details:, The host
>>> rvsh002 did not satisfy internal filter HostDevice because some of the
>>> required host devices are unavailable.] (User: admin@internal-authz).
>>> 2022-06-10 11:22:10,569-05 WARN  [org.ovirt.engine.core.bll.RunVmCommand]
>>> (default task-1) [] Validation of action 'RunVm' failed for user
>>> admin@internal-authz. Reasons:
>>> VAR__ACTION__RUN,VAR__TYPE__VM,SCHEDULING_ALL_HOSTS_FILTERED_OUT,VAR__FILTERTYPE__INTERNAL,$hostName
>>> rvsh002,$filterName
>>> HostDevice,VAR__DETAIL__HOST_DEVICE_UNAVAILABLE,SCHEDULING_HOST_FILTERED_REASON_WITH_DETAIL
>>> 2022-06-10 11:22:10,570-0
>>>
>>> There was nothing in the vdsm.log on the hypervisor related to this issue
>>> that I could see after hitting the run button.

Engine couldn't find a matching host so the VM is not attempted to start 
anywhere.

>>> Thanks
>>> Don
>>>
>>
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code