[ovirt-users] Re: ovirt small network outage causes HE root xfs crash due to race condition

2018-12-23 Thread Mike Lykov

21.12.2018 14:24, Mike Lykov пишет:


I have a 4.2.7 setup hyperconverged, two deployed VM Engine images and i 
have 20-30 second network outage. After some pinging to start engine on 
host 1, then 2, then again 1 Engine image stuck at

"Probing EDD (edd=off to disable)... _"
as here: https://bugzilla.redhat.com/show_bug.cgi?id=1569827


Now I looking to the logs.
Full /var/log archives are here:
https://yadi.sk/d/XZ5jJfQLN6QMlA (HE engine logs) - 36 Mb
https://yadi.sk/d/bZ0TYGxFoHGgIQ (ovirtnode6 logs) - 144  Mb

I do some CCs in this email to personal addresses, if i's not relevant - 
please ignore.


Host nodes (centos 7.5) named ovirtnode1,5,6. Timeouts (in ha agent) are 
default. Sanlock are configured (as i think)

HE running on ovirtnode6, and spare HE deployed on ovirtnode1.

There is two network links: ovirtmgmt over "ovirtmgmt: port 
1(enp59s0f0)" and glusterfs storage network over ib0 interface 
(different subnet)


messages log on ovirtnode6:
That outage:

---
Dec 21 12:32:56 ovirtnode6 kernel: bnx2x :3b:00.0 enp59s0f0: NIC 
Link is Down
Dec 21 12:32:56 ovirtnode6 kernel: ovirtmgmt: port 1(enp59s0f0) entered 
disabled state
Dec 21 12:33:13 ovirtnode6 kernel: bnx2x :3b:00.0 enp59s0f0: NIC 
Link is Up, 1 Mbps full duplex, Flow control: ON - receive & transmit
Dec 21 12:33:13 ovirtnode6 kernel: IPv6: ADDRCONF(NETDEV_CHANGE): 
enp59s0f0: link becomes ready
Dec 21 12:33:13 ovirtnode6 kernel: ovirtmgmt: port 1(enp59s0f0) entered 
forwarding state
Dec 21 12:33:13 ovirtnode6 NetworkManager[1715]:  
[1545381193.2204] device (enp59s0f0): carrier: link connected

---

There is 17 second. at 33:13 link are back. BUT all events lead to crash 
follow later:


HA agent log:
--
MainThread::INFO::2018-12-21 
12:32:59,540::states::444::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(consume) 
Engine vm running on localhost
MainThread::INFO::2018-12-21 
12:32:59,662::hosted_engine::491::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_monitoring_loop) 
Current state EngineUp (score: 3400)
MainThread::INFO::2018-12-21 
12:33:09,797::states::136::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(score) 
Penalizing score by 1280 due to gateway status
MainThread::INFO::2018-12-21 
12:33:09,798::hosted_engine::491::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_monitoring_loop) 
Current state EngineUp (score: 2120)
MainThread::ERROR::2018-12-21 
12:33:19,815::states::436::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(consume) 
Host ovirtnode1.miac (id 1) score is significantly better than local 
score, shutting down VM on this host

--


syslog messages:

Dec 21 12:33:19 ovirtnode6 journal: ovirt-ha-agent 
ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Host 
ovirtnode1.miac (id 1) score is significantly better than local score, 
shutting down VM on this host
Dec 21 12:33:29 ovirtnode6 journal: ovirt-ha-agent 
ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Engine VM 
stopped on localhost
Dec 21 12:33:37 ovirtnode6 kernel: ovirtmgmt: port 3(vnet1) entered 
disabled state

Dec 21 12:33:37 ovirtnode6 kernel: device vnet1 left promiscuous mode
Dec 21 12:33:37 ovirtnode6 kernel: ovirtmgmt: port 3(vnet1) entered 
disabled state
Dec 21 12:33:37 ovirtnode6 NetworkManager[1715]:  
[1545381217.1796] device (vnet1): state change: disconnected -> 
unmanaged (reason 'unmanaged', sys-iface-state: 'removed')
Dec 21 12:33:37 ovirtnode6 NetworkManager[1715]:  
[1545381217.1798] device (vnet1): released from master device ovirtmgmt
Dec 21 12:33:37 ovirtnode6 libvirtd: 2018-12-21 08:33:37.192+: 2783: 
**error : qemuMonitorIO:719 : internal error: End of file 
from qemu monitor*  - WHAT IS THIS?

Dec 21 12:33:37 ovirtnode6 kvm: 2 guests now active
Dec 21 12:33:37 ovirtnode6 systemd-machined: Machine qemu-2-HostedEngine 
terminated.
Dec 21 12:33:37 ovirtnode6 firewalld[1693]: WARNING: COMMAND_FAILED: 
'/usr/sbin/iptables -w2 -w -D libvirt-out -m physdev 
--physdev-is-bridged --physdev-out vnet1 -g FP-vnet1' failed: iptables 
v1.4.21: goto 'FP-vnet1' is not a chain#012#0

12Try `iptables -h' or 'iptables --help' for more information.

Dec 21 12:33:55 ovirtnode6 kernel: ovirtmgmt: port 3(vnet1) entered 
blocking state
Dec 21 12:33:55 ovirtnode6 kernel: ovirtmgmt: port 3(vnet1) entered 
disabled state

Dec 21 12:33:55 ovirtnode6 kernel: device vnet1 entered promiscuous mode
Dec 21 12:33:55 ovirtnode6 kernel: ovirtmgmt: port 3(vnet1) entered 
blocking state
Dec 21 12:33:55 ovirtnode6 kernel: ovirtmgmt: port 3(vnet1) entered 
forwarding state
Dec 21 12:33:55 ovirtnode6 lldpad: recvfrom(Event interface): No buffer 
space available
Dec 21 12:33:55 ovirtnode6 NetworkManager[1715]:  
[1545381235.8086] manager: (vnet1): new Tun device 
(/org/freedesktop/NetworkManager/Devices/37)
Dec 21 12:33:55 ovirtnode6 NetworkM

[ovirt-users] Ovirt does not support AMD EPYC CPUs?

2018-12-23 Thread Erick Perez
Hi,
In its current release, OVIRT only shows AMD Opteron G1,G2,G3,G4,G5 and no 
mention
of EPYC.

This thread mentioned that a respin of Ovirt will address this, but never 
happened.
https://lists.ovirt.org/archives/list/users@ovirt.org/message/2PEAYGK5PE33WLS3T6ATGQEUXI25SEZT/


comments?
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/CLK4GIK5IVFQM3FDYGFBF5SBFIGGSJS3/


[ovirt-users] ISO Domain Problems

2018-12-23 Thread JC Clark

Hello Fellow users,

Platform : Ovirt Engine 4.1

Problem : ISO Domain server has crashed. It is a separate NFS server.  I 
am unable to replace the ISO Domain.  I have the old one in maintenance 
but it won't detach.  It says.




VDSM command ActivateStorageDomainVDS failed: Storage domain does not 
exist: (u'0ad098e9-65e7-494a-90f7-e42949da3f85',)


Which is quite true it does not exist..

Any suggestions?

Thank you

___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/DTN3DDG6D6XC25O5W7XSQPWZ4252WJBN/


[ovirt-users] ISO Domain Problems

2018-12-23 Thread JC Clark

Hello Fellow users,

Platform : Ovirt Engine 4.1

Problem : ISO Domain server has crashed. It is a separate NFS server.  I 
am unable to replace the ISO Domain.  I have the old one in maintenance 
but it won't detach.  It says.




VDSM command ActivateStorageDomainVDS failed: Storage domain does not 
exist: (u'0ad098e9-65e7-494a-90f7-e42949da3f85',)


Which is quite true it does not exist..

Any suggestions?

Thank you

___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/M3DN6WFGWVQXEKMFSXLSFFE7YT3VUYWJ/


[ovirt-users] Re: Acquire an XML dump of a VM oVirt?

2018-12-23 Thread Arik Hadas
On Thu, Dec 20, 2018 at 6:28 PM Jacob Green  wrote:

> What if you cannot run the VM, so its not running on any specific host.
> But you want the XML to identify the  information.
>
>
> Thank you.
>
> On 12/20/2018 09:10 AM, Benny Zlotnik wrote:
>
> You can run `virsh -r dumpxml  `  on the relevant host
>
> On Thu, Dec 20, 2018, 16:17 Jacob Green 
>>  How does one get an XML dump of a VM from ovirt? I have seen ovirt
>> do it in the engine.log, but not sure how to force it to generate one
>> when I need it.
>>
>
oVirt doesn't store the domain xml internally.
The domain xml is generated only when trying to start the vm (that's the
output you've seen in engine.log).
I'm afraid there is no other trigger to force generating that xml at the
moment.
But the generation of the xml is mostly 1:1 mapping of some configuration
that is stored differently in oVirt and can be found via the ui/rest-api.
What do you look for exactly?


>
>>
>> Thank you.
>>
>> --
>> Jacob Green
>>
>> Systems Admin
>>
>> American Alloy Steel
>>
>> 713-300-5690
>> ___
>> Users mailing list -- users@ovirt.org
>> To unsubscribe send an email to users-le...@ovirt.org
>> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
>> oVirt Code of Conduct:
>> https://www.ovirt.org/community/about/community-guidelines/
>> List Archives:
>> https://lists.ovirt.org/archives/list/users@ovirt.org/message/YV7K4GQZID2UC2SPS3PNDEKQUDZ5HLGV/
>>
>
>
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/LNQTWNT3HLXPXOPZOUMBYTV4HOORAQ75/
>
>
> --
> Jacob Green
>
> Systems Admin
>
> American Alloy Steel
>
> 713-300-5690
>
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
> oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives:
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/JH4K6SH2DB7EB52KKE2CTD43PJFUMMVX/
>
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/CTN4U5QJUU2LFXO7E3UYHGDIP2K3ZEFJ/


[ovirt-users] ISO Domain Problems

2018-12-23 Thread JC Clark

Hello Fellow users,

Platform : Ovirt Engine 4.1

Problem : ISO Domain server has crashed. It is a separate NFS server.  I 
am unable to replace the ISO Domain.  I have the old one in maintenance 
but it won't detach.  It says.




VDSM command ActivateStorageDomainVDS failed: Storage domain does not 
exist: (u'0ad098e9-65e7-494a-90f7-e42949da3f85',)


Which is quite true it does not exist..

Any suggestions?

Thank you
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/D7GCFCJZOV5TQYSFQODJ2UYYBUWMQY3B/


[ovirt-users] Failed to add host to oVirt

2018-12-23 Thread Eyal Shenitzky
Hey,

I am failing to add a host to oVirt due to the following error:

2018-12-23 11:15:29,482+0200 ERROR otopi.context context._executeMethod:152
Failed to execute stage 'Environment customization': Cannot find a valid
baseurl for repo: ovirt
-master-centos-gluster5/7Server/x86_64

Does someone encounter this issue?


-- 
Regards,
Eyal Shenitzky
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/VJR5K5OSBOUI57W3FVKE2J7YW7KQUWAX/