[ovirt-users] Re: Hosted Engine can't migrate

2020-11-17 Thread Yedidyah Bar David
Hi,

On Wed, Nov 18, 2020 at 8:08 AM McNamara, Bradley
 wrote:
>
> I'm new to oVirt, but not new to virtualization technologies.  I've run into 
> a wall, but the solution has gotta be simple.  I've seen other threads with 
> the same issue, but a solution is never posted.
>
> I've followed the guide perfectly.  I get  the self-hosted engine up and 
> running with no issues.  I add hosts to the cluster, but it won't let me 
> migrate the HE because of Hosted Engine HA score of "N/A" on every host 
> except the one it was initially deployed on.  The physical servers are 
> exactly the same:  same networks, storage, OS, patches, etc.  The additional 
> hosts are added thru the HE GUI and are added without issue.  Is there a 
> secret button I gotta push?
>
> What am I missing?  This is with 4.3 on CentOS 7.9.

Can other VMs migrate (who use the same storage server, so you know
it's not a storage/network issue)?

Please check/share /var/log/ovirt-hosted-engine-ha/* on all hosts, as
well as output of 'hosted-engine --vm-status' on all of them.

Good luck and best regards,
-- 
Didi
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/WZBXWHRGLAWWT675ROBSDPB6W5KTRG3F/


[ovirt-users] Hosted Engine can't migrate

2020-11-17 Thread McNamara, Bradley
I'm new to oVirt, but not new to virtualization technologies.  I've run into a 
wall, but the solution has gotta be simple.  I've seen other threads with the 
same issue, but a solution is never posted.

I've followed the guide perfectly.  I get  the self-hosted engine up and 
running with no issues.  I add hosts to the cluster, but it won't let me 
migrate the HE because of Hosted Engine HA score of "N/A" on every host except 
the one it was initially deployed on.  The physical servers are exactly the 
same:  same networks, storage, OS, patches, etc.  The additional hosts are 
added thru the HE GUI and are added without issue.  Is there a secret button I 
gotta push?

What am I missing?  This is with 4.3 on CentOS 7.9.

HELP!  Thanks.  :-)
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/TKNPG3TRZGZIUCVWHL6SPFOMZPFDXGYP/


[ovirt-users] OVIRT and KUBEVIRT

2020-11-17 Thread Andre Meyer Pflug
Is there someone who has done an NON PERSISTANT VDI over oVirt using
kubevirt (on OKD) as the infraestructure for vm's?

We plan to do a 1.000.000 concurrent users deployment...

Any help is welcome!

Kind regards,


Andre Meyer Pflug
DDESK LLC
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/Y3SYDD7IECRVNRPFOJOJMIZ36KHYRIPR/


[ovirt-users] Found a host rebooting - ways to watch?

2020-11-17 Thread Chris Adams
I just noticed that one of my oVirt physical hosts has been rebooting
due to an apparent hardware voltage fault.  It's a Dell, and I've got
their tools installed and am monitoring status, but the issue clears
itself.  It has apparently been doing this for a bit now, and we didn't
catch it because (a) there weren't any VMs on it (probably were the
first time but they were restarted elsewhere fast enough that it wasn't
noticed) and (b) it reboots fast enough that at most it pops up in our
monitoring system for one pass and then clears so our NOC either didn't
see it or assumed it was okay since it cleared.

oVirt has been logging alerts when it happens, but seeing that requires
someone to log in and check the logs (and we've got a bunch of different
systems to manage, including multiple oVirt clusters, so nobody is doing
that on a regular basis).  We monitor most things with SNMP and/or CLI
checks (we have PRTG, Nagios, and LibreNMS for various different
things).

What are people doing to monitor the health of their oVirt systems?  Is
it possible to get alerts emailed to admins?  Is there any SNMP support
in oVirt to allow external systems to monitor its health?  This setup is
on 4.3.10 if that matters.

-- 
Chris Adams 
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/APZCECDMZDOGFBMXKAPSDJJENUSEOEOJ/


[ovirt-users] Re: Upgrading engine from 4.3 -> 4.4

2020-11-17 Thread Gianluca Cecchi
On Tue, Nov 17, 2020 at 6:41 PM David Metcalf 
wrote:

> Followed this for standalone hosts:
> https://ovirt.org/documentation/upgrade_guide/#Upgrading_from_4-3
>
> but get this message..
>
> engine-backup --mode=restore --file=engine-backup-2020-11-16.bak
> --provision-all-databases
> ...
> FATAL: Backup was created by version '4.3.9.4' and can not be restored
> using the installed version 4.4.3.11
>
>
In another recent thread already replied that you need to be at latest 4.3
that for oVirt is 4.3.10.
So, in particular, the backup has to be taken while in 4.3.10.

Gianluca
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/5AK4EQ77PGCOAP6EWDAJSBUJEDPAT5CY/


[ovirt-users] Upgrading engine from 4.3 -> 4.4

2020-11-17 Thread David Metcalf
Followed this for standalone hosts:
https://ovirt.org/documentation/upgrade_guide/#Upgrading_from_4-3

but get this message..

engine-backup --mode=restore --file=engine-backup-2020-11-16.bak 
--provision-all-databases
...
FATAL: Backup was created by version '4.3.9.4' and can not be restored using 
the installed version 4.4.3.11
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/5AOVTB4LSMPRG2TZM64L3TOELWJSMXHF/


[ovirt-users] Re: How to create a backup in event of hardware failure of a single hosted engine?

2020-11-17 Thread Alex K
On Tue, Nov 17, 2020, 11:08  wrote:

> Hello everyone,
> Currently we have a single bare metal that hosts oVirt and glusterFS which
> later on to be converted to a 3 nodes for HCI, currently another bare metal
> is coming this week and was planning to  initially create it as a backup.
>
> is it possible to deploy a new hosted engine then create a gluster volume
> for 1st one as a backup domain then attach it to the new one if hardware
> failure occurred, or there is another kind of setup that can be done?
>
I would add the new server as an additional host, first preparing the
gluster volumes under the hood. In this way you achieve HA for engine and
guest VMs. When third server arrives, repeat and have a proper 3 node
replica self hosted engine setup.

> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives:
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/EYOW4WNUACRCTWLG2IRXEGISYFKBWAU7/
>
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/KZ5L3E6UXPHY56WGERVGNEGFGTLQB7RS/


[ovirt-users] Re: ovirt 4.3 - locked image vm - unable to remove a failed deploy of a guest dom

2020-11-17 Thread 3c . monitor
Hi.
taskcleaner and unlock_entity are just 2 items of reported rhel solution I've 
used , but didn't worked.
Locked Image still there!
Thanks.
M.
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/74QMVFTM356GHFIHWPUKMIZLIRDOPYHT/


[ovirt-users] Re: ovirt 4.3 - locked image vm - unable to remove a failed deploy of a guest dom

2020-11-17 Thread Alex K
On Tue, Nov 17, 2020, 16:12 <3c.moni...@gruppofilippetti.it> wrote:

> Hi all.
> I've deployed a VM from a corrupted template (it's disk is missing, but
> I've checked it later...).
> My Software Version is:4.3
> Now, I have an unmanaged VM in inventory and unable to remove it too.
> It's reference is "locked image".
> I've restarted ovirt-engine many times on self-hosted engine and hosts
> too, but no benefits.
> So, what's now?
> I've also consulted: https://access.redhat.com/solutions/396753
> but still no results.
> No tasks or items results to be "locked"...
> Any other ideas?
>
You might try using some of the tools mentioned at
https://www.ovirt.org/develop/developer-guide/db-issues/helperutilities.html
There is an unlock tool.

> Thanks a lot.
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives:
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/7IK2IAJTEMNR7IM4FJYVHJRJQTQZD537/
>
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/LNT3SXR62FVCBAW4FKC273BA23HO74PV/


[ovirt-users] Re: oVirt Node Crash

2020-11-17 Thread Anton Louw via Users
Hi Sandro,

Have you perhaps seen anything in the SOS report that could shed some light on 
the issues?

Thanks


Anton Louw
Cloud Engineer: Storage and Virtualization
__
D: 087 805 1572 | M: N/A
A: Rutherford Estate, 1 Scott Street, Waverley, Johannesburg
anton.l...@voxtelecom.co.za

www.vox.co.za



From: Anton Louw
Sent: 16 November 2020 07:30
To: Sandro Bonazzola ; Arik Hadas ; 
Dominik Holler 
Cc: users@ovirt.org; Johan Koen 
Subject: RE: [ovirt-users] oVirt Node Crash

I have also attached the SOS report as requested

From: Anton Louw
Sent: 16 November 2020 06:54
To: Sandro Bonazzola mailto:sbona...@redhat.com>>; Arik 
Hadas mailto:aha...@redhat.com>>; Dominik Holler 
mailto:dhol...@redhat.com>>
Cc: users@ovirt.org; Johan Koen 
mailto:johan.k...@voxtelecom.co.za>>
Subject: RE: [ovirt-users] oVirt Node Crash

Hi Sandro,

Thanks for the response. I logged onto oVirt this morning, and I see the node 
is in a “Unassigned” state. I can ping it, but cannot SSH, so there is 
something that is causing the host to be unresponsive.

On Saturday after I sent the mail, I opened a console to the node, and I saw 
the below entries before logging in:

audit:backlog limit exceeded

I the tried the solution of increasing the buffer size in the audit.rules file 
in /etc/audit/rules.d/ , as per below, but it did not resolve the issue.

## First rule - delete all
-D

## Increase the buffers to survive stress events.
## Make this bigger for busy systems
-b 8192

## Set failure mode to syslog
-f 1

Is it possible to upgrade the node to 4.4 while the engine is still on 4.3?

Thanks

From: Sandro Bonazzola mailto:sbona...@redhat.com>>
Sent: 13 November 2020 18:39
To: Anton Louw 
mailto:anton.l...@voxtelecom.co.za>>; Arik Hadas 
mailto:aha...@redhat.com>>; Dominik Holler 
mailto:dhol...@redhat.com>>
Cc: users@ovirt.org; Johan Koen 
mailto:johan.k...@voxtelecom.co.za>>
Subject: Re: [ovirt-users] oVirt Node Crash



Il giorno ven 13 nov 2020 alle ore 17:37 Sandro Bonazzola 
mailto:sbona...@redhat.com>> ha scritto:


Il giorno ven 13 nov 2020 alle ore 13:38 Anton Louw via Users 
mailto:users@ovirt.org>> ha scritto:

Hi Everybody,

I have built a new host which has been running fine for the last couple of 
days. I noticed today that the host crashed, but it is not giving me a reason 
as to why.

It happened at 13:45 today, but I have given time before that on the logs as 
well.

Is there something I am missing here?

Not related to the crash, but I see in the logs that 5 out of 20 guests have 
qemu guest agent not responding.

Also you seem to have some issues with some firewalld rules. (Maybe +Dominik 
Holler would like to have a look)

I don't see anything explaining why the host got rebooted.

Still related to guest agent I find a bit alarming the following lines:
Nov 13 13:29:34 jb2-node03 libvirtd: 2020-11-13 11:29:34.294+: 12603: error 
: qemuDomainAgentAvailable:9144 : Guest agent is not responding: QEMU guest 
agent is not connected
Nov 13 13:29:34 jb2-node03 vdsm[13843]: ERROR Shutdown by QEMU Guest Agent 
failed#012Traceback (most recent call last):#012  File 
"/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 5304, in 
qemuGuestAgentShutdown#012
self._dom.shutdownFlags(libvirt.VIR_DOMAIN_SHUTDOWN_GUEST_AGENT)#012  File 
"/usr/lib/python2.7/site-packages/vdsm/virt/virdomain.py", line 100, in f#012   
 ret = attr(*args, **kwargs)#012  File 
"/usr/lib/python2.7/site-packages/vdsm/common/libvirtconnection.py", line 131, 
in wrapper#012ret = f(*args, **kwargs)#012  File 
"/usr/lib/python2.7/site-packages/vdsm/common/function.py", line 94, in 
wrapper#012return func(inst, *args, **kwargs)#012  File 
"/usr/lib64/python2.7/site-packages/libvirt.py", line 2517, in 
shutdownFlags#012if ret == -1: raise libvirtError 
('virDomainShutdownFlags() failed', dom=self)#012libvirtError: Guest agent is 
not responding: QEMU guest agent is not connected
Nov 13 13:29:42 jb2-node03 kernel: vlan0077: port 11(vnet15) entered disabled 
state
Nov 13 13:29:42 jb2-node03 kernel: device vnet15 left promiscuous mode
Nov 13 13:29:42 jb2-node03 kernel: vlan0077: port 11(vnet15) entered disabled 
state
Nov 13 13:29:42 jb2-node03 NetworkManager[6027]:   [1605266982.6539] 
device (vnet15): state change: disconnected -> unmanaged (reason 'unmanaged', 
sys-iface-state: 'removed')
Nov 13 13:29:42 jb2-node03 NetworkManager[6027]:   [1605266982.6550] 
device (vnet15): released from master device vlan0077
Nov 13 13:29:42 jb2-node03 libvirtd: 2020-11-13 11:29:42.669+: 12557: error 
: qemuMonitorIO:718 : internal error: End of file from qemu monitor

+Arik Hadas any clue?

About the crash, can you please provide full sos report from the host? the log 
you provided is not enough to understand what caused the reported crash

Also, given python2 is used here, I assume you're on 4.3 or older. I would 

[ovirt-users] ovirt 4.3 - locked image vm - unable to remove a failed deploy of a guest dom

2020-11-17 Thread 3c . monitor
Hi all.
I've deployed a VM from a corrupted template (it's disk is missing, but I've 
checked it later...).
My Software Version is:4.3
Now, I have an unmanaged VM in inventory and unable to remove it too.
It's reference is "locked image".
I've restarted ovirt-engine many times on self-hosted engine and hosts too, but 
no benefits.
So, what's now?
I've also consulted: https://access.redhat.com/solutions/396753
but still no results.
No tasks or items results to be "locked"...
Any other ideas?
Thanks a lot.
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/7IK2IAJTEMNR7IM4FJYVHJRJQTQZD537/


[ovirt-users] Re: ovirt-shell SSL Error

2020-11-17 Thread Luca 'remix_tj' Lorenzetto
Hello,

i don't know the tool, but maybe insecure = True will help.
Alternatively try to import ovirt's ca certificate into your system
truststore.

Luca

On Tue, Nov 17, 2020 at 2:23 PM FRIES, Dirk, SHS Informatik via Users
 wrote:
>
> Hello there,
>
>
>
> I am trying to connect via ovirt-shell to my ovirt manager.
>
>
>
> To do this I've downloaded the CA File 
> (https://127.0.0.1/ovirt-engine/services/pki-resource?resource=ca-certificate=X509-PEM-CA)
>  to /tmp and set up my config in .ovirtshellrc :
>
>
>
> [cli]
>
> autoconnect = True
>
> autopage = True
>
> [ovirt-shell]
>
> username = adminuser
>
> renew_session = False
>
> timeout = None
>
> extended_prompt = False
>
> url = https://127.0.0.1/api
>
> insecure = False
>
> kerberos = False
>
> filter = False
>
> session_timeout = None
>
> ca_file = /tmp/pki-resource?resource=ca-certificate
>
> dont_validate_cert_chain = True
>
> key_file = None
>
> password = somepassword
>
> cert_file = None
>
>
>
> So when I am trying to connect I get the following error:
>
>
>
> (60, "Peer's certificate issuer has been marked as not trusted by the user.")
>
>
>
> While researching I also found a pretty similar issue here 
> https://lists.ovirt.org/archives/list/users@ovirt.org/thread/IYG6AQPII25366IZQ7E3VD622CKP2OSC/
>  but unfortunally there is no solution posted.
>
>
>
> Does anyone have a hint for me what could cause this problem or where to look 
> else ?
>
>
>
> Thanks a lot !
>
>
>
> Bye, Dirk
>
>
>
>
>
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/F3PR4RBMUQTFPRAS2QRBA3M5N2FXS6E3/



-- 
"E' assurdo impiegare gli uomini di intelligenza eccellente per fare
calcoli che potrebbero essere affidati a chiunque se si usassero delle
macchine"
Gottfried Wilhelm von Leibnitz, Filosofo e Matematico (1646-1716)

"Internet è la più grande biblioteca del mondo.
Ma il problema è che i libri sono tutti sparsi sul pavimento"
John Allen Paulos, Matematico (1945-vivente)

Luca 'remix_tj' Lorenzetto, http://www.remixtj.net , 
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/JUT2MRVN7CNAF7GVM4XO2OSU7FZ5ZWQZ/


[ovirt-users] ovirt-shell SSL Error

2020-11-17 Thread FRIES, Dirk, SHS Informatik via Users
Hello there,

I am trying to connect via ovirt-shell to my ovirt manager.

To do this I've downloaded the CA File 
(https://127.0.0.1/ovirt-engine/services/pki-resource?resource=ca-certificate=X509-PEM-CA)
 to /tmp and set up my config in .ovirtshellrc :

[cli]
autoconnect = True
autopage = True
[ovirt-shell]
username = adminuser
renew_session = False
timeout = None
extended_prompt = False
url = https://127.0.0.1/api
insecure = False
kerberos = False
filter = False
session_timeout = None
ca_file = /tmp/pki-resource?resource=ca-certificate
dont_validate_cert_chain = True
key_file = None
password = somepassword
cert_file = None

So when I am trying to connect I get the following error:

(60, "Peer's certificate issuer has been marked as not trusted by the user.")

While researching I also found a pretty similar issue here 
https://lists.ovirt.org/archives/list/users@ovirt.org/thread/IYG6AQPII25366IZQ7E3VD622CKP2OSC/
 but unfortunally there is no solution posted.

Does anyone have a hint for me what could cause this problem or where to look 
else ?

Thanks a lot !

Bye, Dirk


___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/F3PR4RBMUQTFPRAS2QRBA3M5N2FXS6E3/


[ovirt-users] Re: Really slow manager node

2020-11-17 Thread Nicolás

Hi Toni,

(re-adding the users@ovirt.org list)

El 16/11/20 a las 23:37, tfe...@swissonline.ch escribió:


Hi,

Chances are, that this performance issue may be unrelated to oVirt.

The first thing that comes to my mind is that the system may have a 
bottleneck on the I/O.



Please have a look at:

iostat -x 5

The system may have problems with disk performance, if %util > 95 and 
avgqu-sz > 3 over longer periods of time.





Actually %util never goes beyond the 85%, being the average on 70%; for 
the avgqu-sz, the average value is about 25 (with peaks of 160 as far as 
I could see).


If that's not the case, you may next want to have a look at limits 
(/etc/security/limits.conf).


There may also be default limits which may be too low. (nofile, nproc 
come to my mind)




This file seems not to have been touched (all the content is commented out).


Are there any errors or warnings in the messages or audit logs?




Nothing special, I've checked for the last couple days and everything 
seems ok.



And also htop, iotop, vmstat may help to get a quick overview.




Yep, I've used most of these commands to check, but I've been unable to 
see something clarifying.


In my opinion the slowness could be caused for not using SSD disks and 
because of a low polling interval by the DWHd, but that's just my 
assumption and I've been unable to get some data that would confirm this.


Thanks for the help!

Nico


Hope this helps,

Toni


On 16.11.20 14:08, Nicolás wrote:

Hi,

We're running oVirt 4.3.8 and even if this is a problem we've had 
since a lot of time (I would say since 4.0.x), I decided to look for 
help in case anything can be done.


Our environment is heavily used by users in our University (about 
3000 users), and currently our oVirt infrastructure has 1928 virtual 
machines, being 882 of them currently running. We have a separate 
physical machine for the manager node, and the problem is that this 
machine is very, very, very slow, despite it has (from my point of 
view) enough physical resources to run efficiently.


By slow I mean even the SSH access to it takes about 10 seconds, not 
just the admin/user portals. Any operation takes enough time to make 
the experience not comfortable to our users (enter the VM portal, 
start a VM, open a console...).


Node machine's parameters are:
18GB of RAM memory
1 processor with 12CPUs
300GB SCSI Disk, local storage. No Storage Domain is stored in this 
node machine.


Currently, most consuming processes are:

28802 ovirt 20   0 3302876   1,5g  25988 S   1,0  8,7 22:46.36 
ovirt-engine -server -XX:+TieredCompilation -Xms1024M -Xmx1024M 
-Xss1M -Djava.aw+
28701 ovirt 20   0 5465432 807704  13140 S   5,9  4,4 3:15.53 
ovirt-engine-dwhd 
-Dorg.ovirt.engine.dwh.settings=/tmp/tmp8wtnTA/settings.proper+


# free -m
  total    used    free  shared buff/cache   
available

Mem:  17886    6669    2734 255 8482   10625
Swap:  6143    1002    5141

There are also a lot of postgresql processes:

postgres  2186  0.0  0.0 261812  4136 ?    Ss   jun05  37:39 
/opt/rh/rh-postgresql10/root/usr/bin/postmaster -D 
/var/opt/rh/rh-postgresql10/lib/pgsql/data
postgres  3176  0.0  0.0 216084   656 ?    Ss   jun05   0:00 
postgres: logger process
postgres  3290  0.0  0.2 262204 37476 ?    Ss   jun05  77:31 
postgres: checkpointer process
postgres  3291  0.3  0.2 262052 36960 ?    Ss   jun05 754:02 
postgres: writer process
postgres  3292  0.0  0.0 261812  1988 ?    Ss   jun05 100:51 
postgres: wal writer process
postgres  3293  0.0  0.2 262380 36748 ?    Ss   jun05  21:58 
postgres: autovacuum launcher process
postgres  3294  0.1  0.0 219216  1412 ?    Ds   jun05 335:41 
postgres: stats collector process
postgres  3295  0.0  0.0 262220  1460 ?    Ss   jun05   0:11 
postgres: bgworker: logical replication launcher
postgres  3393  0.0  0.2 265792 40452 ?    Ss   jun05  15:36 
postgres: engine engine ::1(51664) idle
postgres  6105  0.3  0.0 271532 15976 ?    Ds   13:01   0:00 
postgres: autovacuum worker process   ovirt_engine_history
postgres  6216  0.2  0.0 263864 11440 ?    Ss   13:02   0:00 
postgres: autovacuum worker process   engine
postgres  6245  0.0  0.0 262888  6212 ?    Ss   13:02   0:00 
postgres: engine engine 127.0.0.1(42400) idle
postgres  6246  0.0  0.0 262844  3256 ?    Ss   13:02   0:00 
postgres: autovacuum worker process   template1
postgres 18815  0.0  0.0 262912  5852 ?    Ss   nov01   0:00 
postgres: django django 127.0.0.1(59564) idle
postgres 23148  0.0  0.2 266052 43024 ?    Ss   oct28   9:01 
postgres: engine engine 127.0.0.1(59714) idle
postgres 23149  0.0  0.0 262980  6820 ?    Ss   oct28   0:00 
postgres: engine engine 127.0.0.1(59716) idle
postgres 28784  0.0  0.0 262816  3492 ?    Ss   12:02   0:00 
postgres: ovirt_engine_history ovirt_engine_history 127.0.0.1(39470) 
idle
postgres 28785  0.0  0.0 262816  

[ovirt-users] Re: Cannot copy or move disks

2020-11-17 Thread suporte
https://bugzilla.redhat.com/show_bug.cgi?id=1898207 

Regards 

José 


De: "Vinayakswami Hariharmath"  
Para: "suporte"  
Cc: "Sahina Bose" , "Nir Soffer" , 
"Gobinda Das" , "Ritesh Chikatwar" , 
"users" , "Krutika Dhananjay" , "Nisan, 
Tal"  
Enviadas: Terça-feira, 17 De Novembro de 2020 10:44:55 
Assunto: Re: [ovirt-users] Cannot copy or move disks 

Can you please provide me the link to the ticket? 
Regards 
Vh 

On Tue, Nov 17, 2020 at 3:32 PM < supo...@logicworks.pt > wrote: 



already opened a file 

Regards 

José 


De: "Vinayakswami Hariharmath" < vhari...@redhat.com > 
Para: "Sahina Bose" < sab...@redhat.com > 
Cc: "Nir Soffer" < nsof...@redhat.com >, "Gobinda Das" < go...@redhat.com >, 
"Ritesh Chikatwar" < rchik...@redhat.com >, "suporte" < supo...@logicworks.pt 
>, "users" < users@ovirt.org >, "Krutika Dhananjay" < kdhan...@redhat.com >, 
"Nisan, Tal" < tni...@redhat.com > 
Enviadas: Terça-feira, 17 De Novembro de 2020 7:29:17 
Assunto: Re: [ovirt-users] Cannot copy or move disks 

After a quick glance, we need some more information. 
I think it is better to open a ticket and provide the below information to 
verify the gluster issue 

1. gluser volume info 
2. gluster logs when you observe this issue. 
3. Explain the scenario in the ticket when this happened. 

Regards 
Vh 

On Tue, Nov 17, 2020 at 12:31 PM Vinayakswami Hariharmath < vhari...@redhat.com 
> wrote: 

BQ_BEGIN

Sure. I will check this up soon. 
Regards 
Vh 

On Tue, Nov 17, 2020 at 12:18 PM Sahina Bose < sab...@redhat.com > wrote: 

BQ_BEGIN

+Gobinda Das 
+Ritesh Chikatwar +Vinayakswami Hariharmath 
Vinayak, could you look at this? 


On Mon, Nov 16, 2020 at 3:10 PM Nir Soffer < nsof...@redhat.com > wrote: 

BQ_BEGIN
On Sun, Nov 15, 2020 at 10:27 PM < supo...@logicworks.pt > wrote: 
> 
> So, you think it's really a bug? 

I'm pretty sure this is a bug on gluster side. 

> 
>  
> De: "Nir Soffer" < nsof...@redhat.com > 
> Para: supo...@logicworks.pt 
> Cc: "users" < users@ovirt.org >, "Sahina Bose" < sab...@redhat.com >, 
> "Krutika Dhananjay" < kdhan...@redhat.com >, "Nisan, Tal" < tni...@redhat.com 
> > 
> Enviadas: Domingo, 15 De Novembro de 2020 15:03:21 
> Assunto: Re: [ovirt-users] Cannot copy or move disks 
> 
> On Sat, Nov 14, 2020 at 4:45 PM < supo...@logicworks.pt > wrote: 
> > 
> > Hello, 
> > 
> > I just update to Version 4.4.3.11-1.el8. Engine and host 
> > 
> > and now I cannot copy or move disks. 
> > 
> > Storage domains are glusterfs 
> > 
> > # gluster --version 
> > glusterfs 7.8 
> > 
> > Here is what I found on vdsm.log 
> > 
> > 2020-11-14 14:08:16,917+ INFO (tasks/5) [storage.SANLock] Releasing 
> > Lease(name='01178644-2ad6-4d37-8657-f33f547bee6b', 
> > path='/rhev/data-center/mnt/glusterSD/node1-teste.aclou 
> > d.pt:_data2/83f8bbfd-cfa3-46d9-a823-c36054826d13/images/97977cbf-eecc-4476-a11f-7798425d40c4/01178644-2ad6-4d37-8657-f33f547bee6b.lease',
> >  offset=0) (clusterlock:530) 
> > 2020-11-14 14:08:17,015+ INFO (tasks/5) [storage.SANLock] Successfully 
> > released Lease(name='01178644-2ad6-4d37-8657-f33f547bee6b', 
> > path='/rhev/data-center/mnt/glusterSD/node1 
> > -teste.acloud.pt:_data2/83f8bbfd-cfa3-46d9-a823-c36054826d13/images/97977cbf-eecc-4476-a11f-7798425d40c4/01178644-2ad6-4d37-8657-f33f547bee6b.lease',
> >  offset=0) (clusterlock:540) 
> > 2020-11-14 14:08:17,016+ ERROR (tasks/5) [root] Job 
> > '8cd732fc-d69b-4c32-8b35-e4a8e47396fb' failed (jobs:223) 
> > Traceback (most recent call last): 
> > File "/usr/lib/python3.6/site-packages/vdsm/jobs.py", line 159, in run 
> > self._run() 
> > File "/usr/lib/python3.6/site-packages/vdsm/storage/sdm/api/copy_data.py", 
> > line 110, in _run 
> > self._operation.run() 
> > File "/usr/lib/python3.6/site-packages/vdsm/storage/qemuimg.py", line 374, 
> > in run 
> > for data in self._operation.watch(): 
> > File "/usr/lib/python3.6/site-packages/vdsm/storage/operation.py", line 
> > 106, in watch 
> > self._finalize(b"", err) 
> > File "/usr/lib/python3.6/site-packages/vdsm/storage/operation.py", line 
> > 179, in _finalize 
> > raise cmdutils.Error(self._cmd, rc, out, err) 
> > vdsm.common.cmdutils.Error: Command ['/usr/bin/qemu-img', 'convert', '-p', 
> > '-t', 'none', '-T', 'none', '-f', 'raw', '-O', 'raw', 
> > '/rhev/data-center/mnt/glusterSD/node1-teste.aclou 
> > d.pt:_data2/83f8bbfd-cfa3-46d9-a823-c36054826d13/images/789f6e50-b954-4dda-a6d5-077fdfb357d2/d95a3e83-74d2-40a6-9f8f-e6ae68794051',
> >  '/rhev/data-center/mnt/glusterSD/ node1-teste.ac 
> > loud.pt:_data2/83f8bbfd-cfa3-46d9-a823-c36054826d13/images/97977cbf-eecc-4476-a11f-7798425d40c4/01178644-2ad6-4d37-8657-f33f547bee6b']
> >  failed with rc=1 out=b'' err=bytearray(b'qem 
> > u-img: error while reading sector 260177858: No such file or directory\n') 
> 
> This is an impossible error for read(), preadv() etc. 
> 
> > 2020-11-14 14:08:17,017+ INFO (tasks/5) [root] Job 
> > '8cd732fc-d69b-4c32-8b35-e4a8e47396fb' will be deleted in 3600 

[ovirt-users] Re: Cannot copy or move disks

2020-11-17 Thread Vinayakswami Hariharmath
Can you please provide me the link to the ticket?

Regards
Vh

On Tue, Nov 17, 2020 at 3:32 PM  wrote:

> already opened a file
>
> Regards
>
> José
>
> --
> *De: *"Vinayakswami Hariharmath" 
> *Para: *"Sahina Bose" 
> *Cc: *"Nir Soffer" , "Gobinda Das" ,
> "Ritesh Chikatwar" , "suporte" ,
> "users" , "Krutika Dhananjay" ,
> "Nisan, Tal" 
> *Enviadas: *Terça-feira, 17 De Novembro de 2020 7:29:17
> *Assunto: *Re: [ovirt-users] Cannot copy or move disks
>
> After a quick glance, we need some more information.
> I think it is better to open a ticket and provide the below information to
> verify the gluster issue
>
> 1. gluser volume info
> 2. gluster logs when you observe this issue.
> 3. Explain the scenario in the ticket when this happened.
>
> Regards
> Vh
>
> On Tue, Nov 17, 2020 at 12:31 PM Vinayakswami Hariharmath <
> vhari...@redhat.com> wrote:
>
>> Sure. I will check this up soon.
>> Regards
>> Vh
>>
>> On Tue, Nov 17, 2020 at 12:18 PM Sahina Bose  wrote:
>>
>>> +Gobinda Das 
>>>  +Ritesh Chikatwar  +Vinayakswami Hariharmath
>>> 
>>> Vinayak, could you look at this?
>>>
>>>
>>> On Mon, Nov 16, 2020 at 3:10 PM Nir Soffer  wrote:
>>>
 On Sun, Nov 15, 2020 at 10:27 PM  wrote:
 >
 > So, you think it's really a bug?

 I'm pretty sure this is a bug on gluster side.

 >
 > 
 > De: "Nir Soffer" 
 > Para: supo...@logicworks.pt
 > Cc: "users" , "Sahina Bose" ,
 "Krutika Dhananjay" , "Nisan, Tal" <
 tni...@redhat.com>
 > Enviadas: Domingo, 15 De Novembro de 2020 15:03:21
 > Assunto: Re: [ovirt-users] Cannot copy or move disks
 >
 > On Sat, Nov 14, 2020 at 4:45 PM  wrote:
 > >
 > > Hello,
 > >
 > > I just update to Version 4.4.3.11-1.el8. Engine and host
 > >
 > > and now I cannot copy or move disks.
 > >
 > > Storage domains are glusterfs
 > >
 > > # gluster --version
 > > glusterfs 7.8
 > >
 > > Here is what I found on vdsm.log
 > >
 > > 2020-11-14 14:08:16,917+ INFO  (tasks/5) [storage.SANLock]
 Releasing Lease(name='01178644-2ad6-4d37-8657-f33f547bee6b',
 path='/rhev/data-center/mnt/glusterSD/node1-teste.aclou
 > > d.pt:_data2/83f8bbfd-cfa3-46d9-a823-c36054826d13/images/97977cbf-eecc-4476-a11f-7798425d40c4/01178644-2ad6-4d37-8657-f33f547bee6b.lease',
 offset=0) (clusterlock:530)
 > > 2020-11-14 14:08:17,015+ INFO  (tasks/5) [storage.SANLock]
 Successfully released Lease(name='01178644-2ad6-4d37-8657-f33f547bee6b',
 path='/rhev/data-center/mnt/glusterSD/node1
 > > -teste.acloud.pt:_data2/83f8bbfd-cfa3-46d9-a823-c36054826d13/images/97977cbf-eecc-4476-a11f-7798425d40c4/01178644-2ad6-4d37-8657-f33f547bee6b.lease',
 offset=0) (clusterlock:540)
 > > 2020-11-14 14:08:17,016+ ERROR (tasks/5) [root] Job
 '8cd732fc-d69b-4c32-8b35-e4a8e47396fb' failed (jobs:223)
 > > Traceback (most recent call last):
 > >  File "/usr/lib/python3.6/site-packages/vdsm/jobs.py", line 159, in
 run
 > >self._run()
 > >  File
 "/usr/lib/python3.6/site-packages/vdsm/storage/sdm/api/copy_data.py", line
 110, in _run
 > >self._operation.run()
 > >  File "/usr/lib/python3.6/site-packages/vdsm/storage/qemuimg.py",
 line 374, in run
 > >for data in self._operation.watch():
 > >  File "/usr/lib/python3.6/site-packages/vdsm/storage/operation.py",
 line 106, in watch
 > >self._finalize(b"", err)
 > >  File "/usr/lib/python3.6/site-packages/vdsm/storage/operation.py",
 line 179, in _finalize
 > >raise cmdutils.Error(self._cmd, rc, out, err)
 > > vdsm.common.cmdutils.Error: Command ['/usr/bin/qemu-img',
 'convert', '-p', '-t', 'none', '-T', 'none', '-f', 'raw', '-O', 'raw',
 '/rhev/data-center/mnt/glusterSD/node1-teste.aclou
 > > d.pt:_data2/83f8bbfd-cfa3-46d9-a823-c36054826d13/images/789f6e50-b954-4dda-a6d5-077fdfb357d2/d95a3e83-74d2-40a6-9f8f-e6ae68794051',
 '/rhev/data-center/mnt/glusterSD/node1-teste.ac
 > > loud.pt:_data2/83f8bbfd-cfa3-46d9-a823-c36054826d13/images/97977cbf-eecc-4476-a11f-7798425d40c4/01178644-2ad6-4d37-8657-f33f547bee6b']
 failed with rc=1 out=b'' err=bytearray(b'qem
 > > u-img: error while reading sector 260177858: No such file or
 directory\n')
 >
 > This is an impossible error for read(), preadv() etc.
 >
 > > 2020-11-14 14:08:17,017+ INFO  (tasks/5) [root] Job
 '8cd732fc-d69b-4c32-8b35-e4a8e47396fb' will be deleted in 3600 seconds
 (jobs:251)
 > > 2020-11-14 14:08:17,017+ INFO  (tasks/5)
 [storage.ThreadPool.WorkerThread] FINISH task
 6cb1d496-d1ca-40b5-a488-a72982738bab (threadPool:151)
 > > 2020-11-14 14:08:17,316+ INFO  (jsonrpc/2) [api.host] START
 getJobs(job_type='storage',
 job_ids=['8cd732fc-d69b-4c32-8b35-e4a8e47396fb'])
 from=:::192.168.5.165,36616, flow
 > > 

[ovirt-users] Re: VM with illegal snapshots

2020-11-17 Thread Florian Schmid via Users


Hi, 




I have also sometimes the issue with snapshots or disk move, which is also 
related to snapshots, when the engine process is running for a longer time. 

After restarting the engine service, then it is working again for several 
weeks, until it gets too long and issues starting again to happen. 




I had never the time to dig in deeper here and I'm never on a version, which is 
still supported, but it would be nice to here, if you maybe have the same 
issue? 

How long was your engine process running, until the issues have started? 




BR Florian 


Von: "Magnus Isaksson"  
An: "Alex K" , "Giorgio Biacchi"  
CC: "users"  
Gesendet: Dienstag, 17. November 2020 10:37:24 
Betreff: [ovirt-users] Re: VM with illegal snapshots 

Hello 

We have the same issue, and not just one VM, we have about 10-15 VMs that have 
either snapshots that are illegal or snapshots that is not possible to remove. 
This have been an issue for quite some time, all since beginning of 4.3 we get 
"vm with illegal snapshots" almost every week, and most often we can remove the 
snapshot and all is ok, but now this have escalated quite much and leaves us 
with customer VM that we may not be able to reboot or that the disk gets 
corrupted, that have happed a few times. 
This is the VM with most snapshots that we are unable to remove any of them. 

Engine is on 4.3.10 and hosts on 4.3.9 

How can i safely remove these snapshots? 
(I am not super comfortable around database "hacking" so some informative 
description would be much appreciated.) 

And how can we eliminate that these things happen? 

Cheers 
Magnus 

From: Alex K  
Sent: 10 November 2020 05:05 
To: Giorgio Biacchi  
Cc: users  
Subject: [ovirt-users] Re: VM with illegal snapshots 


On Fri, Oct 9, 2020, 12:59 Giorgio Biacchi < [ mailto:gior...@di.unimi.it | 
gior...@di.unimi.it ] > wrote: 


Hi, 
due to a bug in our Ovirt integrated backup system now we have some VMs 
with snapshots in illegal state. 

It seems that there's an inconsistency between the db and the real 
status of images on disk. 

Let me show an example: 

engine=# select 
image_guid,parentid,imagestatus,vm_snapshot_id,volume_type,volume_format,active 
from images where image_group_id='e34f77cb-54d5-40d0-b539-e0a5fd512d2d'; 
image_guid | parentid | 
imagestatus | vm_snapshot_id | volume_type | 
volume_format | active 
--+--+-+--+-+---+
 
a107b6c4-842e-4b40-9215-c965431a0c0f | 
---- | 4 | 
d19d6ca3-1989-4c67-8ee7-c0c43b3e6d74 | 2 | 4 | f 
a4c86a68-9123-454c-b417-1b15038a4bf2 | 
a107b6c4-842e-4b40-9215-c965431a0c0f | 1 | 
e7a405ee-8fd4-4733-ae9c-5252bf07c9d3 | 2 | 4 | f 
f6a61f2e-26bd-4b63-97c6-d66913ce48c5 | 
a4c86a68-9123-454c-b417-1b15038a4bf2 | 1 | 
9d0958b9-4995-4e11-a027-a32d4bac52e4 | 2 | 4 | t 
(3 rows) 


[root@host02 ~]# lvs -o+lv_tags |grep e34f77cb-54d5-40d0-b539-e0a5fd512d2d 
a107b6c4-842e-4b40-9215-c965431a0c0f 
459011cf-ebb6-46ff-831d-8ccfafd82c8a -wi--- 149.50g 
IU_e34f77cb-54d5-40d0-b539-e0a5fd512d2d,MD_68,PU_----
 
f6a61f2e-26bd-4b63-97c6-d66913ce48c5 
459011cf-ebb6-46ff-831d-8ccfafd82c8a -wi--- 10.00g 
IU_e34f77cb-54d5-40d0-b539-e0a5fd512d2d,MD_348,PU_a107b6c4-842e-4b40-9215-c965431a0c0f
 

so image guid a4c86a68-9123-454c-b417-1b15038a4bf2 is not present on 
disk, i think that the image was correctly merged but not removed from 
the database. 

Any suggestion on how to fix the database to reflect the real situation 
on disk?? 



In those cases I delete the entry from engine DB to reflect the status of the 
image chain. 

BQ_BEGIN

TIA 
-- 
gb 

PGP Key: [ http://pgp.mit.edu/ | 
http://pgp.mit.edu/ ] 
Primary key fingerprint: C510 0765 943E EBED A4F2 69D3 16CC DC90 B9CB 0F34 
___ 
Users mailing list -- [ mailto:users@ovirt.org | 
users@ovirt.org ] 
To unsubscribe send an email to [ mailto:users-le...@ovirt.org | 
users-le...@ovirt.org ] 
Privacy Statement: [ https://www.ovirt.org/privacy-policy.html | 
https://www.ovirt.org/privacy-policy.html ] 
oVirt Code of Conduct: [ 
https://www.ovirt.org/community/about/community-guidelines/ | 
https://www.ovirt.org/community/about/community-guidelines/ ] 
List Archives: [ 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/OF4NTAC6BPGRP4YJZRWBXQCNBWLERL72/
 | 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/OF4NTAC6BPGRP4YJZRWBXQCNBWLERL72/
 ] 

BQ_END


___ 
Users mailing list -- users@ovirt.org 
To unsubscribe send an email to users-le...@ovirt.org 
Privacy Statement: https://www.ovirt.org/privacy-policy.html 
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/ 
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/FRAXF7POUODBM4AJ5BFC3TEK4Z323QEG/
 

[ovirt-users] Re: Cannot copy or move disks

2020-11-17 Thread suporte
already opened a file 

Regards 

José 


De: "Vinayakswami Hariharmath"  
Para: "Sahina Bose"  
Cc: "Nir Soffer" , "Gobinda Das" , 
"Ritesh Chikatwar" , "suporte" , 
"users" , "Krutika Dhananjay" , "Nisan, 
Tal"  
Enviadas: Terça-feira, 17 De Novembro de 2020 7:29:17 
Assunto: Re: [ovirt-users] Cannot copy or move disks 

After a quick glance, we need some more information. 
I think it is better to open a ticket and provide the below information to 
verify the gluster issue 

1. gluser volume info 
2. gluster logs when you observe this issue. 
3. Explain the scenario in the ticket when this happened. 

Regards 
Vh 

On Tue, Nov 17, 2020 at 12:31 PM Vinayakswami Hariharmath < vhari...@redhat.com 
> wrote: 



Sure. I will check this up soon. 
Regards 
Vh 

On Tue, Nov 17, 2020 at 12:18 PM Sahina Bose < sab...@redhat.com > wrote: 

BQ_BEGIN

+Gobinda Das 
+Ritesh Chikatwar +Vinayakswami Hariharmath 
Vinayak, could you look at this? 


On Mon, Nov 16, 2020 at 3:10 PM Nir Soffer < nsof...@redhat.com > wrote: 

BQ_BEGIN
On Sun, Nov 15, 2020 at 10:27 PM < supo...@logicworks.pt > wrote: 
> 
> So, you think it's really a bug? 

I'm pretty sure this is a bug on gluster side. 

> 
>  
> De: "Nir Soffer" < nsof...@redhat.com > 
> Para: supo...@logicworks.pt 
> Cc: "users" < users@ovirt.org >, "Sahina Bose" < sab...@redhat.com >, 
> "Krutika Dhananjay" < kdhan...@redhat.com >, "Nisan, Tal" < tni...@redhat.com 
> > 
> Enviadas: Domingo, 15 De Novembro de 2020 15:03:21 
> Assunto: Re: [ovirt-users] Cannot copy or move disks 
> 
> On Sat, Nov 14, 2020 at 4:45 PM < supo...@logicworks.pt > wrote: 
> > 
> > Hello, 
> > 
> > I just update to Version 4.4.3.11-1.el8. Engine and host 
> > 
> > and now I cannot copy or move disks. 
> > 
> > Storage domains are glusterfs 
> > 
> > # gluster --version 
> > glusterfs 7.8 
> > 
> > Here is what I found on vdsm.log 
> > 
> > 2020-11-14 14:08:16,917+ INFO (tasks/5) [storage.SANLock] Releasing 
> > Lease(name='01178644-2ad6-4d37-8657-f33f547bee6b', 
> > path='/rhev/data-center/mnt/glusterSD/node1-teste.aclou 
> > d.pt:_data2/83f8bbfd-cfa3-46d9-a823-c36054826d13/images/97977cbf-eecc-4476-a11f-7798425d40c4/01178644-2ad6-4d37-8657-f33f547bee6b.lease',
> >  offset=0) (clusterlock:530) 
> > 2020-11-14 14:08:17,015+ INFO (tasks/5) [storage.SANLock] Successfully 
> > released Lease(name='01178644-2ad6-4d37-8657-f33f547bee6b', 
> > path='/rhev/data-center/mnt/glusterSD/node1 
> > -teste.acloud.pt:_data2/83f8bbfd-cfa3-46d9-a823-c36054826d13/images/97977cbf-eecc-4476-a11f-7798425d40c4/01178644-2ad6-4d37-8657-f33f547bee6b.lease',
> >  offset=0) (clusterlock:540) 
> > 2020-11-14 14:08:17,016+ ERROR (tasks/5) [root] Job 
> > '8cd732fc-d69b-4c32-8b35-e4a8e47396fb' failed (jobs:223) 
> > Traceback (most recent call last): 
> > File "/usr/lib/python3.6/site-packages/vdsm/jobs.py", line 159, in run 
> > self._run() 
> > File "/usr/lib/python3.6/site-packages/vdsm/storage/sdm/api/copy_data.py", 
> > line 110, in _run 
> > self._operation.run() 
> > File "/usr/lib/python3.6/site-packages/vdsm/storage/qemuimg.py", line 374, 
> > in run 
> > for data in self._operation.watch(): 
> > File "/usr/lib/python3.6/site-packages/vdsm/storage/operation.py", line 
> > 106, in watch 
> > self._finalize(b"", err) 
> > File "/usr/lib/python3.6/site-packages/vdsm/storage/operation.py", line 
> > 179, in _finalize 
> > raise cmdutils.Error(self._cmd, rc, out, err) 
> > vdsm.common.cmdutils.Error: Command ['/usr/bin/qemu-img', 'convert', '-p', 
> > '-t', 'none', '-T', 'none', '-f', 'raw', '-O', 'raw', 
> > '/rhev/data-center/mnt/glusterSD/node1-teste.aclou 
> > d.pt:_data2/83f8bbfd-cfa3-46d9-a823-c36054826d13/images/789f6e50-b954-4dda-a6d5-077fdfb357d2/d95a3e83-74d2-40a6-9f8f-e6ae68794051',
> >  '/rhev/data-center/mnt/glusterSD/ node1-teste.ac 
> > loud.pt:_data2/83f8bbfd-cfa3-46d9-a823-c36054826d13/images/97977cbf-eecc-4476-a11f-7798425d40c4/01178644-2ad6-4d37-8657-f33f547bee6b']
> >  failed with rc=1 out=b'' err=bytearray(b'qem 
> > u-img: error while reading sector 260177858: No such file or directory\n') 
> 
> This is an impossible error for read(), preadv() etc. 
> 
> > 2020-11-14 14:08:17,017+ INFO (tasks/5) [root] Job 
> > '8cd732fc-d69b-4c32-8b35-e4a8e47396fb' will be deleted in 3600 seconds 
> > (jobs:251) 
> > 2020-11-14 14:08:17,017+ INFO (tasks/5) 
> > [storage.ThreadPool.WorkerThread] FINISH task 
> > 6cb1d496-d1ca-40b5-a488-a72982738bab (threadPool:151) 
> > 2020-11-14 14:08:17,316+ INFO (jsonrpc/2) [api.host] START 
> > getJobs(job_type='storage', 
> > job_ids=['8cd732fc-d69b-4c32-8b35-e4a8e47396fb']) 
> > from=:::192.168.5.165,36616, flow 
> > _id=49320e0a-14fb-4cbb-bdfd-b2546c260bf7 (api:48) 
> 
> This was reported here a long time ago with various versions of gluster. 
> I don't think we got any response from gluster folks about it yet. 
> 
> Can you file an oVirt bug about this? 
> 
> Nir 






BQ_END


BQ_END



[ovirt-users] Re: VM with illegal snapshots

2020-11-17 Thread Magnus Isaksson
Hello

We have the same issue, and not just one VM, we have about 10-15 VMs that have 
either snapshots that are illegal or snapshots that is not possible to remove.
This have been an issue for quite some time, all since beginning of 4.3 we get 
"vm with illegal snapshots" almost every week, and most often we can remove the 
snapshot and all is ok, but now this have escalated quite much and leaves us 
with customer VM that we may not be able to reboot or that the disk gets 
corrupted, that have happed a few times.
[cid:2c902230-ccb6-4e63-98f6-ea667ba239bf]
This is the VM with most snapshots that we are unable to remove any of them.

Engine is on 4.3.10 and hosts on 4.3.9

How can i safely remove these snapshots?
(I am not super comfortable around database "hacking" so some informative 
description would be much appreciated.)

And how can we eliminate that these things happen?

Cheers
 Magnus

From: Alex K 
Sent: 10 November 2020 05:05
To: Giorgio Biacchi 
Cc: users 
Subject: [ovirt-users] Re: VM with illegal snapshots



On Fri, Oct 9, 2020, 12:59 Giorgio Biacchi 
mailto:gior...@di.unimi.it>> wrote:
Hi,
due to a bug in our Ovirt integrated backup system now we have some VMs
with snapshots in illegal state.

It seems that there's an inconsistency between the db and the real
status of images on disk.

Let me show an example:

engine=# select
image_guid,parentid,imagestatus,vm_snapshot_id,volume_type,volume_format,active
from images where image_group_id='e34f77cb-54d5-40d0-b539-e0a5fd512d2d';
  image_guid  |   parentid  |
imagestatus |vm_snapshot_id| volume_type |
volume_format | active
--+--+-+--+-+---+
 a107b6c4-842e-4b40-9215-c965431a0c0f |
---- |   4 |
d19d6ca3-1989-4c67-8ee7-c0c43b3e6d74 |   2 | 4 | f
 a4c86a68-9123-454c-b417-1b15038a4bf2 |
a107b6c4-842e-4b40-9215-c965431a0c0f |   1 |
e7a405ee-8fd4-4733-ae9c-5252bf07c9d3 |   2 | 4 | f
 f6a61f2e-26bd-4b63-97c6-d66913ce48c5 |
a4c86a68-9123-454c-b417-1b15038a4bf2 |   1 |
9d0958b9-4995-4e11-a027-a32d4bac52e4 |   2 | 4 | t
(3 rows)


[root@host02 ~]#  lvs -o+lv_tags |grep e34f77cb-54d5-40d0-b539-e0a5fd512d2d
  a107b6c4-842e-4b40-9215-c965431a0c0f
459011cf-ebb6-46ff-831d-8ccfafd82c8a -wi---  149.50g
IU_e34f77cb-54d5-40d0-b539-e0a5fd512d2d,MD_68,PU_----
  f6a61f2e-26bd-4b63-97c6-d66913ce48c5
459011cf-ebb6-46ff-831d-8ccfafd82c8a -wi---   10.00g
IU_e34f77cb-54d5-40d0-b539-e0a5fd512d2d,MD_348,PU_a107b6c4-842e-4b40-9215-c965431a0c0f

so image guid a4c86a68-9123-454c-b417-1b15038a4bf2 is not present on
disk, i think that the image was correctly merged but not removed from
the database.

Any suggestion on how to fix the database to reflect the real situation
on disk??
In those cases I delete the entry from engine DB to reflect the status of the 
image chain.

TIA
--
gb

PGP Key: http://pgp.mit.edu/
Primary key fingerprint: C510 0765 943E EBED A4F2 69D3 16CC DC90 B9CB 0F34
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to 
users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/OF4NTAC6BPGRP4YJZRWBXQCNBWLERL72/
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/FRAXF7POUODBM4AJ5BFC3TEK4Z323QEG/


[ovirt-users] How to create a backup in event of hardware failure of a single hosted engine?

2020-11-17 Thread ernestclydeachua
Hello everyone,
Currently we have a single bare metal that hosts oVirt and glusterFS which 
later on to be converted to a 3 nodes for HCI, currently another bare metal is 
coming this week and was planning to  initially create it as a backup.

is it possible to deploy a new hosted engine then create a gluster volume for 
1st one as a backup domain then attach it to the new one if hardware failure 
occurred, or there is another kind of setup that can be done?
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/EYOW4WNUACRCTWLG2IRXEGISYFKBWAU7/


[ovirt-users] Re: Deploy oVirt Engine fail behind proxy

2020-11-17 Thread Yedidyah Bar David
On Thu, Nov 12, 2020 at 9:32 AM Matteo Bonardi  wrote:
>
>
>
> Il giorno mer 11 nov 2020 alle ore 11:01 Yedidyah Bar David  
> ha scritto:
>>
>> On Wed, Nov 11, 2020 at 11:49 AM Matteo Bonardi  wrote:
>> >
>> > Hi Didi,
>> >
>> > > On Tue, Nov 10, 2020 at 7:18 PM Strahil Nikolov via Users
>> > > > > >
>> > > I already suggested to open a bug for this earlier in this thread.
>> >
>> > I will open a bug for that.
>>
>> Thanks.
>>
>> >
>> > > I am not sure. I talked with Sandro about this, and we agreed that a
>> > > common enterprise requirement is being able to use a local _mirror_
>> > > (e.g. Foreman), mainly for disconnected use (without an Internet
>> > > connection). A proxy is technically quite different, even if in
>> > > principle can be used for the same need.
>> > >
>> >
>> > I can understand your point, but since the deploy script already checks 
>> > that the proxy is set, I don't see why it can't pass this information to 
>> > the engine.
>>
>> Sorry, where?
>
>
> Sincerely I didn't read the code, but reading the log there are more than one 
> reference to proxy.
> For example:
>
> 2020-11-10 18:15:48,518+0100 WARNING 
> otopi.plugins.gr_he_common.network.gateway gateway._setup:88 It seems that 
> this host is configured to use a proxy, please ensure that this host will be 
> able to reach the engine VM trough that proxy or add a specific exception.

You are right. This was added for:

https://bugzilla.redhat.com/show_bug.cgi?id=1387146

The request there wasn't for full proxy support, just a warning.

> 2020-11-10 18:15:49,637+0100 DEBUG 
> otopi.ovirt_hosted_engine_setup.ansible_utils ansible_utils.run:197 
> ansible-playbook: env: {'LS_COLORS': [..], 'USER': 'root', 'PWD': '', 'HOME': 
> '', 'SSH_CLIENT': '', 'https_proxy': 'http://10.248.2.50:3128/', 
> 'http_proxy': 'http://10.248.2.50:3128/', 'no_proxy': 
> '.', 'SSH_TTY': '/dev/pts/0', 'MAIL': 
> '/var/spool/mail/root', 'SHELL': '/bin/bash', 'TERM': 'xterm', 'SHLVL': '1', 
> 'PYTHONPATH': '/usr/share/ovirt-hosted-engine-setup/scripts/..:',[..], 
> 'HE_ANSIBLE_LOG_PATH': 
> '/var/log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-ansible-get_network_interfaces-20201110181549-tppuv6.log'}

This just logs all env vars when running ansible.

> 2020-11-10 18:17:11,505+0100 INFO 
> otopi.ovirt_hosted_engine_setup.ansible_utils 
> ansible_utils._process_output:111 TASK [ovirt.ovirt.hosted_engine_setup : 
> Check http/https proxy]

Similarly. Was added for:

https://bugzilla.redhat.com/1588720

And to further clarify. Adding full proxy support requires:

1. Some design and development work
2. Routinely testing deployment with a proxy
3. Fix new bugs as they are found

It's not clear this is worth it.

That said, patches are welcome :-). To consider including it, it
should obviously be written so that it also seamlessly works without a
proxy.

Thanks and best regards,

>
> I'm not sure at all, but the log makes me think the proxy is being checked 
> more than once.
> Of course I can be wrong.
>
>>
>> Also:
>>
>> On Wed, Nov 11, 2020 at 11:39 AM Matteo Bonardi  wrote:
>> >
>> > Thanks to all yours suggestions I, finally, complete the deploy with 
>> > success.
>> > The solution is been to edit the file 
>> > /usr/share/ansible/roles/ovirt.engine-setup/tasks/install_packages.yml 
>> > adding the proxy variable:
>>
>> Which version?
>>
>> Perhaps you had 4.4.2 and then updated to 4.4.3 before retrying, or
>> something similar?
>
>
> Before retry the deploy, I have completely cleanup the installation with:
>
> [root@ ~]# ovirt-hosted-engine-cleanup
> [root@ ~]# yum remove ovirt-*
>
> And then reinstall all:
>
> [root@ ~]# yum install 
> https://resources.ovirt.org/pub/yum-repo/ovirt-release44.rpm
> [root@ ~]# yum install ovirt-engine-appliance
> [root@ ~]# yum install ovirt-hosted-engine-setup
>
> I am a bit confused on how to check the version (it is embarrassing, but I am 
> new on ovirt and ansible).
> There is a mismatch between the version in deployment server and in engine vm.
> I have attached the rpm version of both.
> The manager web console shows 4.4.2 as version.
>
>>
>> >
>> > - name: Install oVirt Engine package
>> >   package:
>> > name: "ovirt-engine"
>> > state: present
>> >   environment:
>> > https_proxy: "http://:"
>> > http_proxy: "http://:"
>> > ftp_proxy: "http://:"
>> >   when: ovirt_engine_setup_product_type | lower == 'ovirt'
>> > [..]
>> >
>> > One suggestion for who will have the same problem:
>> > This file was created by the first failed run of the deploy.
>> > Without run the deploy, this file does not exists.
>> > So you have to run the deploy one time, let it fail and then edit the file.
>> > I know that it seems a stupid suggestion but i think could help someone.
>>
>> It's not stupid! :-). I think you unknowingly ran into something
>> caused by the move to ovirt-ansible-roles in 4.4.3 [1].
>>
>> Until 4.4.2, this file was in the package ovirt-ansible-engine-setup , in:
>>
>>