[ovirt-users] Re: gluster service on the cluster is unchecked on hci cluster

2022-08-15 Thread Jiří Sléžka

Hi,

finally this post helped

https://lists.ovirt.org/archives/list/users@ovirt.org/message/CL4MI3IJH6MPDXS3B23FQ3BDJXHHSKAG/

invisible locked entry is missing time_zone in HostedEngine configuration...

/usr/share/ovirt-engine/dbscripts/engine-psql.sh -c "update vm_static 
set time_zone='Etc/GMT' where vm_name='HostedEngine'"


after this I can change CPU type in cluster and gluster services 
checkbox stay checked.


Thanks for support,

Jiri


On 8/4/22 20:38, Strahil Nikolov wrote:

Go to the host running the HostedEngine VM and dump the xml via virsh.
Then power cycle the engine and check if it fixed the issue with the CPU.

Best Regards,
Strahil Nikolov

On Wed, Aug 3, 2022 at 23:58, Jiří Sléžka
 wrote:
Dne 8/3/22 v 03:06 Strahil Nikolov napsal(a):
 > I think it's related to Compute -> Clusters -> Cluster Name ->
Gluster Hooks
 >
 > I think https://access.redhat.com/solutions/6644151
<https://access.redhat.com/solutions/6644151 >should solve the
 > problem (you can use a developer subscription to access it).

thanks, I really had 5 hook conflicts

/usr/share/ovirt-engine/dbscripts/engine-psql.sh -c "select
id,name,hook_status,content_type,conflict_status from gluster_hooks
where conflict_status != 0";

                   id                  |        name        |
hook_status | content_type | conflict_status

--++-+--+-
   517462b4-104d-40d1-ac94-3f8baea8e80b | 30samba-start.sh  | ENABLED
   | TEXT        |              4
   d428056d-f6fd-4e56-a48a-ccbdd273b774 | 30samba-set.sh    | ENABLED
   | TEXT        |              4
   a1d8857a-9378-42af-81a8-89a4c75eb52e | 30samba-stop.sh    | ENABLED
   | TEXT        |              4
   af362bbf-d1ea-4d5e-ae07-492c7ce0966f | 29CTDBsetup.sh    | ENABLED
   | TEXT        |              4
   d3bdf3df-13f1-48d8-92d9-03d09989516f | 29CTDB-teardown.sh | ENABLED
   | TEXT        |              4
(5 rows)

I removed them and then synced gluster hooks in cluster

Also diagnostic step

rpm -qV glusterfs-server

revealed that on one of hosts are some hooks missing

[root@ovirt-hci01 <mailto:root@ovirt-hci01> ~]# rpm -qV glusterfs-server
.M...  c /var/lib/glusterd/glusterd.info
missing    /var/lib/glusterd/hooks/1/set/post/S30samba-set.sh
missing    /var/lib/glusterd/hooks/1/start/post/S29CTDBsetup.sh
missing    /var/lib/glusterd/hooks/1/start/post/S30samba-start.sh
missing    /var/lib/glusterd/hooks/1/stop/pre/S29CTDB-teardown.sh
missing    /var/lib/glusterd/hooks/1/stop/pre/S30samba-stop.sh

I reinstalled glusterfs-server package there

Well, I do this only to change CPU Type in cluster but now when Gluster
services are checked and I try to change CPU Type I got

"Error while executing action: Cannot update cluster because the update
triggered update of the VMs/Templates and it failed for the following:
HostedEngine. To fix the issue, please go to each of them, edit, change
the Custom Compatibility Version (or other fields changed previously in
the cluster dialog) and press OK. If the save does not pass, fix the
dialog validation. After successful cluster update, you can revert your
Custom Compatibility Version change (or other changes). If the problem
still persists, you may refer to the engine.log file for further
details."

Strange thing and probably bug - this action disables Gluster services
checkbox in cluster!!! Will try to report it...

Also I have no idea what is wrong with HostedEngine as there is (as I
can see) no custom settings on it... but I cannot change for example
memory on it because "There was an attempt to change Hosted Engine VM
values that are locked."

2022-08-03 22:32:01,436+02 WARN
[org.ovirt.engine.core.bll.UpdateVmCommand] (default task-3193)
[b93958d9-b27d-4f1b-97f0-d78312c2d346] Validation of action 'UpdateVm'
failed for user admin@internal-authz. <mailto:admin@internal-authz.>
Reasons:
VAR__ACTION__UPDATE,VAR__TYPE__VM,VM_CANNOT_UPDATE_HOSTED_ENGINE_FIELD


Cheers,

Jiri

 >
 > Best Regards,
 > Strahil Nikolov
 >
 >    On Wed, Aug 3, 2022 at 1:51, Jiří Sléžka
 >    mailto:jiri.sle...@slu.cz>> wrote:
 >    ___
 >    Users mailing list -- users@ovirt.org <mailto:users@ovirt.org>
<mailto:users@ovirt.org <mailto:users@ovirt.org>>
 >    To unsubscribe send an email to users-le...@ovirt.org
<mailto:users-le...@ovirt.org>
 >    <mailto:users-le...@ovirt.org <mailto:users-le...@ovirt.org>>
 >    Privacy Statement: https://www.ovirt.org/privacy-polic

[ovirt-users] Re: Changing Cluster Compatibility Version from 4.6 to 4.7 issue

2022-08-15 Thread Jiří Sléžka

On 8/13/22 11:38, Alexandr Mikhailov wrote:

This is solution: update vm_static set time_zone='Etc/GMT' where 
vm_name='HostedEngine';


thanks, it works also for me

Cheers,

Jiri


___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/CL4MI3IJH6MPDXS3B23FQ3BDJXHHSKAG/




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/OUPVVH5ROWCXTVPY6JXWUDKDAM6NNNIW/


[ovirt-users] Re: gluster service on the cluster is unchecked on hci cluster

2022-08-03 Thread Jiří Sléžka

Dne 8/3/22 v 03:06 Strahil Nikolov napsal(a):

I think it's related to Compute -> Clusters -> Cluster Name -> Gluster Hooks

I think https://access.redhat.com/solutions/6644151 should solve the 
problem (you can use a developer subscription to access it).


thanks, I really had 5 hook conflicts

/usr/share/ovirt-engine/dbscripts/engine-psql.sh -c "select 
id,name,hook_status,content_type,conflict_status from gluster_hooks 
where conflict_status != 0";


  id  |name| 
hook_status | content_type | conflict_status

--++-+--+-
 517462b4-104d-40d1-ac94-3f8baea8e80b | 30samba-start.sh   | ENABLED 
  | TEXT |   4
 d428056d-f6fd-4e56-a48a-ccbdd273b774 | 30samba-set.sh | ENABLED 
  | TEXT |   4
 a1d8857a-9378-42af-81a8-89a4c75eb52e | 30samba-stop.sh| ENABLED 
  | TEXT |   4
 af362bbf-d1ea-4d5e-ae07-492c7ce0966f | 29CTDBsetup.sh | ENABLED 
  | TEXT |   4
 d3bdf3df-13f1-48d8-92d9-03d09989516f | 29CTDB-teardown.sh | ENABLED 
  | TEXT |   4

(5 rows)

I removed them and then synced gluster hooks in cluster

Also diagnostic step

rpm -qV glusterfs-server

revealed that on one of hosts are some hooks missing

[root@ovirt-hci01 ~]# rpm -qV glusterfs-server
.M...  c /var/lib/glusterd/glusterd.info
missing /var/lib/glusterd/hooks/1/set/post/S30samba-set.sh
missing /var/lib/glusterd/hooks/1/start/post/S29CTDBsetup.sh
missing /var/lib/glusterd/hooks/1/start/post/S30samba-start.sh
missing /var/lib/glusterd/hooks/1/stop/pre/S29CTDB-teardown.sh
missing /var/lib/glusterd/hooks/1/stop/pre/S30samba-stop.sh

I reinstalled glusterfs-server package there

Well, I do this only to change CPU Type in cluster but now when Gluster 
services are checked and I try to change CPU Type I got


"Error while executing action: Cannot update cluster because the update 
triggered update of the VMs/Templates and it failed for the following: 
HostedEngine. To fix the issue, please go to each of them, edit, change 
the Custom Compatibility Version (or other fields changed previously in 
the cluster dialog) and press OK. If the save does not pass, fix the 
dialog validation. After successful cluster update, you can revert your 
Custom Compatibility Version change (or other changes). If the problem 
still persists, you may refer to the engine.log file for further details."


Strange thing and probably bug - this action disables Gluster services 
checkbox in cluster!!! Will try to report it...


Also I have no idea what is wrong with HostedEngine as there is (as I 
can see) no custom settings on it... but I cannot change for example 
memory on it because "There was an attempt to change Hosted Engine VM 
values that are locked."


2022-08-03 22:32:01,436+02 WARN 
[org.ovirt.engine.core.bll.UpdateVmCommand] (default task-3193) 
[b93958d9-b27d-4f1b-97f0-d78312c2d346] Validation of action 'UpdateVm' 
failed for user admin@internal-authz. Reasons: 
VAR__ACTION__UPDATE,VAR__TYPE__VM,VM_CANNOT_UPDATE_HOSTED_ENGINE_FIELD



Cheers,

Jiri



Best Regards,
Strahil Nikolov

On Wed, Aug 3, 2022 at 1:51, Jiří Sléžka
 wrote:
___
Users mailing list -- users@ovirt.org <mailto:users@ovirt.org>
To unsubscribe send an email to users-le...@ovirt.org
<mailto:users-le...@ovirt.org>
Privacy Statement: https://www.ovirt.org/privacy-policy.html
<https://www.ovirt.org/privacy-policy.html>
oVirt Code of Conduct:
https://www.ovirt.org/community/about/community-guidelines/
<https://www.ovirt.org/community/about/community-guidelines/>
List Archives:

https://lists.ovirt.org/archives/list/users@ovirt.org/message/HNGTNEBDBB2GWBYGHSIGNVIUGL4EFWT5/

<https://lists.ovirt.org/archives/list/users@ovirt.org/message/HNGTNEBDBB2GWBYGHSIGNVIUGL4EFWT5/>





smime.p7s
Description: Elektronicky podpis S/MIME
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/MUD4TF6MT33PNGVCQPLHSABJ3VKX64YG/


[ovirt-users] Re: gluster service on the cluster is unchecked on hci cluster

2022-08-02 Thread Jiří Sléžka
but some webhook is registered on host... ovirt-hci.mch.local is 
resolvable (through /etc/hosts)


[root@ovirt-hci01 ~]# gluster-eventsapi status
Webhooks:
http://ovirt-hci.mch.local:80/ovirt-engine/services/glusterevents

+---+-+---+
|NODE   | NODE STATUS | GLUSTEREVENTSD STATUS |
+---+-+---+
| 10.0.4.12 |  UP |OK |
| 10.0.4.13 |  UP |OK |
| localhost |  UP |OK |
+---+-+---+

Jiri


Dne 8/3/22 v 00:38 Jiří Sléžka napsal(a):

Dne 7/23/22 v 23:53 Strahil Nikolov napsal(a):
Did you identify any errors in the Engine log that could provide any 
clue ?


unfortunately no.

but funny thing... today I looked into html source of cluster settings 
page (via Firefox's web developer console). Gluster checkbox has this 
html code


id="ClusterPopupView_enableGlusterService" tabindex="17" 
style="vertical-align: top;" disabled="">


when I edited and removed disabled="" part, I was able check that 
checkbox. After pressing Ok everything seems to be set but there are 
finally three relevant errors in the engine.log


2022-08-03 00:22:36,795+02 ERROR 
[org.ovirt.engine.core.bll.InitGlusterCommandHelper] (default task-2701) 
[6f4a736] Could not sync webhooks to gluster server 
'ovirt-hci03.mch.local': null
2022-08-03 00:22:37,842+02 ERROR 
[org.ovirt.engine.core.bll.InitGlusterCommandHelper] (default task-2701) 
[6471654f] Could not sync webhooks to gluster server 
'ovirt-hci01.mch.local': null
2022-08-03 00:22:39,051+02 ERROR 
[org.ovirt.engine.core.bll.InitGlusterCommandHelper] (default task-2701) 
[6970bf5a] Could not sync webhooks to gluster server 
'ovirt-hci02.mch.local': null


Any idea why?

few lines before first error

2022-08-03 00:22:36,501+02 INFO 
[org.ovirt.engine.core.vdsbroker.gluster.GlusterServersListVDSCommand] 
(default task-2701) [7078c5b6] START, 
GlusterServersListVDSCommand(HostName = ovirt-hci01.mch.local, 
VdsIdVDSCommandParametersBase:{hostId='41722608-413e--a8bb-08ad783ec186'}), 
log id: 6083708e
2022-08-03 00:22:36,616+02 INFO 
[org.ovirt.engine.core.vdsbroker.gluster.GlusterServersListVDSCommand] 
(default task-2701) [7078c5b6] FINISH, GlusterServersListVDSCommand, 
return: [10.0.3.51/24:CONNECTED, 10.0.4.12:CONNECTED, 
10.0.4.13:CONNECTED], log id: 6083708e
2022-08-03 00:22:36,619+02 INFO 
[org.ovirt.engine.core.bll.gluster.AddGlusterWebhookInternalCommand] 
(default task-2701) [6f4a736] Running command: 
AddGlusterWebhookInternalCommand internal: true. Entities affected : ID: 
d03909e7-aca1-496c-9ff6-4a513c961ae3 Type: Cluster
2022-08-03 00:22:36,624+02 INFO 
[org.ovirt.engine.core.vdsbroker.gluster.AddGlusterWebhookVDSCommand] 
(default task-2701) [6f4a736] START, 
AddGlusterWebhookVDSCommand(HostName = ovirt-hci01.mch.local, 
GlusterWebhookVDSParameters:{hostId='41722608-413e--a8bb-08ad783ec186'}), 
log id: 1dfd0a44
2022-08-03 00:22:36,793+02 INFO 
[org.ovirt.engine.core.vdsbroker.gluster.AddGlusterWebhookVDSCommand] 
(default task-2701) [6f4a736] FINISH, AddGlusterWebhookVDSCommand, 
return: , log id: 1dfd0a44
2022-08-03 00:22:36,795+02 ERROR 
[org.ovirt.engine.core.bll.InitGlusterCommandHelper] (default task-2701) 
[6f4a736] Could not sync webhooks to gluster server 
'ovirt-hci03.mch.local': null


Cheers,

Jiri




Best Regards,
Strahil Nikolov

    On Wed, Jul 20, 2022 at 16:15, Jiří Sléžka
     wrote:
    On 7/19/22 22:40, Strahil Nikolov wrote:
 > Then, just ensure that the glusterd.service is enabled on all
    hosts and
 > leave it as it is.
 >
 > If it worries you, you will have to move one of the hosts in 
another

 > cluster (probably a new one) and slowly migrate the VMs from the
    old to
 > the new one.
 > Yet, if you use only 3 hosts that can put your VMs in risk (new
    cluster
 > having a single host could lead to downtimes).

    well, it blocks me from any changes on cluster so it is serious
    problem... but personally I don't like this "new cluster and 
migration"

    approach :-(

 > To be honest, I wouldn't change DB if it's a productive cluster.
    If you
 > decide to go that one -> make an engine backup before that.

    Would anyone from oVirt/gluster developers have a look?

    Thanks in advance,

    Jiri

 >
 > Best Regards,
     > Strahil Nikolov
 >
 >
 >
 >
 >
 >    On Tue, Jul 19, 2022 at 12:25, Jiří Sléžka
 >    mailto:jiri.sle...@slu.cz>> wrote:
 >    On 7/16/22 07:53, Strahil Nikolov wrote:
 >      > Try first with a single host. Set it into maintenance and
    check
 >    if the
 >      > checkmark is available.
 >
 >    setting single host to maintenance didn't change state

[ovirt-users] Re: gluster service on the cluster is unchecked on hci cluster

2022-08-02 Thread Jiří Sléžka

Dne 7/23/22 v 23:53 Strahil Nikolov napsal(a):

Did you identify any errors in the Engine log that could provide any clue ?


unfortunately no.

but funny thing... today I looked into html source of cluster settings 
page (via Firefox's web developer console). Gluster checkbox has this 
html code


id="ClusterPopupView_enableGlusterService" tabindex="17" 
style="vertical-align: top;" disabled="">


when I edited and removed disabled="" part, I was able check that 
checkbox. After pressing Ok everything seems to be set but there are 
finally three relevant errors in the engine.log


2022-08-03 00:22:36,795+02 ERROR 
[org.ovirt.engine.core.bll.InitGlusterCommandHelper] (default task-2701) 
[6f4a736] Could not sync webhooks to gluster server 
'ovirt-hci03.mch.local': null
2022-08-03 00:22:37,842+02 ERROR 
[org.ovirt.engine.core.bll.InitGlusterCommandHelper] (default task-2701) 
[6471654f] Could not sync webhooks to gluster server 
'ovirt-hci01.mch.local': null
2022-08-03 00:22:39,051+02 ERROR 
[org.ovirt.engine.core.bll.InitGlusterCommandHelper] (default task-2701) 
[6970bf5a] Could not sync webhooks to gluster server 
'ovirt-hci02.mch.local': null


Any idea why?

few lines before first error

2022-08-03 00:22:36,501+02 INFO 
[org.ovirt.engine.core.vdsbroker.gluster.GlusterServersListVDSCommand] 
(default task-2701) [7078c5b6] START, 
GlusterServersListVDSCommand(HostName = ovirt-hci01.mch.local, 
VdsIdVDSCommandParametersBase:{hostId='41722608-413e--a8bb-08ad783ec186'}), 
log id: 6083708e
2022-08-03 00:22:36,616+02 INFO 
[org.ovirt.engine.core.vdsbroker.gluster.GlusterServersListVDSCommand] 
(default task-2701) [7078c5b6] FINISH, GlusterServersListVDSCommand, 
return: [10.0.3.51/24:CONNECTED, 10.0.4.12:CONNECTED, 
10.0.4.13:CONNECTED], log id: 6083708e
2022-08-03 00:22:36,619+02 INFO 
[org.ovirt.engine.core.bll.gluster.AddGlusterWebhookInternalCommand] 
(default task-2701) [6f4a736] Running command: 
AddGlusterWebhookInternalCommand internal: true. Entities affected : 
ID: d03909e7-aca1-496c-9ff6-4a513c961ae3 Type: Cluster
2022-08-03 00:22:36,624+02 INFO 
[org.ovirt.engine.core.vdsbroker.gluster.AddGlusterWebhookVDSCommand] 
(default task-2701) [6f4a736] START, 
AddGlusterWebhookVDSCommand(HostName = ovirt-hci01.mch.local, 
GlusterWebhookVDSParameters:{hostId='41722608-413e--a8bb-08ad783ec186'}), 
log id: 1dfd0a44
2022-08-03 00:22:36,793+02 INFO 
[org.ovirt.engine.core.vdsbroker.gluster.AddGlusterWebhookVDSCommand] 
(default task-2701) [6f4a736] FINISH, AddGlusterWebhookVDSCommand, 
return: , log id: 1dfd0a44
2022-08-03 00:22:36,795+02 ERROR 
[org.ovirt.engine.core.bll.InitGlusterCommandHelper] (default task-2701) 
[6f4a736] Could not sync webhooks to gluster server 
'ovirt-hci03.mch.local': null


Cheers,

Jiri




Best Regards,
Strahil Nikolov

On Wed, Jul 20, 2022 at 16:15, Jiří Sléžka
 wrote:
On 7/19/22 22:40, Strahil Nikolov wrote:
 > Then, just ensure that the glusterd.service is enabled on all
hosts and
 > leave it as it is.
 >
 > If it worries you, you will have to move one of the hosts in another
 > cluster (probably a new one) and slowly migrate the VMs from the
old to
 > the new one.
 > Yet, if you use only 3 hosts that can put your VMs in risk (new
cluster
 > having a single host could lead to downtimes).

well, it blocks me from any changes on cluster so it is serious
problem... but personally I don't like this "new cluster and migration"
approach :-(

 > To be honest, I wouldn't change DB if it's a productive cluster.
If you
 > decide to go that one -> make an engine backup before that.

Would anyone from oVirt/gluster developers have a look?

Thanks in advance,

Jiri

 >
 > Best Regards,
     > Strahil Nikolov
 >
 >
 >
 >
 >
 >    On Tue, Jul 19, 2022 at 12:25, Jiří Sléžka
 >    mailto:jiri.sle...@slu.cz>> wrote:
 >    On 7/16/22 07:53, Strahil Nikolov wrote:
 >      > Try first with a single host. Set it into maintenance and
check
 >    if the
 >      > checkmark is available.
 >
 >    setting single host to maintenance didn't change state of the
gluster
 >    services checkbox in cluster settings.
 >
 >      > If not, try to 'reinstall' (UI, Hosts, Installation,
Reinstall) the
 >      > host. During the setup, it should give you to update if
the host
 >    can run
 >      > the HE and it should allow you to select the checkmark for
Gluster.
 >
 >    well, in my oVirt install there is no way to setup glusterfs
services
 >    during host reinstall. There are only choices to configure
firewall,
 >    activate host after install, reboot host after install and
 >    dep

[ovirt-users] Re: gluster service on the cluster is unchecked on hci cluster

2022-07-20 Thread Jiří Sléžka

On 7/19/22 22:40, Strahil Nikolov wrote:
Then, just ensure that the glusterd.service is enabled on all hosts and 
leave it as it is.


If it worries you, you will have to move one of the hosts in another 
cluster (probably a new one) and slowly migrate the VMs from the old to 
the new one.
Yet, if you use only 3 hosts that can put your VMs in risk (new cluster 
having a single host could lead to downtimes).


well, it blocks me from any changes on cluster so it is serious 
problem... but personally I don't like this "new cluster and migration" 
approach :-(


To be honest, I wouldn't change DB if it's a productive cluster. If you 
decide to go that one -> make an engine backup before that.


Would anyone from oVirt/gluster developers have a look?

Thanks in advance,

Jiri



Best Regards,
Strahil Nikolov





On Tue, Jul 19, 2022 at 12:25, Jiří Sléžka
 wrote:
On 7/16/22 07:53, Strahil Nikolov wrote:
 > Try first with a single host. Set it into maintenance and check
if the
 > checkmark is available.

setting single host to maintenance didn't change state of the gluster
services checkbox in cluster settings.

 > If not, try to 'reinstall' (UI, Hosts, Installation, Reinstall) the
 > host. During the setup, it should give you to update if the host
can run
 > the HE and it should allow you to select the checkmark for Gluster.

well, in my oVirt install there is no way to setup glusterfs services
during host reinstall. There are only choices to configure firewall,
activate host after install, reboot host after install and
deploy/undeploy hosted engine...

I think that gluster related stuff is installed automatically as it is
configured on cluster level (where in my case are gluster services
disabled).

 > Let's work with a single node before being so drastic and
outage-ing a
 > cluster.


Cheers,

Jiri

 >
 > Best Regards,
 > Strahil Nikolov
 >
 >    On Thu, Jul 14, 2022 at 23:03, Jiří Sléžka
 >    mailto:jiri.sle...@slu.cz>> wrote:
 >    Dne 7/14/22 v 21:21 Strahil Nikolov napsal(a):
 >      > Go to the UI, select the volume , pres 'Start' and mark the
 >    checkbox for
 >      > 'Force'-fully start .
 >
 >    well, it worked :-) Now all bricks are in UP state. In fact from
 >    commandline point of view all volumes were active and all
bricks up all
 >    the time.
 >
 >      > At least it should update the engine that everything is
running .
 >      > Have you checked if the checkmark for the Gluster service is
 >    available
 >      > if you set the Host into maintenance?
 >
 >    which host do you mean? If all hosts in the cluster I have to
plan an
 >    outage... will try...
 >
 >    Thanks,
 >
 >    Jiri
 >
     >      >
 >      > Best Regards,
 >      > Strahil Nikolov
 >      >
 >      >    On Thu, Jul 14, 2022 at 16:08, Jiří Sléžka
 >      >    mailto:jiri.sle...@slu.cz>
<mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>> wrote:
 >      >    ___
 >      >    Users mailing list -- users@ovirt.org
<mailto:users@ovirt.org> <mailto:users@ovirt.org
<mailto:users@ovirt.org>>
 >    <mailto:users@ovirt.org <mailto:users@ovirt.org>
<mailto:users@ovirt.org <mailto:users@ovirt.org>>>
 >      >    To unsubscribe send an email to users-le...@ovirt.org
<mailto:users-le...@ovirt.org>
 >    <mailto:users-le...@ovirt.org <mailto:users-le...@ovirt.org>>
 >
 >      >    <mailto:users-le...@ovirt.org
<mailto:users-le...@ovirt.org> <mailto:users-le...@ovirt.org
<mailto:users-le...@ovirt.org>>>
 >      >    Privacy Statement:
https://www.ovirt.org/privacy-policy.html
<https://www.ovirt.org/privacy-policy.html>
 >    <https://www.ovirt.org/privacy-policy.html
<https://www.ovirt.org/privacy-policy.html>>
 >      >    <https://www.ovirt.org/privacy-policy.html
<https://www.ovirt.org/privacy-policy.html>
 >    <https://www.ovirt.org/privacy-policy.html
<https://www.ovirt.org/privacy-policy.html>>>
 >      >    oVirt Code of Conduct:
 >      >
https://www.ovirt.org/community/about/community-guidelines/
<https://www.ovirt.org/community/about/community-guidelines/>
 >    <https://www.ovirt.org/community/about/community-guidelines/
<https://www.ovirt.org/community/about/community-guidelines/>>
 >      >   
<https://w

[ovirt-users] Re: gluster service on the cluster is unchecked on hci cluster

2022-07-19 Thread Jiří Sléžka

On 7/16/22 07:53, Strahil Nikolov wrote:
Try first with a single host. Set it into maintenance and check if the 
checkmark is available.


setting single host to maintenance didn't change state of the gluster 
services checkbox in cluster settings.


If not, try to 'reinstall' (UI, Hosts, Installation, Reinstall) the 
host. During the setup, it should give you to update if the host can run 
the HE and it should allow you to select the checkmark for Gluster.


well, in my oVirt install there is no way to setup glusterfs services 
during host reinstall. There are only choices to configure firewall, 
activate host after install, reboot host after install and 
deploy/undeploy hosted engine...


I think that gluster related stuff is installed automatically as it is 
configured on cluster level (where in my case are gluster services 
disabled).


Let's work with a single node before being so drastic and outage-ing a 
cluster.



Cheers,

Jiri



Best Regards,
Strahil Nikolov

On Thu, Jul 14, 2022 at 23:03, Jiří Sléžka
 wrote:
Dne 7/14/22 v 21:21 Strahil Nikolov napsal(a):
 > Go to the UI, select the volume , pres 'Start' and mark the
checkbox for
 > 'Force'-fully start .

well, it worked :-) Now all bricks are in UP state. In fact from
commandline point of view all volumes were active and all bricks up all
the time.

 > At least it should update the engine that everything is running .
 > Have you checked if the checkmark for the Gluster service is
available
 > if you set the Host into maintenance?

which host do you mean? If all hosts in the cluster I have to plan an
outage... will try...

Thanks,

Jiri

 >
 > Best Regards,
 > Strahil Nikolov
 >
 >    On Thu, Jul 14, 2022 at 16:08, Jiří Sléžka
 >    mailto:jiri.sle...@slu.cz>> wrote:
 >    ___
 >    Users mailing list -- users@ovirt.org <mailto:users@ovirt.org>
<mailto:users@ovirt.org <mailto:users@ovirt.org>>
 >    To unsubscribe send an email to users-le...@ovirt.org
<mailto:users-le...@ovirt.org>

 >    <mailto:users-le...@ovirt.org <mailto:users-le...@ovirt.org>>
 >    Privacy Statement: https://www.ovirt.org/privacy-policy.html
<https://www.ovirt.org/privacy-policy.html>
 >    <https://www.ovirt.org/privacy-policy.html
<https://www.ovirt.org/privacy-policy.html>>
 >    oVirt Code of Conduct:
 > https://www.ovirt.org/community/about/community-guidelines/
<https://www.ovirt.org/community/about/community-guidelines/>
 >    <https://www.ovirt.org/community/about/community-guidelines/
<https://www.ovirt.org/community/about/community-guidelines/>>
 >    List Archives:
 >

https://lists.ovirt.org/archives/list/users@ovirt.org/message/624NH3C5REFDV55K4NPKF6IU4IHG6FPK/

<https://lists.ovirt.org/archives/list/users@ovirt.org/message/624NH3C5REFDV55K4NPKF6IU4IHG6FPK/>
 >   
<https://lists.ovirt.org/archives/list/users@ovirt.org/message/624NH3C5REFDV55K4NPKF6IU4IHG6FPK/


<https://lists.ovirt.org/archives/list/users@ovirt.org/message/624NH3C5REFDV55K4NPKF6IU4IHG6FPK/>>
 >





smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/BPORMHV6GU3VVKK4LTFJEYR27B76ZEWB/


[ovirt-users] Re: gluster service on the cluster is unchecked on hci cluster

2022-07-14 Thread Jiří Sléžka

Dne 7/14/22 v 21:21 Strahil Nikolov napsal(a):
Go to the UI, select the volume , pres 'Start' and mark the checkbox for 
'Force'-fully start .


well, it worked :-) Now all bricks are in UP state. In fact from 
commandline point of view all volumes were active and all bricks up all 
the time.



At least it should update the engine that everything is running .
Have you checked if the checkmark for the Gluster service is available 
if you set the Host into maintenance?


which host do you mean? If all hosts in the cluster I have to plan an 
outage... will try...


Thanks,

Jiri



Best Regards,
Strahil Nikolov

On Thu, Jul 14, 2022 at 16:08, Jiří Sléžka
 wrote:
___
Users mailing list -- users@ovirt.org <mailto:users@ovirt.org>
To unsubscribe send an email to users-le...@ovirt.org
<mailto:users-le...@ovirt.org>
Privacy Statement: https://www.ovirt.org/privacy-policy.html
<https://www.ovirt.org/privacy-policy.html>
oVirt Code of Conduct:
https://www.ovirt.org/community/about/community-guidelines/
<https://www.ovirt.org/community/about/community-guidelines/>
List Archives:

https://lists.ovirt.org/archives/list/users@ovirt.org/message/624NH3C5REFDV55K4NPKF6IU4IHG6FPK/

<https://lists.ovirt.org/archives/list/users@ovirt.org/message/624NH3C5REFDV55K4NPKF6IU4IHG6FPK/>





smime.p7s
Description: Elektronicky podpis S/MIME
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/YEM4HECQBEMRZK2UGVZYVUAQBHYR4I2C/


[ovirt-users] Re: gluster service on the cluster is unchecked on hci cluster

2022-07-14 Thread Jiří Sléžka

On 7/14/22 14:30, Jiří Sléžka wrote:

On 7/14/22 00:34, Strahil Nikolov wrote:

Well... not yet.
Check if the engine detects the volumes and verify again that all 
glustereventsd work.


I would even consider restarting the engine, just to be on the safe side.


engine restarted (I also yum updated it before), glustereventsd is 
running on all hosts, selinux's port label is set. Still only one brick 
is up and two are in unknown state in manager. See the screenshot.



What is your oVirt version ? Maybe an update could solve your problem.


latest in 4.4 version -> 4.4.10.7-1.el8. Engine and hosts are Rocky 
Linux 8.6 based.


ale here is output of select * from gluster_volume_bricks; I dont know 
if it is relevant but it shows old (2022-06-12 14:32:46.476558+02) dates 
in _update_date filed in part of UNKNOWN state bricks. Also UP state 
bricks has timestamp i past (around 1 day), is it normal?


https://pastebin.com/hQnj1en3

Could you suggest some relevant strings in engine.log | vdsm.log I could 
looks for? My blind grepping just reveleated


grep "GLUSTER" /var/log/ovirt-engine/engine.log

2022-07-13 13:10:19,263+02 WARN 
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] 
(default task-3) [] EVENT_ID: GLUSTER_BRICK_STATUS_CHANGED(4,086), 
Detected change in status of brick 
10.0.4.11:/gluster_bricks/engine/engine of volume engine of cluster 
McHosting from UNKNOWN to UP via gluster event.
2022-07-13 13:10:19,492+02 WARN 
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] 
(default task-3) [] EVENT_ID: GLUSTER_BRICK_STATUS_CHANGED(4,086), 
Detected change in status of brick 10.0.4.11:/gluster_bricks/vms/vms of 
volume vms of cluster McHosting from UNKNOWN to UP via gluster event.
2022-07-13 13:10:21,185+02 WARN 
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] 
(default task-3) [] EVENT_ID: GLUSTER_BRICK_STATUS_CHANGED(4,086), 
Detected change in status of brick 10.0.4.11:/gluster_bricks/vms2/vms2 
of volume vms of cluster McHosting from UNKNOWN to UP via gluster event.


it match the engine log and also time I run glustereventsd service on 
host 10.0.4.11


but I see also repeatedly

2022-07-13 14:14:14,440+02 WARN 
[org.ovirt.engine.core.bll.UpdateClusterCommand] 
(EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-88) 
[5fb948bc] Validation of action 'UpdateCluster' failed for user SYSTEM. 
Reasons: 
VAR__TYPE__CLUSTER,VAR__ACTION__UPDATE,CLUSTER_CANNOT_DISABLE_GLUSTER_WHEN_CLUSTER_CONTAINS_VOLUMES 



so it looks to me like there is some action running which wants disable 
gluster services on cluster and it cannot (and it is right!). But it 
probably blocks that gluster services checkbox in cluster settings in 
manager. What do you think?


sorry, date implies it is my yesterday's testing invoking of cluster 
edit window... there are no running jobs or tasks in db (as I can see)...


so still dont know what is wrong :-(

Cheers,

Jiri




Cheers, Jiri



Best Regards,
Strahil Nikolov

    On Wed, Jul 13, 2022 at 17:05, Jiří Sléžka
     wrote:
    On 7/13/22 14:53, Jiří Sléžka wrote:
 > On 7/12/22 22:28, Strahil Nikolov wrote:
 >> glustereventad will notify the engine when something changes -
    like a
 >> new volume is created from the cli (or bad things happened ;) ),
    so it
 >> should be running. >
 >> You can use the workaround from the github issue and reatart the
 >> glustereventsd service.
 >
 > ok, workaround applied, glustereventsd service enabled and
    started on
 > all hosts.
 >
 > I can see this log entry in volume Events
 >
 > Detected change in status of brick
 > 10.0.4.11:/gluster_bricks/engine/engine of volume engine of 
cluster

 > McHosting from UNKNOWN to UP via gluster event.
 >
 > but Bricks tab shows still two (.12 and .13) of three bricks in
    Unknown
 > state. From command line point of view all bricks are up and 
healthy.

 >
 > it looks like engine thinks that gluster service is disabled in
    cluster
 > but I cannot enable it because checkbox is disabled. In my 
other (FC

 > based) oVirt instance Gluster Service checkbox is not selected
    but not
 > disabled. So I am interested what could make that checkbox
    inactive...

    well, on db side it looks like cluster has gluster_service 
disabled...


    engine=# select virt_service, gluster_service from cluster;
   virt_service | gluster_service
    --+-
   t            | f
    (1 row)

    still don't know why the checkbox is disabled. Would it be safe to
    enabled gluster_service directly in db? I suppose no... :-)

    Cheers,

    Jiri


 >
 >> For the vdsm, you can always run
 >> '/usr/libexec/vdsm/vdsmd_init_common.sh --pre-start' which is
    executed
 &

[ovirt-users] Re: gluster service on the cluster is unchecked on hci cluster

2022-07-14 Thread Jiří Sléžka

On 7/14/22 00:34, Strahil Nikolov wrote:

Well... not yet.
Check if the engine detects the volumes and verify again that all 
glustereventsd work.


I would even consider restarting the engine, just to be on the safe side.


engine restarted (I also yum updated it before), glustereventsd is 
running on all hosts, selinux's port label is set. Still only one brick 
is up and two are in unknown state in manager. See the screenshot.



What is your oVirt version ? Maybe an update could solve your problem.


latest in 4.4 version -> 4.4.10.7-1.el8. Engine and hosts are Rocky 
Linux 8.6 based.


ale here is output of select * from gluster_volume_bricks; I dont know 
if it is relevant but it shows old (2022-06-12 14:32:46.476558+02) dates 
in _update_date filed in part of UNKNOWN state bricks. Also UP state 
bricks has timestamp i past (around 1 day), is it normal?


https://pastebin.com/hQnj1en3

Could you suggest some relevant strings in engine.log | vdsm.log I could 
looks for? My blind grepping just reveleated


grep "GLUSTER" /var/log/ovirt-engine/engine.log

2022-07-13 13:10:19,263+02 WARN 
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] 
(default task-3) [] EVENT_ID: GLUSTER_BRICK_STATUS_CHANGED(4,086), 
Detected change in status of brick 
10.0.4.11:/gluster_bricks/engine/engine of volume engine of cluster 
McHosting from UNKNOWN to UP via gluster event.
2022-07-13 13:10:19,492+02 WARN 
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] 
(default task-3) [] EVENT_ID: GLUSTER_BRICK_STATUS_CHANGED(4,086), 
Detected change in status of brick 10.0.4.11:/gluster_bricks/vms/vms of 
volume vms of cluster McHosting from UNKNOWN to UP via gluster event.
2022-07-13 13:10:21,185+02 WARN 
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] 
(default task-3) [] EVENT_ID: GLUSTER_BRICK_STATUS_CHANGED(4,086), 
Detected change in status of brick 10.0.4.11:/gluster_bricks/vms2/vms2 
of volume vms of cluster McHosting from UNKNOWN to UP via gluster event.


it match the engine log and also time I run glustereventsd service on 
host 10.0.4.11


but I see also repeatedly

2022-07-13 14:14:14,440+02 WARN 
[org.ovirt.engine.core.bll.UpdateClusterCommand] 
(EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-88) 
[5fb948bc] Validation of action 'UpdateCluster' failed for user SYSTEM. 
Reasons: 
VAR__TYPE__CLUSTER,VAR__ACTION__UPDATE,CLUSTER_CANNOT_DISABLE_GLUSTER_WHEN_CLUSTER_CONTAINS_VOLUMES


so it looks to me like there is some action running which wants disable 
gluster services on cluster and it cannot (and it is right!). But it 
probably blocks that gluster services checkbox in cluster settings in 
manager. What do you think?



Cheers, Jiri



Best Regards,
Strahil Nikolov

On Wed, Jul 13, 2022 at 17:05, Jiří Sléžka
 wrote:
On 7/13/22 14:53, Jiří Sléžka wrote:
 > On 7/12/22 22:28, Strahil Nikolov wrote:
 >> glustereventad will notify the engine when something changes -
like a
 >> new volume is created from the cli (or bad things happened ;) ),
so it
 >> should be running. >
 >> You can use the workaround from the github issue and reatart the
 >> glustereventsd service.
 >
 > ok, workaround applied, glustereventsd service enabled and
started on
 > all hosts.
 >
 > I can see this log entry in volume Events
 >
 > Detected change in status of brick
 > 10.0.4.11:/gluster_bricks/engine/engine of volume engine of cluster
 > McHosting from UNKNOWN to UP via gluster event.
 >
 > but Bricks tab shows still two (.12 and .13) of three bricks in
Unknown
 > state. From command line point of view all bricks are up and healthy.
 >
 > it looks like engine thinks that gluster service is disabled in
cluster
 > but I cannot enable it because checkbox is disabled. In my other (FC
 > based) oVirt instance Gluster Service checkbox is not selected
but not
 > disabled. So I am interested what could make that checkbox
inactive...

well, on db side it looks like cluster has gluster_service disabled...

engine=# select virt_service, gluster_service from cluster;
   virt_service | gluster_service
--+-
   t            | f
(1 row)

still don't know why the checkbox is disabled. Would it be safe to
enabled gluster_service directly in db? I suppose no... :-)

Cheers,

Jiri


 >
 >> For the vdsm, you can always run
 >> '/usr/libexec/vdsm/vdsmd_init_common.sh --pre-start' which is
executed
 >> by the vdsmd.service before every start (ExecStartPre stanza)
and see
 >> if it complains about something.
 >
 > [root@ovirt-hci03 <mailto:root@ovirt-hci03> ~]#
/usr/libexec/vdsm/vdsmd_init_common.s

[ovirt-users] Re: gluster service on the cluster is unchecked on hci cluster

2022-07-13 Thread Jiří Sléžka

On 7/13/22 14:53, Jiří Sléžka wrote:

On 7/12/22 22:28, Strahil Nikolov wrote:
glustereventad will notify the engine when something changes - like a 
new volume is created from the cli (or bad things happened ;) ), so it 
should be running. >
You can use the workaround from the github issue and reatart the 
glustereventsd service.


ok, workaround applied, glustereventsd service enabled and started on 
all hosts.


I can see this log entry in volume Events

Detected change in status of brick 
10.0.4.11:/gluster_bricks/engine/engine of volume engine of cluster 
McHosting from UNKNOWN to UP via gluster event.


but Bricks tab shows still two (.12 and .13) of three bricks in Unknown 
state. From command line point of view all bricks are up and healthy.


it looks like engine thinks that gluster service is disabled in cluster 
but I cannot enable it because checkbox is disabled. In my other (FC 
based) oVirt instance Gluster Service checkbox is not selected but not 
disabled. So I am interested what could make that checkbox inactive...


well, on db side it looks like cluster has gluster_service disabled...

engine=# select virt_service, gluster_service from cluster;
 virt_service | gluster_service
--+-
 t| f
(1 row)

still don't know why the checkbox is disabled. Would it be safe to 
enabled gluster_service directly in db? I suppose no... :-)


Cheers,

Jiri



For the vdsm, you can always run 
'/usr/libexec/vdsm/vdsmd_init_common.sh --pre-start' which is executed 
by the vdsmd.service before every start (ExecStartPre stanza) and see 
if it complains about something.


[root@ovirt-hci03 ~]# /usr/libexec/vdsm/vdsmd_init_common.sh --pre-start
vdsm: Running mkdirs
vdsm: Running configure_vdsm_logs
vdsm: Running run_init_hooks
vdsm: Running check_is_configured
sanlock is configured for vdsm
lvm is configured for vdsm
abrt is already configured for vdsm
Managed volume database is already configured
Current revision of multipath.conf detected, preserving
libvirt is already configured for vdsm
vdsm: Running validate_configuration
SUCCESS: ssl configured to true. No conflicts
vdsm: Running prepare_transient_repository
vdsm: Running syslog_available
vdsm: Running nwfilter
vdsm: Running dummybr
vdsm: Running tune_system
vdsm: Running test_space
vdsm: Running test_lo

retcode 0, all looks ok...

Cheers,

Jiri




Best Regards,
Strahil Nikolov

    On Tue, Jul 12, 2022 at 11:12, Jiří Sléžka
     wrote:
    On 7/11/22 16:22, Jiří Sléžka wrote:
 > On 7/11/22 15:57, Strahil Nikolov wrote:
 >> Can you check for AVC denials and the error message like the
    described
 >> in
 >>

https://github.com/gluster/glusterfs-selinux/issues/27#issue-1097225183

<https://github.com/gluster/glusterfs-selinux/issues/27#issue-1097225183

 >?
 >
 > thanks for reply, there are two unrelated (qemu-kvm) avc denials
    logged
 > (related probably to sanlock recovery)
 >
 > also I cannot find glustereventsd in any related log... is it 
really

 > used by vdsm-gluster?
 >
 > this service runs on no hosts
 >
 > systemctl status glustereventsd
 > ● glustereventsd.service - Gluster Events Notifier
 >     Loaded: loaded 
(/usr/lib/systemd/system/glustereventsd.service;

 > disabled; vendor preset: disabled)
 >     Active: inactive (dead)

    it looks like root of the problem is that Gluster service is
    disabled in
    cluster settings and cannot be enabled. But it was enabled before...
    also I have to manually install vdsm-gluster when I (re)install new
    host, but bricks from this host are in unknown state in admin. Maybe
    vdsm-gluster is not correctly configured? Maybe glustereventsd is not
    running? I am just guessing...

    I have no access to other HCI installation so I cannot compare
    differences.

    I would be really happy if someone could tell me what circumstances
    could disable Gluster service checkbox in admin and how to enable it
    again...

    Cheers,

    Jiri


 >
 > Cheers,
 >
 > Jiri
 >
 >
 >>
 >>
 >> Best Regards,
 >> Strahil Nikolov
 >>
 >>     On Mon, Jul 11, 2022 at 16:44, Jiří Sléžka
 >>     mailto:jiri.sle...@slu.cz>> wrote:
 >>     Hello,
 >>
 >>     On 7/11/22 14:34, Strahil Nikolov wrote:
 >>  > Can you check something on the host:
 >>  > cat /etc/glusterfs/eventsconfig.json
 >>
 >>     cat /etc/glusterfs/eventsconfig.json
 >>     {
 >>      "log-level": "INFO",
 >>      "port": 24009,
 >>      "disable-events-log": false
 >>     }
 >>
 >>
 >>  > semanage por

[ovirt-users] Re: Stuck in Manager upgrade. Can't set Cluster to maintenance mode.

2022-07-13 Thread Jiří Sléžka

On 7/13/22 12:44, Johannes Lutz wrote:

Can anybody help me?
Or is the solution to build a new hosted engine and try recreating it from a 
backup?

What do i have to do to get the Hosted Engine in Global Maintenance Mode when 
"hosted-engine --set-maintenance --mode=global" does not work ...

Its very frustrating ...


Yes, it is. Certificate refreshing should be automagic job on background...

If I understand it right, your hosted engine is running?

hosted-engine --vm-status

on any hosts state that cluster is in global maintenance mode? You 
should see this line at the end of output


!! Cluster is in GLOBAL MAINTENANCE mode !!

Cheers, Jiri




Best Regards
J.Lutz
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/AUCHVGS5EZ3YZ5WDMERFCB3TS3KWBD3L/




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/5INMXCEMYE3JR5ECAHNWITBYSHK5H4GF/


[ovirt-users] Re: gluster service on the cluster is unchecked on hci cluster

2022-07-13 Thread Jiří Sléžka

On 7/12/22 22:28, Strahil Nikolov wrote:
glustereventad will notify the engine when something changes - like a 
new volume is created from the cli (or bad things happened ;) ), so it 
should be running. >
You can use the workaround from the github issue and reatart the 
glustereventsd service.


ok, workaround applied, glustereventsd service enabled and started on 
all hosts.


I can see this log entry in volume Events

Detected change in status of brick 
10.0.4.11:/gluster_bricks/engine/engine of volume engine of cluster 
McHosting from UNKNOWN to UP via gluster event.


but Bricks tab shows still two (.12 and .13) of three bricks in Unknown 
state. From command line point of view all bricks are up and healthy.


it looks like engine thinks that gluster service is disabled in cluster 
but I cannot enable it because checkbox is disabled. In my other (FC 
based) oVirt instance Gluster Service checkbox is not selected but not 
disabled. So I am interested what could make that checkbox inactive...


For the vdsm, you can always run '/usr/libexec/vdsm/vdsmd_init_common.sh 
--pre-start' which is executed by the vdsmd.service before every start 
(ExecStartPre stanza) and see if it complains about something.


[root@ovirt-hci03 ~]# /usr/libexec/vdsm/vdsmd_init_common.sh --pre-start
vdsm: Running mkdirs
vdsm: Running configure_vdsm_logs
vdsm: Running run_init_hooks
vdsm: Running check_is_configured
sanlock is configured for vdsm
lvm is configured for vdsm
abrt is already configured for vdsm
Managed volume database is already configured
Current revision of multipath.conf detected, preserving
libvirt is already configured for vdsm
vdsm: Running validate_configuration
SUCCESS: ssl configured to true. No conflicts
vdsm: Running prepare_transient_repository
vdsm: Running syslog_available
vdsm: Running nwfilter
vdsm: Running dummybr
vdsm: Running tune_system
vdsm: Running test_space
vdsm: Running test_lo

retcode 0, all looks ok...

Cheers,

Jiri




Best Regards,
Strahil Nikolov

On Tue, Jul 12, 2022 at 11:12, Jiří Sléžka
 wrote:
On 7/11/22 16:22, Jiří Sléžka wrote:
 > On 7/11/22 15:57, Strahil Nikolov wrote:
 >> Can you check for AVC denials and the error message like the
described
 >> in
 >>
https://github.com/gluster/glusterfs-selinux/issues/27#issue-1097225183
<https://github.com/gluster/glusterfs-selinux/issues/27#issue-1097225183
 >?
 >
 > thanks for reply, there are two unrelated (qemu-kvm) avc denials
logged
 > (related probably to sanlock recovery)
 >
 > also I cannot find glustereventsd in any related log... is it really
 > used by vdsm-gluster?
 >
 > this service runs on no hosts
 >
 > systemctl status glustereventsd
 > ● glustereventsd.service - Gluster Events Notifier
 >     Loaded: loaded (/usr/lib/systemd/system/glustereventsd.service;
 > disabled; vendor preset: disabled)
 >     Active: inactive (dead)

it looks like root of the problem is that Gluster service is
disabled in
cluster settings and cannot be enabled. But it was enabled before...
also I have to manually install vdsm-gluster when I (re)install new
host, but bricks from this host are in unknown state in admin. Maybe
vdsm-gluster is not correctly configured? Maybe glustereventsd is not
running? I am just guessing...

I have no access to other HCI installation so I cannot compare
differences.

I would be really happy if someone could tell me what circumstances
could disable Gluster service checkbox in admin and how to enable it
again...

Cheers,

Jiri


 >
 > Cheers,
 >
 > Jiri
 >
 >
 >>
 >>
 >> Best Regards,
 >> Strahil Nikolov
 >>
 >>     On Mon, Jul 11, 2022 at 16:44, Jiří Sléžka
 >>     mailto:jiri.sle...@slu.cz>> wrote:
 >>     Hello,
 >>
 >>     On 7/11/22 14:34, Strahil Nikolov wrote:
 >>  > Can you check something on the host:
 >>  > cat /etc/glusterfs/eventsconfig.json
 >>
 >>     cat /etc/glusterfs/eventsconfig.json
 >>     {
 >>      "log-level": "INFO",
 >>      "port": 24009,
 >>      "disable-events-log": false
 >>     }
 >>
 >>
 >>  > semanage port -l | grep $(awk -F ':' '/port/
{gsub(",","",$2);
 >> print
 >>  > $2}' /etc/glusterfs/eventsconfig.json)
 >>
 >>     semanage port -l | grep 24009
 >>
 >>     returns empty set, it looks like this port is not labeled
 >>
 >>     Cheers,
 >>
 >>     Jiri
 >>
 >>  >
  

[ovirt-users] Re: gluster service on the cluster is unchecked on hci cluster

2022-07-12 Thread Jiří Sléžka

On 7/11/22 16:22, Jiří Sléžka wrote:

On 7/11/22 15:57, Strahil Nikolov wrote:
Can you check for AVC denials and the error message like the described 
in 
https://github.com/gluster/glusterfs-selinux/issues/27#issue-1097225183 ?


thanks for reply, there are two unrelated (qemu-kvm) avc denials logged 
(related probably to sanlock recovery)


also I cannot find glustereventsd in any related log... is it really 
used by vdsm-gluster?


this service runs on no hosts

systemctl status glustereventsd
● glustereventsd.service - Gluster Events Notifier
    Loaded: loaded (/usr/lib/systemd/system/glustereventsd.service; 
disabled; vendor preset: disabled)

    Active: inactive (dead)


it looks like root of the problem is that Gluster service is disabled in 
cluster settings and cannot be enabled. But it was enabled before... 
also I have to manually install vdsm-gluster when I (re)install new 
host, but bricks from this host are in unknown state in admin. Maybe 
vdsm-gluster is not correctly configured? Maybe glustereventsd is not 
running? I am just guessing...


I have no access to other HCI installation so I cannot compare differences.

I would be really happy if someone could tell me what circumstances 
could disable Gluster service checkbox in admin and how to enable it 
again...


Cheers,

Jiri



Cheers,

Jiri





Best Regards,
Strahil Nikolov

    On Mon, Jul 11, 2022 at 16:44, Jiří Sléžka
     wrote:
    Hello,

    On 7/11/22 14:34, Strahil Nikolov wrote:
 > Can you check something on the host:
 > cat /etc/glusterfs/eventsconfig.json

    cat /etc/glusterfs/eventsconfig.json
    {
     "log-level": "INFO",
     "port": 24009,
     "disable-events-log": false
    }


 > semanage port -l | grep $(awk -F ':' '/port/ {gsub(",","",$2); 
print

 > $2}' /etc/glusterfs/eventsconfig.json)

    semanage port -l | grep 24009

    returns empty set, it looks like this port is not labeled

    Cheers,

    Jiri

 >
 > Best Regards,
 > Strahil Nikolov
 > В понеделник, 11 юли 2022 г., 02:18:57 ч. Гринуич+3, Jiří Sléžka
 > mailto:jiri.sle...@slu.cz>> написа:
 >
 >
 > Hi,
 >
 > I would like to change CPU Type in my oVirt 4.4.10 HCI cluster
    (based on
 > 3 glusterfs/virt hosts). When I try to I got this error
 >
 > Error while executing action: Cannot disable gluster service on 
the

 > cluster as it contains volumes.
 >
 > As I remember I had Gluster Service enabled on this cluster but
    now both
 > (Enable Virt Services and Enable Gluster Service) checkboxes are
    grayed
 > out and Gluster Service is unchecked.
 >
 > Also Storage / Volumes displays my volumes... well, displays one
    brick
 > on particular host in unknown state (? mark) which is new
    situation. As
 > I can see from command line all bricks are online, no healing in
 > progress, all looks good...
 >
 > I am not sure if the second issue is relevant to first one so main
 > question is how can I (re)enable gluster service in my cluster?
 >
 > Thanks in advance,
 >
 > Jiri
 > ___
 > Users mailing list -- users@ovirt.org <mailto:users@ovirt.org>
    <mailto:users@ovirt.org <mailto:users@ovirt.org>>
 > To unsubscribe send an email to users-le...@ovirt.org
    <mailto:users-le...@ovirt.org>

 > <mailto:users-le...@ovirt.org <mailto:users-le...@ovirt.org>>
 > Privacy Statement: https://www.ovirt.org/privacy-policy.html
    <https://www.ovirt.org/privacy-policy.html >
 > <https://www.ovirt.org/privacy-policy.html
    <https://www.ovirt.org/privacy-policy.html>>
 > oVirt Code of Conduct:
 > https://www.ovirt.org/community/about/community-guidelines/
    <https://www.ovirt.org/community/about/community-guidelines/ >
 > <https://www.ovirt.org/community/about/community-guidelines/
    <https://www.ovirt.org/community/about/community-guidelines/>>
 > List Archives:
 >

https://lists.ovirt.org/archives/list/users@ovirt.org/message/S4NVCQ33ZSJSHR7P7K7OICSA5F253BVA/ 


<https://lists.ovirt.org/archives/list/users@ovirt.org/message/S4NVCQ33ZSJSHR7P7K7OICSA5F253BVA/ 


 >
 >

<https://lists.ovirt.org/archives/list/users@ovirt.org/message/S4NVCQ33ZSJSHR7P7K7OICSA5F253BVA/ 


<https://lists.ovirt.org/archives/list/users@ovirt.org/message/S4NVCQ33ZSJSHR7P7K7OICSA5F253BVA/>> 






___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www

[ovirt-users] Re: gluster service on the cluster is unchecked on hci cluster

2022-07-11 Thread Jiří Sléžka

On 7/11/22 15:57, Strahil Nikolov wrote:
Can you check for AVC denials and the error message like the described 
in https://github.com/gluster/glusterfs-selinux/issues/27#issue-1097225183 ?


thanks for reply, there are two unrelated (qemu-kvm) avc denials logged 
(related probably to sanlock recovery)


also I cannot find glustereventsd in any related log... is it really 
used by vdsm-gluster?


this service runs on no hosts

systemctl status glustereventsd
● glustereventsd.service - Gluster Events Notifier
   Loaded: loaded (/usr/lib/systemd/system/glustereventsd.service; 
disabled; vendor preset: disabled)

   Active: inactive (dead)

Cheers,

Jiri





Best Regards,
Strahil Nikolov

On Mon, Jul 11, 2022 at 16:44, Jiří Sléžka
 wrote:
Hello,

On 7/11/22 14:34, Strahil Nikolov wrote:
 > Can you check something on the host:
 > cat /etc/glusterfs/eventsconfig.json

cat /etc/glusterfs/eventsconfig.json
{
     "log-level": "INFO",
     "port": 24009,
     "disable-events-log": false
}


 > semanage port -l | grep $(awk -F ':' '/port/ {gsub(",","",$2); print
 > $2}' /etc/glusterfs/eventsconfig.json)

semanage port -l | grep 24009

returns empty set, it looks like this port is not labeled

Cheers,

Jiri

 >
 > Best Regards,
 > Strahil Nikolov
 > В понеделник, 11 юли 2022 г., 02:18:57 ч. Гринуич+3, Jiří Sléžka
 > mailto:jiri.sle...@slu.cz>> написа:
 >
 >
 > Hi,
 >
 > I would like to change CPU Type in my oVirt 4.4.10 HCI cluster
(based on
 > 3 glusterfs/virt hosts). When I try to I got this error
 >
 > Error while executing action: Cannot disable gluster service on the
 > cluster as it contains volumes.
 >
 > As I remember I had Gluster Service enabled on this cluster but
now both
 > (Enable Virt Services and Enable Gluster Service) checkboxes are
grayed
 > out and Gluster Service is unchecked.
 >
 > Also Storage / Volumes displays my volumes... well, displays one
brick
 > on particular host in unknown state (? mark) which is new
situation. As
 > I can see from command line all bricks are online, no healing in
 > progress, all looks good...
 >
 > I am not sure if the second issue is relevant to first one so main
 > question is how can I (re)enable gluster service in my cluster?
 >
 > Thanks in advance,
 >
 > Jiri
 > ___
 > Users mailing list -- users@ovirt.org <mailto:users@ovirt.org>
<mailto:users@ovirt.org <mailto:users@ovirt.org>>
 > To unsubscribe send an email to users-le...@ovirt.org
<mailto:users-le...@ovirt.org>

 > <mailto:users-le...@ovirt.org <mailto:users-le...@ovirt.org>>
 > Privacy Statement: https://www.ovirt.org/privacy-policy.html
<https://www.ovirt.org/privacy-policy.html >
 > <https://www.ovirt.org/privacy-policy.html
<https://www.ovirt.org/privacy-policy.html>>
 > oVirt Code of Conduct:
 > https://www.ovirt.org/community/about/community-guidelines/
<https://www.ovirt.org/community/about/community-guidelines/ >
 > <https://www.ovirt.org/community/about/community-guidelines/
<https://www.ovirt.org/community/about/community-guidelines/>>
 > List Archives:
 >

https://lists.ovirt.org/archives/list/users@ovirt.org/message/S4NVCQ33ZSJSHR7P7K7OICSA5F253BVA/

<https://lists.ovirt.org/archives/list/users@ovirt.org/message/S4NVCQ33ZSJSHR7P7K7OICSA5F253BVA/
 >
 >

<https://lists.ovirt.org/archives/list/users@ovirt.org/message/S4NVCQ33ZSJSHR7P7K7OICSA5F253BVA/

<https://lists.ovirt.org/archives/list/users@ovirt.org/message/S4NVCQ33ZSJSHR7P7K7OICSA5F253BVA/>>





smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/KJZMKJCPGC2E2PEBYEZWLPX5YUDHD76A/


[ovirt-users] gluster service on the cluster is unchecked on hci cluster

2022-07-10 Thread Jiří Sléžka

Hi,

I would like to change CPU Type in my oVirt 4.4.10 HCI cluster (based on 
3 glusterfs/virt hosts). When I try to I got this error


Error while executing action: Cannot disable gluster service on the 
cluster as it contains volumes.


As I remember I had Gluster Service enabled on this cluster but now both 
(Enable Virt Services and Enable Gluster Service) checkboxes are grayed 
out and Gluster Service is unchecked.


Also Storage / Volumes displays my volumes... well, displays one brick 
on particular host in unknown state (? mark) which is new situation. As 
I can see from command line all bricks are online, no healing in 
progress, all looks good...


I am not sure if the second issue is relevant to first one so main 
question is how can I (re)enable gluster service in my cluster?


Thanks in advance,

Jiri


smime.p7s
Description: Elektronicky podpis S/MIME
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/S4NVCQ33ZSJSHR7P7K7OICSA5F253BVA/


[ovirt-users] Re: install new ovirt 4.4 host

2022-06-04 Thread Jiří Sléžka
just for record, appstream must be enabled, I just exclude these 
packages (RockyLinux 8.6, appstream repo, 
/etc/yum.repos.d/Rocky-AppStream.repo)


excludepkgs=
  qemu-kvm*
  qemu-img

I am not sure if it is necessary but then I was able reinstall and 
upgrade 4.4 host (because of hw upgrade)


Also I have to remove from includepkgs=

 ansible
 ansible-doc

in ovirt-4.4-epel repo (defined in 
/etc/yum.repos.d/ovirt-4.4-dependencies.repo) because unresolvable 
dependencies (looks like this repo serve 5.x version but <2.10 is needed)


Cheers, Jiri

On 6/3/22 07:12, Jiří Sléžka wrote:

Hi,

sorry for noise, it looks like packages from (rocky's) appstream hides 
qemu-kvm* from ovirt-4.4-centos-advanced-virtualization. When I disable 
appstream repo I can see them...


yum info qemu-kvm --disablerepo appstream

Will try later today but probably I should disable appstream at all, right?

Cheers,

Jiri

On 6/2/22 16:05, Jiří Sléžka wrote:

Hi,

I am around to install new ovirt 4.4 host on RockyLinux8 based server. 
I just install


yum install http://mirror.slu.cz/ovirt/yum-repo/ovirt-release44.rpm

but when I search for (for example) qemu-kvm package

yum info qemu-kvm

I can see only qemu-kvm-6.2.0-11.module+el8.6.0+847+b490afdd from 
appstream (RockyLinux slips to 8.6).


It looks like ovirt-4.4-centos-advanced-virtualization repo does not 
have qemu-kvm at all


yum list | grep ovirt-4.4-centos-advanced-virtualization

Shoud I try to install new host from manager and hope or should I do 
some magic before?


Thanks in advance,

Jiri


___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/NDESY5DZQZWUYHPY2UWUKK7PXZ27RY6G/ 




___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/TKBKNPQYLPANROBVECYC6H4B76TT7MLE/

___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/L4JMKYQAICOFFLFWEK6OI3AFRUAG6ZK5/


[ovirt-users] Re: install new ovirt 4.4 host

2022-06-02 Thread Jiří Sléžka

Hi,

sorry for noise, it looks like packages from (rocky's) appstream hides 
qemu-kvm* from ovirt-4.4-centos-advanced-virtualization. When I disable 
appstream repo I can see them...


yum info qemu-kvm --disablerepo appstream

Will try later today but probably I should disable appstream at all, right?

Cheers,

Jiri

On 6/2/22 16:05, Jiří Sléžka wrote:

Hi,

I am around to install new ovirt 4.4 host on RockyLinux8 based server. I 
just install


yum install http://mirror.slu.cz/ovirt/yum-repo/ovirt-release44.rpm

but when I search for (for example) qemu-kvm package

yum info qemu-kvm

I can see only qemu-kvm-6.2.0-11.module+el8.6.0+847+b490afdd from 
appstream (RockyLinux slips to 8.6).


It looks like ovirt-4.4-centos-advanced-virtualization repo does not 
have qemu-kvm at all


yum list | grep ovirt-4.4-centos-advanced-virtualization

Shoud I try to install new host from manager and hope or should I do 
some magic before?


Thanks in advance,

Jiri


___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/NDESY5DZQZWUYHPY2UWUKK7PXZ27RY6G/




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/TKBKNPQYLPANROBVECYC6H4B76TT7MLE/


[ovirt-users] install new ovirt 4.4 host

2022-06-02 Thread Jiří Sléžka

Hi,

I am around to install new ovirt 4.4 host on RockyLinux8 based server. I 
just install


yum install http://mirror.slu.cz/ovirt/yum-repo/ovirt-release44.rpm

but when I search for (for example) qemu-kvm package

yum info qemu-kvm

I can see only qemu-kvm-6.2.0-11.module+el8.6.0+847+b490afdd from 
appstream (RockyLinux slips to 8.6).


It looks like ovirt-4.4-centos-advanced-virtualization repo does not 
have qemu-kvm at all


yum list | grep ovirt-4.4-centos-advanced-virtualization

Shoud I try to install new host from manager and hope or should I do 
some magic before?


Thanks in advance,

Jiri



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/NDESY5DZQZWUYHPY2UWUKK7PXZ27RY6G/


[ovirt-users] Re: vnc certificate renew

2022-05-05 Thread Jiří Sléžka

On 5/5/22 10:42, si...@justconnect.ie wrote:

Hi Jiri,

I understand the libvirt-vnc part of this thread but can you explain the 
following in more detail please:

"when you update also CA then

cp /etc/pki/vdsm/certs/cacert.pem /etc/pki/vdsm/libvirt-vnc/ca-cert.pem"


sorry, it is probably not necessary.

In my particular case I had expired engine.cer so I have regenerate it 
during engine-setup process. Then I enroll certificates on all hosts. 
After that I mentioned that migrations to some hosts fails. Qemu log shows


2022-05-02T13:55:05.987598Z qemu-kvm: Our own certificate 
/etc/pki/vdsm/libvirt-vnc/server-cert.pem failed validation against 
/etc/pki/vdsm/libvirt-vnc/ca-cert.pem: The certificate hasn't got a 
known issuer


so I copied key, cert and also cacert.pem to libvirt-vnc which solves my 
issue.



When does /etc/pki/vdsm/certs/cacert.pem get updated (checked mine and it's 
2021) if not by the 'Enroll Certificate' action?


I believe cacert could be updated during engine-setup process but I am 
not sure about this. In my case CA was not renewed


openssl x509 -in /etc/pki/ovirt-engine/ca.pem -noout -text

Validity
Not Before: Aug 30 14:45:05 2015 GMT
Not After : Aug 28 14:45:05 2025 GMT

so I have no idea why /etc/pki/vdsm/libvirt-vnc/server-cert.pem cannot 
be validated against /etc/pki/vdsm/libvirt-vnc/ca-cert.pem on host. 
Copying /etc/pki/vdsm/certs/cacert.pem to 
/etc/pki/vdsm/libvirt-vnc/ca-cert.pem solved this issue...


Cheers,

Jiri




Kind Regards

Simon...
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/HVT3KMVESR5ND7S4LMI6PJDVZRUN63QE/




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/A5YBO3JR3DLOIIKKA46XQXX7U46QPFQ4/


[ovirt-users] Re: vnc certificate renew

2022-05-02 Thread Jiří Sléžka

Hi,

Dne 5/2/22 v 17:58 csab...@freemail.hu napsal(a):

Hi,

LAst month a renewed our hosts certificates by the "Enroll certificates" method.
The "/etc/pki/vdsm/libvirt-vnc/server-cert.pem" certificate wasn't renewed on 
my nodes (other certificates were).

How can i renew this certificate too?


on host just copy renewed vdsm key and cert to libvirt-vnc

cp /etc/pki/vdsm/certs/vdsmcert.pem 
/etc/pki/vdsm/libvirt-vnc/server-cert.pem

cp /etc/pki/vdsm/keys/vdsmkey.pem /etc/pki/vdsm/libvirt-vnc/server-key.pem

when you update also CA then

cp /etc/pki/vdsm/certs/cacert.pem /etc/pki/vdsm/libvirt-vnc/ca-cert.pem

Cheers,

Jiri



thanks
csabany
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/L3HRCMX6NMF2TC7ZVF4ED3TNS6KRIXCN/




smime.p7s
Description: Elektronicky podpis S/MIME
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/BHHVL64ZBXRXVNC5RUQJ36HRIUJXFWTT/


[ovirt-users] Re: recovery from expired engine.cer certificate

2022-05-02 Thread Jiří Sléžka

Hi,

will answer myself... but if you have comments or have better solution 
please levae comment


ovirt-engine-setup log logs SELECT statements to test global maintenance 
state. In my case


engine=# SELECT vm_guid, run_on_vds FROM vms  WHERE vm_name = 
'HostedEngine';
   vm_guid|  run_on_vds 


--+--
 96a6b6a7-75a9-472a-9d4f-1502b415470a | 
e24f0dcc-51f3-4d1a-acf5-2833a9dc584a

(1 row)

and

engine=# SELECT vds_id, ha_global_maintenance FROM vds_statistics WHERE 
vds_id = 'e24f0dcc-51f3-4d1a-acf5-2833a9dc584a';

vds_id| ha_global_maintenance
--+---
 e24f0dcc-51f3-4d1a-acf5-2833a9dc584a | f
(1 row)

because I believe global maintenance is really enabled I have updated 
ha_global_maintenance state with


engine=# UPDATE vds_statistics SET ha_global_maintenance = true WHERE 
vds_id = 'e24f0dcc-51f3-4d1a-acf5-2833a9dc584a';

UPDATE 1

after that I run

engine-setup --offline

and choose Renew certificates? (Yes, No) [No]: Yes

after that all hosts becomes up and vms were recovered (except that vms 
on failed and restarted host)


Cheers,

Jiri


On 5/2/22 11:16, Jiří Sléžka wrote:

Hello,

I am stuck in this situation...

It looks like engine certificate (engine.cer) expired few days ago

[root@ovirt ~]# openssl x509 -in /etc/pki/ovirt-engine/certs/engine.cer 
-noout -dates

notBefore=Mar 23 21:34:19 2021 GMT
notAfter=Apr 26 21:34:19 2022 GMT

CA and other certs are still valid

Yesterday I had one host outage and HE restarted on other host. But it 
cannot communicate with all hosts due to certificate expiration


lnav /var/log/ovirt-engine/engine.log

...
2022-05-02 11:02:29,127+02 ERROR 
[org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring] 
(EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-43) 
[] Unable to RefreshCapabilities: VDSNetworkException: 
VDSGenericException: VDSNetworkException: Received fatal alert: 
certificate_expired

...

There are vms still running on hosts.

Is there way how to (manualy?) renew engine cert and recover from this 
situation?


I have tried run engine-setup (and select renew certificate during install)

[root@ovirt ~]# engine-setup --offline

but it fails with

[ ERROR ] It seems that you are running your engine inside of the 
hosted-engine VM and are not in "Global Maintenance" mode.
  In that case you should put the system into the "Global 
Maintenance" mode before running engine-setup, or the hosted-engine HA 
agent might kill the machine, which might corrupt your data.


[ ERROR ] Failed to execute stage 'Setup validation': Hosted Engine 
setup detected, but Global Maintenance is not set.


But global maintenance is enabled on host...

[root@ovirt06 ~]# hosted-engine --vm-status

!! Cluster is in GLOBAL MAINTENANCE mode !!

--== Host ovirt05.net.slu.cz (id: 1) status ==--

Host ID    : 1
Host timestamp : 38627
Score  : 3400
Engine status  : {"vm": "down_unexpected", "health": 
"bad", "detail": "Down", "reason": "bad vm status"}

Hostname   : ovirt05.net.slu.cz
Local maintenance  : False
stopped    : False
crc32  : b719664d
conf_on_shared_storage : True
local_conf_timestamp   : 38627
Status up-to-date  : True
Extra metadata (valid at timestamp):
 metadata_parse_version=1
 metadata_feature_version=1
 timestamp=38627 (Mon May  2 10:55:43 2022)
 host-id=1
 score=3400
 vm_conf_refresh_time=38627 (Mon May  2 10:55:43 2022)
 conf_on_shared_storage=True
 maintenance=False
 state=EngineDown
 stopped=False

--== Host ovirt06.net.slu.cz (id: 2) status ==--

Host ID    : 2
Host timestamp : 8858161
Score  : 3400
Engine status  : {"vm": "up", "health": "good", 
"detail": "Up"}

Hostname   : ovirt06.net.slu.cz
Local maintenance  : False
stopped    : False
crc32  : 414a980b
conf_on_shared_storage : True
local_conf_timestamp   : 8858161
Status up-to-date  : True
Extra metadata (valid at timestamp):
 metadata_parse_version=1
 metadata_feature_version=1
 timestamp=8858161 (Mon May  2 10:55:48 2022)
 host-id=2
 score=3400
 vm_conf_refresh_time=8858161 (Mon May  2 10:55:48 2022)
 conf_on_shared_storage=True
 maintenance=False
 state=GlobalMai

[ovirt-users] recovery from expired engine.cer certificate

2022-05-02 Thread Jiří Sléžka

Hello,

I am stuck in this situation...

It looks like engine certificate (engine.cer) expired few days ago

[root@ovirt ~]# openssl x509 -in /etc/pki/ovirt-engine/certs/engine.cer 
-noout -dates

notBefore=Mar 23 21:34:19 2021 GMT
notAfter=Apr 26 21:34:19 2022 GMT

CA and other certs are still valid

Yesterday I had one host outage and HE restarted on other host. But it 
cannot communicate with all hosts due to certificate expiration


lnav /var/log/ovirt-engine/engine.log

...
2022-05-02 11:02:29,127+02 ERROR 
[org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring] 
(EE-ManagedScheduledExecutorService-engineScheduledThreadPool-Thread-43) 
[] Unable to RefreshCapabilities: VDSNetworkException: 
VDSGenericException: VDSNetworkException: Received fatal alert: 
certificate_expired

...

There are vms still running on hosts.

Is there way how to (manualy?) renew engine cert and recover from this 
situation?


I have tried run engine-setup (and select renew certificate during install)

[root@ovirt ~]# engine-setup --offline

but it fails with

[ ERROR ] It seems that you are running your engine inside of the 
hosted-engine VM and are not in "Global Maintenance" mode.
 In that case you should put the system into the "Global 
Maintenance" mode before running engine-setup, or the hosted-engine HA 
agent might kill the machine, which might corrupt your data.


[ ERROR ] Failed to execute stage 'Setup validation': Hosted Engine 
setup detected, but Global Maintenance is not set.


But global maintenance is enabled on host...

[root@ovirt06 ~]# hosted-engine --vm-status

!! Cluster is in GLOBAL MAINTENANCE mode !!

--== Host ovirt05.net.slu.cz (id: 1) status ==--

Host ID: 1
Host timestamp : 38627
Score  : 3400
Engine status  : {"vm": "down_unexpected", "health": 
"bad", "detail": "Down", "reason": "bad vm status"}

Hostname   : ovirt05.net.slu.cz
Local maintenance  : False
stopped: False
crc32  : b719664d
conf_on_shared_storage : True
local_conf_timestamp   : 38627
Status up-to-date  : True
Extra metadata (valid at timestamp):
metadata_parse_version=1
metadata_feature_version=1
timestamp=38627 (Mon May  2 10:55:43 2022)
host-id=1
score=3400
vm_conf_refresh_time=38627 (Mon May  2 10:55:43 2022)
conf_on_shared_storage=True
maintenance=False
state=EngineDown
stopped=False

--== Host ovirt06.net.slu.cz (id: 2) status ==--

Host ID: 2
Host timestamp : 8858161
Score  : 3400
Engine status  : {"vm": "up", "health": "good", 
"detail": "Up"}

Hostname   : ovirt06.net.slu.cz
Local maintenance  : False
stopped: False
crc32  : 414a980b
conf_on_shared_storage : True
local_conf_timestamp   : 8858161
Status up-to-date  : True
Extra metadata (valid at timestamp):
metadata_parse_version=1
metadata_feature_version=1
timestamp=8858161 (Mon May  2 10:55:48 2022)
host-id=2
score=3400
vm_conf_refresh_time=8858161 (Mon May  2 10:55:48 2022)
conf_on_shared_storage=True
maintenance=False
state=GlobalMaintenance
stopped=False

!! Cluster is in GLOBAL MAINTENANCE mode !!

relevant lines from ovirt-engine-setup log are

...
2022-05-02 11:08:02,194+0200 DEBUG 
otopi.ovirt_engine_setup.engine_common.database database.execute:239 
Creating own connection
2022-05-02 11:08:02,233+0200 DEBUG 
otopi.ovirt_engine_setup.engine_common.database database.execute:284 
Result: [{'vm_guid': '96a6b6a7-75a9-472a-9d4f-1502b415470a', 
'run_on_vds': 'e24f0dcc-51f3-4d1a-acf5-2833a9dc584a'}]
2022-05-02 11:08:02,234+0200 DEBUG 
otopi.ovirt_engine_setup.engine_common.database database.execute:234 
Database: 'None', Statement: '

SELECT vds_id, ha_global_maintenance
FROM vds_statistics
WHERE vds_id = %(VdsId)s;
', args: {'VdsId': 
'e24f0dcc-51f3-4d1a-acf5-2833a9dc584a'}
2022-05-02 11:08:02,234+0200 DEBUG 
otopi.ovirt_engine_setup.engine_common.database database.execute:239 
Creating own connection
2022-05-02 11:08:02,250+0200 DEBUG 
otopi.ovirt_engine_setup.engine_common.database database.execute:284 
Result: [{'vds_id': 'e24f0dcc-51f3-4d1a-acf5-2833a9dc584a', 
'ha_global_maintenance': False}]
2022-05-02 11:08:02,250+0200 ERROR 
otopi.plugins.ovirt_engine_common.ovirt_engine.system.he 
he._validate:114 It seems that you are running your engine inside of the 
hosted-engine VM and are not in "Global Maintenance" mode.
In that case you should put the 

[ovirt-users] Re: Gluster issue with brick going down

2022-03-22 Thread Jiří Sléžka

Hi,

On 3/21/22 14:12, Chris Adams wrote:

I have a hyper-converged cluster running oVirt 4.4.10 and Gluster 8.6.
Periodically, one brick of one volume will drop out, but it's seemingly
random as to which volume and brick is affected.  All I see in the brick
log is:

[2022-03-19 13:27:36.360727] W [MSGID: 113075] 
[posix-helpers.c:2135:posix_fs_health_check] 0-vmstore-posix: 
aio_read_cmp_buf() on /gluster_bricks/vmstore/vmstore/.glusterfs/health_check 
returned ret is -1 error is Structure needs cleaning
[2022-03-19 13:27:36.361160] M [MSGID: 113075] 
[posix-helpers.c:2214:posix_health_check_thread_proc] 0-vmstore-posix: 
health-check failed, going down
[2022-03-19 13:27:36.361395] M [MSGID: 113075] 
[posix-helpers.c:2232:posix_health_check_thread_proc] 0-vmstore-posix: still 
alive! -> SIGTERM

Searching around, I see references to similar issues, but no real
solutions.  I see a suggestion that changing the health-check-interval
from 10 to 30 seconds helps, but it looks like 30 seconds is the default
with this version of Gluster (and I don't see it explicitly set for any
of my volumes).

While "Structure needs cleaning" appears to be an XFS filesystem error,
I don't see any XFS errors from the kernel.

This is a low I/O cluster - the storage network is on two 10 gig
switches with a two-port LAG to each server, but typically is only
seeing a few tens of megabits per second.


I experience the same behavior. Workaround could be disabling 
health-check like


gluster volume set  storage.health-check-interval 0

In my case it helped with bricks randomly going offline.

There is something else broken in my hci cluster because I have also 
problem with sanlock which time to time cannot renew lock and wdmd 
reboots one or two hosts. I still cannot find the root of this behavior 
but it is probably hw related.


Cheers,

Jiri







smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/G4KBSP3Y7WEBTB6XV7L2P2RJPMD2E5ZA/


[ovirt-users] Re: Random reboots

2022-02-17 Thread Jiří Sléžka

On 2/16/22 23:37, Nir Soffer wrote:

On Wed, Feb 16, 2022 at 9:18 PM Nir Soffer  wrote:


On Wed, Feb 16, 2022 at 5:12 PM Nir Soffer  wrote:


On Wed, Feb 16, 2022 at 10:10 AM Pablo Olivera  wrote:


Hi community,

We're dealing with an issue as we occasionally have random reboots on
any of our hosts.
We're using ovirt 4.4.3 in production with about 60 VM distributed over
5 hosts. We've a virtualized engine and a DRBD storage mounted by NFS.
The infrastructure is interconnected by a Cisco 9000 switch.
The last random reboot was yesterday February 14th at 03:03 PM (in the
log it appears as: 15:03 due to our time configuration) of the host:
'nodo1'.
At the moment of the reboot we detected in the log of the switch a
link-down in the port where the host is connected.
I attach log of the engine and host 'nodo1' in case you can help us to
find the cause of these random reboots.



According to messages:

1. Sanlock could not renew the lease for 80 seconds:

Feb 14 15:03:06 nodo1 sanlock[2017]: 2022-02-14 15:03:06 1655257
[2017]: s1 check_our_lease failed 80


2. In this case sanlock must terminate the processes holding a lease
on the that storage - I guess that pid 6398 is vdsm.

Feb 14 15:03:06 nodo1 sanlock[2017]: 2022-02-14 15:03:06 1655257
[2017]: s1 kill 6398 sig 15 count 1
Feb 14 15:03:06 nodo1 sanlock[2017]: 2022-02-14 15:03:06 1655258
[2017]: s1 kill 6398 sig 15 count 2


pid 6398 is not vdsm:

Feb 14 15:02:51 nodo1 vdsm[4338]

The fact that we see "sig 15" means sanlock is trying to send SIGTERM.
If pid 6398 is a VM (hosted engine vm?) we would expect to see:


[2017]: s1 kill 6398 sig 100 count 1


Exactly once - which means run the killpath program registered by libvirt,
which will terminate the vm.


I reproduce this issue locally - we never use killpath program, because we
don't configure libvirt on_lockfailure in the domain xml.

So we get the default behavior, which is sanlock terminating the vm.



So my guess is that this is not a VM, so the only other option is hosted
engine broker, using a lease on the whiteboard.


...
Feb 14 15:03:36 nodo1 sanlock[2017]: 2022-02-14 15:03:36 1655288
[2017]: s1 kill 6398 sig 15 count 32

3. Terminating pid 6398 stopped here, and we see:

Feb 14 15:03:36 nodo1 wdmd[2033]: test failed rem 19 now 1655288 ping
1655237 close 1655247 renewal 1655177 expire 1655257 client 2017
sanlock_a5c35d19-4c34-4571-ac77-1b10de484426:1


According to David, this means we have 19 more attempts to kill the process
holding the lease.



4. So it looks like wdmd rebooted the host.

Feb 14 15:08:09 nodo1 kernel: Linux version
4.18.0-193.28.1.el8_2.x86_64 (mockbu...@kbuilder.bsys.centos.org) (gcc
version 8.3.1 20191121 (Red Hat 8.3.1-5) (GCC)) #1 SMP Thu Oct 22
00:20:22 UTC 2020


This is strange, since sanlock should try to kill pid 6398 40 times,
and then switch
to SIGKILL. The watchdog should not reboot the host before sanlock
finish the attempt to kill the processes.

David, do you think this is expected? do we have any issue in sanlock?


I discussed it with David (sanlock author). What we see here may be truncated
logs when a host is rebooted by the watchdog. The last time logs were synced
to storage was probably Feb 14 15:03:36. Any message written after that was
lost in the host page cache.



It is possible that sanlock will not be able to terminate a process if
the process is blocked on inaccessible storage. This seems to be the
case here.

In vdsm log we see that storage is indeed inaccessible:

2022-02-14 15:03:03,149+0100 WARN  (check/loop) [storage.check]
Checker 
'/rhev/data-center/mnt/newstoragedrbd.andromeda.com:_var_nfsshare_data/a5c35d19-4c34-4571-ac77-1b10de484426/dom_md/metadata'
is blocked for 60.00 seconds (check:282)

But we don't see any termination request - so this host is not the SPM.

I guess this host was running the hosted engine vm, which uses a storage lease.
If you lose access to storage, sanlcok will kill the hosted engine vm,
so the system
can start it elsewhere. If the hosted engine vm is stuck on storage, sanlock
cannot kill it and it will reboot the host.


Pablo, can you locate the process with pid 6398?

Looking in hosted engine logs and other logs on the system may reveal what
was this process. When we find the process, we can check the source to
understand
why it was not terminating - likely blocked on the inaccessible NFS server.


The process is most likely a VM - I reproduced the exact scenario locally.

You can file a vdsm bug for this. The system behave as designed, but the design
is problematic; one VM with a lease stuck on NFS server can cause the
entire host
to be rebooted.

>

With block storage we don't have this issue, since we have exact
control over multipath
timeouts. Multipath will fail I/O in 80 seconds, after sanlock failed
to renew the lease.
When I/O fails, the process block on storage will unblocked an will be
terminated
by the kernel.


I observe this or similar behavior also in my glusterfs HCI cluster (but 
not on 

[ovirt-users] Re: how to debug sanlock issue

2022-01-16 Thread Jiří Sléžka

Hi,

thanks for reply

Dne 1/13/22 v 16:04 Vojtech Juranek napsal(a):

Hi,


Hello,

I have 3 node HCI cluster with glusterfs. oVirt 4.4.9.5-1. In last 2
weeks I experience 2 outages where HE and all/some vms were restarted.
While digging in logs I can see that sanlock cannot renew leases and it
leads to killing vms as is very good described in [1].

It looks to me like some hw issue with one of the hosts but cannot find
which one.


when you check the sanlock logs (/var/log/sanlock.log) around the time of
outage, you should be able to see which of the host failed to renew its
sanlock leases. It could be on some of them (could be some issue with these
host(s)) or on all of them (in this case is more likely a network issue or
storage issue).


it looks like all of hosts had renewal issues, first was ovirt-hci02

Jan 13 08:27:25 ovirt-hci02 sanlock[1263]: 2022-01-13 08:27:25 1416706 
[341378]: s7 delta_renew read timeout 10 sec offset 0 
/rhev/data-center/mnt/glusterSD/10.0.4.11:_vms/6de5ae6d-c7cc-4292-bdbf-10495a38837b/dom_md/ids
Jan 13 08:27:25 ovirt-hci02 sanlock[1263]: 2022-01-13 08:27:25 1416706 
[341378]: s7 renewal error -202 delta_length 10 last_success 1416676
Jan 13 08:27:31 ovirt-hci01 sanlock[1375]: 2022-01-13 08:27:31 1420170 
[766769]: s7 delta_renew long write time 20 sec
Jan 13 08:27:32 ovirt-hci03 sanlock[1457]: 2022-01-13 08:27:32 1412428 
[761241]: s6 delta_renew long write time 11 sec
Jan 13 08:27:32 ovirt-hci01 sanlock[1375]: 2022-01-13 08:27:32 1420171 
[764099]: s6 delta_renew long write time 18 sec
Jan 13 08:27:42 ovirt-hci02 sanlock[1263]: 2022-01-13 08:27:42 1416723 
[341376]: s6 delta_renew long write time 21 sec
Jan 13 08:27:42 ovirt-hci02 sanlock[1263]: 2022-01-13 08:27:42 1416723 
[341376]: s6 renewed 1416702 delta_length 22 too long
Jan 13 08:27:42 ovirt-hci01 sanlock[1375]: 2022-01-13 08:27:42 1420181 
[766769]: s7 delta_renew long write time 11 sec
Jan 13 08:27:44 ovirt-hci03 sanlock[1457]: 2022-01-13 08:27:44 1412440 
[761233]: s5 delta_renew long write time 30 sec
Jan 13 08:27:44 ovirt-hci03 sanlock[1457]: 2022-01-13 08:27:44 1412440 
[761233]: s5 renewed 1412410 delta_length 30 too long

...

good point is that it could be a network issue but I have no proof of 
it. Switches (10GE) were not restarted, no errors on interfaces, no log 
entries, no excessive traffic on any interface...


also I am confused with line "...read timeout 10 sec offset 0 
/rhev/data-center/mnt/glusterSD/10.0.4.11:_vms/6de5ae6d-c7cc-4292-bdbf-10495a38837b/dom_md/ids". 
If I understand it correctly it logs just mount info, not exact ip of 
gluster host from which cannot read host ovirt-hci02 lock data, right?



Also, if you want only find out which hosts wasn't able to renew the leases,
it's even more easy - it was the host whose VMs were killed. If the host runs
HA VMs and host is not able to renew its leases, sanlock will kill VMs running
on this host.


vms were killed on ovirt-hci01 and ovirt-hci02 (I believe ovirt-hci01 
hosted also HE in that time). It looks like vms were first killed (or 
better started to kill) on ovirt-hci02 at 8:28:15, then on ovirt-hci01 
at 8:28:51.


Cheers,

Jiri



Vojta
  

for example today's outage restarted vms on hosts 1 and 2 but not 3.
Sanlock logs

there are these lines in /var/log/messages on host 2 (ovirt-hci02)

Jan 13 08:27:25 ovirt-hci02 sanlock[1263]: 2022-01-13 08:27:25 1416706
[341378]: s7 delta_renew read timeout 10 sec offset 0
/rhev/data-center/mnt/glusterSD/10.0.4.11:_vms/6de5ae6d-c7cc-4292-bdbf-10495
a38837b/dom_md/ids Jan 13 08:28:59 ovirt-hci02 sanlock[1263]: 2022-01-13
08:28:59 1416800 [341257]: write_sectors delta_leader offset 1024 rv -202
/rhev/data-center/mnt/glusterSD/10.0.4.11:_engine/816a3d0b-2e10-4900-b3cb-4a
9b5cd0dd5d/dom_md/ids Jan 13 08:29:27 ovirt-hci02 sanlock[1263]: 2022-01-13
08:29:27 1416828 [4189968]: write_sectors delta_leader offset 1024 rv -202
/rhev/data-center/mnt/glusterSD/10.0.4.11:_engine/816a3d0b-2e10-4900-b3cb-4a
9b5cd0dd5d/dom_md/ids

but not on hosts 1 and 3. Could it indicate that there could be storage
related problem on host 1?

could you please suggest further/better debugging approach?

Thanx a lot,

Jiri

[1] https://www.ovirt.org/develop/developer-guide/vdsm/sanlock.html



___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/UJYSBUBM3CGP762Q4WRZSF67KJNGGIVC/





smime.p7s
Description: Elektronicky podpis S/MIME
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List 

[ovirt-users] how to debug sanlock issue

2022-01-13 Thread Jiří Sléžka

Hello,

I have 3 node HCI cluster with glusterfs. oVirt 4.4.9.5-1. In last 2 
weeks I experience 2 outages where HE and all/some vms were restarted. 
While digging in logs I can see that sanlock cannot renew leases and it 
leads to killing vms as is very good described in [1].


It looks to me like some hw issue with one of the hosts but cannot find 
which one.


for example today's outage restarted vms on hosts 1 and 2 but not 3. 
Sanlock logs


there are these lines in /var/log/messages on host 2 (ovirt-hci02)

Jan 13 08:27:25 ovirt-hci02 sanlock[1263]: 2022-01-13 08:27:25 1416706 
[341378]: s7 delta_renew read timeout 10 sec offset 0 
/rhev/data-center/mnt/glusterSD/10.0.4.11:_vms/6de5ae6d-c7cc-4292-bdbf-10495a38837b/dom_md/ids
Jan 13 08:28:59 ovirt-hci02 sanlock[1263]: 2022-01-13 08:28:59 1416800 
[341257]: write_sectors delta_leader offset 1024 rv -202 
/rhev/data-center/mnt/glusterSD/10.0.4.11:_engine/816a3d0b-2e10-4900-b3cb-4a9b5cd0dd5d/dom_md/ids
Jan 13 08:29:27 ovirt-hci02 sanlock[1263]: 2022-01-13 08:29:27 1416828 
[4189968]: write_sectors delta_leader offset 1024 rv -202 
/rhev/data-center/mnt/glusterSD/10.0.4.11:_engine/816a3d0b-2e10-4900-b3cb-4a9b5cd0dd5d/dom_md/ids


but not on hosts 1 and 3. Could it indicate that there could be storage 
related problem on host 1?


could you please suggest further/better debugging approach?

Thanx a lot,

Jiri

[1] https://www.ovirt.org/develop/developer-guide/vdsm/sanlock.html



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/WGDE6NC6RE7NIPHXZOQCHSARJU6F55V2/


[ovirt-users] Re: Installin ovirt and ovirt host on orcale 8.5 linux issues

2022-01-11 Thread Jiří Sléžka

On 1/11/22 13:57, Yedidyah Bar David wrote:
On Tue, Jan 11, 2022 at 2:50 PM Nazeem Durgahee 
> wrote:


Hi,

__ __

We made it work.

__ __

The workaround is to create a symlink between /usr/bin/python3 to
/usr/bin/python2 and it worked.

__ __

We created an empty /usr/bin/python2. Now we will try same on the
other two hosts that we need to install.


Thanks for the update.

I now pushed this patch:

https://gerrit.ovirt.org/c/ovirt-engine/+/118229 



Can you please verify it on Oracle Linux? I'd like to get volunteers
to verify also on AlmaLinux and Rocky Linux, to make sure we do not
break these.


setup module on Rocky Linux 8 host returns

...
"ansible_distribution": "Rocky",
"ansible_distribution_file_parsed": true,
"ansible_distribution_file_path": "/etc/redhat-release",
"ansible_distribution_file_variety": "RedHat",
"ansible_distribution_major_version": "8",
"ansible_distribution_release": "Green Obsidian",
"ansible_distribution_version": "8.5",
...
"ansible_os_family": "RedHat",
...

so patch looks good to me

Cheers, Jiri




Thanks and best regards,
--
Didi

___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/2VW3KR3SX6AON3QTTCERI6CRZS6ZQVH7/





smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/IP3XBN2W6WVIKQTO4E6CNXN3HTZ4E47B/


[ovirt-users] Re: Suggested upgrading path from CentOS based 4.4.8 to 4.4.9

2021-12-28 Thread Jiří Sléžka

Hi,

just for info, recently I have migrated two ovirt 4.4.8 installations to 
Rocky Linux 8.5. Everything went smoothly and it was as simple as


curl 
https://raw.githubusercontent.com/rocky-linux/rocky-tools/main/migrate2rocky/migrate2rocky.sh 
-o migrate2rocky.sh

chmod u+x migrate2rocky.sh
./migrate2rocky.sh -r

on hosts and after that the same on HE (in my case also with upgrade to 
4.4.9)


One of these systems was HCI and everything looks good.

Happy new year to all of oVirt (and Gluster, CentOS, Rocky,...) teams ;-)

Cheers, Jiri


On 11/9/21 09:46, Jiří Sléžka wrote:

Hi,

On 11/8/21 21:38, Christoph Timm wrote:

Hi Gianluca,

I have a self hosted platform with plain CentOS 8.4 host. Also my 
engine is currently running CentOS 8.4.

I updated my whole system from 4.4.7 to 4.4.9.

As 4.4.9 does not support CentOS hosts I had to migrate them to CentOS 
Stream 8 as well.
Due to dependency issues I have put the host into maintenance mode and 
did the rest through CLI.


# dnf swap centos-linux-repos centos-stream-repos
# dnf update ovirt-release44
# dnf distro-sync
# reboot
Done


Thanks for this. I think this should be documented better because it 
confuses me a bit.


I run small HCI cluster on CentOS8.4 hosts and would like to know valid 
migration paths. Sure, one is CentOS Strem. Is onother one RHEL 8.4? It 
looks like not (from release notes for 4.4.9: "This release is available 
now for Red Hat Enterprise Linux 8.5 Beta (or similar) and CentOS 
Stream"). Did anybody even try migrate CentOS hosts to Rocky Linux?


In this case I prefer to migrate to stabe RHEL clone distro and stay a 
bit behind oVirt releases.


What is your opinion, does it make sense?

Thanks in advance,

Jiri



I did not change the engine OS aa CentOS 8.4 is still supported here.

Also I did the update in multiple steps over a couple of days which 
did not cause any issue on my side.


Hope this helps.

Best regards
Christoph

Am 08.11.21 um 13:41 schrieb Gianluca Cecchi:
I have a lab with an environment based on 4.4.8.6-1, with 3 CentOS 
Linux 8.4 hosts and a CentOS 8.4 external engine system (that is a VM 
on vSphere, so that I can leverage a snapshot methodology for the 
process...).
I would like to pass to 4.4.9 and retain a full plain OS on hosts for 
the moment, without going through oVirt nodes, but standing the repo 
problems and CentOS 8.x going through EOL this is what I'm planning 
to do:


1. stop engine service on engine system

2. convert engine to CentOS Stream
This step needs some confirmation.
Could you provide an official link about the process?
I'm not able to find it again. Is it a problem of mine or all (CentOS 
website, RHEL website) seem to point only to conversion from CentOS 
Linux to RHEL??
Apart external websites provided workflows, I was only able to find a 
mid January youtube video, when CentOS was based on 8.3, with these 
steps:

yum install centos-release-stream
yum swap centos-{linux,stream}-repos
yum repolist
yum distro-sync
reboot
The video link is here:
https://www.youtube.com/watch?v=Ba2ytp_8x7s

No mention at
https://www.redhat.com/en/blog/faq-centos-stream-updates

And on CentOS page I only found this:
https://centos.org/distro-faq/
with Q7 containing only the two instructions:
dnf swap centos-linux-repos centos-stream-repos
dnf distro-sync

What to use safely?
Is it possible to include some sort of documentation or links on 
oVirt page, to migrate from CentOS Linux to CentOS Stream for oVirt 
upgrade purposes?


3. After reboot implied, I think, in step 2., use the usual steps to 
update engine to 4.4.9


4. update the first out of three hosts from CentOS Linux to CentOS 
Stream and to 4.4.9.


4.a follow the same approach of engine (when defined) and pass it to 
Stream retaining the 4.4.8.

4.b upgrade from the web admin gui to 4.4.9

5. Do the same for second host and third hosts

Any hints, comments, limitations in having mixed 4.4.8 and 4.4.9 
hosts for a while and such?


Thanks,
Gianluca



___
Users mailing list --users@ovirt.org
To unsubscribe send an email tousers-le...@ovirt.org
Privacy Statement:https://www.ovirt.org/privacy-policy.html
oVirt Code of 
Conduct:https://www.ovirt.org/community/about/community-guidelines/
List 
Archives:https://lists.ovirt.org/archives/list/users@ovirt.org/message/L3CRTNR2IUGNRZNVQMROQABHHBKMEYPP/ 




___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/RGF6HG7LSJOYTV3AVYK4VP7HPQ4UAK2A/ 






___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-p

[ovirt-users] Re: Suggested upgrading path from CentOS based 4.4.8 to 4.4.9

2021-11-09 Thread Jiří Sléžka

Hi,

On 11/8/21 21:38, Christoph Timm wrote:

Hi Gianluca,

I have a self hosted platform with plain CentOS 8.4 host. Also my engine 
is currently running CentOS 8.4.

I updated my whole system from 4.4.7 to 4.4.9.

As 4.4.9 does not support CentOS hosts I had to migrate them to CentOS 
Stream 8 as well.
Due to dependency issues I have put the host into maintenance mode and 
did the rest through CLI.


# dnf swap centos-linux-repos centos-stream-repos
# dnf update ovirt-release44
# dnf distro-sync
# reboot
Done


Thanks for this. I think this should be documented better because it 
confuses me a bit.


I run small HCI cluster on CentOS8.4 hosts and would like to know valid 
migration paths. Sure, one is CentOS Strem. Is onother one RHEL 8.4? It 
looks like not (from release notes for 4.4.9: "This release is available 
now for Red Hat Enterprise Linux 8.5 Beta (or similar) and CentOS 
Stream"). Did anybody even try migrate CentOS hosts to Rocky Linux?


In this case I prefer to migrate to stabe RHEL clone distro and stay a 
bit behind oVirt releases.


What is your opinion, does it make sense?

Thanks in advance,

Jiri



I did not change the engine OS aa CentOS 8.4 is still supported here.

Also I did the update in multiple steps over a couple of days which did 
not cause any issue on my side.


Hope this helps.

Best regards
Christoph

Am 08.11.21 um 13:41 schrieb Gianluca Cecchi:
I have a lab with an environment based on 4.4.8.6-1, with 3 CentOS 
Linux 8.4 hosts and a CentOS 8.4 external engine system (that is a VM 
on vSphere, so that I can leverage a snapshot methodology for the 
process...).
I would like to pass to 4.4.9 and retain a full plain OS on hosts for 
the moment, without going through oVirt nodes, but standing the repo 
problems and CentOS 8.x going through EOL this is what I'm planning to do:


1. stop engine service on engine system

2. convert engine to CentOS Stream
This step needs some confirmation.
Could you provide an official link about the process?
I'm not able to find it again. Is it a problem of mine or all (CentOS 
website, RHEL website) seem to point only to conversion from CentOS 
Linux to RHEL??
Apart external websites provided workflows, I was only able to find a 
mid January youtube video, when CentOS was based on 8.3, with these steps:

yum install centos-release-stream
yum swap centos-{linux,stream}-repos
yum repolist
yum distro-sync
reboot
The video link is here:
https://www.youtube.com/watch?v=Ba2ytp_8x7s

No mention at
https://www.redhat.com/en/blog/faq-centos-stream-updates

And on CentOS page I only found this:
https://centos.org/distro-faq/
with Q7 containing only the two instructions:
dnf swap centos-linux-repos centos-stream-repos
dnf distro-sync

What to use safely?
Is it possible to include some sort of documentation or links on oVirt 
page, to migrate from CentOS Linux to CentOS Stream for oVirt upgrade 
purposes?


3. After reboot implied, I think, in step 2., use the usual steps to 
update engine to 4.4.9


4. update the first out of three hosts from CentOS Linux to CentOS 
Stream and to 4.4.9.


4.a follow the same approach of engine (when defined) and pass it to 
Stream retaining the 4.4.8.

4.b upgrade from the web admin gui to 4.4.9

5. Do the same for second host and third hosts

Any hints, comments, limitations in having mixed 4.4.8 and 4.4.9 hosts 
for a while and such?


Thanks,
Gianluca



___
Users mailing list --users@ovirt.org
To unsubscribe send an email tousers-le...@ovirt.org
Privacy Statement:https://www.ovirt.org/privacy-policy.html
oVirt Code of 
Conduct:https://www.ovirt.org/community/about/community-guidelines/
List 
Archives:https://lists.ovirt.org/archives/list/users@ovirt.org/message/L3CRTNR2IUGNRZNVQMROQABHHBKMEYPP/



___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/RGF6HG7LSJOYTV3AVYK4VP7HPQ4UAK2A/





smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/V6CZM33TTWB6ZFML4JN67MG3KQ455FNB/


[ovirt-users] Re: GlusterFS Monitoring/Alerting

2021-09-07 Thread Jiří Sléžka

Hi,

On 9/7/21 1:05 PM, si...@justconnect.ie wrote:

Hi All,

Does anyone have recommendations for GlusterFS monitoring/alerting software and 
or plugins.


I am using Zabbix and this simple plugin

https://github.com/Lelik13a/Zabbix-GluserFS

there are probably more sophisticated solutions but this one serves me well

Cheers,

Jiri



Kind regards

Simon...
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/M6AKHWOD7GLBHJRSCWNZRMM7OOXMKFOY/






smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/VX5OS3D2LVO5NCKL2UQBEQ523IZVM7BP/


[ovirt-users] mount options change

2021-09-01 Thread Jiří Sléžka

Hi,

is there any way to change mount options on active storage?

I started using my gluster volume in replica 3, arbiter 1 configuration 
with 10.0.4.11:/vms mount point and mount options


backup-volfile-servers=10.0.4.13

I would like to add also 10.0.4.12 as backup volfile as I am on full 
replica 3 now.


Yes, I suppose I can shut down all vms, switch domain to maintenance and 
then change mount options but is there any other way? For example change 
it directly in db and then restart all hosts one by one (while vms are 
running)?


Cheers,

Jiri




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/ZQZEZDWO362VGJB7RCVNE2W3GNUU7OZA/


[ovirt-users] Re: glusterfs health-check failed, (brick) going down

2021-07-11 Thread Jiří Sléžka

Hi Jayme,

On 7/8/21 12:54 PM, Jayme wrote:
I have observed this behaviour recently and in the past on 4.3 and 4.4, 
and in my case it’s almost always following an ovirt upgrade. After 
upgrade (especially upgrades involving glusterfs) I’d have bricks 
randomly go down like your describing for about a week or so after 
upgrade and I’d have to manually start them. At some point it just 
corrects itself and is stable again. I really have no idea why it occurs 
and what’s happening that eventually stops it from happening.


well, I agree that this issue probably follows oVirt upgrade. Until now 
no brick has failed but 4.4.7 is out and now I'm hesitant if I want to 
upgrade ;-) Of course I will, but probably a bit later.


In the gluster list, there is at least one other user who observe this 
behavior, unfortunately there is no known fix for this :-(


Cheers,

Jiri



On Wed, Jul 7, 2021 at 4:10 PM Jiří Sléžka <mailto:jiri.sle...@slu.cz>> wrote:


Hello,

I have 3 node HCI cluster with oVirt 4.4.6 and CentOS8.

For time to time (I belive) random brick on random host goes down
because health-check. It looks like

[root@ovirt-hci02 ~]# grep "posix_health_check"
/var/log/glusterfs/bricks/*
/var/log/glusterfs/bricks/gluster_bricks-vms2-vms2.log:[2021-07-07
07:13:37.408184] M [MSGID: 113075]
[posix-helpers.c:2214:posix_health_check_thread_proc] 0-vms-posix:
health-check failed, going down
/var/log/glusterfs/bricks/gluster_bricks-vms2-vms2.log:[2021-07-07
07:13:37.408407] M [MSGID: 113075]
[posix-helpers.c:2232:posix_health_check_thread_proc] 0-vms-posix:
still
alive! -> SIGTERM
/var/log/glusterfs/bricks/gluster_bricks-vms2-vms2.log:[2021-07-07
16:11:14.518971] M [MSGID: 113075]
[posix-helpers.c:2214:posix_health_check_thread_proc] 0-vms-posix:
health-check failed, going down
/var/log/glusterfs/bricks/gluster_bricks-vms2-vms2.log:[2021-07-07
16:11:14.519200] M [MSGID: 113075]
[posix-helpers.c:2232:posix_health_check_thread_proc] 0-vms-posix:
still
alive! -> SIGTERM

on other host

[root@ovirt-hci01 ~]# grep "posix_health_check"
/var/log/glusterfs/bricks/*
/var/log/glusterfs/bricks/gluster_bricks-engine-engine.log:[2021-07-05
13:15:51.983327] M [MSGID: 113075]
[posix-helpers.c:2214:posix_health_check_thread_proc] 0-engine-posix:
health-check failed, going down
/var/log/glusterfs/bricks/gluster_bricks-engine-engine.log:[2021-07-05
13:15:51.983728] M [MSGID: 113075]
[posix-helpers.c:2232:posix_health_check_thread_proc] 0-engine-posix:
still alive! -> SIGTERM
/var/log/glusterfs/bricks/gluster_bricks-vms2-vms2.log:[2021-07-05
01:53:35.769129] M [MSGID: 113075]
[posix-helpers.c:2214:posix_health_check_thread_proc] 0-vms-posix:
health-check failed, going down
/var/log/glusterfs/bricks/gluster_bricks-vms2-vms2.log:[2021-07-05
01:53:35.769819] M [MSGID: 113075]
[posix-helpers.c:2232:posix_health_check_thread_proc] 0-vms-posix:
still
alive! -> SIGTERM

I cannot link these errors to any storage/fs issue (in dmesg or
/var/log/messages), brick devices looks healthy (smartd).

I can force start brick with

gluster volume start vms|engine force

and after some healing all works fine for few days

Did anybody observe this behavior?

vms volume has this structure (two bricks per host, each is separate
JBOD ssd disk), engine volume has one brick on each host...

gluster volume info vms

Volume Name: vms
Type: Distributed-Replicate
Volume ID: 52032ec6-99d4-4210-8fb8-ffbd7a1e0bf7
Status: Started
Snapshot Count: 0
Number of Bricks: 2 x 3 = 6
Transport-type: tcp
Bricks:
Brick1: 10.0.4.11:/gluster_bricks/vms/vms
Brick2: 10.0.4.13:/gluster_bricks/vms/vms
Brick3: 10.0.4.12:/gluster_bricks/vms/vms
Brick4: 10.0.4.11:/gluster_bricks/vms2/vms2
Brick5: 10.0.4.13:/gluster_bricks/vms2/vms2
Brick6: 10.0.4.12:/gluster_bricks/vms2/vms2
Options Reconfigured:
cluster.granular-entry-heal: enable
performance.stat-prefetch: off
cluster.eager-lock: enable
performance.io-cache: off
performance.read-ahead: off
performance.quick-read: off
user.cifs: off
network.ping-timeout: 30
network.remote-dio: off
performance.strict-o-direct: on
performance.low-prio-threads: 32
features.shard: on
storage.owner-gid: 36
storage.owner-uid: 36
transport.address-family: inet
storage.fips-mode-rchecksum: on
nfs.disable: on
performance.client-io-threads: off

Cheers,

Jiri
___
Users mailing list -- users@ovirt.org <mailto:users@ovirt.org>
To unsubscribe send an email to users-le...@ovirt.org
<mailto:users-le...@ovirt.org>
Privacy Statement: https://www.ovirt.org/privacy-policy.html
<https://www.ovirt.org/p

[ovirt-users] glusterfs health-check failed, (brick) going down

2021-07-07 Thread Jiří Sléžka

Hello,

I have 3 node HCI cluster with oVirt 4.4.6 and CentOS8.

For time to time (I belive) random brick on random host goes down 
because health-check. It looks like


[root@ovirt-hci02 ~]# grep "posix_health_check" /var/log/glusterfs/bricks/*
/var/log/glusterfs/bricks/gluster_bricks-vms2-vms2.log:[2021-07-07 
07:13:37.408184] M [MSGID: 113075] 
[posix-helpers.c:2214:posix_health_check_thread_proc] 0-vms-posix: 
health-check failed, going down
/var/log/glusterfs/bricks/gluster_bricks-vms2-vms2.log:[2021-07-07 
07:13:37.408407] M [MSGID: 113075] 
[posix-helpers.c:2232:posix_health_check_thread_proc] 0-vms-posix: still 
alive! -> SIGTERM
/var/log/glusterfs/bricks/gluster_bricks-vms2-vms2.log:[2021-07-07 
16:11:14.518971] M [MSGID: 113075] 
[posix-helpers.c:2214:posix_health_check_thread_proc] 0-vms-posix: 
health-check failed, going down
/var/log/glusterfs/bricks/gluster_bricks-vms2-vms2.log:[2021-07-07 
16:11:14.519200] M [MSGID: 113075] 
[posix-helpers.c:2232:posix_health_check_thread_proc] 0-vms-posix: still 
alive! -> SIGTERM


on other host

[root@ovirt-hci01 ~]# grep "posix_health_check" /var/log/glusterfs/bricks/*
/var/log/glusterfs/bricks/gluster_bricks-engine-engine.log:[2021-07-05 
13:15:51.983327] M [MSGID: 113075] 
[posix-helpers.c:2214:posix_health_check_thread_proc] 0-engine-posix: 
health-check failed, going down
/var/log/glusterfs/bricks/gluster_bricks-engine-engine.log:[2021-07-05 
13:15:51.983728] M [MSGID: 113075] 
[posix-helpers.c:2232:posix_health_check_thread_proc] 0-engine-posix: 
still alive! -> SIGTERM
/var/log/glusterfs/bricks/gluster_bricks-vms2-vms2.log:[2021-07-05 
01:53:35.769129] M [MSGID: 113075] 
[posix-helpers.c:2214:posix_health_check_thread_proc] 0-vms-posix: 
health-check failed, going down
/var/log/glusterfs/bricks/gluster_bricks-vms2-vms2.log:[2021-07-05 
01:53:35.769819] M [MSGID: 113075] 
[posix-helpers.c:2232:posix_health_check_thread_proc] 0-vms-posix: still 
alive! -> SIGTERM


I cannot link these errors to any storage/fs issue (in dmesg or 
/var/log/messages), brick devices looks healthy (smartd).


I can force start brick with

gluster volume start vms|engine force

and after some healing all works fine for few days

Did anybody observe this behavior?

vms volume has this structure (two bricks per host, each is separate 
JBOD ssd disk), engine volume has one brick on each host...


gluster volume info vms

Volume Name: vms
Type: Distributed-Replicate
Volume ID: 52032ec6-99d4-4210-8fb8-ffbd7a1e0bf7
Status: Started
Snapshot Count: 0
Number of Bricks: 2 x 3 = 6
Transport-type: tcp
Bricks:
Brick1: 10.0.4.11:/gluster_bricks/vms/vms
Brick2: 10.0.4.13:/gluster_bricks/vms/vms
Brick3: 10.0.4.12:/gluster_bricks/vms/vms
Brick4: 10.0.4.11:/gluster_bricks/vms2/vms2
Brick5: 10.0.4.13:/gluster_bricks/vms2/vms2
Brick6: 10.0.4.12:/gluster_bricks/vms2/vms2
Options Reconfigured:
cluster.granular-entry-heal: enable
performance.stat-prefetch: off
cluster.eager-lock: enable
performance.io-cache: off
performance.read-ahead: off
performance.quick-read: off
user.cifs: off
network.ping-timeout: 30
network.remote-dio: off
performance.strict-o-direct: on
performance.low-prio-threads: 32
features.shard: on
storage.owner-gid: 36
storage.owner-uid: 36
transport.address-family: inet
storage.fips-mode-rchecksum: on
nfs.disable: on
performance.client-io-threads: off

Cheers,

Jiri
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/BPXG53NG34QKCABYJ35UYIWPNNWTKXW4/


[ovirt-users] Re: upgrading gluster brick os from CentOS7 to 8

2021-07-07 Thread Jiří Sléžka

Hello,

just for info... I decided to try this approach and it worked well. I 
reinstalled old CentOS7 glusterfs node to CentOS8, enabled ovirt44 
repos, installed the same version of glusterfs, mounted bricks, restored 
glusterfs and volume (/var/lib/glusterd) configuration. Gluster started 
healing and it end soon as brick was offline only for around 1 hour. 
Then I add this server as oVirt host and enabled glusterfs service in 
cluster.


So far everything looks good (except for probably the unrelated problem 
I'll describe in separate mail)


Cheers,

Jiri


On 5/19/21 9:18 PM, Jiří Sléžka wrote:

Hello,

I'm in progress of moving my oVirt HCI cluster to production. It was a
bit complicated process because I had two new servers to install oVirt
from scratch in the lab and three old servers with standalone kvm
hypervisors in production. One of them (installed with CentOS7) was good
enough to join HCI cluster. All three old kvm servers had production vms
running on them.

New cluster was installed with CentOS8 and oVirt4.4, I started with
single node, then expanded it to two hosts with one another which acts
as arbiter (just for stability in lab envirnment).

After some testing and tuning I moved this two node cluster (without
arbiter node) to my server housing. Then I prepared gluster brick on
that one old server which I want reuse and join it to gluster storage so
now it is replica 3 - two nodes are CentOS8 based and acts as oVirt
hosts, one is CentOS7 and acts also as standalone kvm hypervisor.

Then I migrated all vms from standalone kvm hypervisors to oVirt, then
switched off two oldest hosts.

Now I would like to reinstall CentOS7 node to oVirt host. My plan is to
keep gluster brick fs and backup configuration and restore it after
reinstall to CentOS8 as mentioned in
https://mjanja.ch/2018/08/migrate-glusterfs-to-a-new-operating-system/.
I want to speed up gluster healing. Does it make sense?

Now the questin: Brick fs was created on CentOS7 and don't have new xfs
features FINOBT,SPARSE_INODES,REFLINK. Could it be problem? Will be
better to recreate fs in CentOS8 and then do full heal? I am afraid that
it will take a long time and make big load to production hosts.

TL;DR: Does gluster/oVirt4.4 make use of FINOBT,SPARSE_INODES,REFLINK
xfs features?


btw. oVirt is great product and HCI looks really functional and usable
(with 10GE network and SSDsof course). Big thanks to developers and
community!

Cheers,

Jiri



___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/PHPCYOGYL4A3PHVB56E3LL4KWR4JVM45/


___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/XPAXKTFC6CSRTEEHLH5HFDWES5GGSFJE/


[ovirt-users] upgrading gluster brick os from CentOS7 to 8

2021-05-19 Thread Jiří Sléžka
Hello,

I'm in progress of moving my oVirt HCI cluster to production. It was a
bit complicated process because I had two new servers to install oVirt
from scratch in the lab and three old servers with standalone kvm
hypervisors in production. One of them (installed with CentOS7) was good
enough to join HCI cluster. All three old kvm servers had production vms
running on them.

New cluster was installed with CentOS8 and oVirt4.4, I started with
single node, then expanded it to two hosts with one another which acts
as arbiter (just for stability in lab envirnment).

After some testing and tuning I moved this two node cluster (without
arbiter node) to my server housing. Then I prepared gluster brick on
that one old server which I want reuse and join it to gluster storage so
now it is replica 3 - two nodes are CentOS8 based and acts as oVirt
hosts, one is CentOS7 and acts also as standalone kvm hypervisor.

Then I migrated all vms from standalone kvm hypervisors to oVirt, then
switched off two oldest hosts.

Now I would like to reinstall CentOS7 node to oVirt host. My plan is to
keep gluster brick fs and backup configuration and restore it after
reinstall to CentOS8 as mentioned in
https://mjanja.ch/2018/08/migrate-glusterfs-to-a-new-operating-system/.
I want to speed up gluster healing. Does it make sense?

Now the questin: Brick fs was created on CentOS7 and don't have new xfs
features FINOBT,SPARSE_INODES,REFLINK. Could it be problem? Will be
better to recreate fs in CentOS8 and then do full heal? I am afraid that
it will take a long time and make big load to production hosts.

TL;DR: Does gluster/oVirt4.4 make use of FINOBT,SPARSE_INODES,REFLINK
xfs features?


btw. oVirt is great product and HCI looks really functional and usable
(with 10GE network and SSDsof course). Big thanks to developers and
community!

Cheers,

Jiri



___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/PHPCYOGYL4A3PHVB56E3LL4KWR4JVM45/


[ovirt-users] Re: oVirt 2021 Spring survey questions

2021-04-27 Thread Jiří Sléžka
Hi,

On 4/27/21 10:13 AM, Sandro Bonazzola wrote:
> Hi,
> it's about the usual time of the year when we ask the community to
> provide feedback with a survey.
> Any questions you'd like to be asked?

maybe something about most wanted new feature?

Cheers,

Jiri

> 
> -- 
> 
> Sandro Bonazzola
> 
> MANAGER, SOFTWARE ENGINEERING, EMEA R RHV
> 
> Red Hat EMEA 
> 
> sbona...@redhat.com    
> 
>  
> 
> **
> *Red Hat respects your work life balance. Therefore there is no need to
> answer this email out of your office hours.
> *
> *
> 
> *
> 
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/S25LWSV7WLARKMJOYVQVSRLXV7O4LVUF/
> 
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/HPZIWWYYHHNCAISGYX4XE6VG6IPPICLY/


[ovirt-users] Re: oVirt4.4.5 and gluster op-version

2021-03-20 Thread Jiří Sléžka
Hello,

On 3/20/21 5:39 PM, Strahil Nikolov wrote:
> If all fuse clients are using gluster v8 , yes.
> 
> Most probably, your oVirt nodes are the only clients, but you can always
> verify with list-client volume command.

yes, oVirt nodes are the only clients and supports 8 op-version

gluster volume status all clients
...

I set new op-version to 8 without problems

gluster volume set all cluster.op-version 8

Thanx,

Jiri


> 
> Best Regards,
> Strahil Nikolov
> 
> On Fri, Mar 19, 2021 at 21:50, Jiří Sléžka
>  wrote:
> Hello,
> 
> I have just upgraded my 2hosts+1arbiter hci cluster to 4.4.5. Gluster is
> not managed by oVirt and currently is in op.version 70200
> 
> gluster volume get all cluster.op-version
> cluster.op-version                      70200
> 
> after gluster upgrade max-op-version is
> 
> gluster volume get all cluster.max-op-version
> cluster.max-op-version                  8
> 
> can I (or should I) switch to 8 op-version?
> 
> Thanks,
> Jiri
> ___
> Users mailing list -- users@ovirt.org <mailto:users@ovirt.org>
> To unsubscribe send an email to users-le...@ovirt.org
> <mailto:users-le...@ovirt.org>
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> <https://www.ovirt.org/privacy-policy.html>
> oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> <https://www.ovirt.org/community/about/community-guidelines/>
> List Archives:
> 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/7REZ46TPMMT4XQWN7VYHJAQ6MNIF7ROF/
> 
> <https://lists.ovirt.org/archives/list/users@ovirt.org/message/7REZ46TPMMT4XQWN7VYHJAQ6MNIF7ROF/>
> 
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/5TLYX3YTLJYL3U456AGYUKUYVNQHXTLQ/


[ovirt-users] oVirt4.4.5 and gluster op-version

2021-03-19 Thread Jiří Sléžka
Hello,

I have just upgraded my 2hosts+1arbiter hci cluster to 4.4.5. Gluster is
not managed by oVirt and currently is in op.version 70200

gluster volume get all cluster.op-version
cluster.op-version  70200

after gluster upgrade max-op-version is

gluster volume get all cluster.max-op-version
cluster.max-op-version  8

can I (or should I) switch to 8 op-version?

Thanks,
Jiri
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/7REZ46TPMMT4XQWN7VYHJAQ6MNIF7ROF/


[ovirt-users] Re: oVirt + Proxmox Backup Server

2021-02-02 Thread Jiří Sléžka

Hi,

On 2/2/21 9:54 AM, Mustapha Aissat wrote:

Hi,

You Can try BareOS https://www.bareos.org/en/ 
It has recently added support of Ovirt.
https://docs.bareos.org/TasksAndConcepts/Plugins.html#ovirt-plugin 



thanks for info, it looks promising.

Cheers, Jiri



Best regards,

On Thu, Jan 28, 2021 at 9:56 PM Leo David > wrote:


Hi,
I think that as long as you can manage to have the pbs client
installed on your nodes, you can setup some functional cli based
backup strategy.
Cheers,

Leo

On Thu, Jan 28, 2021, 21:17 mailto:jorgevisent...@gmail.com>> wrote:

So... I would like to know if works hehe

It seems to me to be a very interesting solution, and it is open
source

Buuut, they are not say if works with kvm or other integration.
___
Users mailing list -- users@ovirt.org 
To unsubscribe send an email to users-le...@ovirt.org

Privacy Statement: https://www.ovirt.org/privacy-policy.html

oVirt Code of Conduct:
https://www.ovirt.org/community/about/community-guidelines/

List Archives:

https://lists.ovirt.org/archives/list/users@ovirt.org/message/PEOSANI7NZ3SORP66WW5JUZ6DUFDPAZY/



___
Users mailing list -- users@ovirt.org 
To unsubscribe send an email to users-le...@ovirt.org

Privacy Statement: https://www.ovirt.org/privacy-policy.html

oVirt Code of Conduct:
https://www.ovirt.org/community/about/community-guidelines/

List Archives:

https://lists.ovirt.org/archives/list/users@ovirt.org/message/AJVZNZVQGJBCMMBYZ3TJBSUYTGCDPAEW/




___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/IJTKGIWG3A2YHPSDT3V4QNMW46IKB2P6/






smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/D4WGV3VQLL2J62QL5ZWDYWSYL64GPD4L/


[ovirt-users] Timeout in "Add HE disks" task

2021-01-09 Thread Jiří Sléžka
Hello,

I am trying to upgrade RHV4.3.10 (standalone) to RHV4.4.3 (HE) in one
step which is probably not supported but in case of our oVirt cluster it
worked well.

I am using hosted-engine --deploy with answer file

hosted-engine --deploy --restore-from-file=backup/backup.tar
--config-append=answer-file_pontus.txt

I can pass until this task

[ INFO  ] TASK [ovirt.ovirt.hosted_engine_setup : Add HE disks]

then ansible fails with a "Timeout exceed while waiting on result state
of the entity."

In the engine I can see that he_virtio_disk is successfuly created and
in OK state but he_sanlock is in LOCKED state. Shared storage domain is FC.

Here is part of ovirt-hosted-engine-setup-ansible-create_target_vm.log

...
{
"_ansible_item_label": {
"content": "hosted_engine_sanlock",
"description": "Hosted-Engine sanlock disk",
"format": "raw",
"name": "he_sanlock",
"size": "1GiB",
"sparse": false
},
"_ansible_no_log": false,
"ansible_loop_var": "item",
"changed": false,
"exception": "Traceback (most recent call last):\n  File
\"/tmp/ansible_ovirt_disk_payload_sy1l4b1p/ansible_ovirt_disk_payload.zip/ansible_collections/ovirt/ovirt/plugins/modules/ovirt_disk.py\",
line 802, in main\n  File
\"/tmp/ansible_ovirt_disk_payload_sy1l4b1p/ansible_ovirt_disk_payload.zip/ansible_collections/redhat/rhv/plugins/module_utils/ovirt.py\",
line 652, in create\n
poll_interval=self._module.params['poll_interval'],\n  File
\"/tmp/ansible_ovirt_disk_payload_sy1l4b1p/ansible_ovirt_disk_payload.zip/ansible_collections/redhat/rhv/plugins/module_utils/ovirt.py\",
line 370, in wait\nraise Exception(\"Timeout exceed while waiting on
result state of the entity.\")\nException: Timeout exceed while waiting
on result state of the entity.\n",
"failed": true,
...

Do have anybody an idea where could be problem?

Thanks in advance,

Jiri
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/AWPSURIJXPABFJ24HVGEIXWBYQAV43SK/


[ovirt-users] Re: Q: Hybrid GlusterFS / local storage setup?

2020-10-18 Thread Jiří Sléžka
Hi,

On 10/18/20 11:16 AM, Gilboa Davara wrote:
> 
> On Thu, Oct 15, 2020 at 3:27 PM Nir Soffer  > wrote:
> 
> On Thu, Oct 15, 2020 at 3:20 PM Gilboa Davara  > wrote:
> >
> > On Thu, Oct 15, 2020 at 2:38 PM Nir Soffer  > wrote:
> >>
> >> > I've got room to spare.
> >> > Any documentation on how to achieve this (or some pointers
> where to look)?
> >>
> >> It should be documented in ovirt.org , and in
> RHV documentation:
> >>
> 
> https://access.redhat.com/documentation/en-us/red_hat_virtualization/4.4/html/administration_guide/
> >>
> >> > I couldn't find LVM / block device under host devices / storage
> domain / etc and Google search returned irrelevant results.
> >>
> >> I tested locally, LVM devices are not available in:
> >> Compute > Hosts > {hostname} > Host Devices
> >>
> >> Looks like libvirt does not support device mapper devices. You
> can try:
> >> # virsh -r nodedev-list
> >>
> >> To see supported devices. The list seems to match what oVirt displays
> >> in the Host Devices tab.
> >>
> >> So you only option it to attach the entire local device to the
> VM, either using
> >> pci passthrough or as a scsi disk.
> >>
> >> Nir
> >
> >
> > Full SCSI passthrough per "desktop" VM is an overkill for this
> user case. (Plus, I don't see MD devices in the list, only pure
> SATA/SAS devices).
> > Any idea if there are plans to add support for LVM devices (or any
> other block device)?
> 
> I don't think there is such a plan, but it makes sense to support
> such usage.
> 
> Please file ovit-engine RFE explaining the use case, and we can consider
> it for a future version.
> 
> Nir
> 
> 
> Done.
> https://bugzilla.redhat.com/show_bug.cgi?id=1889138

I also wote for this feature. There is also this RFE which could allow
similar functionality (allow use of local disk storages in shared
storage cluster).

https://bugzilla.redhat.com/show_bug.cgi?id=1406412

Cheers,

Jiri

> 
> Thanks again for the help!
> - Gilboa
> 
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/NQOCWZ2SYPS2LXH2MRJLGICWOVZXXMCB/
> 



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/24NC357DKZ4JTHUB3HQQPNEQ3T2K6XQT/


[ovirt-users] too many glitches while moving storage domain

2020-10-10 Thread Jiří Sléžka
Hi,

today I started one not so common operation and I would like to share my
(not so good) experience.

I have 4 old Opteron G3 hosts which creates 1 DC with 1 cluster with 4.2
compatibility. Then I got 2 newer Intel based hosts which creates
separate DC and cluster. I use one shared FC storage with few LUNs for
all the stuff. Intel cluster is selfhosted with oVirt 4.4.2 where I
migrated original standalone oVirt 4.3.10 manager.

So I have 2 DCs, one with 4.2 and one with 4.4 compatibility.

Now the funny part... I had 2 LUNs connected to old 4.2 DC. My intention
was detach one of them and attach it to the new DC.

first pain was that some of vms had ther disks on both LUNs. It is
indicated late in the process and without hint which vms they are. So I
had to activate LUN in old DC and tried to find that vms (search string
in vms tab "Storage = one and Storage = two" seems not working). Ok, it
took two or three rounds... then, also late in the process there was
problem that one vm had previewed snapshot so another round with
activation of LUN in old DC... Then I was able detach and import LUN to
new DC. Nice, but with warning that LUN has 4.2 compatibility which will
be converted to 4.4 and there is no way back to connect it to old DC...
It is logical but very scary if something went wrong... but it did not
in my case :-)

LUN is connected in new DC. Now I had to import vms. Most vms were
imported ok but two of them were based on template wich resides on other
LUN. It was not indicated during detaching! It looks like I cannot move
template from storage to storage other way than through Export storage
(which I don't have at this moment) or through OVA export for which I
have not enough free storage space on hosts. Its a trap! :-) Btw. there
is no check for free space while starting export to OVA (template uses
preallocated disk). Exporting task still runs but there is no free space
at the host... and probably no way to cancel it from manager :-(

Ok, I had most of the vms imported. Last really strange thing is that I
lost one vm during import. It is not listed in VirtualMachines nor in VM
import tab on starage nor in Disk tab... that vm was an Aruba migration
tool and was imported from OVN image.

In fact there are two disks in Disk import tab, one of them has no
Alias/description and was created today around time I started work on
this migration. The second one has alias "vmdisk1" and is few months
older but I have no idea if it is the lost vm...

Sorry for long story, TL;DR version could be: There are glitches in some
(not so common) workflows...

Cheers,

Jiri




smime.p7s
Description: Elektronicky podpis S/MIME
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/3RSL5Y2BDX2OWYXHVWJXZV4C2XA5VMBX/


[ovirt-users] Re: ldap auth problem after upgrade from 4.4.1 to 4.4.2

2020-10-02 Thread Jiří Sléžka
On 10/1/20 9:41 PM, Martin Perina wrote:
> 
> 
> On Thu, Oct 1, 2020 at 3:18 PM Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> wrote:
> 
> Hi,
> 
> On 10/1/20 2:53 PM, Martin Perina wrote:
> > Hi,
> >
> > it seems that you are affected by
> > https://bugzilla.redhat.com/show_bug.cgi?id=1880149
> > Could you please try the workaround mentioned there?
> 
> bingo! Thanks a lot!
> 
> It is interesting behavior as my engine has no public ipv6 address (ipv6
> is set to ignore in nm).
> 
> also
> 
> [root@ovirt ~]# ping6 google.com <http://google.com>
> connect: Network is unreachable
> 
> but ok, problem is solved :-)
> 
> 
> Most probably your LDAP server can be resolved to both IPv4 and IPv6
> addresses and we choose a random resolved address in aaa-ldap when
> connecting. Enabling IPv6 by default was introduced in
> https://bugzilla.redhat.com/1726189 but unfortunately we have missed
> this scenario (engine IPv4, LDAP dual IPv4/IPv6) during testing ...

yes, this is exactly our case. No problem, it is really hard to catch
all variants.

Cheers,

Jiri


> 
> 
> Jiri
> 
> 
> >
> > Thanks,
> > Martin
> >
> >
> > On Thu, Oct 1, 2020 at 11:17 AM Jiří Sléžka  <mailto:jiri.sle...@slu.cz>
> > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>> wrote:
> >
> >     Hi,
> >
> >     I just upgraded my HE to 4.4.2 but now I cannot login using my
> ldap aaa
> >     profile anymore.
> >
> >     We are using Novell/NetIQ E-directory (load ballanced by haproxy,
> >     probably not important...)
> >
> >     In 4.4.1 I was hit by removed TLSv1 (which is the newest protocol
> >     supported by our edir) from default crypto policies but I was able
> >     revert it by
> >
> >     update-crypto-policies --set LEGACY
> >
> >     after upgrade to 4.4.2 the error is
> >
> >     server_error: An error occurred while attempting to connect to
> server
> >     ldap1.slu.cz:389 <http://ldap1.slu.cz:389>
> <http://ldap1.slu.cz:389>:
> >     IOException(LDAPException(resultCode=91 (connect
> >     error), errorMessage='An error occurred while attempting to
> establish a
> >     connection to server ldap1.slu.cz/193.84.206.212:389
> <http://ldap1.slu.cz/193.84.206.212:389>
> >     <http://ldap1.slu.cz/193.84.206.212:389>:
> >     SocketException(Network is unreachable (connect failed)),
> >     ldapSDKVersion=4.0.14,
> >     revision=c0fb784eebf9d36a67c736d0428fb3577f2e25bb'))
> >
> >     but our ldap server is reachable from ovirt, I tested it via
> (also ldaps
> >     and startls variants are working)
> >
> >     ldapsearch -H ldap://ldap1.slu.cz <http://ldap1.slu.cz>
> <http://ldap1.slu.cz> -x -D
> >     cn=*,ou=**,o=su -w
> >     '' -b 'o=su'
> >
> >     As a workaround I tried to set plain ldap protocol in profile
> >
> >     cat /etc/ovirt-engine/aaa/CRO.properties
> >
> >
> >     include = 
> >
> >     vars.server = ldap1.slu.cz <http://ldap1.slu.cz>
> <http://ldap1.slu.cz>
> >     vars.port = 389
> >     vars.user = cn=*,ou=**,o=su
> >     vars.password = **
> >
> >     pool.default.serverset.single.server = ${global:vars.server}
> >     pool.default.serverset.single.port = ${global:vars.port}
> >     pool.default.auth.simple.bindDN = ${global:vars.user}
> >     pool.default.auth.simple.password = ${global:vars.password}
> >
> >     pool.default.ssl.startTLS = false
> >     pool.default.ssl.enable = false
> >     #pool.default.ssl.protocol = TLSv1
> >     #pool.default.ssl.startTLSProtocol = TLSv1
> >     #pool.default.ssl.insecure = true
> >
> >     sequence-init.init.100-my-edir-init-vars = my-edir-init-vars
> >     sequence.my-edir-init-vars.010.description = set baseDN
> >     sequence.my-edir-init-vars.010.type = var-set
> >     sequence.my-edir-init-vars.010.var-set.variable = simple_baseDN
> >     sequence.my-edir-init-vars.010.var-set.value = o=su
> >
> >     #search.default.search-request.derefPolicy = ALWAYS
> >
> >
> >     but the e

[ovirt-users] Re: ldap auth problem after upgrade from 4.4.1 to 4.4.2

2020-10-01 Thread Jiří Sléžka
Hi,

On 10/1/20 2:53 PM, Martin Perina wrote:
> Hi,
> 
> it seems that you are affected by
> https://bugzilla.redhat.com/show_bug.cgi?id=1880149
> Could you please try the workaround mentioned there?

bingo! Thanks a lot!

It is interesting behavior as my engine has no public ipv6 address (ipv6
is set to ignore in nm).

also

[root@ovirt ~]# ping6 google.com
connect: Network is unreachable

but ok, problem is solved :-)

Jiri


> 
> Thanks,
> Martin
> 
> 
> On Thu, Oct 1, 2020 at 11:17 AM Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> wrote:
> 
> Hi,
> 
> I just upgraded my HE to 4.4.2 but now I cannot login using my ldap aaa
> profile anymore.
> 
> We are using Novell/NetIQ E-directory (load ballanced by haproxy,
> probably not important...)
> 
> In 4.4.1 I was hit by removed TLSv1 (which is the newest protocol
> supported by our edir) from default crypto policies but I was able
> revert it by
> 
> update-crypto-policies --set LEGACY
> 
> after upgrade to 4.4.2 the error is
> 
> server_error: An error occurred while attempting to connect to server
> ldap1.slu.cz:389 <http://ldap1.slu.cz:389>:
> IOException(LDAPException(resultCode=91 (connect
> error), errorMessage='An error occurred while attempting to establish a
> connection to server ldap1.slu.cz/193.84.206.212:389
> <http://ldap1.slu.cz/193.84.206.212:389>:
> SocketException(Network is unreachable (connect failed)),
> ldapSDKVersion=4.0.14,
> revision=c0fb784eebf9d36a67c736d0428fb3577f2e25bb'))
> 
> but our ldap server is reachable from ovirt, I tested it via (also ldaps
> and startls variants are working)
> 
> ldapsearch -H ldap://ldap1.slu.cz <http://ldap1.slu.cz> -x -D
> cn=*,ou=**,o=su -w
> '' -b 'o=su'
> 
> As a workaround I tried to set plain ldap protocol in profile
> 
> cat /etc/ovirt-engine/aaa/CRO.properties
> 
> 
> include = 
> 
> vars.server = ldap1.slu.cz <http://ldap1.slu.cz>
> vars.port = 389
> vars.user = cn=*,ou=**,o=su
> vars.password = **
> 
> pool.default.serverset.single.server = ${global:vars.server}
> pool.default.serverset.single.port = ${global:vars.port}
> pool.default.auth.simple.bindDN = ${global:vars.user}
> pool.default.auth.simple.password = ${global:vars.password}
> 
> pool.default.ssl.startTLS = false
> pool.default.ssl.enable = false
> #pool.default.ssl.protocol = TLSv1
> #pool.default.ssl.startTLSProtocol = TLSv1
> #pool.default.ssl.insecure = true
> 
> sequence-init.init.100-my-edir-init-vars = my-edir-init-vars
> sequence.my-edir-init-vars.010.description = set baseDN
> sequence.my-edir-init-vars.010.type = var-set
> sequence.my-edir-init-vars.010.var-set.variable = simple_baseDN
> sequence.my-edir-init-vars.010.var-set.value = o=su
> 
> #search.default.search-request.derefPolicy = ALWAYS
> 
> 
> but the error is the same...
> 
> ovirt-engine-extensions-tool aaa login-user --profile=CRO
> --user-name=my_user
> 
> 
> WARNING: [ovirt-engine-extension-aaa-ldap.authn::SU-LDAP-authentication]
> TLS/SSL insecure mode
> ...
> WARNING: [ovirt-engine-extension-aaa-ldap.authn::auth.CRO.slu.cz
> <http://auth.CRO.slu.cz>] Cannot
> initialize LDAP framework, deferring initialization. Error: An error
> occurred while attempting to connect to server ldap1.slu.cz:389
> <http://ldap1.slu.cz:389>:
> IOException(LDAPException(resultCode=91 (connect error),
> errorMessage='An error occurred while attempting to establish a
> connection to server ldap1.slu.cz/193.84.206.212:389
> <http://ldap1.slu.cz/193.84.206.212:389>:
> SocketException(Network is unreachable (connect failed)),
> ldapSDKVersion=4.0.14,
> revision=c0fb784eebf9d36a67c736d0428fb3577f2e25bb'))
> ...
> INFO: API: -->Authn.InvokeCommands.AUTHENTICATE_CREDENTIALS
> profile='CRO' user='my_user'
> Password:
> ...
> WARNING: [ovirt-engine-extension-aaa-ldap.authn::auth.CRO.slu.cz
> <http://auth.CRO.slu.cz>] Cannot
> initialize LDAP framework, deferring initialization. Error: An error
> occurred while attempting to connect to server ldap1.slu.cz:389
> <http://ldap1.slu.cz:389>:
> IOException(LDAPException(resultCode=91 (connect error),
> errorMessage='An error occurred while attempting to establish a
> connection to server ldap1.slu.cz/193.84.206.212:389
> <http://ldap1.slu.cz/193.84.206.212:389>:
> SocketException(N

[ovirt-users] ldap auth problem after upgrade from 4.4.1 to 4.4.2

2020-10-01 Thread Jiří Sléžka
Hi,

I just upgraded my HE to 4.4.2 but now I cannot login using my ldap aaa
profile anymore.

We are using Novell/NetIQ E-directory (load ballanced by haproxy,
probably not important...)

In 4.4.1 I was hit by removed TLSv1 (which is the newest protocol
supported by our edir) from default crypto policies but I was able
revert it by

update-crypto-policies --set LEGACY

after upgrade to 4.4.2 the error is

server_error: An error occurred while attempting to connect to server
ldap1.slu.cz:389: IOException(LDAPException(resultCode=91 (connect
error), errorMessage='An error occurred while attempting to establish a
connection to server ldap1.slu.cz/193.84.206.212:389:
SocketException(Network is unreachable (connect failed)),
ldapSDKVersion=4.0.14, revision=c0fb784eebf9d36a67c736d0428fb3577f2e25bb'))

but our ldap server is reachable from ovirt, I tested it via (also ldaps
and startls variants are working)

ldapsearch -H ldap://ldap1.slu.cz -x -D cn=*,ou=**,o=su -w
'' -b 'o=su'

As a workaround I tried to set plain ldap protocol in profile

cat /etc/ovirt-engine/aaa/CRO.properties


include = 

vars.server = ldap1.slu.cz
vars.port = 389
vars.user = cn=*,ou=**,o=su
vars.password = **

pool.default.serverset.single.server = ${global:vars.server}
pool.default.serverset.single.port = ${global:vars.port}
pool.default.auth.simple.bindDN = ${global:vars.user}
pool.default.auth.simple.password = ${global:vars.password}

pool.default.ssl.startTLS = false
pool.default.ssl.enable = false
#pool.default.ssl.protocol = TLSv1
#pool.default.ssl.startTLSProtocol = TLSv1
#pool.default.ssl.insecure = true

sequence-init.init.100-my-edir-init-vars = my-edir-init-vars
sequence.my-edir-init-vars.010.description = set baseDN
sequence.my-edir-init-vars.010.type = var-set
sequence.my-edir-init-vars.010.var-set.variable = simple_baseDN
sequence.my-edir-init-vars.010.var-set.value = o=su

#search.default.search-request.derefPolicy = ALWAYS


but the error is the same...

ovirt-engine-extensions-tool aaa login-user --profile=CRO
--user-name=my_user


WARNING: [ovirt-engine-extension-aaa-ldap.authn::SU-LDAP-authentication]
TLS/SSL insecure mode
...
WARNING: [ovirt-engine-extension-aaa-ldap.authn::auth.CRO.slu.cz] Cannot
initialize LDAP framework, deferring initialization. Error: An error
occurred while attempting to connect to server ldap1.slu.cz:389:
IOException(LDAPException(resultCode=91 (connect error),
errorMessage='An error occurred while attempting to establish a
connection to server ldap1.slu.cz/193.84.206.212:389:
SocketException(Network is unreachable (connect failed)),
ldapSDKVersion=4.0.14, revision=c0fb784eebf9d36a67c736d0428fb3577f2e25bb'))
...
INFO: API: -->Authn.InvokeCommands.AUTHENTICATE_CREDENTIALS
profile='CRO' user='my_user'
Password:
...
WARNING: [ovirt-engine-extension-aaa-ldap.authn::auth.CRO.slu.cz] Cannot
initialize LDAP framework, deferring initialization. Error: An error
occurred while attempting to connect to server ldap1.slu.cz:389:
IOException(LDAPException(resultCode=91 (connect error),
errorMessage='An error occurred while attempting to establish a
connection to server ldap1.slu.cz/193.84.206.212:389:
SocketException(Network is unreachable (connect failed)),
ldapSDKVersion=4.0.14, revision=c0fb784eebf9d36a67c736d0428fb3577f2e25bb'))
Oct 01, 2020 10:57:37 AM
org.ovirt.engine.exttool.core.ExtensionsToolExecutor main
SEVERE: An error occurred while attempting to connect to server
ldap1.slu.cz:389:  IOException(LDAPException(resultCode=91 (connect
error), errorMessage='An error occurred while attempting to establish a
connection to server ldap1.slu.cz/193.84.206.212:389:
SocketException(Network is unreachable (connect failed)),
ldapSDKVersion=4.0.14, revision=c0fb784eebf9d36a67c736d0428fb3577f2e25bb'))

debug with tcpdump reveals only that connection is made and there are
only "bindRequest" and "bindResponse success" messages visible (with
correct tcp handshake and close) and nothing more

any help would be appreciated

Cheers,

Jiri



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/M4MFGXGJ33R5DFX66HHGENOROHGOTF2D/


[ovirt-users] ovirt4.4 and ldap auth with starttls

2020-08-07 Thread Jiří Sléžka
Hello,

better start new thread...

it looks like tls1.0 is not supported anymore in
ovirt-engine-extension-aaa-ldap

I just migrated engine from 4.3 to 4.4 and cannot use my ldap profile
because

server_error: The connection reader was unable to successfully complete
TLS negotiation: SSLHandshakeException(The server selected protocol
version TLS10 is not accepted by client preferences [TLS12]),
ldapSDKVersion=4.0.14, revision=c0fb784eebf9d36a67c736d0428fb3577f2e25bb

but when I try to force tls 1.0 by setting

...
pool.default.ssl.startTLS = true
pool.default.ssl.startTLSProtocol = TLSv1
...

I got

server_error: The connection reader was unable to successfully complete
TLS negotiation: SSLHandshakeException(No appropriate protocol (protocol
is disabled or cipher suites are inappropriate)), ldapSDKVersion=4.0.14,
revision=c0fb784eebf9d36a67c736d0428fb3577f2e25bb

I can't switch to something better on server side, is it possible to
allow weak ciphers/protocols on client side?

Thanks in advance,

Jiri




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/CBVIAEO3R4BQNJ5453O2D5NJH7FQ7YGR/


[ovirt-users] Re: migrating standalone engine to selfhosted and upgrade from 4.3 to 4.4 in one step

2020-08-07 Thread Jiří Sléžka
On 8/7/20 9:50 AM, Jiří Sléžka wrote:
> On 8/5/20 2:07 PM, Jiří Sléžka wrote:
>> On 8/3/20 11:12 AM, Jiří Sléžka wrote:
>>> Hello,
>>>
>>> I have 4 host cluster managed with standalone engine in version 4.3 and
>>> I would like to migrate this standalone engine to 4.4 as hosted engine.
>>>
>>> I have two new hosts which I would like to use as base for new HE
>>> cluster. (new hosts are Intel based, old ones are AMD Opteron based -
>>> new cluster will have 4.4 compatibility, old one have to stay at 4.2
>>> compatibility level).
>>>
>>> I red this
>>>
>>> https://www.ovirt.org/documentation/migrating_from_a_standalone_manager_to_a_self-hosted_engine/
>>>
>>> but the question is: Can I migrate and upgrade in one step? Have anybody
>>> did that already? If it is not possible what is a suggested approach?
>>
>> I just tried it. It looks like it could work at least until installation
>> process want to login into engine. It looks like it does not use valid
>> login name nor password.
>>
>> [ INFO  ] TASK [ovirt.hosted_engine_setup : Expose engine VM webui over
>> a local port via ssh port forwarding]
>> [ INFO  ] changed: [localhost]
>> [ INFO  ] TASK [ovirt.hosted_engine_setup : Evaluate temporary bootstrap
>> engine URL]
>> [ INFO  ] ok: [localhost]
>> [ INFO  ] The bootstrap engine is temporary accessible over
>> https://ovirt05.net.slu.cz:6900/ovirt-engine/
>> [ INFO  ] TASK [ovirt.hosted_engine_setup : Detect VLAN ID]
>> [ INFO  ] changed: [localhost]
>> [ INFO  ] TASK [ovirt.hosted_engine_setup : Set Engine public key as
>> authorized key without validating the TLS/SSL certificates]
>> [ INFO  ] changed: [localhost]
>> [ INFO  ] TASK [ovirt.hosted_engine_setup : include_tasks]
>> [ INFO  ] ok: [localhost]
>> [ INFO  ] TASK [ovirt.hosted_engine_setup : Obtain SSO token using
>> username/password credentials]
>> [ INFO  ] ok: [localhost]
>> [ INFO  ] TASK [ovirt.hosted_engine_setup : Ensure that the target
>> datacenter is present]
>> [ ERROR ] ovirtsdk4.AuthError: Error during SSO authentication
>> access_denied : Cannot authenticate user 'None@N/A': No valid profile
>> found in credentials..
>> [ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg":
>> "Error during SSO authentication access_denied : Cannot authenticate
>> user 'None@N/A': No valid profile found in credentials.."}
>>
>> I tried to login to https://ovirt05.net.slu.cz:6900/ovirt-engine/ and it
>> probably accept username admin@internal and new password entered during
>> hosted engine deploy but then it display error "The provided
>> authorization grant for the auth code has expired."
>>
>> Maybe it is related to this bug (and custom 3rd party Apache certificate)
>>
>> https://bugzilla.redhat.com/show_bug.cgi?id=1715767
>>
>> in my case it looks like on engine vm in file
>>
>> /etc/pki/ovirt-engine/apache-ca.pem
>>
>> is original certificate from backup which is for ovirt.slu.cz fqdn. For
>> new hosted engine I use new fqdn ovirt.net.slu.cz. Should I change
>> ovirt.slu.cz record to point to new ip address (it have to be one from
>> ovirtmgmt subnet) and then try restore? Documentation is not much clear
>> in this particular subject.
> 
> well, I will answer myself
> 
> * setting fqdn is not probably important at this time, self hosted
> engine is prepared with modified /etc/hosts
> 
> * main problem was that I am using 3rd party certificate for long time
> so I didn't mention this documentation section
> 
> https://ovirt.org/documentation/administration_guide/#Replacing_the_Manager_CA_Certificate
> 
> especially section 14 which describe how to configure engine-backup to
> backup also custom CA certificate. But this part is badly formatted as
> described in
> 
> https://bugzilla.redhat.com/show_bug.cgi?id=1859505
> 
> relevant BZ is also https://bugzilla.redhat.com/show_bug.cgi?id=1841203
> which point me to the right direction

just for record.

I had to change dns record for fqdn during deploy process - after HE vm
was copied to shared storage (FC in my case) and before or during "
Check engine VM health"

...
[ INFO  ] TASK [ovirt.hosted_engine_setup : Start ovirt-ha-agent service
on the host]
[ INFO  ] changed: [localhost]
[ INFO  ] TASK [ovirt.hosted_engine_setup : Exit HE maintenance mode]
[ INFO  ] changed: [localhost]
[ INFO  ] TASK [ovirt.hosted_engine_setup : Check engine VM health]
[ INFO  ] changed: [localhost]
[ INFO  ] TASK [ovirt.hosted_engine_setup : Get

[ovirt-users] Re: migrating standalone engine to selfhosted and upgrade from 4.3 to 4.4 in one step

2020-08-07 Thread Jiří Sléžka
On 8/5/20 2:07 PM, Jiří Sléžka wrote:
> On 8/3/20 11:12 AM, Jiří Sléžka wrote:
>> Hello,
>>
>> I have 4 host cluster managed with standalone engine in version 4.3 and
>> I would like to migrate this standalone engine to 4.4 as hosted engine.
>>
>> I have two new hosts which I would like to use as base for new HE
>> cluster. (new hosts are Intel based, old ones are AMD Opteron based -
>> new cluster will have 4.4 compatibility, old one have to stay at 4.2
>> compatibility level).
>>
>> I red this
>>
>> https://www.ovirt.org/documentation/migrating_from_a_standalone_manager_to_a_self-hosted_engine/
>>
>> but the question is: Can I migrate and upgrade in one step? Have anybody
>> did that already? If it is not possible what is a suggested approach?
> 
> I just tried it. It looks like it could work at least until installation
> process want to login into engine. It looks like it does not use valid
> login name nor password.
> 
> [ INFO  ] TASK [ovirt.hosted_engine_setup : Expose engine VM webui over
> a local port via ssh port forwarding]
> [ INFO  ] changed: [localhost]
> [ INFO  ] TASK [ovirt.hosted_engine_setup : Evaluate temporary bootstrap
> engine URL]
> [ INFO  ] ok: [localhost]
> [ INFO  ] The bootstrap engine is temporary accessible over
> https://ovirt05.net.slu.cz:6900/ovirt-engine/
> [ INFO  ] TASK [ovirt.hosted_engine_setup : Detect VLAN ID]
> [ INFO  ] changed: [localhost]
> [ INFO  ] TASK [ovirt.hosted_engine_setup : Set Engine public key as
> authorized key without validating the TLS/SSL certificates]
> [ INFO  ] changed: [localhost]
> [ INFO  ] TASK [ovirt.hosted_engine_setup : include_tasks]
> [ INFO  ] ok: [localhost]
> [ INFO  ] TASK [ovirt.hosted_engine_setup : Obtain SSO token using
> username/password credentials]
> [ INFO  ] ok: [localhost]
> [ INFO  ] TASK [ovirt.hosted_engine_setup : Ensure that the target
> datacenter is present]
> [ ERROR ] ovirtsdk4.AuthError: Error during SSO authentication
> access_denied : Cannot authenticate user 'None@N/A': No valid profile
> found in credentials..
> [ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg":
> "Error during SSO authentication access_denied : Cannot authenticate
> user 'None@N/A': No valid profile found in credentials.."}
> 
> I tried to login to https://ovirt05.net.slu.cz:6900/ovirt-engine/ and it
> probably accept username admin@internal and new password entered during
> hosted engine deploy but then it display error "The provided
> authorization grant for the auth code has expired."
> 
> Maybe it is related to this bug (and custom 3rd party Apache certificate)
> 
> https://bugzilla.redhat.com/show_bug.cgi?id=1715767
> 
> in my case it looks like on engine vm in file
> 
> /etc/pki/ovirt-engine/apache-ca.pem
> 
> is original certificate from backup which is for ovirt.slu.cz fqdn. For
> new hosted engine I use new fqdn ovirt.net.slu.cz. Should I change
> ovirt.slu.cz record to point to new ip address (it have to be one from
> ovirtmgmt subnet) and then try restore? Documentation is not much clear
> in this particular subject.

well, I will answer myself

* setting fqdn is not probably important at this time, self hosted
engine is prepared with modified /etc/hosts

* main problem was that I am using 3rd party certificate for long time
so I didn't mention this documentation section

https://ovirt.org/documentation/administration_guide/#Replacing_the_Manager_CA_Certificate

especially section 14 which describe how to configure engine-backup to
backup also custom CA certificate. But this part is badly formatted as
described in

https://bugzilla.redhat.com/show_bug.cgi?id=1859505

relevant BZ is also https://bugzilla.redhat.com/show_bug.cgi?id=1841203
which point me to the right direction

Cheers,

Jiri



> 
> Cheers,
> 
> Jiri
> 
>>
>> Thanks for help
>>
>> Jiri
>>
>>
>> ___
>> Users mailing list -- users@ovirt.org
>> To unsubscribe send an email to users-le...@ovirt.org
>> Privacy Statement: https://www.ovirt.org/privacy-policy.html
>> oVirt Code of Conduct: 
>> https://www.ovirt.org/community/about/community-guidelines/
>> List Archives: 
>> https://lists.ovirt.org/archives/list/users@ovirt.org/message/YH4J7GG7WLOLUFIADZPL6JOPDETJ23CZ/
>>
> 
> 
> 
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archi

[ovirt-users] Re: migrating standalone engine to selfhosted and upgrade from 4.3 to 4.4 in one step

2020-08-05 Thread Jiří Sléžka
On 8/3/20 11:12 AM, Jiří Sléžka wrote:
> Hello,
> 
> I have 4 host cluster managed with standalone engine in version 4.3 and
> I would like to migrate this standalone engine to 4.4 as hosted engine.
> 
> I have two new hosts which I would like to use as base for new HE
> cluster. (new hosts are Intel based, old ones are AMD Opteron based -
> new cluster will have 4.4 compatibility, old one have to stay at 4.2
> compatibility level).
> 
> I red this
> 
> https://www.ovirt.org/documentation/migrating_from_a_standalone_manager_to_a_self-hosted_engine/
> 
> but the question is: Can I migrate and upgrade in one step? Have anybody
> did that already? If it is not possible what is a suggested approach?

I just tried it. It looks like it could work at least until installation
process want to login into engine. It looks like it does not use valid
login name nor password.

[ INFO  ] TASK [ovirt.hosted_engine_setup : Expose engine VM webui over
a local port via ssh port forwarding]
[ INFO  ] changed: [localhost]
[ INFO  ] TASK [ovirt.hosted_engine_setup : Evaluate temporary bootstrap
engine URL]
[ INFO  ] ok: [localhost]
[ INFO  ] The bootstrap engine is temporary accessible over
https://ovirt05.net.slu.cz:6900/ovirt-engine/
[ INFO  ] TASK [ovirt.hosted_engine_setup : Detect VLAN ID]
[ INFO  ] changed: [localhost]
[ INFO  ] TASK [ovirt.hosted_engine_setup : Set Engine public key as
authorized key without validating the TLS/SSL certificates]
[ INFO  ] changed: [localhost]
[ INFO  ] TASK [ovirt.hosted_engine_setup : include_tasks]
[ INFO  ] ok: [localhost]
[ INFO  ] TASK [ovirt.hosted_engine_setup : Obtain SSO token using
username/password credentials]
[ INFO  ] ok: [localhost]
[ INFO  ] TASK [ovirt.hosted_engine_setup : Ensure that the target
datacenter is present]
[ ERROR ] ovirtsdk4.AuthError: Error during SSO authentication
access_denied : Cannot authenticate user 'None@N/A': No valid profile
found in credentials..
[ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg":
"Error during SSO authentication access_denied : Cannot authenticate
user 'None@N/A': No valid profile found in credentials.."}

I tried to login to https://ovirt05.net.slu.cz:6900/ovirt-engine/ and it
probably accept username admin@internal and new password entered during
hosted engine deploy but then it display error "The provided
authorization grant for the auth code has expired."

Maybe it is related to this bug (and custom 3rd party Apache certificate)

https://bugzilla.redhat.com/show_bug.cgi?id=1715767

in my case it looks like on engine vm in file

/etc/pki/ovirt-engine/apache-ca.pem

is original certificate from backup which is for ovirt.slu.cz fqdn. For
new hosted engine I use new fqdn ovirt.net.slu.cz. Should I change
ovirt.slu.cz record to point to new ip address (it have to be one from
ovirtmgmt subnet) and then try restore? Documentation is not much clear
in this particular subject.

Cheers,

Jiri

> 
> Thanks for help
> 
> Jiri
> 
> 
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/YH4J7GG7WLOLUFIADZPL6JOPDETJ23CZ/
> 




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/SWKF5CF3UHVRDE2NA2R3EW3S6642S2HA/


[ovirt-users] migrating standalone engine to selfhosted and upgrade from 4.3 to 4.4 in one step

2020-08-03 Thread Jiří Sléžka
Hello,

I have 4 host cluster managed with standalone engine in version 4.3 and
I would like to migrate this standalone engine to 4.4 as hosted engine.

I have two new hosts which I would like to use as base for new HE
cluster. (new hosts are Intel based, old ones are AMD Opteron based -
new cluster will have 4.4 compatibility, old one have to stay at 4.2
compatibility level).

I red this

https://www.ovirt.org/documentation/migrating_from_a_standalone_manager_to_a_self-hosted_engine/

but the question is: Can I migrate and upgrade in one step? Have anybody
did that already? If it is not possible what is a suggested approach?

Thanks for help

Jiri



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/YH4J7GG7WLOLUFIADZPL6JOPDETJ23CZ/


[ovirt-users] Re: 10GB disk created on glusterfs storage has only 4096B in vm

2020-07-29 Thread Jiří Sléžka
On 7/29/20 7:37 PM, Gianluca Cecchi wrote:
> On Wed, Jul 29, 2020 at 7:13 PM Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> wrote:
> 
> 
> 
> thanks for reply. In my case it loks like I have not
> "performance.stat-prefetch" enabled...
> 
> [root@ovirt-hci01 ~]# gluster volume info vms
> 
> Volume Name: vms
> Type: Replicate
> Volume ID: ba5fd3b8-0704-4462-8257-94b77a1222c4
> Status: Started
> Snapshot Count: 0
> Number of Bricks: 1 x 2 = 2
> Transport-type: tcp
> Bricks:
> Brick1: 10.0.4.13:/gluster_bricks/vms/vms
> Brick2: 10.0.4.11:/gluster_bricks/vms/vms
> Options Reconfigured:
> performance.client-io-threads: off
> cluster.eager-lock: enable
> performance.io-cache: off
> performance.read-ahead: off
> performance.quick-read: off
> user.cifs: off
> network.ping-timeout: 30
> network.remote-dio: off
> performance.strict-o-direct: on
> performance.low-prio-threads: 32
> features.shard: on
> storage.owner-gid: 36
> storage.owner-uid: 36
> transport.address-family: inet
> storage.fips-mode-rchecksum: on
> nfs.disable: on
> cluster.self-heal-daemon: enable
> 
> 
> BTW: you can run this command to see all options settings, default ones
> and not for the vms volume:
> 
> gluster volume get vms all

good point!

Cheers,

Jiri



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/RSGDDY32V664VCF2ZZT3FJQPYF3GWDQM/


[ovirt-users] Re: 10GB disk created on glusterfs storage has only 4096B in vm

2020-07-29 Thread Jiří Sléžka
hello,

On 7/29/20 6:33 PM, shadow emy wrote:
> I had a similar problem, after i migrate the disk from a "Storage Domain"  to 
> a second "Storage Domain"  the disk size was different and i could not start 
> the vm anymore.
> The Ovirt error was : "Unable to get volume size for domain"
> Somehow gluster has some errors when i migrate disks and the disk have 
> different size.
> What i did was to disable  performance.stat-prefetch for the gluster volume(I 
> read it has some bugs if is enabled in gluster 7.x)

thanks for reply. In my case it loks like I have not
"performance.stat-prefetch" enabled...

[root@ovirt-hci01 ~]# gluster volume info vms

Volume Name: vms
Type: Replicate
Volume ID: ba5fd3b8-0704-4462-8257-94b77a1222c4
Status: Started
Snapshot Count: 0
Number of Bricks: 1 x 2 = 2
Transport-type: tcp
Bricks:
Brick1: 10.0.4.13:/gluster_bricks/vms/vms
Brick2: 10.0.4.11:/gluster_bricks/vms/vms
Options Reconfigured:
performance.client-io-threads: off
cluster.eager-lock: enable
performance.io-cache: off
performance.read-ahead: off
performance.quick-read: off
user.cifs: off
network.ping-timeout: 30
network.remote-dio: off
performance.strict-o-direct: on
performance.low-prio-threads: 32
features.shard: on
storage.owner-gid: 36
storage.owner-uid: 36
transport.address-family: inet
storage.fips-mode-rchecksum: on
nfs.disable: on
cluster.self-heal-daemon: enable


...well, If it is not a default setting... so I set it off...

[root@ovirt-hci01 ~]# gluster volume set vms performance.stat-prefetch off
volume set: success

...and unbeliavable, it works now!

Is it bug in glusterfs 7.6-1.el8 (link?) or something else?

Should not be this setting default one?

> Also try to look at Disk Snapshots, see if you have a snapshot that does not 
> exist.
> In my case i had this snapshot problem after migration  and i had to delete 
> the snapshot from the database and set the main image disk id as default in 
> the vm.

I have no snapshots of this vm. I just created new vm and try to install
it from ISO.

> Maybe in you case is different.

Well, it was exactly this case, thanks!

Cheers,

Jiri

>  
> Emy
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/WO7S2ZFTSRPRY22NX6TCGEYE3BL3YKZX/
> 




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/KTVKYB6WHC7SPC2OMVWWVRUS76WAOPHB/


[ovirt-users] 10GB disk created on glusterfs storage has only 4096B in vm

2020-07-29 Thread Jiří Sléžka
Hi,

I have another spooky issue. CentOS 8.2, oVirt 4.4.1.10-1

I am trying to create vm on my HCI cluster (which in fact has only two
nodes now, the third will be added later). I have created new 10GB disk
on gluster storage (it behaves the same way when I chose prealocated or
thin provision disk). Everything looks good (I was surprised how fast
was this disk created) but when I run the vm then disk has only 4096 bytes.

see http://mirror.slu.cz/tmp/ovirt_small_disk.png

disk creation part of engine.log is here

https://paste.slu.cz/?0e33e800609898d8#CEC2maoe7zk9b1zmnHaAENjaAnpiyzzGzPFSe8sZ3JUc

the same part in vdsm.log on SPM host

https://paste.slu.cz/?e89fdce800c39b28#3UWw7dXxQAmZysMSKp5NbeTWMNDghZVh2NLEmfb1e4rb

where could be problem? Did anybody experience this behavior?

Cheers,

Jiri




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/GM4SOUOAHZDFAXI6476DEPEVFRSOQ6WV/


[ovirt-users] Re: problem with custom bond options

2020-07-27 Thread Jiří Sléžka
On 7/24/20 5:23 PM, Strahil Nikolov wrote:
> Hi Jiri,
> 
> you are the second person who mentions it.  Can you open a bug at 
> bugzilla.redhat.com  about that  ?

sure, here it is

https://bugzilla.redhat.com/show_bug.cgi?id=1860843

Best Regards,

Jiri Slezka


> 
> Best Regards,
> Strahil Nikolov
> 
> На 24 юли 2020 г. 16:30:02 GMT+03:00, "Jiří Sléžka"  
> написа:
>> On 7/24/20 11:36 AM, Jiří Sléžka wrote:
>>> On 7/24/20 10:56 AM, Ales Musil wrote:
>>>>
>>>>
>>>> On Fri, Jul 24, 2020 at 10:40 AM Jiří Sléžka >>> <mailto:jiri.sle...@slu.cz>> wrote:
>>>>
>>>> On 7/23/20 2:07 PM, Jiří Sléžka wrote:
>>>> > On 7/23/20 12:35 PM, Ales Musil wrote:
>>>> >>
>>>> >>
>>>> >> On Thu, Jul 23, 2020 at 11:50 AM Jiří Sléžka
>> >>> <mailto:jiri.sle...@slu.cz>
>>>> >> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>
>> wrote:
>>>> >>
>>>> >>     On 7/23/20 11:03 AM, Ales Musil wrote:
>>>> >>     >
>>>> >>     >
>>>> >>     > On Thu, Jul 23, 2020 at 10:35 AM Jiří Sléžka
>>>> mailto:jiri.sle...@slu.cz>
>>>> >>     <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>
>>>> >>     > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
>>>> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>> wrote:
>>>> >>     >
>>>> >>     >     Hi,
>>>> >>     >
>>>> >>     >     On 7/23/20 8:38 AM, Ales Musil wrote:
>>>> >>     >     >
>>>> >>     >     >
>>>> >>     >     > On Wed, Jul 22, 2020 at 9:41 PM Jiří Sléžka
>>>> >>     mailto:jiri.sle...@slu.cz>
>>>> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>
>>>> >>     >     <mailto:jiri.sle...@slu.cz
>> <mailto:jiri.sle...@slu.cz>
>>>> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>
>>>> >>     >     > <mailto:jiri.sle...@slu.cz
>>>> <mailto:jiri.sle...@slu.cz> <mailto:jiri.sle...@slu.cz
>>>> <mailto:jiri.sle...@slu.cz>>
>>>> >>     <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
>>>> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>>>
>> wrote:
>>>> >>     >     >
>>>> >>     >     >     Hello,
>>>> >>     >     >
>>>> >>     >     >
>>>> >>     >     > Hi,
>>>> >>     >     >
>>>> >>     >     >
>>>> >>     >     >     CentOS8, oVirt 4.4.1.10-1.el8
>>>> >>     >     >
>>>> >>     >     >     I am trying to setup active-backup (mode=1)
>>>> bonding mode
>>>> >>     with
>>>> >>     >     custom
>>>> >>     >     >     properties. I have one 10GE switch, the
>> second is
>>>> just 1G.
>>>> >>     >     10GE link is
>>>> >>     >     >     the primary one.
>>>> >>     >     >
>>>> >>     >     >     cat
>> /etc/sysconfig/network-scripts/ifcfg-bond0
>>>> >>     >     >
>>>> >>     >     >
>>>> >>     >     > first of all in oVirt 4.4 the network-scripts are
>> not
>>>> relevant
>>>> >>     >     anymore.
>>>> >>     >     > More relevant is output from 'nmstatectl show'.
>>>> >>     >
>>>> >>     >     thanks, I believed that ifcfg files still describes
>> saved
>>>> >>     interface
>>>> >>     >     configuration (even on nm managed interfaces)...
>>>> >>     >
>>>> >>     >
>>>> >>     > It does but it might not be that detailed as we would
>> have
>>>> hoped for.
>&g

[ovirt-users] Re: problem with custom bond options

2020-07-24 Thread Jiří Sléžka
On 7/24/20 11:36 AM, Jiří Sléžka wrote:
> On 7/24/20 10:56 AM, Ales Musil wrote:
>>
>>
>> On Fri, Jul 24, 2020 at 10:40 AM Jiří Sléžka > <mailto:jiri.sle...@slu.cz>> wrote:
>>
>> On 7/23/20 2:07 PM, Jiří Sléžka wrote:
>> > On 7/23/20 12:35 PM, Ales Musil wrote:
>> >>
>> >>
>> >> On Thu, Jul 23, 2020 at 11:50 AM Jiří Sléžka > <mailto:jiri.sle...@slu.cz>
>> >> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>> wrote:
>> >>
>> >>     On 7/23/20 11:03 AM, Ales Musil wrote:
>> >>     >
>> >>     >
>> >>     > On Thu, Jul 23, 2020 at 10:35 AM Jiří Sléžka
>> mailto:jiri.sle...@slu.cz>
>> >>     <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>
>> >>     > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
>> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>> wrote:
>> >>     >
>> >>     >     Hi,
>> >>     >
>> >>     >     On 7/23/20 8:38 AM, Ales Musil wrote:
>> >>     >     >
>> >>     >     >
>> >>     >     > On Wed, Jul 22, 2020 at 9:41 PM Jiří Sléžka
>> >>     mailto:jiri.sle...@slu.cz>
>> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>
>> >>     >     <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
>> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>
>> >>     >     > <mailto:jiri.sle...@slu.cz
>> <mailto:jiri.sle...@slu.cz> <mailto:jiri.sle...@slu.cz
>> <mailto:jiri.sle...@slu.cz>>
>> >>     <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
>> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>>> wrote:
>> >>     >     >
>> >>     >     >     Hello,
>> >>     >     >
>> >>     >     >
>> >>     >     > Hi,
>> >>     >     >
>> >>     >     >
>> >>     >     >     CentOS8, oVirt 4.4.1.10-1.el8
>> >>     >     >
>> >>     >     >     I am trying to setup active-backup (mode=1)
>> bonding mode
>> >>     with
>> >>     >     custom
>> >>     >     >     properties. I have one 10GE switch, the second is
>> just 1G.
>> >>     >     10GE link is
>> >>     >     >     the primary one.
>> >>     >     >
>> >>     >     >     cat /etc/sysconfig/network-scripts/ifcfg-bond0
>> >>     >     >
>> >>     >     >
>> >>     >     > first of all in oVirt 4.4 the network-scripts are not
>> relevant
>> >>     >     anymore.
>> >>     >     > More relevant is output from 'nmstatectl show'.
>> >>     >
>> >>     >     thanks, I believed that ifcfg files still describes saved
>> >>     interface
>> >>     >     configuration (even on nm managed interfaces)...
>> >>     >
>> >>     >
>> >>     > It does but it might not be that detailed as we would have
>> hoped for.
>> >>     > Another reason why I said that it is not relevant is of
>> course if
>> >>     > someone tries
>> >>     > reconfigure the interface through network-scripts.
>> >>
>> >>     well, honestly I did that (modified ifcfg and then use nmcli con
>> >>     reload). So right way is using nmcli con modify command?
>> >>
>> >>
>> >> Yes or nmstate. Just be aware that anything that you do to interface
>> >> outside of oVirt can have harmful impacts on the host and overall
>> oVirt
>> >> state.
>> >>  
>> >>
>> >>
>> >>     >     from nmstatectl show I can see that bond0 has specified mac
>> >>     address
>> >>     >
>> >>     >   
>> >>   
>>   
>> https://paste.slu.cz/?d363cf2c029f6b83#Ew2rCiYyNGrdfffy6bvzSjbb8x4jJsaUdhxkjwThMFka
>

[ovirt-users] Re: problem with custom bond options

2020-07-24 Thread Jiří Sléžka
On 7/24/20 10:56 AM, Ales Musil wrote:
> 
> 
> On Fri, Jul 24, 2020 at 10:40 AM Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> wrote:
> 
> On 7/23/20 2:07 PM, Jiří Sléžka wrote:
> > On 7/23/20 12:35 PM, Ales Musil wrote:
> >>
> >>
> >> On Thu, Jul 23, 2020 at 11:50 AM Jiří Sléžka  <mailto:jiri.sle...@slu.cz>
> >> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>> wrote:
> >>
> >>     On 7/23/20 11:03 AM, Ales Musil wrote:
> >>     >
> >>     >
> >>     > On Thu, Jul 23, 2020 at 10:35 AM Jiří Sléžka
> mailto:jiri.sle...@slu.cz>
> >>     <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>
> >>     > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>> wrote:
>     >>     >
> >>     >     Hi,
> >>     >
> >>     >     On 7/23/20 8:38 AM, Ales Musil wrote:
> >>     >     >
> >>     >     >
> >>     >     > On Wed, Jul 22, 2020 at 9:41 PM Jiří Sléžka
> >>     mailto:jiri.sle...@slu.cz>
> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>
> >>     >     <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>
> >>     >     > <mailto:jiri.sle...@slu.cz
> <mailto:jiri.sle...@slu.cz> <mailto:jiri.sle...@slu.cz
> <mailto:jiri.sle...@slu.cz>>
> >>     <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>>> wrote:
> >>     >     >
> >>     >     >     Hello,
> >>     >     >
> >>     >     >
> >>     >     > Hi,
> >>     >     >
> >>     >     >
> >>     >     >     CentOS8, oVirt 4.4.1.10-1.el8
> >>     >     >
> >>     >     >     I am trying to setup active-backup (mode=1)
> bonding mode
> >>     with
> >>     >     custom
> >>     >     >     properties. I have one 10GE switch, the second is
> just 1G.
> >>     >     10GE link is
> >>     >     >     the primary one.
> >>     >     >
> >>     >     >     cat /etc/sysconfig/network-scripts/ifcfg-bond0
> >>     >     >
> >>     >     >
> >>     >     > first of all in oVirt 4.4 the network-scripts are not
> relevant
> >>     >     anymore.
> >>     >     > More relevant is output from 'nmstatectl show'.
> >>     >
> >>     >     thanks, I believed that ifcfg files still describes saved
> >>     interface
> >>     >     configuration (even on nm managed interfaces)...
> >>     >
> >>     >
> >>     > It does but it might not be that detailed as we would have
> hoped for.
> >>     > Another reason why I said that it is not relevant is of
> course if
> >>     > someone tries
> >>     > reconfigure the interface through network-scripts.
> >>
> >>     well, honestly I did that (modified ifcfg and then use nmcli con
> >>     reload). So right way is using nmcli con modify command?
> >>
> >>
> >> Yes or nmstate. Just be aware that anything that you do to interface
> >> outside of oVirt can have harmful impacts on the host and overall
> oVirt
> >> state.
> >>  
> >>
> >>
> >>     >     from nmstatectl show I can see that bond0 has specified mac
> >>     address
> >>     >
> >>     >   
> >>   
>   
> https://paste.slu.cz/?d363cf2c029f6b83#Ew2rCiYyNGrdfffy6bvzSjbb8x4jJsaUdhxkjwThMFka
> >>     >
> >>     >     >     BONDING_OPTS="active_slave=ens5 downdelay=0
> miimon=100
> >>     >     >     mode=active-backup primary=ens5 updelay=0"
> >>     >     >     TYPE=Bond
> >>     >     >     BONDING_MASTER=yes
> >>     >     >     PROXY_METHOD=none
> >>     >     >   

[ovirt-users] Re: problem with custom bond options

2020-07-24 Thread Jiří Sléžka
On 7/23/20 2:07 PM, Jiří Sléžka wrote:
> On 7/23/20 12:35 PM, Ales Musil wrote:
>>
>>
>> On Thu, Jul 23, 2020 at 11:50 AM Jiří Sléžka > <mailto:jiri.sle...@slu.cz>> wrote:
>>
>> On 7/23/20 11:03 AM, Ales Musil wrote:
>> >
>> >
>> > On Thu, Jul 23, 2020 at 10:35 AM Jiří Sléžka > <mailto:jiri.sle...@slu.cz>
>> > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>> wrote:
>> >
>> >     Hi,
>> >
>> >     On 7/23/20 8:38 AM, Ales Musil wrote:
>> >     >
>> >     >
>> >     > On Wed, Jul 22, 2020 at 9:41 PM Jiří Sléžka
>> mailto:jiri.sle...@slu.cz>
>> >     <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>
>> >     > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
>> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>> wrote:
>> >     >
>> >     >     Hello,
>> >     >
>> >     >
>> >     > Hi,
>> >     >
>> >     >
>> >     >     CentOS8, oVirt 4.4.1.10-1.el8
>> >     >
>> >     >     I am trying to setup active-backup (mode=1) bonding mode
>> with
>> >     custom
>> >     >     properties. I have one 10GE switch, the second is just 1G.
>> >     10GE link is
>> >     >     the primary one.
>> >     >
>> >     >     cat /etc/sysconfig/network-scripts/ifcfg-bond0
>> >     >
>> >     >
>> >     > first of all in oVirt 4.4 the network-scripts are not relevant
>> >     anymore.
>> >     > More relevant is output from 'nmstatectl show'.
>> >
>> >     thanks, I believed that ifcfg files still describes saved
>> interface
>> >     configuration (even on nm managed interfaces)...
>> >
>> >
>> > It does but it might not be that detailed as we would have hoped for.
>> > Another reason why I said that it is not relevant is of course if
>> > someone tries
>> > reconfigure the interface through network-scripts.
>>
>> well, honestly I did that (modified ifcfg and then use nmcli con
>> reload). So right way is using nmcli con modify command?
>>
>>
>> Yes or nmstate. Just be aware that anything that you do to interface
>> outside of oVirt can have harmful impacts on the host and overall oVirt
>> state.
>>  
>>
>>
>> >     from nmstatectl show I can see that bond0 has specified mac
>> address
>> >
>> >   
>>  
>> https://paste.slu.cz/?d363cf2c029f6b83#Ew2rCiYyNGrdfffy6bvzSjbb8x4jJsaUdhxkjwThMFka
>> >
>> >     >     BONDING_OPTS="active_slave=ens5 downdelay=0 miimon=100
>> >     >     mode=active-backup primary=ens5 updelay=0"
>> >     >     TYPE=Bond
>> >     >     BONDING_MASTER=yes
>> >     >     PROXY_METHOD=none
>> >     >     BROWSER_ONLY=no
>> >     >     IPV4_FAILURE_FATAL=no
>> >     >     IPV6_DISABLED=yes
>> >     >     IPV6INIT=no
>> >     >     NAME=bond0
>> >     >     UUID=c054364e-47cf-47ee-a7fc-70b37c9977e7
>> >     >     DEVICE=bond0
>> >     >     ONBOOT=yes
>> >     >     MTU=9000
>> >     >
>> >     >     When I try to add a custom parameter "fail_over_mac=active"
>> >     (which I
>> >     >     believe could solve my problems with stalled mac
>> addresses in
>> >     switch's
>> >     >     cam table in case of failover) I got...
>> >     >
>> >     >     "Error while executing action HostSetupNetworks: Unexpected
>> >     exception"
>> >     >
>> >     >     ...in manager. In the engine.log it looks like
>> >     >
>> >     >     2020-07-22 21:20:35,774+02 WARN
>> >     >   
>> >   
>>   [org.ovirt.engine.core.vdsbroker.vdsbroker.HostSetupNetworksVDSCommand]
>> >     >     (default task-8) [da1984f3-f38b-4e0a-ac80-a81e67d73ff0]
>> Unexpected
>> >   

[ovirt-users] Re: problem with custom bond options

2020-07-23 Thread Jiří Sléžka
On 7/23/20 12:35 PM, Ales Musil wrote:
> 
> 
> On Thu, Jul 23, 2020 at 11:50 AM Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> wrote:
> 
> On 7/23/20 11:03 AM, Ales Musil wrote:
> >
> >
> > On Thu, Jul 23, 2020 at 10:35 AM Jiří Sléžka  <mailto:jiri.sle...@slu.cz>
> > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>> wrote:
> >
> >     Hi,
> >
> >     On 7/23/20 8:38 AM, Ales Musil wrote:
> >     >
> >     >
> >     > On Wed, Jul 22, 2020 at 9:41 PM Jiří Sléžka
> mailto:jiri.sle...@slu.cz>
> >     <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>
> >     > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>> wrote:
> >     >
> >     >     Hello,
> >     >
> >     >
> >     > Hi,
> >     >
> >     >
> >     >     CentOS8, oVirt 4.4.1.10-1.el8
> >     >
> >     >     I am trying to setup active-backup (mode=1) bonding mode
> with
> >     custom
> >     >     properties. I have one 10GE switch, the second is just 1G.
> >     10GE link is
> >     >     the primary one.
> >     >
> >     >     cat /etc/sysconfig/network-scripts/ifcfg-bond0
> >     >
> >     >
> >     > first of all in oVirt 4.4 the network-scripts are not relevant
> >     anymore.
> >     > More relevant is output from 'nmstatectl show'.
> >
> >     thanks, I believed that ifcfg files still describes saved
> interface
> >     configuration (even on nm managed interfaces)...
> >
> >
> > It does but it might not be that detailed as we would have hoped for.
> > Another reason why I said that it is not relevant is of course if
> > someone tries
> > reconfigure the interface through network-scripts.
> 
> well, honestly I did that (modified ifcfg and then use nmcli con
> reload). So right way is using nmcli con modify command?
> 
> 
> Yes or nmstate. Just be aware that anything that you do to interface
> outside of oVirt can have harmful impacts on the host and overall oVirt
> state.
>  
> 
> 
> >     from nmstatectl show I can see that bond0 has specified mac
> address
> >
> >   
>  
> https://paste.slu.cz/?d363cf2c029f6b83#Ew2rCiYyNGrdfffy6bvzSjbb8x4jJsaUdhxkjwThMFka
> >
> >     >     BONDING_OPTS="active_slave=ens5 downdelay=0 miimon=100
> >     >     mode=active-backup primary=ens5 updelay=0"
> >     >     TYPE=Bond
> >     >     BONDING_MASTER=yes
> >     >     PROXY_METHOD=none
> >     >     BROWSER_ONLY=no
> >     >     IPV4_FAILURE_FATAL=no
> >     >     IPV6_DISABLED=yes
> >     >     IPV6INIT=no
> >     >     NAME=bond0
> >     >     UUID=c054364e-47cf-47ee-a7fc-70b37c9977e7
> >     >     DEVICE=bond0
> >     >     ONBOOT=yes
> >     >     MTU=9000
> >     >
> >     >     When I try to add a custom parameter "fail_over_mac=active"
> >     (which I
> >     >     believe could solve my problems with stalled mac
> addresses in
> >     switch's
> >     >     cam table in case of failover) I got...
> >     >
> >     >     "Error while executing action HostSetupNetworks: Unexpected
> >     exception"
> >     >
> >     >     ...in manager. In the engine.log it looks like
> >     >
> >     >     2020-07-22 21:20:35,774+02 WARN
> >     >   
> >   
>   [org.ovirt.engine.core.vdsbroker.vdsbroker.HostSetupNetworksVDSCommand]
> >     >     (default task-8) [da1984f3-f38b-4e0a-ac80-a81e67d73ff0]
> Unexpected
> >     >     return value: Status [code=-32603, message=Internal JSON-RPC
> >     error:
> >     >     {'reason': 'MAC address cannot be specified in bond
> interface
> >     along with
> >     >     specified bond options'}]
> >     >     2020-07-22 21:20:35,774+02 ERROR
> >     >   
> >   
>   [org.ovirt.engine.core.vdsbroker.vdsbroker.HostSetupNetworksVDSCommand]
> >     >     (default task-8) [da1984f3-f38b-4e0a-ac80-a81e67d73ff0]

[ovirt-users] Re: problem with custom bond options

2020-07-23 Thread Jiří Sléžka
On 7/23/20 11:03 AM, Ales Musil wrote:
> 
> 
> On Thu, Jul 23, 2020 at 10:35 AM Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> wrote:
> 
> Hi,
> 
> On 7/23/20 8:38 AM, Ales Musil wrote:
> >
> >
> > On Wed, Jul 22, 2020 at 9:41 PM Jiří Sléžka  <mailto:jiri.sle...@slu.cz>
> > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>> wrote:
> >
> >     Hello,
> >
> >
> > Hi,
> >
> >
> >     CentOS8, oVirt 4.4.1.10-1.el8
> >
> >     I am trying to setup active-backup (mode=1) bonding mode with
> custom
> >     properties. I have one 10GE switch, the second is just 1G.
> 10GE link is
> >     the primary one.
> >
> >     cat /etc/sysconfig/network-scripts/ifcfg-bond0
> >
> >
> > first of all in oVirt 4.4 the network-scripts are not relevant
> anymore.
> > More relevant is output from 'nmstatectl show'.
> 
> thanks, I believed that ifcfg files still describes saved interface
> configuration (even on nm managed interfaces)...
> 
> 
> It does but it might not be that detailed as we would have hoped for.
> Another reason why I said that it is not relevant is of course if
> someone tries
> reconfigure the interface through network-scripts.

well, honestly I did that (modified ifcfg and then use nmcli con
reload). So right way is using nmcli con modify command?

> from nmstatectl show I can see that bond0 has specified mac address
> 
> 
> https://paste.slu.cz/?d363cf2c029f6b83#Ew2rCiYyNGrdfffy6bvzSjbb8x4jJsaUdhxkjwThMFka
> 
> >     BONDING_OPTS="active_slave=ens5 downdelay=0 miimon=100
> >     mode=active-backup primary=ens5 updelay=0"
> >     TYPE=Bond
> >     BONDING_MASTER=yes
> >     PROXY_METHOD=none
> >     BROWSER_ONLY=no
> >     IPV4_FAILURE_FATAL=no
> >     IPV6_DISABLED=yes
> >     IPV6INIT=no
> >     NAME=bond0
> >     UUID=c054364e-47cf-47ee-a7fc-70b37c9977e7
> >     DEVICE=bond0
> >     ONBOOT=yes
> >     MTU=9000
> >
> >     When I try to add a custom parameter "fail_over_mac=active"
> (which I
> >     believe could solve my problems with stalled mac addresses in
> switch's
> >     cam table in case of failover) I got...
> >
> >     "Error while executing action HostSetupNetworks: Unexpected
> exception"
> >
> >     ...in manager. In the engine.log it looks like
> >
> >     2020-07-22 21:20:35,774+02 WARN
> >   
>  [org.ovirt.engine.core.vdsbroker.vdsbroker.HostSetupNetworksVDSCommand]
> >     (default task-8) [da1984f3-f38b-4e0a-ac80-a81e67d73ff0] Unexpected
> >     return value: Status [code=-32603, message=Internal JSON-RPC
> error:
> >     {'reason': 'MAC address cannot be specified in bond interface
> along with
> >     specified bond options'}]
> >     2020-07-22 21:20:35,774+02 ERROR
> >   
>  [org.ovirt.engine.core.vdsbroker.vdsbroker.HostSetupNetworksVDSCommand]
> >     (default task-8) [da1984f3-f38b-4e0a-ac80-a81e67d73ff0] Failed in
> >     'HostSetupNetworksVDS' method
> >     2020-07-22 21:20:35,774+02 WARN
> >   
>  [org.ovirt.engine.core.vdsbroker.vdsbroker.HostSetupNetworksVDSCommand]
> >     (default task-8) [da1984f3-f38b-4e0a-ac80-a81e67d73ff0] Unexpected
> >     return value: Status [code=-32603, message=Internal JSON-RPC
> error:
> >     {'reason': 'MAC address cannot be specified in bond interface
> along with
> >     specified bond options'}]
> >     2020-07-22 21:20:35,811+02 ERROR
> >   
>  [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
> >     (default task-8) [da1984f3-f38b-4e0a-ac80-a81e67d73ff0] EVENT_ID:
> >     VDS_BROKER_COMMAND_FAILURE(10,802), VDSM ovirt-hci01.mch.local
> command
> >     HostSetupNetworksVDS failed: Internal JSON-RPC error:
> {'reason': 'MAC
> >     address cannot be specified in bond interface along with
> specified bond
> >     options'}
> >
> >
> > Can you please share supervdsm.log from the relevant host?
> 
> here it is
> 
> 
> https://paste.slu.cz/?ef8bd7eeae8eeaed#Ej3MXRufm6Y9qjKCkgCXXieP132kRAiswR17ygxQDhft
> 
> 
> This indeed seems to be a bug. Can you please open BZ
> <https://bugzilla.redhat.com/enter_bug.cgi?product=v

[ovirt-users] Re: problem with custom bond options

2020-07-23 Thread Jiří Sléžka
Hi,

On 7/23/20 8:38 AM, Ales Musil wrote:
> 
> 
> On Wed, Jul 22, 2020 at 9:41 PM Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> wrote:
> 
> Hello,
> 
> 
> Hi,
> 
> 
> CentOS8, oVirt 4.4.1.10-1.el8
> 
> I am trying to setup active-backup (mode=1) bonding mode with custom
> properties. I have one 10GE switch, the second is just 1G. 10GE link is
> the primary one.
> 
> cat /etc/sysconfig/network-scripts/ifcfg-bond0
> 
> 
> first of all in oVirt 4.4 the network-scripts are not relevant anymore.
> More relevant is output from 'nmstatectl show'.

thanks, I believed that ifcfg files still describes saved interface
configuration (even on nm managed interfaces)...

from nmstatectl show I can see that bond0 has specified mac address

https://paste.slu.cz/?d363cf2c029f6b83#Ew2rCiYyNGrdfffy6bvzSjbb8x4jJsaUdhxkjwThMFka

> BONDING_OPTS="active_slave=ens5 downdelay=0 miimon=100
> mode=active-backup primary=ens5 updelay=0"
> TYPE=Bond
> BONDING_MASTER=yes
> PROXY_METHOD=none
> BROWSER_ONLY=no
> IPV4_FAILURE_FATAL=no
> IPV6_DISABLED=yes
> IPV6INIT=no
> NAME=bond0
> UUID=c054364e-47cf-47ee-a7fc-70b37c9977e7
> DEVICE=bond0
> ONBOOT=yes
> MTU=9000
> 
> When I try to add a custom parameter "fail_over_mac=active" (which I
> believe could solve my problems with stalled mac addresses in switch's
> cam table in case of failover) I got...
> 
> "Error while executing action HostSetupNetworks: Unexpected exception"
> 
> ...in manager. In the engine.log it looks like
> 
> 2020-07-22 21:20:35,774+02 WARN
> [org.ovirt.engine.core.vdsbroker.vdsbroker.HostSetupNetworksVDSCommand]
> (default task-8) [da1984f3-f38b-4e0a-ac80-a81e67d73ff0] Unexpected
> return value: Status [code=-32603, message=Internal JSON-RPC error:
> {'reason': 'MAC address cannot be specified in bond interface along with
> specified bond options'}]
> 2020-07-22 21:20:35,774+02 ERROR
> [org.ovirt.engine.core.vdsbroker.vdsbroker.HostSetupNetworksVDSCommand]
> (default task-8) [da1984f3-f38b-4e0a-ac80-a81e67d73ff0] Failed in
> 'HostSetupNetworksVDS' method
> 2020-07-22 21:20:35,774+02 WARN
> [org.ovirt.engine.core.vdsbroker.vdsbroker.HostSetupNetworksVDSCommand]
> (default task-8) [da1984f3-f38b-4e0a-ac80-a81e67d73ff0] Unexpected
> return value: Status [code=-32603, message=Internal JSON-RPC error:
> {'reason': 'MAC address cannot be specified in bond interface along with
> specified bond options'}]
> 2020-07-22 21:20:35,811+02 ERROR
> [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
> (default task-8) [da1984f3-f38b-4e0a-ac80-a81e67d73ff0] EVENT_ID:
> VDS_BROKER_COMMAND_FAILURE(10,802), VDSM ovirt-hci01.mch.local command
> HostSetupNetworksVDS failed: Internal JSON-RPC error: {'reason': 'MAC
> address cannot be specified in bond interface along with specified bond
> options'}
> 
> 
> Can you please share supervdsm.log from the relevant host?

here it is

https://paste.slu.cz/?ef8bd7eeae8eeaed#Ej3MXRufm6Y9qjKCkgCXXieP132kRAiswR17ygxQDhft

Cheers,

Jiri


> Could anybody explain me what 'MAC address cannot be specified in bond
> interface along with specified bond options' means? I believe a MAC
> address is not configured in interface configuration.
> 
> Or does it mean 'fail_over_mac=active' is not supported in oVirt?
> 
> Thanks in advance,
> 
> Jiri
> 
> 
> 
> 
> 
> ___
> Users mailing list -- users@ovirt.org <mailto:users@ovirt.org>
> To unsubscribe send an email to users-le...@ovirt.org
> <mailto:users-le...@ovirt.org>
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives:
> 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/BUGNSEBD3OBSUPASLJQYYJIF5767XMDE/
> 
> 
> 
> Thank you.
> Regards,
> Ales
> 
> -- 
> 
> Ales Musil
> 
> Software Engineer - RHV Network
> 
> Red Hat EMEA <https://www.redhat.com>
> 
> amu...@redhat.com <mailto:amu...@redhat.com>    IM: amusil
> 
> <https://red.ht/sig>
> 




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/YQ5SW7FIHB57UIDB37WEHEIFKZOXVA6F/


[ovirt-users] problem with custom bond options

2020-07-22 Thread Jiří Sléžka
Hello,

CentOS8, oVirt 4.4.1.10-1.el8

I am trying to setup active-backup (mode=1) bonding mode with custom
properties. I have one 10GE switch, the second is just 1G. 10GE link is
the primary one.

cat /etc/sysconfig/network-scripts/ifcfg-bond0

BONDING_OPTS="active_slave=ens5 downdelay=0 miimon=100
mode=active-backup primary=ens5 updelay=0"
TYPE=Bond
BONDING_MASTER=yes
PROXY_METHOD=none
BROWSER_ONLY=no
IPV4_FAILURE_FATAL=no
IPV6_DISABLED=yes
IPV6INIT=no
NAME=bond0
UUID=c054364e-47cf-47ee-a7fc-70b37c9977e7
DEVICE=bond0
ONBOOT=yes
MTU=9000

When I try to add a custom parameter "fail_over_mac=active" (which I
believe could solve my problems with stalled mac addresses in switch's
cam table in case of failover) I got...

"Error while executing action HostSetupNetworks: Unexpected exception"

...in manager. In the engine.log it looks like

2020-07-22 21:20:35,774+02 WARN
[org.ovirt.engine.core.vdsbroker.vdsbroker.HostSetupNetworksVDSCommand]
(default task-8) [da1984f3-f38b-4e0a-ac80-a81e67d73ff0] Unexpected
return value: Status [code=-32603, message=Internal JSON-RPC error:
{'reason': 'MAC address cannot be specified in bond interface along with
specified bond options'}]
2020-07-22 21:20:35,774+02 ERROR
[org.ovirt.engine.core.vdsbroker.vdsbroker.HostSetupNetworksVDSCommand]
(default task-8) [da1984f3-f38b-4e0a-ac80-a81e67d73ff0] Failed in
'HostSetupNetworksVDS' method
2020-07-22 21:20:35,774+02 WARN
[org.ovirt.engine.core.vdsbroker.vdsbroker.HostSetupNetworksVDSCommand]
(default task-8) [da1984f3-f38b-4e0a-ac80-a81e67d73ff0] Unexpected
return value: Status [code=-32603, message=Internal JSON-RPC error:
{'reason': 'MAC address cannot be specified in bond interface along with
specified bond options'}]
2020-07-22 21:20:35,811+02 ERROR
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
(default task-8) [da1984f3-f38b-4e0a-ac80-a81e67d73ff0] EVENT_ID:
VDS_BROKER_COMMAND_FAILURE(10,802), VDSM ovirt-hci01.mch.local command
HostSetupNetworksVDS failed: Internal JSON-RPC error: {'reason': 'MAC
address cannot be specified in bond interface along with specified bond
options'}


Could anybody explain me what 'MAC address cannot be specified in bond
interface along with specified bond options' means? I believe a MAC
address is not configured in interface configuration.

Or does it mean 'fail_over_mac=active' is not supported in oVirt?

Thanks in advance,

Jiri







smime.p7s
Description: Elektronicky podpis S/MIME
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/BUGNSEBD3OBSUPASLJQYYJIF5767XMDE/


[ovirt-users] Re: oVirt localization: you can help!

2020-06-22 Thread Jiří Sléžka
On 6/17/20 8:48 AM, Sandro Bonazzola wrote:
> Hi,
> if you have some time, here is a chance for helping oVirt project
> without requiring development skills.
> Help us localize oVirt to your natural language!
> oVirt Engine needs some work for:
> (see 
> https://zanata.phx.ovirt.org/iteration/view/ovirt-engine/ovirt-4.4?dswid=6591 
> )

I would like to, but it looks like I can't login (generate activation
mail) using Fedora or Google account...

An unexpected error has occurred. Please report this problem with
details of what you were attempting.

but I don't have a Jira account to do that...

Cheers,

Jiri

> 
>   * Czech 33.14% Translated
>   * German 98.87% Translated
>   * Italian 80.02% Translated
>   * Korean 99.72% Translated
>   * Portuguese (Brazil) 99.72% Translated
>   * Russian 30.73% Translated
>   * Spanish 98.87% Translated
> 
> ovirt-engine-ui-extensions:
> (see 
> https://zanata.phx.ovirt.org/iteration/view/ovirt-engine-ui-extensions/1.1?dswid=-5868
>  ) 
> 
>   * Czech 22.82%Translated
>   * German 89.9% Translated
>   * Italian 15.62% Translated
>   * Korean 99.17% Translated
>   * Portuguese (Brazil) 89.9% Translated
>   * Spanish 89.9% Translated
> 
> ovirt-web-ui
> (see https://zanata.phx.ovirt.org/iteration/view/ovirt-web-ui/1.6?dswid=-9733 
> )
> 
>   * Czech 24.8% Translated
>   * German 98.33% Translated
>   * Italian 12.85% Translated
>   * Korean 98.51% Translated
>   * Portuguese (Brazil) 98.33% Translated
> 
> If you're trying to help and  you encounter any issue with the
> translation platform let us know and we'll help you solve them.
> 
> Thanks,
> -- 
> 
> Sandro Bonazzola
> 
> MANAGER, SOFTWARE ENGINEERING, EMEA R RHV
> 
> Red Hat EMEA 
> 
> sbona...@redhat.com    
> 
>  
> 
> **
> *Red Hat respects your work life balance. Therefore there is no need to
> answer this email out of your office hours.
> *
> 
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/NYX3AXWEPCNBKKS6O65KNXXAP2UWRWG6/
> 




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/CUNTEMHLASGSM7ALUVMCSVU6PIGORQQP/


[ovirt-users] Re: basic infra and glusterfs sizing question

2020-06-04 Thread Jiří Sléžka
On 5/30/20 3:48 PM, Jp wrote:
> I'm running oVirt + Gluster in HCI config and had similar questions
> as you when building it out.

I think it would be nice to have some (best practice) design guides...
but there are so many possibilities how to build a oVirt cluster... This
time I try to build very cheap solution with as much redundancy as is
feasible. But of course what is chep cannot be rock-solid...

>> - single point of failure in this router (not really - just in case
>> oVirt is badly broken and I need to access internal vlans to
>> recover it)
> 
> There is no SPOF if you're doing 3x HCI nodes.  I regularly put 1 of
> my 3 Nodes into Maintenance or shutdown Gluster and have had no
> SPOFs.  Are you only doing a single Node?  If so, the point of
> failure is ... that 1 node :)

you are righ, I ment hypotetical situation with non functional HE vm,
broken gluster etc...

>> * have this router as virtual appliance inside oVirt (something
>> like pfSense for example)
> 
> I'm running pfSense in hardware still (a Netgate ARM device).
> There's plenty of opinions on Reddit, StackOverflow, etc. about
> running any router in VM.  There's several steps you'd need to take
> when I looked into it, and if you setup pfSense's interfaces as
> virtio / vhost I'd imagine you'd bump into limitations b/c those para
> devices weren't intended to do things like hardware offload, advanced
> routing, etc.; so you may have to setup PCI passthru / SR-IOV to get
> all of pfSense's routing capabilities.  So I'm keeping pfSense in
> hardware ... though I've thought of creating a backup pfSense
> instance in VM encase of hardware disaster to keep my Internet up in
> "limp mode" ... but creating a cellular Hotspot is my current backup
> plan :)

thanks for sharing your experience.

I will try to keep my topology as simple as possible in the start.
pfSense appliance is something I can add later.

>> Install all hosts and HE with public addresses
> 
> Why?  The HE is a manager to the cluster and sits on the management
> network (ovirtmgmt), so giving it public IPs would be adding a
> security risk to the setup.  I keep my HE accessible only via local
> VLAN and that's how most folks lock it down.  Are you thinking the HE
> or HCI includes a load balancer?  Eitherway, oVirt doesn't, but
> putting a load balancer in front of VM's and giving it your public IP
> would make more sense for exposing things to the Internet ... but I'm
> assuming too much and don't know what your cluster will be running. 

just for sure I can access it in case of disaster recovery. But it is
overkill and of course security risk. My problem is that I have no other
access to my housing other then through public ips. No problem, I will
add dedicated router which will act as gw for local vlans, NAT and vpn
gw and will keep oVirt hosts inside on private space.

Once more thanks for brainstorming :-)

Cheers,

Jiri


> ___ Users mailing list --
> users@ovirt.org To unsubscribe send an email to
> users-le...@ovirt.org Privacy Statement:
> https://www.ovirt.org/privacy-policy.html oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/ List
> Archives:
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/BCV75LWZ6KTBTP23OIEYIQOMH42RDO3I/
>



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/BBRHMZVHJZXTHAXWIKLICMJTS2CCO5KN/


[ovirt-users] Re: basic infra and glusterfs sizing question

2020-06-04 Thread Jiří Sléžka
On 5/29/20 5:27 PM, Jayme wrote:
> Also, I can't think of the limit off the top of my head. I believe it's
> either 75 or 100Gb. If the engine volume is set any lower the
> installation will fail. There is a minimum size requirement.

thanks for reply. Meantime I was looking into RHV 4.4 beta docs and the
limit is mentioned there

Minimum Total - 55 GB

https://access.redhat.com/documentation/en-us/red_hat_virtualization/4.4-beta/html/installing_red_hat_virtualization_as_a_self-hosted_engine_using_the_command_line/rhv_requirements#Storage_Requirements_SHE_cli_deploy

but oVirt HE vm is created with 51GB disk

as Strahil mentioned in other post in this thread cockpit/ansible
deploys engine lv for gluster brick as thick volume. Data and vmstore is
deployed as thin volume.

I will probably use default 100GB thick volume... (and only one other
volume for vms on ssd disk)

Cheers,

Jiri

> 
> On Fri, May 29, 2020 at 12:09 PM Jayme  <mailto:jay...@gmail.com>> wrote:
> 
> Regarding Gluster question. The volumes would be provisioned with
> LVM on the same block device. I believe 100Gb is recommended for the
> engine volume. The other volumes such as data would be created on
> another logical volume and you can use up the rest of the available
> space there. Ex. 100gb engine, 500Gb data and 400Gb vmstore. 
> 
> Data domains are basically the same now, in the past there used to
> be different domain types such as ISO domains which are deprecated.
> You don't really need any more than engine volume and data volume. 
> You could have a volume for storing ISOs if you wanted to. You could
> have a separate volume for OS disks and another volume for data
> disks which would give you more flexibility for backups (so that you
> could backup data disks but not OS for example). 
> 
> On Fri, May 29, 2020 at 10:29 AM Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> wrote:
> 
> Hello,
> 
> I am just curious if basic gluster HCI layout which is suggested in
> cockpit has some deeper meaning.
> 
> There are suggested 3 volumes
> 
> * engine - it is clear, it is the volume where engine vm is running.
> When this vm is 51GB big how small could this volume be? I have
> 1TB SSD
> storage and I would like utilize it as much as possible. Could I
> create
> this volume as small as this vm is? Is it safe for example for
> future
> upgrades?
> 
> * vmstore - it make sense it is a space for all other vms running in
> oVirt. Right?
> 
> * data - which purpose has this volume? other data like for example
> ISOs? Direct disks?
> 
> Another infra question... or maybe request for comment
> 
> I have small amount of public ipv4 addresses in my housing (but
> I have
> own switches there so I can create vlans and separate internal
> traffic).
> I can access only these public ipv4 addresses directly. I would
> like to
> conserve these addressess as much as possible so what is the best
> approach in your opinion?
> 
> * Install all hosts and HE with management network on private
> addressess
> 
>   * have small router (hw appliance with for example LEDE) which
> will
> utilize one ipv4 address and will do NAT and vpn for accessing my
> internals vlans.
>     + looks like simple approach to me
>     - single point of failure in this router (not really - just
> in case
> oVirt is badly broken and I need to access internal vlans to
> recover it)
> 
>   * have this router as virtual appliance inside oVirt
> (something like
> pfSense for example)
>     + no need hw router
>     + not sure but I could probably configure vrrp redundancy
>     - still single point of failure like in first case
> 
>   * any other approach? Could ovn help here somehow?
> 
> * Install all hosts and HE with public addresses :-)
>   + access to all hosts directly
>   - 3 node HCI cluster uses 4 public ip addressess
> 
> Thanks for your opinions
> 
> Cheers,
> 
> Jiri
> 
> ___
> Users mailing list -- users@ovirt.org <mailto:users@ovirt.org>
> To unsubscribe send an email to users-le...@ovirt.org
> <mailto:users-le...@ovirt.org>
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct:
> https://www.ovirt.

[ovirt-users] basic infra and glusterfs sizing question

2020-05-29 Thread Jiří Sléžka
Hello,

I am just curious if basic gluster HCI layout which is suggested in
cockpit has some deeper meaning.

There are suggested 3 volumes

* engine - it is clear, it is the volume where engine vm is running.
When this vm is 51GB big how small could this volume be? I have 1TB SSD
storage and I would like utilize it as much as possible. Could I create
this volume as small as this vm is? Is it safe for example for future
upgrades?

* vmstore - it make sense it is a space for all other vms running in
oVirt. Right?

* data - which purpose has this volume? other data like for example
ISOs? Direct disks?

Another infra question... or maybe request for comment

I have small amount of public ipv4 addresses in my housing (but I have
own switches there so I can create vlans and separate internal traffic).
I can access only these public ipv4 addresses directly. I would like to
conserve these addressess as much as possible so what is the best
approach in your opinion?

* Install all hosts and HE with management network on private addressess

  * have small router (hw appliance with for example LEDE) which will
utilize one ipv4 address and will do NAT and vpn for accessing my
internals vlans.
+ looks like simple approach to me
- single point of failure in this router (not really - just in case
oVirt is badly broken and I need to access internal vlans to recover it)

  * have this router as virtual appliance inside oVirt (something like
pfSense for example)
+ no need hw router
+ not sure but I could probably configure vrrp redundancy
- still single point of failure like in first case

  * any other approach? Could ovn help here somehow?

* Install all hosts and HE with public addresses :-)
  + access to all hosts directly
  - 3 node HCI cluster uses 4 public ip addressess

Thanks for your opinions

Cheers,

Jiri



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/LIFQQHTFVTS6KICR5MTRPGO5CH7QDLK7/


[ovirt-users] Re: oVirt4.4 HCI single host mortal combat

2020-05-28 Thread Jiří Sléžka
On 5/27/20 11:19 AM, Jiří Sléžka wrote:
> Hi,
> 
> I am still fighting with oVirt 4.4 installation in HCI single host
> configuration. Seems to be hard fighter... ;-)
> 
> It looks like there is no 4.4 HCI single host installation guide so I am
> using compilation of this sources
> 
> * https://www.ovirt.org/download/
> *
> https://www.ovirt.org/documentation/gluster-hyperconverged/chap-Single_node_hyperconverged.html
> 
> I did
> 
> * clean minimal install of CentOS 8.1
> * setup networks (I am using vlans on bond for internal traffic)
> 
> dnf update -y
> dnf install https://resources.ovirt.org/pub/yum-repo/ovirt-release44.rpm
> 
> dnf install cockpit cockpit-ovirt-dashboard vdsm-gluster
> ovirt-engine-appliance glusterfs-server gluster-ansible-roles
> 
> systemctl enable --now cockpit.socket
> firewall-cmd --add-service=cockpit
> firewall-cmd --add-service=cockpit --permanent
> 
> ssh-keygen
> ssh-copy-id root@10.0.4.11
> ssh root@10.0.4.11
> 
> (10.0.4.11 is local address on vlan which I would like use as storage
> network)
> 
> I would like to run gluster-ansible-roles from command line but I am not
> sure how exactly do it right way so I am going the cockpit way and the
> gluster part works like expected.
> 
> Next Boss is HE install
> 
> Round 1, Fight!
> 
> The hosted engine wizzard ends with missing ca-cert.pem (unfortunately I
> close the window and cannot find that log anymore). But it looks to me
> like problem mentioned in
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/4PUIES2JGAFHJY6BJG5MQY4URRE3STKS/
> 
> I have switched to command line...
> 
> Round 2, Fight!
> 
> ovirt-hosted-engine-cleanup
> 
> ovirt-hosted-engine-setup
> 
> [ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "The
> host has been set in non_operational status, deployment errors:   code
> 505: Host ovirt-hci01.stud.slu.cz installation failed. Failed to
> configure management network on the host.,code 9000: Failed to
> verify Power Management configuration for Host ovirt-hci01.stud.slu.cz.,
>   fix accordingly and re-deploy."}
> 
> /var/log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20200526210340-h7waks.log
> 
> interesting thing is the vm looks like running in some way
> 
> ps aux | grep kvm
> qemu 26790 58.2  6.6 6927972 3295760 ? Sl   21:28  12:40
> /usr/libexec/qemu-kvm -name guest=HostedEngineLocal,debug-threads=on -S
> -object
> secret,id=masterKey0,format=raw,file=/var/lib/libvirt/qemu/domain-1-HostedEngineLocal/master-key.aes
> -machine pc-q35-rhel8.1.0,accel=kvm,usb=off,dump-guest-core=off -cpu
> Nehalem-IBRS,vme=on,ss=on,x2apic=on,tsc...
> ...
> 
> but
> 
> hosted-engine --vm-status
> It seems like a previous attempt to deploy hosted-engine failed or it's
> still in progress. Please clean it up before trying again
> 
> hosted-engine --check-deployed
> The hosted engine has not been deployed
> 
> ok...
> 
> Round 3, Fight!
> 
> ovirt-hosted-engine-cleanup
> 
> ovirt-hosted-engine-setup --config-append=/root/answers-20200526214934.conf
> 
> [ ERROR ] fatal: [localhost]: FAILED! => {"changed": true, "cmd":
> ["virt-install", "-n", "HostedEngineLocal", "--os-variant", "rhel8.0",
> "--virt-type", "kvm", "--memory", "4096", "--vcpus", "2", "--network",
> "network=default,mac=00:16:3e:7a:ce:77,model=virtio", "--disk",
> "/var/tmp/localvmc7szuw4y/images/6c7c4d4b-9c11-485d-98e0-466a09888515/c16b87ac-f9d4-491d-a972-7dc333a324a0",
> "--import", "--disk",
> "path=/var/tmp/localvmc7szuw4y/seed.iso,device=cdrom",
> "--noautoconsole", "--rng", "/dev/random", "--graphics", "vnc",
> "--video", "vga", "--sound", "none", "--controller", "usb,model=none",
> "--memballoon", "none", "--boot", "hd,menu=off", "--clock",
> "kvmclock_present=yes"], "delta": "0:00:04.419991", "end": "2020-05-26
> 22:27:20.730780", "msg": "non-zero return code", "rc": 1, "start":
> "2020-05-26 22:27:16.310789", "stderr": "ERRORinternal error:
> process exited while connecting to monitor: 2020-05-26T20:27:19.254675Z
> qemu-kvm: -object
> tls-creds-x509,id=vnc-tls-creds0,dir=/etc/pki/vdsm/libvirt-vnc,endpoint=server,verify-peer=no:
> Unable to access credentials /etc/pki/vdsm/lib

[ovirt-users] oVirt4.4 HCI single host mortal combat

2020-05-27 Thread Jiří Sléžka
Hi,

I am still fighting with oVirt 4.4 installation in HCI single host
configuration. Seems to be hard fighter... ;-)

It looks like there is no 4.4 HCI single host installation guide so I am
using compilation of this sources

* https://www.ovirt.org/download/
*
https://www.ovirt.org/documentation/gluster-hyperconverged/chap-Single_node_hyperconverged.html

I did

* clean minimal install of CentOS 8.1
* setup networks (I am using vlans on bond for internal traffic)

dnf update -y
dnf install https://resources.ovirt.org/pub/yum-repo/ovirt-release44.rpm

dnf install cockpit cockpit-ovirt-dashboard vdsm-gluster
ovirt-engine-appliance glusterfs-server gluster-ansible-roles

systemctl enable --now cockpit.socket
firewall-cmd --add-service=cockpit
firewall-cmd --add-service=cockpit --permanent

ssh-keygen
ssh-copy-id root@10.0.4.11
ssh root@10.0.4.11

(10.0.4.11 is local address on vlan which I would like use as storage
network)

I would like to run gluster-ansible-roles from command line but I am not
sure how exactly do it right way so I am going the cockpit way and the
gluster part works like expected.

Next Boss is HE install

Round 1, Fight!

The hosted engine wizzard ends with missing ca-cert.pem (unfortunately I
close the window and cannot find that log anymore). But it looks to me
like problem mentioned in
https://lists.ovirt.org/archives/list/users@ovirt.org/message/4PUIES2JGAFHJY6BJG5MQY4URRE3STKS/

I have switched to command line...

Round 2, Fight!

ovirt-hosted-engine-cleanup

ovirt-hosted-engine-setup

[ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "The
host has been set in non_operational status, deployment errors:   code
505: Host ovirt-hci01.stud.slu.cz installation failed. Failed to
configure management network on the host.,code 9000: Failed to
verify Power Management configuration for Host ovirt-hci01.stud.slu.cz.,
  fix accordingly and re-deploy."}

/var/log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20200526210340-h7waks.log

interesting thing is the vm looks like running in some way

ps aux | grep kvm
qemu 26790 58.2  6.6 6927972 3295760 ? Sl   21:28  12:40
/usr/libexec/qemu-kvm -name guest=HostedEngineLocal,debug-threads=on -S
-object
secret,id=masterKey0,format=raw,file=/var/lib/libvirt/qemu/domain-1-HostedEngineLocal/master-key.aes
-machine pc-q35-rhel8.1.0,accel=kvm,usb=off,dump-guest-core=off -cpu
Nehalem-IBRS,vme=on,ss=on,x2apic=on,tsc...
...

but

hosted-engine --vm-status
It seems like a previous attempt to deploy hosted-engine failed or it's
still in progress. Please clean it up before trying again

hosted-engine --check-deployed
The hosted engine has not been deployed

ok...

Round 3, Fight!

ovirt-hosted-engine-cleanup

ovirt-hosted-engine-setup --config-append=/root/answers-20200526214934.conf

[ ERROR ] fatal: [localhost]: FAILED! => {"changed": true, "cmd":
["virt-install", "-n", "HostedEngineLocal", "--os-variant", "rhel8.0",
"--virt-type", "kvm", "--memory", "4096", "--vcpus", "2", "--network",
"network=default,mac=00:16:3e:7a:ce:77,model=virtio", "--disk",
"/var/tmp/localvmc7szuw4y/images/6c7c4d4b-9c11-485d-98e0-466a09888515/c16b87ac-f9d4-491d-a972-7dc333a324a0",
"--import", "--disk",
"path=/var/tmp/localvmc7szuw4y/seed.iso,device=cdrom",
"--noautoconsole", "--rng", "/dev/random", "--graphics", "vnc",
"--video", "vga", "--sound", "none", "--controller", "usb,model=none",
"--memballoon", "none", "--boot", "hd,menu=off", "--clock",
"kvmclock_present=yes"], "delta": "0:00:04.419991", "end": "2020-05-26
22:27:20.730780", "msg": "non-zero return code", "rc": 1, "start":
"2020-05-26 22:27:16.310789", "stderr": "ERRORinternal error:
process exited while connecting to monitor: 2020-05-26T20:27:19.254675Z
qemu-kvm: -object
tls-creds-x509,id=vnc-tls-creds0,dir=/etc/pki/vdsm/libvirt-vnc,endpoint=server,verify-peer=no:
Unable to access credentials /etc/pki/vdsm/libvirt-vnc/ca-cert.pem: No
such file or directory\nDomain installation does not appear to have been
successful.\nIf it was, you can restart your domain by running:\n  virsh
--connect qemu:///system start HostedEngineLocal\notherwise, please
restart your installation.", "stderr_lines": ["ERRORinternal error:
process exited while connecting to monitor: 2020-05-26T20:27:19.254675Z
qemu-kvm: -object
tls-creds-x509,id=vnc-tls-creds0,dir=/etc/pki/vdsm/libvirt-vnc,endpoint=server,verify-peer=no:
Unable to access credentials /etc/pki/vdsm/libvirt-vnc/ca-cert.pem: No
such file or directory", "Domain installation does not appear to have
been successful.", "If it was, you can restart your domain by running:",
"  virsh --connect qemu:///system start HostedEngineLocal", "otherwise,
please restart your installation."], "stdout": "\nStarting install...",
"stdout_lines": ["", "Starting install..."]}

...and this error I get every round

relevant lines from
/var/log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20200526223648-dvejcr.log

2020-05-26 22:49:15,303+0200 DEBUG

[ovirt-users] Re: searching in manager

2020-05-25 Thread Jiří Sléžka
Hi Eli,

> Can you please try :
> 
> Storage.name = "RHEV-SlowStorage" or Storage.name = "RHEV-MidStorage"

yep, it works as exceptet. Also

Storage.name != "RHEV-SSDStorage" works well.

I was mistaken by autocomplete function which suggests

Vms: Storage.
Vms: Storage =
Vms: Storage !=

the first suggested option is the right way to follow.

Thanks,

Jiri

> 
> 
> On Fri, May 22, 2020 at 1:34 PM Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> wrote:
> 
> Hi,
> 
> oVirt 4.3.9.4-1.el7
> 
> I am trying to search a vms which has storage domain other than
> RHEV-SSDStorage so I entered this search string in Manager in Compute /
> Virtual Machines search form.
> 
> Storage != "RHEV-SSDStorage"
> 
> but it looks like all vms are listed.
> 
> When I try...
> 
> Storage = "RHEV-SSDStorage"
> 
> ...it works like expected and only right vms are listed.
> 
> Also combinations seems does not work.
> 
> Storage = "RHEV-SlowStorage" or Storage = "RHEV-MidStorage"
> 
> ...returns 0 rows.
> 
> Should it work?
> 
> Cheers,
> 
> Jiri
> 
> ___
> Users mailing list -- users@ovirt.org <mailto:users@ovirt.org>
> To unsubscribe send an email to users-le...@ovirt.org
> <mailto:users-le...@ovirt.org>
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives:
> 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/RVFGTGGM75U2M4ENW2K4DWFK34BPWJWH/
> 




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/HFCD44KWK73VU22GLENLRRQBE3S3ICIP/


[ovirt-users] Re: how to save stateless disk

2020-05-25 Thread Jiří Sléžka
On 5/22/20 1:47 PM, Michal Skrivanek wrote:
> 
> 
>> On 22 May 2020, at 12:52, Jiří Sléžka  wrote:
>>
>> Hi,
>>
>> I have one vm configured as stateless (useful for example for testing
>> ansible deploying). But now I am in the middle of work and we have
>> planned power outage. If I power down this wm now I will lost my work.
> 
> suspend/resume _may_ work. I would suggest to try that first:)
> If it doesn’t work it’s worth a bug

While I try to suspend stateless vm in oVirt 4.3.9.4-1.el7 I get "Cannot
suspend VM, VM is stateless.".

But as Eli Mesika mentioned it should work in latest 4.4.

Cheers,

Jiri

> 
>>
>> It looks like I am unable to create snapshot if vm is in stateless
>> state. Is there any trick how to not loose my stateless state?
>>
>> Is it RFE worth problem? 
>>
>> Cheers,
>>
>> Jiri
>>
>> ___
>> Users mailing list -- users@ovirt.org
>> To unsubscribe send an email to users-le...@ovirt.org
>> Privacy Statement: https://www.ovirt.org/privacy-policy.html
>> oVirt Code of Conduct: 
>> https://www.ovirt.org/community/about/community-guidelines/
>> List Archives: 
>> https://lists.ovirt.org/archives/list/users@ovirt.org/message/BZCJI4U3WQ22AHUU234TSNN5ZXJLXXCP/
> 




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/HV3DB7MDZM5UK6OQJO5KKHCR4QQJ2FV6/


[ovirt-users] how to save stateless disk

2020-05-22 Thread Jiří Sléžka
Hi,

I have one vm configured as stateless (useful for example for testing
ansible deploying). But now I am in the middle of work and we have
planned power outage. If I power down this wm now I will lost my work.

It looks like I am unable to create snapshot if vm is in stateless
state. Is there any trick how to not loose my stateless state?

Is it RFE worth problem?

Cheers,

Jiri



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/BZCJI4U3WQ22AHUU234TSNN5ZXJLXXCP/


[ovirt-users] searching in manager

2020-05-22 Thread Jiří Sléžka
Hi,

oVirt 4.3.9.4-1.el7

I am trying to search a vms which has storage domain other than
RHEV-SSDStorage so I entered this search string in Manager in Compute /
Virtual Machines search form.

Storage != "RHEV-SSDStorage"

but it looks like all vms are listed.

When I try...

Storage = "RHEV-SSDStorage"

...it works like expected and only right vms are listed.

Also combinations seems does not work.

Storage = "RHEV-SlowStorage" or Storage = "RHEV-MidStorage"

...returns 0 rows.

Should it work?

Cheers,

Jiri



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/RVFGTGGM75U2M4ENW2K4DWFK34BPWJWH/


[ovirt-users] locked snapshot after snapshot preview

2020-05-20 Thread Jiří Sléžka
Hi,

Ovirt 4.3.9.4-1.el7, CentOS7...

colleague of me tried yesterday do a snapshot preview on his vm but
operation got stalled.

I can see in Task list in Manager operation "Preview VM Snapshot před AD
test of VM install_W10_LTSC-REK" in state Finalizing (6 hours ago)

In the Events I see

May 19, 2020, 4:00:17 PM
Failed to complete Snapshot-Preview před AD test for VM
install_W10_LTSC-REK.
18acb86e-581c-43f5-bf99-f8d738870976

May 19, 2020, 3:57:40 PM
Failed to run VM install_W10_LTSC-REK due to a failed validation:
[Cannot run VM. The VM is performing an operation on a Snapshot. Please
wait for the operation to finish, and try again.] (User: **@*).
d892b7dd-741b-4c61-ac37-b1077eaeaf79

When I grepping correlation id from failed run from engine log a got

zcat /var/log/ovirt-engine/engine.log-20200520.gz | grep
"d892b7dd-741b-4c61-ac37-b1077eaeaf79"

2020-05-19 15:57:40,435+02 INFO
[org.ovirt.engine.core.bll.RunVmCommand] (default task-210)
[d892b7dd-741b-4c61-ac37-b1077eaeaf79] Lock Acquired to object
'EngineLock:{exclusiveLocks='[2f1fb183-55ea-477c-a75b-625f889e4c79=VM]',
sharedLocks=''}'
2020-05-19 15:57:40,489+02 ERROR
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
(default task-210) [d892b7dd-741b-4c61-ac37-b1077eaeaf79] EVENT_ID:
USER_FAILED_RUN_VM(54), Failed to run VM install_W10_LTSC-REK due to a
failed validation: [Cannot run VM. The VM is performing an operation on
a Snapshot. Please wait for the operation to finish, and try again.]
(User: kra0...@cro.slu.cz).
2020-05-19 15:57:40,489+02 WARN
[org.ovirt.engine.core.bll.RunVmCommand] (default task-210)
[d892b7dd-741b-4c61-ac37-b1077eaeaf79] Validation of action 'RunVm'
failed for user kra0...@cro.slu.cz. Reasons:
VAR__ACTION__RUN,VAR__TYPE__VM,ACTION_TYPE_FAILED_VM_IS_DURING_SNAPSHOT
2020-05-19 15:57:40,489+02 INFO
[org.ovirt.engine.core.bll.RunVmCommand] (default task-210)
[d892b7dd-741b-4c61-ac37-b1077eaeaf79] Lock freed to object
'EngineLock:{exclusiveLocks='[2f1fb183-55ea-477c-a75b-625f889e4c79=VM]',
sharedLocks=''}'

greping correlation id on failed snapshot preview shows only INFO
events, not ERRORS nor WARNINGS and it looks to me like everything was
FINISHed

When I try to display locked entities a got

cd /usr/share/ovirt-engine/setup/dbutils
./unlock_entity.sh -q -t all -c

Locked VMs

   vm_name| snapshot_id
--+--
 install_W10_LTSC-REK | 9d53c7b4-8ec5-4d31-8ab1-16bb75ab8f2b


Locked templates

 template_name | disk_id
---+-


Locked disks

vm_id |   disk_id

--+--
 2f1fb183-55ea-477c-a75b-625f889e4c79 | 707db1f6-0859-48ab-b9d0-a97619ed8b0b


Locked snapshots

vm_id | snapshot_id

--+--
 2f1fb183-55ea-477c-a75b-625f889e4c79 | 9d53c7b4-8ec5-4d31-8ab1-16bb75ab8f2b


Illegal images

 vm_name | image_guid
-+

I am about to try unlock that snapshot but for sure...

* how can I be sure that snapshot operations is not still running (it
would be nice to have some sort of progress bar o better reporting what
is happening in Manager...)

* looks it like bug? Should I fill bugreport or publish more logs?

Cheers,

Jiri



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/H5SSEVCCZU5WCCQJMB72P5H6K2ZYLCTP/


[ovirt-users] Re: [Feedback needed] oVirt 4.4.0 Test week

2020-05-19 Thread Jiří Sléžka
Hi,

On 5/18/20 6:26 PM, Sandro Bonazzola wrote:
> 
> 
> Il giorno lun 18 mag 2020 alle ore 18:12 Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> ha scritto:
> 
> Hi,
> 
> I am a bit late but today a tried to install single host HCI with
> gluster.
> 
> I am not sure if it is currently supported but
> 
> * I installed
> 
> dnf install
> https://resources.ovirt.org/pub/yum-repo/ovirt-release44-pre.rpm
> dnf module enable -y javapackages-tools pki-deps postgresql:12 389-ds
> 
> 
> these modules are not needed on the host, these are needed only on the
> standalone engine / within engine appliance

ok, thanks for info

> dnf install ovirt-engine-appliance vdsm-gluster
> 
> 
> May I ask which installation guide are you following? I'm pretty sure
> it's outdated and needs a refresh.

I have followed mostly

https://ovirt.org/documentation/gluster-hyperconverged/Gluster_Hyperconverged_Guide.html

but also my own notes...

I know this guide is for 4.3 but I didn't find relevant guide for 4.4.

> * I prepared host by gdeploy (from external Fedora workstation because I
> didn't find gdeploy in any CentOS8 repo)
> 
> 
> gdeploy has been deprecated in favor of gluster-ansible roles about 2
> years ago

ok, I told you I am bit late ;-)

If there is more relevant documentation, please share a link. I would
like help write some guide/blogpost about 4.4. install but probably in
czech language...

Thanks in advance,

Jiri

> * I tried to instal ovvirt via
> 
> ovirt-hosted-engine-setup
> 
> ...but process failed with
> 
> [ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg":
> "Unable to start service libvirtd: Job for libvirtd.service failed
> because the control process exited with error code.\nSee \"systemctl
> status libvirtd.service\" and \"journalctl -xe\" for details.\n"}
> [ ERROR ] Failed to execute stage 'Closing up': Failed executing
> ansible-playbook
> 
> In messages log is mentioned
> 
> ...
> May 18 17:04:40 ovirt-hci01 journal[8062]: Cannot read CA certificate
> '/etc/pki/CA/cacert.pem': No such file or directory
> May 18 17:04:40 ovirt-hci01 systemd[1]: libvirtd.service: Main process
> exited, code=exited, status=6/NOTCONFIGURED
> ...
> 
> There is no /etc/pki/CA directory. I think CA should be installed during
> ovirt-hosted-engine-setup, isn't it? Should I do that manually? How?
> 
> Cheers,
> 
> Jiri
> 
> 
> On 5/8/20 1:23 PM, Sandro Bonazzola wrote:
> > Hi,
> > oVirt team is planning to release oVirt 4.4.0 Ga in the next couple of
> > weeks.
> > oVirt 4.4.0 release candidate was released yesterday and we'd like to
> > gather as much feedback as possible.
> > Please join us testing this release candidate next week, starting
> Sunday
> > May 10th 2020 till Friday May 15th 2020!
> > We are going to coordinate the testing effort with a public Trello
> board
> > at https://trello.com/b/5ZNJgPC3/ovirt-440-test-day
> > You'll find instructions on how to use the board there.
> > For joining the board you can use this
> >
> link: 
> https://trello.com/invite/b/5ZNJgPC3/f1b1826ee4902f348c44607765a15099/ovirt-440-test-day
>  
> >
> > If you have not an environment dedicated to testing, remember you can
> > set up a few VMs and test the deployment with nested virtualization
> > using your production environment creating a virtual test environment.
> > In this case please be careful avoiding touching the production
> > environment from the testing one.
> >
> > The oVirt team will monitor the Trello board, the #ovirt IRC
> channel on
> > irc.oftc.net <http://irc.oftc.net> <http://irc.oftc.net> server
> and the users@ovirt.org <mailto:users@ovirt.org>
> > <mailto:users@ovirt.org <mailto:users@ovirt.org>> mailing list to
> assist with the testing and
> > debugging issues.
> > Basic instructions for setting up a minimal system are available in
> > release candidate announce
> >
> at: 
> https://lists.ovirt.org/archives/list/annou...@ovirt.org/message/3QORBKVKTALNJ5SMJHEDO4QJ5YUCULTT/attachment/3/attachment.html
> > Release notes for this release candidate are available
> > here: https://ovirt.org/release/4.4.0/
> >
> > Thanks
> > --
> >
> > Sandro Bonazzola
> >

[ovirt-users] Re: [Feedback needed] oVirt 4.4.0 Test week

2020-05-18 Thread Jiří Sléžka
Hi,

I am a bit late but today a tried to install single host HCI with gluster.

I am not sure if it is currently supported but

* I installed

dnf install https://resources.ovirt.org/pub/yum-repo/ovirt-release44-pre.rpm
dnf module enable -y javapackages-tools pki-deps postgresql:12 389-ds
dnf install ovirt-engine-appliance vdsm-gluster

* I prepared host by gdeploy (from external Fedora workstation because I
didn't find gdeploy in any CentOS8 repo)

* I tried to instal ovvirt via

ovirt-hosted-engine-setup

...but process failed with

[ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg":
"Unable to start service libvirtd: Job for libvirtd.service failed
because the control process exited with error code.\nSee \"systemctl
status libvirtd.service\" and \"journalctl -xe\" for details.\n"}
[ ERROR ] Failed to execute stage 'Closing up': Failed executing
ansible-playbook

In messages log is mentioned

...
May 18 17:04:40 ovirt-hci01 journal[8062]: Cannot read CA certificate
'/etc/pki/CA/cacert.pem': No such file or directory
May 18 17:04:40 ovirt-hci01 systemd[1]: libvirtd.service: Main process
exited, code=exited, status=6/NOTCONFIGURED
...

There is no /etc/pki/CA directory. I think CA should be installed during
ovirt-hosted-engine-setup, isn't it? Should I do that manually? How?

Cheers,

Jiri


On 5/8/20 1:23 PM, Sandro Bonazzola wrote:
> Hi,
> oVirt team is planning to release oVirt 4.4.0 Ga in the next couple of
> weeks.
> oVirt 4.4.0 release candidate was released yesterday and we'd like to
> gather as much feedback as possible.
> Please join us testing this release candidate next week, starting Sunday
> May 10th 2020 till Friday May 15th 2020!
> We are going to coordinate the testing effort with a public Trello board
> at https://trello.com/b/5ZNJgPC3/ovirt-440-test-day
> You'll find instructions on how to use the board there.
> For joining the board you can use this
> link: 
> https://trello.com/invite/b/5ZNJgPC3/f1b1826ee4902f348c44607765a15099/ovirt-440-test-day
>  
> 
> If you have not an environment dedicated to testing, remember you can
> set up a few VMs and test the deployment with nested virtualization
> using your production environment creating a virtual test environment.
> In this case please be careful avoiding touching the production
> environment from the testing one.
> 
> The oVirt team will monitor the Trello board, the #ovirt IRC channel on
> irc.oftc.net  server and the users@ovirt.org
>  mailing list to assist with the testing and
> debugging issues.
> Basic instructions for setting up a minimal system are available in
> release candidate announce
> at: 
> https://lists.ovirt.org/archives/list/annou...@ovirt.org/message/3QORBKVKTALNJ5SMJHEDO4QJ5YUCULTT/attachment/3/attachment.html
> Release notes for this release candidate are available
> here: https://ovirt.org/release/4.4.0/
> 
> Thanks
> -- 
> 
> Sandro Bonazzola
> 
> MANAGER, SOFTWARE ENGINEERING, EMEA R RHV
> 
> Red Hat EMEA 
> 
> sbona...@redhat.com    
> 
>  
> |Our code is open_ 
> 
> **
> *Red Hat respects your work life balance. Therefore there is no need to
> answer this email out of your office hours.
> *
> 
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/WCQSRKSTCSHAKPJBKIURQXPCMJDCRV74/
> 




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/XQUHTOUUMJGJX2IDX4OIKT6DYC2XUEN3/


[ovirt-users] Re: Multiple NICs on hosted engine, oVirt 4.3

2020-04-05 Thread Jiří Sléžka
Hello Paul,

thanks for reply and for pointing me to RHV documentation. Shame on me,
I forogot it exists and is well maintained and super relevant for oVirt.

But in this case I am a bit lost. I can't find something relevant to my
effort to modify HE vm's config to add the second NIC to it.

Or do you mean I should change the location of vm.conf from /var/run
location to something persistent and then modify this vm.conf as
mentioned in that old post?

For example, now I got this output for conf key

[root@ovirt-hci01 ~]# hosted-engine --get-shared-config conf
--type=he_shared

conf : /var/run/ovirt-hosted-engine-ha/vm.conf, type : he_shared

[root@ovirt-hci01 ~]# hosted-engine --get-shared-config conf --type=he_local

conf : /var/run/ovirt-hosted-engine-ha/vm.conf, type : he_local

Is it valid to change it to (for example)
/etc/ovirt-hosted-engine/vm.conf and modify that file there? Is it safe
for further updates, maintenance, etc.?

Maybe the second question - which process creates vm.conf in /var/run path?

Cheers,

Jiri

On 4/5/20 8:35 AM, Staniforth, Paul wrote:
> Hello Jri,
>   The configurations local and shared are managed by
> hosted-engine command.
> https://access.redhat.com/documentation/en-us/red_hat_virtualization/4.3/html/administration_guide/chap-administering_the_self-hosted_engine
> <https://access.redhat.com/documentation/en-us/red_hat_virtualization/4.3/html/administration_guide/chap-administering_the_self-hosted_engine>
>   
> Chapter 12. Administering the Self-Hosted Engine Red Hat Virtualization
> 4.3 | Red Hat Customer Portal
> <https://access.redhat.com/documentation/en-us/red_hat_virtualization/4.3/html/administration_guide/chap-administering_the_self-hosted_engine>
> The Red Hat Customer Portal delivers the knowledge, expertise, and
> guidance available through your Red Hat subscription.
> access.redhat.com
> 
> see
> "hosted-engine --help"
> "man hosted-engine"



> 
> Regards,
>   Paul S.
> ----
> *From:* Jiří Sléžka 
> *Sent:* 04 April 2020 21:02
> *To:* users@ovirt.org 
> *Subject:* [ovirt-users] Multiple NICs on hosted engine, oVirt 4.3
>  
> Caution External Mail: Do not click any links or open any attachments
> unless you trust the sender and know that the content is safe.
> 
> Hello,
> 
> I would like to add second NIC to hosted engine vm (I need access to
> isolated network with host's power management). I found this solution...
> 
> https://eur02.safelinks.protection.outlook.com/?url=https%3A%2F%2Flists.ovirt.org%2Fpipermail%2Fusers%2F2014-November%2F062610.htmldata=02%7C01%7Cp.staniforth%40leedsbeckett.ac.uk%7C31875f5cdc3143f4425708d7d8d49bd9%7Cd79a81124fbe417aa112cd0fb490d85c%7C0%7C1%7C637216279973033087sdata=2NtDru1nfjgkasnIXPReHEgFKnmjwEj3VBJJVqwv%2Bbg%3Dreserved=0
> 
> ...which seems to work in oVirt 3.5 but in oVirt 4.3 there is no file
> like /etc/ovirt-hosted-engine/vm.conf anymore.
> 
> In the file /etc/ovirt-hosted-engine/hosted-engine.conf is a line
> 
> ...
> conf=/var/run/ovirt-hosted-engine-ha/vm.conf
> ...
> 
> which points to this config but this is a running config...
> 
> Is there a way to add a second NIC to hosted engin vm?
> 
> Cheers,
> 
> Jiri
> 
> 
> To view the terms under which this email is distributed, please go to:-
> http://leedsbeckett.ac.uk/disclaimer/email/
> 



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/46FQFLOI622O3G26YVL3LU4RCDCXZVY5/


[ovirt-users] Multiple NICs on hosted engine, oVirt 4.3

2020-04-04 Thread Jiří Sléžka
Hello,

I would like to add second NIC to hosted engine vm (I need access to
isolated network with host's power management). I found this solution...

https://lists.ovirt.org/pipermail/users/2014-November/062610.html

...which seems to work in oVirt 3.5 but in oVirt 4.3 there is no file
like /etc/ovirt-hosted-engine/vm.conf anymore.

In the file /etc/ovirt-hosted-engine/hosted-engine.conf is a line

...
conf=/var/run/ovirt-hosted-engine-ha/vm.conf
...

which points to this config but this is a running config...

Is there a way to add a second NIC to hosted engin vm?

Cheers,

Jiri




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/RJ7C3X63YPIIDZHYHGLUMUGVJXZ4BLJ5/


[ovirt-users] hosts becomes NonResponsive

2019-05-21 Thread Jiří Sléžka
Hi,

time to time one of our four ovirt hosts become NonResponsive.

From engine point of view it looks this way (engine.log)

2019-05-21 13:10:30,261+02 ERROR
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
(EE-ManagedThreadFactory-engineScheduled-Thread-95) [] EVENT_ID:
VDS_BROKER_COMMAND_FAILURE(10,802), VDSM ovirt03.net.slu.cz command Get
Host Capabilities failed: Message timeout which can be caused by
communication issues
2019-05-21 13:10:30,261+02 ERROR
[org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring]
(EE-ManagedThreadFactory-engineScheduled-Thread-95) [] Unable to
RefreshCapabilities: VDSNetworkException: VDSGenericException:
VDSNetworkException: Message timeout which can be caused by
communication issues

from host (which is reachable) it looks like (vdsm.log)

2019-05-21 13:10:27,154+0200 INFO  (vmrecovery) [vdsm.api] START
getConnectedStoragePoolsList(options=None) from=internal,
task_id=a1bebf2f-7070-4344-90b7-1d709ba94b5c (api:48)
2019-05-21 13:10:27,154+0200 INFO  (vmrecovery) [vdsm.api] FINISH
getConnectedStoragePoolsList return={'poollist': []} from=internal,
task_id=a1bebf2f-7070-4344-90b7-1d709ba94b5c (api:54)
2019-05-21 13:10:27,155+0200 INFO  (vmrecovery) [vds] recovery: waiting
for storage pool to go up (clientIF:709)
2019-05-21 13:10:31,245+0200 INFO  (jsonrpc/4) [api.host] START
getAllVmStats() from=::1,39144 (api:48)
2019-05-21 13:10:31,247+0200 INFO  (jsonrpc/4) [api.host] FINISH
getAllVmStats return={'status': {'message': 'Done', 'code': 0},
'statsList': (suppressed)} from=::1,39144 (api:54)
2019-05-21 13:10:31,249+0200 INFO  (jsonrpc/4) [jsonrpc.JsonRpcServer]
RPC call Host.getAllVmStats succeeded in 0.00 seconds (__init__:312)


hosts are latest CentOS7 (but old AMD Opteron HW), oVirt is 4.3.3.7-1.el7

I cannot track it down to network layer. We have 4 other RHV hosts on
the same infrastructure and it works well. Some clues what is happening?

Thanks in advance,

Jiri Slezka



smime.p7s
Description: Elektronicky podpis S/MIME
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/C7CNMZR75CRUXD4JPUT2YG5WFKDRPSDI/


[ovirt-users] oVirt 4.3 and AMD Opteron G3 CPU

2019-03-08 Thread Jiří Sléžka
Hi,

I know, old AMD Opterons G3 are unsupported in oVirt 4.3 but anyway, is
there a way to switch cluster to 4.3 mode with these old CPUs? On the
other hand 4.2 mode is no problem, I just would like to test new stuff
(on legacy hw ;-)

Cheers,

Jiri Slezka



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/5LG7ICTTTYDB3SZFYDNQEJ2R4HNL5M7M/


[ovirt-users] Re: stucked snapshot, locked disk

2019-02-18 Thread Jiří Sléžka
Hi,


> 
> On Thu, Feb 14, 2019 at 11:37 AM Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> wrote:
> 
> Hello,
> 
> we are using ovirt 4.2.8.2-1.el7.
> 
> One our user probably tried to preview taken snapshot but the task is
> stucked and never finished. Also disk is locked.
> 
> Here is engine.log
> 
> https://pastebin.com/izBJ1BUg
> 
> I am not sure what went wrong except one java.lang.NullPointerException
> error... Could it be some bug?
> 
> 
> Indeed, it looks like a bug. Can you open one: 
> https://bugzilla.redhat.com/enter_bug.cgi?product=ovirt-engine
> component = BLL.Virt,  team = Virt
> please attach your logs too.

here it is

https://bugzilla.redhat.com/show_bug.cgi?id=1678234

Cheers,

Jiri

>  
> 
> 
> Also I would like to unlock disk and clean this task. I can see locked
> snapshot and disk
> 
> [root@ovirt dbutils]# ./unlock_entity.sh -q -t snapshot -c
> 
> Locked snapshots
> 
>                 vm_id                 |             snapshot_id
> 
> 
> --+--
>  df27ee13-9961-41d5-a4eb-bbb0aadf9086 |
> 58998020-08cb-4c84-8847-0cb6790724ba
> 
> [root@ovirt dbutils]# ./unlock_entity.sh -q -t disk -c
> 
> Locked disks
> 
>                 vm_id                 |               disk_id
> 
> 
> --+--
>  df27ee13-9961-41d5-a4eb-bbb0aadf9086 |
> f6405448-baf8-4cc2-8ee7-798ca11cad10
> 
> but no runnig tasks
> 
> [root@ovirt dbutils]# ./taskcleaner.sh -o
>  t
> 
> but in ovirt-manager I see this stucked task...
> 
> Preview VM Snapshot predsysprep of VM install_10_64bit_LTSB_UK_ucebny -
> Started: Feb 12, 2019, 11:28:04 AM
> Validating - Completed: Feb 12, 2019, 11:28:04 AM
> Executing - Completed: Feb 12, 2019, 11:28:15 AM
> Creating Volume - Completed: Feb 12, 2019, 11:28:15 AM
> Finalizing - Started: Feb 12, 2019, 11:28:15 AM
> 
> what is best approach to unlock and clean this?
> 
> 
> I'm not sure -- hopefully someone else can assist :)
>  
> 
> 
> Thanks in advance,
> 
> Cheers, Jiri
> 
> ___
> Users mailing list -- users@ovirt.org <mailto:users@ovirt.org>
> To unsubscribe send an email to users-le...@ovirt.org
> <mailto:users-le...@ovirt.org>
> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
> oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives:
> 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/FVJTQ25MGI2PRAXJIEHSLEOL62VRPOT2/
> 
> 
> 
> -- 
> 
> GREG SHEREMETA
> 
> SENIOR SOFTWARE ENGINEER - TEAM LEAD - RHV UX
> 
> Red Hat NA
> 
> <https://www.redhat.com/>
> 
> gsher...@redhat.com <mailto:gsher...@redhat.com>    IRC: gshereme
> 
> <https://red.ht/sig>
> 




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/2NIINLK66IU242NHZ6MBVRF2X6S3D5SC/


[ovirt-users] Re: stucked snapshot, locked disk

2019-02-18 Thread Jiří Sléžka
Hi,

On 2/15/19 10:01 PM, Benny Zlotnik wrote:
> IIUC it shouldn't get to 475 since it would short-circuit if it isn't a disk
> 
> Jiří, can you attach the full engine logs so we can retrace everything
> that happened?

sure,
https://filesender.cesnet.cz/?s=download=a67b3f78-8f65-f559-6a9c-f709fff0a94b

Cheers,

Jiri

> 
> On Fri, Feb 15, 2019 at 10:06 PM Michal Skrivanek
> mailto:michal.skriva...@redhat.com>> wrote:
> 
> Isn’t the code wrong? Seems the function checks for disks or unmanaged
> device, and if it is unmanaged non-disk device it still kind of
> assumes it’s a disk at line 475?
> It’s not common, but any device can be unmanaged
> 
> > On 14 Feb 2019, at 16:45, Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> wrote:
> >
> > Hello,
> >
> > we are using ovirt 4.2.8.2-1.el7.
> >
> > One our user probably tried to preview taken snapshot but the task is
> > stucked and never finished. Also disk is locked.
> >
> > Here is engine.log
> >
> > https://pastebin.com/izBJ1BUg
> >
> > I am not sure what went wrong except one
> java.lang.NullPointerException
> > error... Could it be some bug?
> >
> > Also I would like to unlock disk and clean this task. I can see locked
> > snapshot and disk
> >
> > [root@ovirt dbutils]# ./unlock_entity.sh -q -t snapshot -c
> >
> > Locked snapshots
> >
> >                vm_id                 |             snapshot_id
> >
> >
> 
> --+--
> > df27ee13-9961-41d5-a4eb-bbb0aadf9086 |
> 58998020-08cb-4c84-8847-0cb6790724ba
> >
> > [root@ovirt dbutils]# ./unlock_entity.sh -q -t disk -c
> >
> > Locked disks
> >
> >                vm_id                 |               disk_id
> >
> >
> 
> --+--
> > df27ee13-9961-41d5-a4eb-bbb0aadf9086 |
> f6405448-baf8-4cc2-8ee7-798ca11cad10
> >
> > but no runnig tasks
> >
> > [root@ovirt dbutils]# ./taskcleaner.sh -o
> > t
> >
> > but in ovirt-manager I see this stucked task...
> >
> > Preview VM Snapshot predsysprep of VM
> install_10_64bit_LTSB_UK_ucebny -
> > Started: Feb 12, 2019, 11:28:04 AM
> > Validating - Completed: Feb 12, 2019, 11:28:04 AM
> > Executing - Completed: Feb 12, 2019, 11:28:15 AM
> > Creating Volume - Completed: Feb 12, 2019, 11:28:15 AM
> > Finalizing - Started: Feb 12, 2019, 11:28:15 AM
> >
> > what is best approach to unlock and clean this?
> >
> > Thanks in advance,
> >
> > Cheers, Jiri
> >
> > ___
> > Users mailing list -- users@ovirt.org <mailto:users@ovirt.org>
> > To unsubscribe send an email to users-le...@ovirt.org
> <mailto:users-le...@ovirt.org>
> > Privacy Statement: https://www.ovirt.org/site/privacy-policy/
> > oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> > List Archives:
> 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/FVJTQ25MGI2PRAXJIEHSLEOL62VRPOT2/
> 




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/RTHGO2NAVDC7VCPCAPZUAYOJC52GDDWW/


[ovirt-users] stucked snapshot, locked disk

2019-02-14 Thread Jiří Sléžka
Hello,

we are using ovirt 4.2.8.2-1.el7.

One our user probably tried to preview taken snapshot but the task is
stucked and never finished. Also disk is locked.

Here is engine.log

https://pastebin.com/izBJ1BUg

I am not sure what went wrong except one java.lang.NullPointerException
error... Could it be some bug?

Also I would like to unlock disk and clean this task. I can see locked
snapshot and disk

[root@ovirt dbutils]# ./unlock_entity.sh -q -t snapshot -c

Locked snapshots

vm_id | snapshot_id

--+--
 df27ee13-9961-41d5-a4eb-bbb0aadf9086 | 58998020-08cb-4c84-8847-0cb6790724ba

[root@ovirt dbutils]# ./unlock_entity.sh -q -t disk -c

Locked disks

vm_id |   disk_id

--+--
 df27ee13-9961-41d5-a4eb-bbb0aadf9086 | f6405448-baf8-4cc2-8ee7-798ca11cad10

but no runnig tasks

[root@ovirt dbutils]# ./taskcleaner.sh -o
 t

but in ovirt-manager I see this stucked task...

Preview VM Snapshot predsysprep of VM install_10_64bit_LTSB_UK_ucebny -
Started: Feb 12, 2019, 11:28:04 AM
Validating - Completed: Feb 12, 2019, 11:28:04 AM
Executing - Completed: Feb 12, 2019, 11:28:15 AM
Creating Volume - Completed: Feb 12, 2019, 11:28:15 AM
Finalizing - Started: Feb 12, 2019, 11:28:15 AM

what is best approach to unlock and clean this?

Thanks in advance,

Cheers, Jiri



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/FVJTQ25MGI2PRAXJIEHSLEOL62VRPOT2/


[ovirt-users] Re: cannot activate host

2018-08-10 Thread Jiří Sléžka
Well, issue was solved by installing latest async update 4.2.5...
Probably helped this part of upgrade process...

...
[ INFO  ] Cleaning async tasks and compensations
[ INFO  ] Unlocking existing entities
...


Cheers,

Jiri


On 08/10/2018 11:03 AM, Jiří Sléžka wrote:
> On 08/10/2018 10:52 AM, Raz Tamir wrote:
>>
>>
>> On Fri, Aug 10, 2018 at 11:30 AM, Jiří Sléžka > <mailto:jiri.sle...@slu.cz>> wrote:
>>
>> Hi,
>>
>> On 08/09/2018 11:25 PM, Raz Tamir wrote:
>> > You can try using vdsm-client to check if there are running tasks on 
>> the
>> > host using this command:
>> > 
>> >     vdsm-client Host getAllTasksInfo
>>
>> thanks for support, I tried this on affected host
>>
>> [root@blade01 ~]# vdsm-client Host getAllTasksInfo
>> vdsm-client: Command Host.getAllTasksInfo with args {} failed:
>> (code=654, message=Not SPM: ())
>>
>> so I moved into SPM host (blade03 at this time)
>>
>> [root@blade03 ~]# vdsm-client Host getAllTasksInfo
>> {}
>>
>> I tried also
>>
>> [root@blade01 ~]# vdsm-client Host getAllTasks
>> {}
>> [root@blade01 ~]# vdsm-client Host getJobs
>> {}
>>
>> [root@blade03 ~]# vdsm-client Host getAllTasks
>> {}
>> [root@blade03 ~]# vdsm-client Host getJobs
>> {}
>>
>> It looks like there are no tasks/jobs running on hosts. It looks like
>> something is stucked in engine db.
>>
>> At the beginning of this issue, there were some race condition where
>> blade01 had two running tasks/jobs which cannot be finished (network
>> configuration and activating of host). I deleted this jobs manually in
>> db via
>>
>> SELECT * FROM job ORDER BY start_time DESC;
>> SELECT DeleteJob('...job_id...');
>> SELECT DeleteJob('...job_id...');
>>
>> Maybe it was not wise... but what can I do now to solve this? :-)
>>
>> Yes it can be bad to mess with the DB.
>> My best suugestion at the momet is to try and re-add this host.
> 
> I would like to but I cannot even remove this host (Cannot remove Host.
> Related operation is currently in progress. Please try again later.) :-(
> But maybe I can try to add it as brand new one?
> 
> But I am curious what checks do manager and why we cannot see the
> tasks/jobs/locks/anything in db...
> 
> Cheers, Jiri
> 
> 
> 
>>  
>>
>>
>> Cheers, Jiri
>>
>>
>> > 
>> > If there are running tasks, you can stop them:
>> > 
>> >     vdsm-client Task stop taskID=xxx-yyy
>> > 
>> > If there are finished tasks, you can clear them:
>> > 
>> >     vdsm-client Task clear taskID=xxx-yyy
>> > 
>> > 
>> > On Thu, Aug 9, 2018, 16:15 Jiří Sléžka > <mailto:jiri.sle...@slu.cz>
>> > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>> wrote:
>> >
>> >     Hello,
>> >
>> >     still no luck with solving this issue.
>> >
>> >     I cannot even remove this host.
>> >
>> >     engine log is now spammed with this messages
>> >
>> >     2018-08-09 15:03:08,410+02 INFO
>> >     [org.ovirt.engine.core.bll.lock.InMemoryLockManager]
>> >     (EE-ManagedThreadFactory-engine-Thread-1057)
>> >     [cb8aa091-70ce-419a-b45b-3ffad6f2529b] Failed to acquire lock
>> and wait
>> >     lock
>> >   
>>  
>> 'HostEngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS_INIT]',
>> >     sharedLocks=''}'
>> >     2018-08-09 15:03:08,446+02 INFO
>> >     [org.ovirt.engine.core.bll.lock.InMemoryLockManager]
>> >     (EE-ManagedThreadFactory-engine-Thread-1057)
>> >     [cb8aa091-70ce-419a-b45b-3ffad6f2529b] Failed to acquire lock
>> and wait
>> >     lock
>> >   
>>  
>> 'HostEngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS_INIT]',
>> >     sharedLocks=''}'
>> >     2018-08-09 15:03:08,451+02 INFO
>> >     [org.ovirt.engine.core.bll.lock.InMemoryLockManager]
>> >     (EE-ManagedThreadFactory-engine-Thread-1057)
>> >     [cb8aa091-70ce-419a-b45b-3ffad6f2529b] Failed to acquire lock
>> and wait
>> >     lock
>> >   
>>  
>> 'HostEngi

[ovirt-users] Re: cannot activate host

2018-08-10 Thread Jiří Sléžka
On 08/10/2018 10:52 AM, Raz Tamir wrote:
> 
> 
> On Fri, Aug 10, 2018 at 11:30 AM, Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> wrote:
> 
> Hi,
> 
> On 08/09/2018 11:25 PM, Raz Tamir wrote:
> > You can try using vdsm-client to check if there are running tasks on the
> > host using this command:
> > 
> >     vdsm-client Host getAllTasksInfo
> 
> thanks for support, I tried this on affected host
> 
> [root@blade01 ~]# vdsm-client Host getAllTasksInfo
> vdsm-client: Command Host.getAllTasksInfo with args {} failed:
> (code=654, message=Not SPM: ())
> 
> so I moved into SPM host (blade03 at this time)
> 
> [root@blade03 ~]# vdsm-client Host getAllTasksInfo
> {}
> 
> I tried also
> 
> [root@blade01 ~]# vdsm-client Host getAllTasks
> {}
> [root@blade01 ~]# vdsm-client Host getJobs
> {}
> 
> [root@blade03 ~]# vdsm-client Host getAllTasks
> {}
> [root@blade03 ~]# vdsm-client Host getJobs
> {}
> 
> It looks like there are no tasks/jobs running on hosts. It looks like
> something is stucked in engine db.
> 
> At the beginning of this issue, there were some race condition where
> blade01 had two running tasks/jobs which cannot be finished (network
> configuration and activating of host). I deleted this jobs manually in
> db via
> 
> SELECT * FROM job ORDER BY start_time DESC;
> SELECT DeleteJob('...job_id...');
> SELECT DeleteJob('...job_id...');
> 
> Maybe it was not wise... but what can I do now to solve this? :-)
> 
> Yes it can be bad to mess with the DB.
> My best suugestion at the momet is to try and re-add this host.

I would like to but I cannot even remove this host (Cannot remove Host.
Related operation is currently in progress. Please try again later.) :-(
But maybe I can try to add it as brand new one?

But I am curious what checks do manager and why we cannot see the
tasks/jobs/locks/anything in db...

Cheers, Jiri



>  
> 
> 
> Cheers, Jiri
> 
> 
> > 
> > If there are running tasks, you can stop them:
> > 
> >     vdsm-client Task stop taskID=xxx-yyy
> > 
> > If there are finished tasks, you can clear them:
> > 
> >     vdsm-client Task clear taskID=xxx-yyy
> > 
> > 
> > On Thu, Aug 9, 2018, 16:15 Jiří Sléžka  <mailto:jiri.sle...@slu.cz>
> > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>> wrote:
> >
> >     Hello,
> >
> >     still no luck with solving this issue.
> >
> >     I cannot even remove this host.
> >
> >     engine log is now spammed with this messages
> >
> >     2018-08-09 15:03:08,410+02 INFO
> >     [org.ovirt.engine.core.bll.lock.InMemoryLockManager]
> >     (EE-ManagedThreadFactory-engine-Thread-1057)
> >     [cb8aa091-70ce-419a-b45b-3ffad6f2529b] Failed to acquire lock
> and wait
> >     lock
> >   
>  
> 'HostEngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS_INIT]',
> >     sharedLocks=''}'
> >     2018-08-09 15:03:08,446+02 INFO
> >     [org.ovirt.engine.core.bll.lock.InMemoryLockManager]
> >     (EE-ManagedThreadFactory-engine-Thread-1057)
> >     [cb8aa091-70ce-419a-b45b-3ffad6f2529b] Failed to acquire lock
> and wait
> >     lock
> >   
>  
> 'HostEngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS_INIT]',
> >     sharedLocks=''}'
> >     2018-08-09 15:03:08,451+02 INFO
> >     [org.ovirt.engine.core.bll.lock.InMemoryLockManager]
> >     (EE-ManagedThreadFactory-engine-Thread-1057)
> >     [cb8aa091-70ce-419a-b45b-3ffad6f2529b] Failed to acquire lock
> and wait
> >     lock
> >   
>  
> 'HostEngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS_INIT]',
> >     sharedLocks=''}'
> >     2018-08-09 15:03:08,486+02 INFO
> >     [org.ovirt.engine.core.bll.lock.InMemoryLockManager]
> >     (EE-ManagedThreadFactory-engine-Thread-1057)
> >     [cb8aa091-70ce-419a-b45b-3ffad6f2529b] Failed to acquire lock
> and wait
> >     lock
> >   
>      
> 'HostEngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS_INIT]',
> >     sharedLocks=''}'
> >
> >     btw. oVirt 4.2.5.2-1.el7/CentOS7
> >
> >     Any tips how to resolve this and at least remove this host
> from ovirt?
> >
>  

[ovirt-users] Re: cannot activate host

2018-08-10 Thread Jiří Sléžka
Hi,

On 08/09/2018 11:25 PM, Raz Tamir wrote:
> You can try using vdsm-client to check if there are running tasks on the
> host using this command:
> 
> vdsm-client Host getAllTasksInfo

thanks for support, I tried this on affected host

[root@blade01 ~]# vdsm-client Host getAllTasksInfo
vdsm-client: Command Host.getAllTasksInfo with args {} failed:
(code=654, message=Not SPM: ())

so I moved into SPM host (blade03 at this time)

[root@blade03 ~]# vdsm-client Host getAllTasksInfo
{}

I tried also

[root@blade01 ~]# vdsm-client Host getAllTasks
{}
[root@blade01 ~]# vdsm-client Host getJobs
{}

[root@blade03 ~]# vdsm-client Host getAllTasks
{}
[root@blade03 ~]# vdsm-client Host getJobs
{}

It looks like there are no tasks/jobs running on hosts. It looks like
something is stucked in engine db.

At the beginning of this issue, there were some race condition where
blade01 had two running tasks/jobs which cannot be finished (network
configuration and activating of host). I deleted this jobs manually in
db via

SELECT * FROM job ORDER BY start_time DESC;
SELECT DeleteJob('...job_id...');
SELECT DeleteJob('...job_id...');

Maybe it was not wise... but what can I do now to solve this? :-)

Cheers, Jiri


> 
> If there are running tasks, you can stop them:
> 
> vdsm-client Task stop taskID=xxx-yyy
> 
> If there are finished tasks, you can clear them:
> 
> vdsm-client Task clear taskID=xxx-yyy
> 
> 
> On Thu, Aug 9, 2018, 16:15 Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> wrote:
> 
> Hello,
> 
> still no luck with solving this issue.
> 
> I cannot even remove this host.
> 
> engine log is now spammed with this messages
> 
> 2018-08-09 15:03:08,410+02 INFO
> [org.ovirt.engine.core.bll.lock.InMemoryLockManager]
> (EE-ManagedThreadFactory-engine-Thread-1057)
> [cb8aa091-70ce-419a-b45b-3ffad6f2529b] Failed to acquire lock and wait
> lock
> 
> 'HostEngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS_INIT]',
> sharedLocks=''}'
> 2018-08-09 15:03:08,446+02 INFO
> [org.ovirt.engine.core.bll.lock.InMemoryLockManager]
> (EE-ManagedThreadFactory-engine-Thread-1057)
> [cb8aa091-70ce-419a-b45b-3ffad6f2529b] Failed to acquire lock and wait
> lock
> 
> 'HostEngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS_INIT]',
> sharedLocks=''}'
> 2018-08-09 15:03:08,451+02 INFO
> [org.ovirt.engine.core.bll.lock.InMemoryLockManager]
> (EE-ManagedThreadFactory-engine-Thread-1057)
> [cb8aa091-70ce-419a-b45b-3ffad6f2529b] Failed to acquire lock and wait
> lock
> 
> 'HostEngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS_INIT]',
> sharedLocks=''}'
> 2018-08-09 15:03:08,486+02 INFO
> [org.ovirt.engine.core.bll.lock.InMemoryLockManager]
> (EE-ManagedThreadFactory-engine-Thread-1057)
> [cb8aa091-70ce-419a-b45b-3ffad6f2529b] Failed to acquire lock and wait
> lock
> 
> 'HostEngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS_INIT]',
> sharedLocks=''}'
> 
> btw. oVirt 4.2.5.2-1.el7/CentOS7
> 
> Any tips how to resolve this and at least remove this host from ovirt?
> 
> Cheers, Jiri
> 
> 
> On 08/06/2018 12:52 PM, Gobinda Das wrote:
> > Can you please post vdsm log which is inside /var/log/vdsm/vdsm.log ?
> >
> > On Mon, Aug 6, 2018 at 3:51 PM, Jiří Sléžka  <mailto:jiri.sle...@slu.cz>
> > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>> wrote:
> >
> >     Hi,
> >
> >     no one can help?
> >
> >     I still cannot activate this host - error is "Cannot activate
> Host.
> >     Related operation is currently in progress. Please try again
> later."
> >
> >     I believe relevant log entrieas are
> >
> >     2018-08-06 12:15:50,398+02 INFO
> >     [org.ovirt.engine.core.bll.ActivateVdsCommand] (default task-426)
> >     [9387077a-8276-4a3f-a087-584a10a09b08] Failed to Acquire Lock
> to object
> >   
>  'EngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS]',
> >     sharedLocks=''}'
> >     2018-08-06 12:15:50,398+02 WARN
> >     [org.ovirt.engine.core.bll.ActivateVdsCommand] (default task-426)
> >     [9387077a-8276-4a3f-a087-584a10a09b08] Validation of action
> >     'ActivateVds' failed for user ***my_username***. Reasons:
> >   
>  VAR__ACTION__ACTIVATE,VAR__TYPE__HOST,ACTION_TYPE_FAILED_OBJECT_LOCKED
> >
> >
> >     when I tried to "reinstall host" I got

[ovirt-users] Re: cannot activate host

2018-08-09 Thread Jiří Sléžka
Hello,

still no luck with solving this issue.

I cannot even remove this host.

engine log is now spammed with this messages

2018-08-09 15:03:08,410+02 INFO
[org.ovirt.engine.core.bll.lock.InMemoryLockManager]
(EE-ManagedThreadFactory-engine-Thread-1057)
[cb8aa091-70ce-419a-b45b-3ffad6f2529b] Failed to acquire lock and wait
lock
'HostEngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS_INIT]',
sharedLocks=''}'
2018-08-09 15:03:08,446+02 INFO
[org.ovirt.engine.core.bll.lock.InMemoryLockManager]
(EE-ManagedThreadFactory-engine-Thread-1057)
[cb8aa091-70ce-419a-b45b-3ffad6f2529b] Failed to acquire lock and wait
lock
'HostEngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS_INIT]',
sharedLocks=''}'
2018-08-09 15:03:08,451+02 INFO
[org.ovirt.engine.core.bll.lock.InMemoryLockManager]
(EE-ManagedThreadFactory-engine-Thread-1057)
[cb8aa091-70ce-419a-b45b-3ffad6f2529b] Failed to acquire lock and wait
lock
'HostEngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS_INIT]',
sharedLocks=''}'
2018-08-09 15:03:08,486+02 INFO
[org.ovirt.engine.core.bll.lock.InMemoryLockManager]
(EE-ManagedThreadFactory-engine-Thread-1057)
[cb8aa091-70ce-419a-b45b-3ffad6f2529b] Failed to acquire lock and wait
lock
'HostEngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS_INIT]',
sharedLocks=''}'

btw. oVirt 4.2.5.2-1.el7/CentOS7

Any tips how to resolve this and at least remove this host from ovirt?

Cheers, Jiri


On 08/06/2018 12:52 PM, Gobinda Das wrote:
> Can you please post vdsm log which is inside /var/log/vdsm/vdsm.log ?
> 
> On Mon, Aug 6, 2018 at 3:51 PM, Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> wrote:
> 
> Hi,
> 
> no one can help?
> 
> I still cannot activate this host - error is "Cannot activate Host.
> Related operation is currently in progress. Please try again later."
> 
> I believe relevant log entrieas are
> 
> 2018-08-06 12:15:50,398+02 INFO
> [org.ovirt.engine.core.bll.ActivateVdsCommand] (default task-426)
> [9387077a-8276-4a3f-a087-584a10a09b08] Failed to Acquire Lock to object
> 'EngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS]',
> sharedLocks=''}'
> 2018-08-06 12:15:50,398+02 WARN
> [org.ovirt.engine.core.bll.ActivateVdsCommand] (default task-426)
> [9387077a-8276-4a3f-a087-584a10a09b08] Validation of action
> 'ActivateVds' failed for user ***my_username***. Reasons:
> VAR__ACTION__ACTIVATE,VAR__TYPE__HOST,ACTION_TYPE_FAILED_OBJECT_LOCKED
> 
> 
> when I tried to "reinstall host" I got entry in Events "Failed to update
> Host blade01" and correlation id. Relevant log entries are
> 
> 
> cat /var/log/ovirt-engine/engine.log | grep
> d2b332da-9050-419c-8188-8fdda5e9d807
> 
> 2018-08-06 12:08:57,451+02 INFO
> [org.ovirt.engine.core.bll.hostdeploy.InstallVdsCommand] (default
> task-425) [d2b332da-9050-419c-8188-8fdda5e9d807] Running command:
> InstallVdsCommand internal: false. Entities affected :  ID:
> 786646cd-c9ef-49a8-8aea-56e858dcf202 Type: VDSAction group
> EDIT_HOST_CONFIGURATION with role type ADMIN
> 2018-08-06 12:08:57,476+02 INFO
> [org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand]
> (default task-425) [d2b332da-9050-419c-8188-8fdda5e9d807] Failed to
> Acquire Lock to object
> 'EngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS]',
> sharedLocks=''}'
> 2018-08-06 12:08:57,476+02 WARN
> [org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand]
> (default task-425) [d2b332da-9050-419c-8188-8fdda5e9d807] Validation of
> action 'InstallVdsInternal' failed for user ***my_username***. Reasons:
> ACTION_TYPE_FAILED_OBJECT_LOCKED
> 2018-08-06 12:08:57,479+02 ERROR
> [org.ovirt.engine.core.bll.hostdeploy.InstallVdsCommand] (default
> task-425) [d2b332da-9050-419c-8188-8fdda5e9d807] Installation/upgrade of
> Host '786646cd-c9ef-49a8-8aea-56e858dcf202', 'blade01' failed: Cannot
> ${action} ${type}. Related operation is currently in progress. Please
> try again later.
> 2018-08-06 12:08:57,482+02 INFO
> [org.ovirt.engine.core.bll.CommandCompensator] (default task-425)
> [d2b332da-9050-419c-8188-8fdda5e9d807] Command
> [id=b74adfbb-c043-4fa9-a163-0a6ea7fba9dc]: Compensating
> DELETED_OR_UPDATED_ENTITY of
> org.ovirt.engine.core.common.businessentities.VdsStatic; snapshot:
> id=786646cd-c9ef-49a8-8aea-56e858dcf202.
> 2018-08-06 12:08:57,486+02 ERROR
> [org.ovirt.engine.core.bll.hostdeploy.InstallVdsCommand] (default
> task-425) [d2b332da-9050-419c-8188-8fdda5e9d807] Transaction rolled-back
> for command 'org.ovirt.engine.core.bll.hostdeploy.InstallVdsComm

[ovirt-users] Re: cannot activate host

2018-08-06 Thread Jiří Sléžka
Hello,

On 08/06/2018 12:52 PM, Gobinda Das wrote:
> Can you please post vdsm log which is inside /var/log/vdsm/vdsm.log ?

I suppose you are interested in vdsm log of affected host blade01

It looks like nothing special is there...


2018-08-06 12:08:37,736+0200 INFO  (periodic/0) [vdsm.api] START
repoStats(domains=()) from=internal, tas
k_id=e2c4f6da-8179-4022-a204-9e79f83223d2 (api:46)
2018-08-06 12:08:37,737+0200 INFO  (periodic/0) [vdsm.api] FINISH
repoStats return={} from=internal, task
_id=e2c4f6da-8179-4022-a204-9e79f83223d2 (api:52)
2018-08-06 12:08:37,737+0200 INFO  (periodic/0) [vdsm.api] START
multipath_health() from=internal, task_i
d=c5a1406e-1afd-458e-a97d-4942f86b7be0 (api:46)
2018-08-06 12:08:37,737+0200 INFO  (periodic/0) [vdsm.api] FINISH
multipath_health return={} from=interna
l, task_id=c5a1406e-1afd-458e-a97d-4942f86b7be0 (api:52)
2018-08-06 12:08:37,841+0200 INFO  (jsonrpc/2) [api.host] START
getAllVmStats() from=::1,43898 (api:46)
2018-08-06 12:08:37,842+0200 INFO  (jsonrpc/2) [api.host] FINISH
getAllVmStats return={'status': {'messag
e': 'Done', 'code': 0}, 'statsList': (suppressed)} from=::1,43898 (api:52)
2018-08-06 12:08:37,842+0200 INFO  (jsonrpc/2) [jsonrpc.JsonRpcServer]
RPC call Host.getAllVmStats succee
ded in 0.00 seconds (__init__:573)
2018-08-06 12:08:52,761+0200 INFO  (periodic/3) [vdsm.api] START
repoStats(domains=()) from=internal, tas
k_id=6a0a0ecd-941c-4f21-99b2-ea8bd140df0e (api:46)
2018-08-06 12:08:52,761+0200 INFO  (periodic/3) [vdsm.api] FINISH
repoStats return={} from=internal, task
_id=6a0a0ecd-941c-4f21-99b2-ea8bd140df0e (api:52)
2018-08-06 12:08:52,761+0200 INFO  (periodic/3) [vdsm.api] START
multipath_health() from=internal, task_i
d=2243d5f5-2ca8-434d-953d-33778318d79d (api:46)
2018-08-06 12:08:52,761+0200 INFO  (periodic/3) [vdsm.api] FINISH
multipath_health return={} from=interna
l, task_id=2243d5f5-2ca8-434d-953d-33778318d79d (api:52)
2018-08-06 12:08:52,852+0200 INFO  (jsonrpc/3) [api.host] START
getAllVmStats() from=::1,43898 (api:46)
2018-08-06 12:08:52,853+0200 INFO  (jsonrpc/3) [api.host] FINISH
getAllVmStats return={'status': {'messag
e': 'Done', 'code': 0}, 'statsList': (suppressed)} from=::1,43898 (api:52)
2018-08-06 12:08:52,853+0200 INFO  (jsonrpc/3) [jsonrpc.JsonRpcServer]
RPC call Host.getAllVmStats succee
ded in 0.00 seconds (__init__:573)
2018-08-06 12:09:07,784+0200 INFO  (periodic/0) [vdsm.api] START
repoStats(domains=()) from=internal, tas
k_id=05bc82c8-bca4-4ee3-81a1-d9f3f0b142e9 (api:46)
2018-08-06 12:09:07,785+0200 INFO  (periodic/0) [vdsm.api] FINISH
repoStats return={} from=internal, task
_id=05bc82c8-bca4-4ee3-81a1-d9f3f0b142e9 (api:52)
2018-08-06 12:09:07,785+0200 INFO  (periodic/0) [vdsm.api] START
multipath_health() from=internal, task_i
d=602cf720-762e-4f05-9ef7-3b9741377136 (api:46)
2018-08-06 12:09:07,785+0200 INFO  (periodic/0) [vdsm.api] FINISH
multipath_health return={} from=interna
l, task_id=602cf720-762e-4f05-9ef7-3b9741377136 (api:52)
2018-08-06 12:09:07,871+0200 INFO  (jsonrpc/4) [api.host] START
getAllVmStats() from=::1,43898 (api:46)
2018-08-06 12:09:07,872+0200 INFO  (jsonrpc/4) [api.host] FINISH
getAllVmStats return={'status': {'messag
e': 'Done', 'code': 0}, 'statsList': (suppressed)} from=::1,43898 (api:52)
2018-08-06 12:09:07,872+0200 INFO  (jsonrpc/4) [jsonrpc.JsonRpcServer]
RPC call Host.getAllVmStats succee
ded in 0.00 seconds (__init__:573)

Cheers,

Jiri




> 
> On Mon, Aug 6, 2018 at 3:51 PM, Jiří Sléžka  <mailto:jiri.sle...@slu.cz>> wrote:
> 
> Hi,
> 
> no one can help?
> 
> I still cannot activate this host - error is "Cannot activate Host.
> Related operation is currently in progress. Please try again later."
> 
> I believe relevant log entrieas are
> 
> 2018-08-06 12:15:50,398+02 INFO
> [org.ovirt.engine.core.bll.ActivateVdsCommand] (default task-426)
> [9387077a-8276-4a3f-a087-584a10a09b08] Failed to Acquire Lock to object
> 'EngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS]',
> sharedLocks=''}'
> 2018-08-06 12:15:50,398+02 WARN
> [org.ovirt.engine.core.bll.ActivateVdsCommand] (default task-426)
> [9387077a-8276-4a3f-a087-584a10a09b08] Validation of action
> 'ActivateVds' failed for user ***my_username***. Reasons:
> VAR__ACTION__ACTIVATE,VAR__TYPE__HOST,ACTION_TYPE_FAILED_OBJECT_LOCKED
> 
> 
> when I tried to "reinstall host" I got entry in Events "Failed to update
> Host blade01" and correlation id. Relevant log entries are
> 
> 
> cat /var/log/ovirt-engine/engine.log | grep
> d2b332da-9050-419c-8188-8fdda5e9d807
> 
> 2018-08-06 12:08:57,451+02 INFO
> [org.ovirt.engine.core.bll.hostdeploy.InstallVdsCommand] (default
> task-425) [d2b332da-9050-419c-8188-8fdda5e9d807] Running command:
> InstallVdsCommand internal: false. Entities affecte

[ovirt-users] Re: cannot activate host

2018-08-06 Thread Jiří Sléžka
Hi,

no one can help?

I still cannot activate this host - error is "Cannot activate Host.
Related operation is currently in progress. Please try again later."

I believe relevant log entrieas are

2018-08-06 12:15:50,398+02 INFO
[org.ovirt.engine.core.bll.ActivateVdsCommand] (default task-426)
[9387077a-8276-4a3f-a087-584a10a09b08] Failed to Acquire Lock to object
'EngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS]',
sharedLocks=''}'
2018-08-06 12:15:50,398+02 WARN
[org.ovirt.engine.core.bll.ActivateVdsCommand] (default task-426)
[9387077a-8276-4a3f-a087-584a10a09b08] Validation of action
'ActivateVds' failed for user ***my_username***. Reasons:
VAR__ACTION__ACTIVATE,VAR__TYPE__HOST,ACTION_TYPE_FAILED_OBJECT_LOCKED


when I tried to "reinstall host" I got entry in Events "Failed to update
Host blade01" and correlation id. Relevant log entries are


cat /var/log/ovirt-engine/engine.log | grep
d2b332da-9050-419c-8188-8fdda5e9d807

2018-08-06 12:08:57,451+02 INFO
[org.ovirt.engine.core.bll.hostdeploy.InstallVdsCommand] (default
task-425) [d2b332da-9050-419c-8188-8fdda5e9d807] Running command:
InstallVdsCommand internal: false. Entities affected :  ID:
786646cd-c9ef-49a8-8aea-56e858dcf202 Type: VDSAction group
EDIT_HOST_CONFIGURATION with role type ADMIN
2018-08-06 12:08:57,476+02 INFO
[org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand]
(default task-425) [d2b332da-9050-419c-8188-8fdda5e9d807] Failed to
Acquire Lock to object
'EngineLock:{exclusiveLocks='[786646cd-c9ef-49a8-8aea-56e858dcf202=VDS]',
sharedLocks=''}'
2018-08-06 12:08:57,476+02 WARN
[org.ovirt.engine.core.bll.hostdeploy.InstallVdsInternalCommand]
(default task-425) [d2b332da-9050-419c-8188-8fdda5e9d807] Validation of
action 'InstallVdsInternal' failed for user ***my_username***. Reasons:
ACTION_TYPE_FAILED_OBJECT_LOCKED
2018-08-06 12:08:57,479+02 ERROR
[org.ovirt.engine.core.bll.hostdeploy.InstallVdsCommand] (default
task-425) [d2b332da-9050-419c-8188-8fdda5e9d807] Installation/upgrade of
Host '786646cd-c9ef-49a8-8aea-56e858dcf202', 'blade01' failed: Cannot
${action} ${type}. Related operation is currently in progress. Please
try again later.
2018-08-06 12:08:57,482+02 INFO
[org.ovirt.engine.core.bll.CommandCompensator] (default task-425)
[d2b332da-9050-419c-8188-8fdda5e9d807] Command
[id=b74adfbb-c043-4fa9-a163-0a6ea7fba9dc]: Compensating
DELETED_OR_UPDATED_ENTITY of
org.ovirt.engine.core.common.businessentities.VdsStatic; snapshot:
id=786646cd-c9ef-49a8-8aea-56e858dcf202.
2018-08-06 12:08:57,486+02 ERROR
[org.ovirt.engine.core.bll.hostdeploy.InstallVdsCommand] (default
task-425) [d2b332da-9050-419c-8188-8fdda5e9d807] Transaction rolled-back
for command 'org.ovirt.engine.core.bll.hostdeploy.InstallVdsCommand'.
2018-08-06 12:08:57,491+02 ERROR
[org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
(default task-425) [d2b332da-9050-419c-8188-8fdda5e9d807] EVENT_ID:
USER_FAILED_UPDATE_VDS(105), Failed to update Host blade01 (User:
***my_username***).

How can I unlock this entries?

Thanks in advance,

Jiri Slezka



On 08/03/2018 02:07 PM, Jiří Sléžka wrote:
> Hi,
> 
> I have one interesting issue. I was asked by colleague to repair his
> cluster which is part of our oVirt installation. He tried to upgrade his
> host to gain 4.2 compatibility level but it fails in some cases. Never
> mind, I checked it, repaired, installed upgrades and activate his hosts
> with one exception.
> 
> One of hosts was stucked in finishing two tasks. One was changing
> network configuration, the second was activating of this host. I have to
> manually clear this tasks in database by
> 
> SELECT * FROM job ORDER BY start_time DESC;
> SELECT DeleteJob('...job_id...');
> SELECT DeleteJob('...job_id...');
> 
> Now this tasks are gone but still cannot activate this host.
> 
> 
> Error while executing action:
> 
> blade01:
> 
> Cannot activate Host. Related operation is currently in progress.
> Please try again later.
> 
> Host is updatet (latest 4.2 repo), rebooted, looks fine...
> 
> In tasks in UI are no tasks found (the same in db)...
> 
> What can I do now?
> 
> Cheers, Jiri
> 
> 
> 
> 
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/C2TWKXGOEESXCDJUZ7IBKYDE2APFN4ZB/
> 




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-p

[ovirt-users] cannot activate host

2018-08-03 Thread Jiří Sléžka
Hi,

I have one interesting issue. I was asked by colleague to repair his
cluster which is part of our oVirt installation. He tried to upgrade his
host to gain 4.2 compatibility level but it fails in some cases. Never
mind, I checked it, repaired, installed upgrades and activate his hosts
with one exception.

One of hosts was stucked in finishing two tasks. One was changing
network configuration, the second was activating of this host. I have to
manually clear this tasks in database by

SELECT * FROM job ORDER BY start_time DESC;
SELECT DeleteJob('...job_id...');
SELECT DeleteJob('...job_id...');

Now this tasks are gone but still cannot activate this host.


Error while executing action:

blade01:

Cannot activate Host. Related operation is currently in progress.
Please try again later.

Host is updatet (latest 4.2 repo), rebooted, looks fine...

In tasks in UI are no tasks found (the same in db)...

What can I do now?

Cheers, Jiri




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/C2TWKXGOEESXCDJUZ7IBKYDE2APFN4ZB/


[ovirt-users] Re: sun.security.validator

2018-07-10 Thread Jiří Sléžka
Hello,

> I'm running Version 4.2.3.8-1.el7, and after reboot the engine machine
> no longer could login into administration portal with this error:
> 
> sun.security.validator.ValidatorException: PKIX path validation faile
> java.security.cert.CertPathValidatorException: validity check failed
> 
> I'm using a self signed cert.

In other words, you are using your custom cert, right?

look into

/etc/httpd/conf.d/ssl.conf

and check

SSLCertificateFile
SSLCertificateKeyFile
SSLCACertificateFile

settings. They are probably overwriten by 4.2.3 upgrade process

it was solved in

https://bugzilla.redhat.com/show_bug.cgi?id=1576377

Cheers,

Jiri


> 
> Any idea?
> 
> Thanks
> 
> -- 
> 
> Jose Ferradeira
> http://www.logicworks.pt
> 
> 
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/RHAODTFY66DYI4DHS73PEIRNRO3P7Q5I/
> 




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/BGWNGZ3BYSM7PGQRNKGSMURTCL3EPUJ6/


[ovirt-users] Re: RHEL5 guests frequently hang when migrating host.

2018-06-27 Thread Jiří Sléžka
Hi,

I had similiar experience with some vms (really small amount of them).
It hangs, cpu was 100%, console inresponsible and powering that vm off
and on solves problem. But one of this vms was CentOS6, not 5 (in the
fact it was our mirror.slu.cz which mirrors also ovirt project so sorry
for unplanned outages ;-)

I am not sure if it is tied to migrations, I had this experience even
without migrating this vm (as long as it was not migrated automatically)

This didn't happen after I upgraded hosts to CentOS7.5.1804 (I am always
using the latest oVirt in time here)

Cpu familly is AMD Opteron(tm) Processor 6172 (yes, it is a little bit
older cluster :-)

Cheers,

Jiri Slezka


On 06/27/2018 08:59 AM, Eduardo Mayoral wrote:
> Hi,
> 
>     I am experiencing that my RHEL5 guests frequently "hang" when
> migrating host. Console is blank, CPU after migration is 100% and as far
> as oVirt is concerned, the VM is OK.
> 
>     oVirt is 4.2.3.8-1.el7, on CentOS 7. Hosts are CentOS 7 as well.
> Cluster is in "Intel Westmere family" CPU type.
> 
>     Guest is RHEL5, fully patched, kernel 4.2.3.8-1.el7 with
> ovirt-guest-agent installed from EPEL.
> 
>     I do not see anything out of place in the ovirt-engine and vdsm
> logs, and the guest logs are simply not there, the stop right before the
> migration,as if the machine had "frozen".
> 
>     Powering the VM off and starting it starts the VM correctly. This
> does not happen 100% of the time. If I try to migrate the VM when it is
> freshly started the migration is faster (maybe 5 seconds), and the guest
> OS does not hang.
> 
>     Anybody else experiencing something similar? Maybe something
> (timeouts?) that I should tune on the guest OS for RHEL5?
> 
>     Thanks!
> 
> --
> 
> Eduardo Mayoral.
> 
> 
> 
> 
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/42D5IQBRX7ABZFAXA3O4SHABRPLDCMZA/
> 




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/JDNYDKUGP3LJQY4DMBEY3PIPOZFFPOII/


[ovirt-users] Re: oVirt 4.2 and CLI options

2018-06-06 Thread Jiří Sléžka
Hi

On 06/06/2018 11:33 AM, Torsten Stolpmann wrote:
> On 02.06.2018 08:52, Yaniv Kaul wrote:
>>
>>
>> On Thu, May 31, 2018, 10:08 PM Simon Coter > > wrote:
>>
>>     Hi,
>>
>>     what is the best choice for a CLI interface vs oVirt 4.2 ?
>>
>>
>> While I recommend looking at Ansible,
>> https://github.com/fbacchella/ovirtcmd is also an interesting option.
>>
> 
> I found ovirt-shell an indispensable tool during troubleshooting when
> ssh is the last usable option, due to its interactive nature in relation
> to ansible.
> 
> I feel sad to see it vanish in 4.3 without any equivalent replacement.

the same for me. I will miss ovirt-shell. It was really strong tool
(though I would prefer stronger tab completion and some polishing)

is there really no replacement?

> Will have a look at ovirtcmd though. Are there plans to make this
> available via yum in the future?

will check ovirtcmd too

Cheers, Jiri


> 
> 
> Torsten
> 
> 
>>     I've looked for it and I saw that ovirt-shell is already deprecated.
>>
>>
>> Correct.
>> Y.
>>
>>     Thanks
>>
>>     Simon
>>     ___
>>     Users mailing list -- users@ovirt.org 
>>     To unsubscribe send an email to users-le...@ovirt.org
>>     
>>     Privacy Statement: https://www.ovirt.org/site/privacy-policy/
>>     oVirt Code of Conduct:
>>     https://www.ovirt.org/community/about/community-guidelines/
>>     List Archives:
>>    
>> https://lists.ovirt.org/archives/list/users@ovirt.org/message/IUSAU6HO776435P7DZ36CJ2AVHZLDGDW/
>>
>>
>>
>>
>> ___
>> Users mailing list -- users@ovirt.org
>> To unsubscribe send an email to users-le...@ovirt.org
>> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
>> oVirt Code of Conduct:
>> https://www.ovirt.org/community/about/community-guidelines/
>> List Archives:
>> https://lists.ovirt.org/archives/list/users@ovirt.org/message/L2UZCZGCD7U75UOB6D35XYWXD4V3JTB4/
>>
>>
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/site/privacy-policy/
> oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives:
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/2NTSXTADGO62VGYX6KZKMJ6E37MDYDCS/
> 



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/site/privacy-policy/
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/ETITLVDF6OF663SO2ZIRFNZFM24VCZH5/


[ovirt-users] Re: sun.security.validator.ValidatorException after update to 4.2.3

2018-05-08 Thread Jiří Sléžka
Hi,

solution was obvious. Upgrade process modified apache's ssl.conf and
reverted my customization.

for example - my custom cert...

SSLCertificateFile /etc/pki/tls/certs/ovirt.crt.pem

...was replaced by this

SSLCertificateFile /etc/pki/ovirt-engine/certs/apache.cer

the same for SSLCertificateKeyFile and SSLCACertificateFile

After reverting this changes everything works as usual but it makes me
unsure if I have my 3rd party certificate configured the right way...

Cheers,

Jiri


On 05/07/2018 05:41 PM, Jiří Sléžka wrote:
> Hi,
> 
> after upgrade ovirt from 4.2.2 to 4.2.3.5-1.el7.centos I cannot login
> into admin portal because
> 
> sun.security.validator.ValidatorException: PKIX path building failed:
> sun.security.provider.certpath.SunCertPathBuilderException: unable to
> find valid certification path to requested target
> 
> I am using custom 3rd party certificate
> 
> Any hints how to resolve this issue?
> 
> Thanks in advance,
> 
> Jiri Slezka
> 
> 
> 
> 
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
> 



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org


[ovirt-users] sun.security.validator.ValidatorException after update to 4.2.3

2018-05-07 Thread Jiří Sléžka
Hi,

after upgrade ovirt from 4.2.2 to 4.2.3.5-1.el7.centos I cannot login
into admin portal because

sun.security.validator.ValidatorException: PKIX path building failed:
sun.security.provider.certpath.SunCertPathBuilderException: unable to
find valid certification path to requested target

I am using custom 3rd party certificate

Any hints how to resolve this issue?

Thanks in advance,

Jiri Slezka




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] problem importing ova vm

2018-02-22 Thread Jiří Sléžka
On 02/22/2018 01:58 PM, Richard W.M. Jones wrote:
> On Thu, Feb 22, 2018 at 01:27:18PM +0100, Jiří Sléžka wrote:
>> libvirt needs authentication to connect to libvirt URI qemu:///system
>> (see also: http://libvirt.org/auth.html http://libvirt.org/uri.html)
> 
> You can set the backend to direct to avoid needing libvirt:
> 
>   export LIBGUESTFS_BACKEND=direct
> 
> Alternately you can fiddle with the libvirt polkit configuration
> to permit access:

thanks, here is full output

http://mirror.slu.cz/tmp/libguestfs-test-tool.txt

Jiri

> 
>   https://libvirt.org/aclpolkit.html
> 
> Rich.
> 




smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] problem importing ova vm

2018-02-21 Thread Jiří Sléžka
On 02/21/2018 05:35 PM, Arik Hadas wrote:
> 
> 
> On Wed, Feb 21, 2018 at 6:03 PM, Jiří Sléžka <jiri.sle...@slu.cz
> <mailto:jiri.sle...@slu.cz>> wrote:
> 
>     On 02/21/2018 03:43 PM, Jiří Sléžka wrote:
> > On 02/20/2018 11:09 PM, Arik Hadas wrote:
> >>
>     >>
> >> On Tue, Feb 20, 2018 at 6:37 PM, Jiří Sléžka <jiri.sle...@slu.cz
> <mailto:jiri.sle...@slu.cz>
> >> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>> wrote:
> >>
> >>     On 02/20/2018 03:48 PM, Arik Hadas wrote:
> >>     >
> >>     >
> >>     > On Tue, Feb 20, 2018 at 3:49 PM, Jiří Sléžka
> <jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>
> >>     > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>> wrote:
> >>     >
> >>     >     Hi Arik,
> >>     >
> >>     >     On 02/20/2018 01:22 PM, Arik Hadas wrote:
> >>     >     >
> >>     >     >
> >>     >     > On Tue, Feb 20, 2018 at 2:03 PM, Jiří Sléžka
> <jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>
> >>     <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>
> >>     >     > <mailto:jiri.sle...@slu.cz
> <mailto:jiri.sle...@slu.cz> <mailto:jiri.sle...@slu.cz
> <mailto:jiri.sle...@slu.cz>>
> >>     <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>>> wrote:
> >>     >     >
> >>     >     >     Hi,
> >>     >     >
> >>     >     >
> >>     >     > Hi Jiří,
> >>     >     >  
> >>     >     >
> >>     >     >
> >>     >     >     I would like to try import some ova files into
> our oVirt
> >>     instance [1]
> >>     >     >     [2] but I facing problems.
> >>     >     >
> >>     >     >     I have downloaded all ova images into one of hosts
> >>     (ovirt01) into
> >>     >     >     direcory /ova
> >>     >     >
> >>     >     >     ll /ova/
> >>     >     >     total 6532872
> >>     >     >     -rw-r--r--. 1 vdsm kvm 1160387072 Feb 16 16:21
> >>     HAAS-hpcowrie.ovf
> >>     >     >     -rw-r--r--. 1 vdsm kvm 785984 Feb 16 16:22
> >>     HAAS-hpdio.ova
> >>     >     >     -rw-r--r--. 1 vdsm kvm  846736896 Feb 16 16:22
> >>     HAAS-hpjdwpd.ova
> >>     >     >     -rw-r--r--. 1 vdsm kvm  891043328 Feb 16 16:23
> >>     HAAS-hptelnetd.ova
> >>     >     >     -rw-r--r--. 1 vdsm kvm  908222464 Feb 16 16:23
> >>     HAAS-hpuchotcp.ova
> >>     >     >     -rw-r--r--. 1 vdsm kvm  880643072 Feb 16 16:24
> >>     HAAS-hpuchoudp.ova
> >>     >     >     -rw-r--r--. 1 vdsm kvm  890833920 Feb 16 16:24
> >>     HAAS-hpuchoweb.ova
> >>     >     >
> >>     >     >     Then I tried to import them - from host ovirt01 and
> >>     directory /ova but
> >>     >     >     spinner spins infinitly and nothing is happen.
> >>     >     >
> >>     >     >
> >>     >     > And does it work when you provide a path to the
> actual ova
> >>     file, i.e.,
> >>     >     > /ova/HAAS-hpdio.ova, rather than to the directory?
> >>     >
> >>     >     this time it ends with "Failed to load VM configuration
> from
> >>     OVA file:
> >>     >     /ova/HAAS-hpdio.ova" error. 
> >>     >
> >>     >
> >>     > Note that the logic that is applied on a specified folder
> is "try
> >>     > fetching an 'ova folder' out of the destination folder"
> rather than
> >>     > "list all

Re: [ovirt-users] problem importing ova vm

2018-02-21 Thread Jiří Sléžka
On 02/21/2018 03:43 PM, Jiří Sléžka wrote:
> On 02/20/2018 11:09 PM, Arik Hadas wrote:
>>
>>
>> On Tue, Feb 20, 2018 at 6:37 PM, Jiří Sléžka <jiri.sle...@slu.cz
>> <mailto:jiri.sle...@slu.cz>> wrote:
>>
>> On 02/20/2018 03:48 PM, Arik Hadas wrote:
>> >
>> >
>> > On Tue, Feb 20, 2018 at 3:49 PM, Jiří Sléžka <jiri.sle...@slu.cz 
>> <mailto:jiri.sle...@slu.cz>
>> > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>> wrote:
>> >
>> >     Hi Arik,
>>     >
>> >     On 02/20/2018 01:22 PM, Arik Hadas wrote:
>> >     >
>> >     >
>> >     > On Tue, Feb 20, 2018 at 2:03 PM, Jiří Sléžka <jiri.sle...@slu.cz 
>> <mailto:jiri.sle...@slu.cz>
>> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>
>> >     > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
>> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>> wrote:
>> >     >
>> >     >     Hi,
>> >     >
>> >     >
>> >     > Hi Jiří,
>> >     >  
>> >     >
>> >     >
>> >     >     I would like to try import some ova files into our oVirt
>> instance [1]
>> >     >     [2] but I facing problems.
>> >     >
>> >     >     I have downloaded all ova images into one of hosts
>> (ovirt01) into
>> >     >     direcory /ova
>> >     >
>> >     >     ll /ova/
>> >     >     total 6532872
>> >     >     -rw-r--r--. 1 vdsm kvm 1160387072 Feb 16 16:21
>> HAAS-hpcowrie.ovf
>> >     >     -rw-r--r--. 1 vdsm kvm 785984 Feb 16 16:22
>> HAAS-hpdio.ova
>> >     >     -rw-r--r--. 1 vdsm kvm  846736896 Feb 16 16:22
>> HAAS-hpjdwpd.ova
>> >     >     -rw-r--r--. 1 vdsm kvm  891043328 Feb 16 16:23
>> HAAS-hptelnetd.ova
>> >     >     -rw-r--r--. 1 vdsm kvm  908222464 Feb 16 16:23
>> HAAS-hpuchotcp.ova
>> >     >     -rw-r--r--. 1 vdsm kvm  880643072 Feb 16 16:24
>> HAAS-hpuchoudp.ova
>> >     >     -rw-r--r--. 1 vdsm kvm  890833920 Feb 16 16:24
>> HAAS-hpuchoweb.ova
>> >     >
>> >     >     Then I tried to import them - from host ovirt01 and
>> directory /ova but
>> >     >     spinner spins infinitly and nothing is happen.
>> >     >
>> >     >
>> >     > And does it work when you provide a path to the actual ova
>> file, i.e.,
>> >     > /ova/HAAS-hpdio.ova, rather than to the directory?
>> >
>> >     this time it ends with "Failed to load VM configuration from
>> OVA file:
>> >     /ova/HAAS-hpdio.ova" error. 
>> >
>> >
>> > Note that the logic that is applied on a specified folder is "try
>> > fetching an 'ova folder' out of the destination folder" rather than
>> > "list all the ova files inside the specified folder". It seems
>> that you
>> > expected the former output since there are no disks in that
>> folder, right?
>>
>> yes, It would be more user friendly to list all ova files and then
>> select which one to import (like listing all vms in vmware import)
>>
>> Maybe description of path field in manager should be "Path to ova file"
>> instead of "Path" :-)
>>
>>
>> Sorry, I obviously meant 'latter' rather than 'former' before..
>> Yeah, I agree that would be better, at least until listing the OVA files
>> in the folder is implemented (that was the original plan, btw) - could
>> you please file a bug?
> 
> yes, sure
> 
> 
>> >     >     I cannot see anything relevant in vdsm log of host ovirt01.
>> >     >
>> >     >     In the engine.log of our standalone ovirt manager is just 
>> this
>> >     >     relevant line
>> >     >
>> >     >     2018-02-20 12:35:04,289+01 INFO
>> >     >     [org.ovirt.engine.core.common.utils.ansible.AnsibleExecutor] 
>> (default
>> >     >     task-31) [458990a7-b054-491a-904e-5c4fe44892c4] Executing 
>> Ansible
>> >     >     command: ANSIBLE_STDOUT_CALLBACK=ovaqueryp

Re: [ovirt-users] problem importing ova vm

2018-02-21 Thread Jiří Sléžka
On 02/20/2018 11:09 PM, Arik Hadas wrote:
> 
> 
> On Tue, Feb 20, 2018 at 6:37 PM, Jiří Sléžka <jiri.sle...@slu.cz
> <mailto:jiri.sle...@slu.cz>> wrote:
> 
> On 02/20/2018 03:48 PM, Arik Hadas wrote:
> >
> >
> > On Tue, Feb 20, 2018 at 3:49 PM, Jiří Sléžka <jiri.sle...@slu.cz 
> <mailto:jiri.sle...@slu.cz>
> > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>> wrote:
> >
> >     Hi Arik,
> >
> >     On 02/20/2018 01:22 PM, Arik Hadas wrote:
> >     >
> >     >
> >     > On Tue, Feb 20, 2018 at 2:03 PM, Jiří Sléžka <jiri.sle...@slu.cz 
> <mailto:jiri.sle...@slu.cz>
> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>
> >     > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>
> <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>>> wrote:
> >     >
> >     >     Hi,
> >     >
> >     >
> >     > Hi Jiří,
> >     >  
> >     >
> >     >
> >     >     I would like to try import some ova files into our oVirt
> instance [1]
> >     >     [2] but I facing problems.
> >     >
> >     >     I have downloaded all ova images into one of hosts
> (ovirt01) into
> >     >     direcory /ova
> >     >
> >     >     ll /ova/
> >     >     total 6532872
> >     >     -rw-r--r--. 1 vdsm kvm 1160387072 Feb 16 16:21
> HAAS-hpcowrie.ovf
> >     >     -rw-r--r--. 1 vdsm kvm 785984 Feb 16 16:22
> HAAS-hpdio.ova
> >     >     -rw-r--r--. 1 vdsm kvm  846736896 Feb 16 16:22
> HAAS-hpjdwpd.ova
> >     >     -rw-r--r--. 1 vdsm kvm  891043328 Feb 16 16:23
> HAAS-hptelnetd.ova
> >     >     -rw-r--r--. 1 vdsm kvm  908222464 Feb 16 16:23
> HAAS-hpuchotcp.ova
> >     >     -rw-r--r--. 1 vdsm kvm  880643072 Feb 16 16:24
> HAAS-hpuchoudp.ova
> >     >     -rw-r--r--. 1 vdsm kvm  890833920 Feb 16 16:24
> HAAS-hpuchoweb.ova
> >     >
> >     >     Then I tried to import them - from host ovirt01 and
> directory /ova but
> >     >     spinner spins infinitly and nothing is happen.
> >     >
> >     >
> >     > And does it work when you provide a path to the actual ova
> file, i.e.,
> >     > /ova/HAAS-hpdio.ova, rather than to the directory?
> >
> >     this time it ends with "Failed to load VM configuration from
> OVA file:
> >     /ova/HAAS-hpdio.ova" error. 
> >
> >
> > Note that the logic that is applied on a specified folder is "try
> > fetching an 'ova folder' out of the destination folder" rather than
> > "list all the ova files inside the specified folder". It seems
> that you
> > expected the former output since there are no disks in that
> folder, right?
> 
> yes, It would be more user friendly to list all ova files and then
> select which one to import (like listing all vms in vmware import)
> 
> Maybe description of path field in manager should be "Path to ova file"
> instead of "Path" :-)
> 
> 
> Sorry, I obviously meant 'latter' rather than 'former' before..
> Yeah, I agree that would be better, at least until listing the OVA files
> in the folder is implemented (that was the original plan, btw) - could
> you please file a bug?

yes, sure


> >     >     I cannot see anything relevant in vdsm log of host ovirt01.
> >     >
> >     >     In the engine.log of our standalone ovirt manager is just this
> >     >     relevant line
> >     >
> >     >     2018-02-20 12:35:04,289+01 INFO
> >     >     [org.ovirt.engine.core.common.utils.ansible.AnsibleExecutor] 
> (default
> >     >     task-31) [458990a7-b054-491a-904e-5c4fe44892c4] Executing 
> Ansible
> >     >     command: ANSIBLE_STDOUT_CALLBACK=ovaqueryplugin
> >     >     [/usr/bin/ansible-playbook,
> >     >     --private-key=/etc/pki/ovirt-engine/keys/engine_id_rsa,
> >     >     --inventory=/tmp/ansible-inventory8237874608161160784,
> >     >     --extra-vars=ovirt_query_ova_path=/ova,
> >     >     /usr/share/ovirt-engine/playbooks/ovirt-ova-query.yml] 
> [Logfile:
> >     >     
> /var/log/ovirt-engine/ova/ovirt-query-ova-ansib

Re: [ovirt-users] problem importing ova vm

2018-02-20 Thread Jiří Sléžka
On 02/20/2018 03:48 PM, Arik Hadas wrote:
> 
> 
> On Tue, Feb 20, 2018 at 3:49 PM, Jiří Sléžka <jiri.sle...@slu.cz
> <mailto:jiri.sle...@slu.cz>> wrote:
> 
> Hi Arik,
> 
> On 02/20/2018 01:22 PM, Arik Hadas wrote:
> >
>     >
> > On Tue, Feb 20, 2018 at 2:03 PM, Jiří Sléžka <jiri.sle...@slu.cz 
> <mailto:jiri.sle...@slu.cz>
> > <mailto:jiri.sle...@slu.cz <mailto:jiri.sle...@slu.cz>>> wrote:
> >
> >     Hi,
> >
> >
> > Hi Jiří,
> >  
> >
> >
> >     I would like to try import some ova files into our oVirt instance 
> [1]
> >     [2] but I facing problems.
> >
> >     I have downloaded all ova images into one of hosts (ovirt01) into
> >     direcory /ova
> >
> >     ll /ova/
> >     total 6532872
> >     -rw-r--r--. 1 vdsm kvm 1160387072 Feb 16 16:21 HAAS-hpcowrie.ovf
> >     -rw-r--r--. 1 vdsm kvm 785984 Feb 16 16:22 HAAS-hpdio.ova
> >     -rw-r--r--. 1 vdsm kvm  846736896 Feb 16 16:22 HAAS-hpjdwpd.ova
> >     -rw-r--r--. 1 vdsm kvm  891043328 Feb 16 16:23 HAAS-hptelnetd.ova
> >     -rw-r--r--. 1 vdsm kvm  908222464 Feb 16 16:23 HAAS-hpuchotcp.ova
> >     -rw-r--r--. 1 vdsm kvm  880643072 Feb 16 16:24 HAAS-hpuchoudp.ova
> >     -rw-r--r--. 1 vdsm kvm  890833920 Feb 16 16:24 HAAS-hpuchoweb.ova
> >
> >     Then I tried to import them - from host ovirt01 and directory /ova 
> but
> >     spinner spins infinitly and nothing is happen.
> >
> >
> > And does it work when you provide a path to the actual ova file, i.e.,
> > /ova/HAAS-hpdio.ova, rather than to the directory?
> 
> this time it ends with "Failed to load VM configuration from OVA file:
> /ova/HAAS-hpdio.ova" error. 
> 
> 
> Note that the logic that is applied on a specified folder is "try
> fetching an 'ova folder' out of the destination folder" rather than
> "list all the ova files inside the specified folder". It seems that you
> expected the former output since there are no disks in that folder, right?

yes, It would be more user friendly to list all ova files and then
select which one to import (like listing all vms in vmware import)

Maybe description of path field in manager should be "Path to ova file"
instead of "Path" :-)

> >     I cannot see anything relevant in vdsm log of host ovirt01.
> >
> >     In the engine.log of our standalone ovirt manager is just this
> >     relevant line
> >
> >     2018-02-20 12:35:04,289+01 INFO
> >     [org.ovirt.engine.core.common.utils.ansible.AnsibleExecutor] 
> (default
> >     task-31) [458990a7-b054-491a-904e-5c4fe44892c4] Executing Ansible
> >     command: ANSIBLE_STDOUT_CALLBACK=ovaqueryplugin
> >     [/usr/bin/ansible-playbook,
> >     --private-key=/etc/pki/ovirt-engine/keys/engine_id_rsa,
> >     --inventory=/tmp/ansible-inventory8237874608161160784,
> >     --extra-vars=ovirt_query_ova_path=/ova,
> >     /usr/share/ovirt-engine/playbooks/ovirt-ova-query.yml] [Logfile:
> >     
> /var/log/ovirt-engine/ova/ovirt-query-ova-ansible-20180220123504-ovirt01.net
> <http://ovirt-query-ova-ansible-20180220123504-ovirt01.net>
> >     <http://20180220123504-ovirt01.net
> <http://20180220123504-ovirt01.net>>.slu.cz.log]
> >
> >     also there are two ansible processes which are still running
> (and makes
> >     heavy load on system (load 9+ and growing, it looks like it
> eats all the
> >     memory and system starts swapping))
> >
> >     ovirt    32087  3.3  0.0 332252  5980 ?        Sl   12:35   0:41
> >     /usr/bin/python2 /usr/bin/ansible-playbook
> >     --private-key=/etc/pki/ovirt-engine/keys/engine_id_rsa
> >     --inventory=/tmp/ansible-inventory8237874608161160784
> >     --extra-vars=ovirt_query_ova_path=/ova
> >     /usr/share/ovirt-engine/playbooks/ovirt-ova-query.yml
> >     ovirt    32099 57.5 78.9 15972880 11215312 ?   R    12:35  11:52
> >     /usr/bin/python2 /usr/bin/ansible-playbook
> >     --private-key=/etc/pki/ovirt-engine/keys/engine_id_rsa
> >     --inventory=/tmp/ansible-inventory8237874608161160784
> >     --extra-vars=ovirt_query_ova_path=/ova
> >     /usr/share/ovirt-engine/playbooks/ovirt-ova-query.yml
> >
> >     playbook looks like
> >
> >     - hosts: all
>

[ovirt-users] problem importing ova vm

2018-02-20 Thread Jiří Sléžka
Hi,

I would like to try import some ova files into our oVirt instance [1]
[2] but I facing problems.

I have downloaded all ova images into one of hosts (ovirt01) into
direcory /ova

ll /ova/
total 6532872
-rw-r--r--. 1 vdsm kvm 1160387072 Feb 16 16:21 HAAS-hpcowrie.ovf
-rw-r--r--. 1 vdsm kvm 785984 Feb 16 16:22 HAAS-hpdio.ova
-rw-r--r--. 1 vdsm kvm  846736896 Feb 16 16:22 HAAS-hpjdwpd.ova
-rw-r--r--. 1 vdsm kvm  891043328 Feb 16 16:23 HAAS-hptelnetd.ova
-rw-r--r--. 1 vdsm kvm  908222464 Feb 16 16:23 HAAS-hpuchotcp.ova
-rw-r--r--. 1 vdsm kvm  880643072 Feb 16 16:24 HAAS-hpuchoudp.ova
-rw-r--r--. 1 vdsm kvm  890833920 Feb 16 16:24 HAAS-hpuchoweb.ova

Then I tried to import them - from host ovirt01 and directory /ova but
spinner spins infinitly and nothing is happen.

I cannot see anything relevant in vdsm log of host ovirt01.

In the engine.log of our standalone ovirt manager is just this relevant line

2018-02-20 12:35:04,289+01 INFO
[org.ovirt.engine.core.common.utils.ansible.AnsibleExecutor] (default
task-31) [458990a7-b054-491a-904e-5c4fe44892c4] Executing Ansible
command: ANSIBLE_STDOUT_CALLBACK=ovaqueryplugin
[/usr/bin/ansible-playbook,
--private-key=/etc/pki/ovirt-engine/keys/engine_id_rsa,
--inventory=/tmp/ansible-inventory8237874608161160784,
--extra-vars=ovirt_query_ova_path=/ova,
/usr/share/ovirt-engine/playbooks/ovirt-ova-query.yml] [Logfile:
/var/log/ovirt-engine/ova/ovirt-query-ova-ansible-20180220123504-ovirt01.net.slu.cz.log]

also there are two ansible processes which are still running (and makes
heavy load on system (load 9+ and growing, it looks like it eats all the
memory and system starts swapping))

ovirt32087  3.3  0.0 332252  5980 ?Sl   12:35   0:41
/usr/bin/python2 /usr/bin/ansible-playbook
--private-key=/etc/pki/ovirt-engine/keys/engine_id_rsa
--inventory=/tmp/ansible-inventory8237874608161160784
--extra-vars=ovirt_query_ova_path=/ova
/usr/share/ovirt-engine/playbooks/ovirt-ova-query.yml
ovirt32099 57.5 78.9 15972880 11215312 ?   R12:35  11:52
/usr/bin/python2 /usr/bin/ansible-playbook
--private-key=/etc/pki/ovirt-engine/keys/engine_id_rsa
--inventory=/tmp/ansible-inventory8237874608161160784
--extra-vars=ovirt_query_ova_path=/ova
/usr/share/ovirt-engine/playbooks/ovirt-ova-query.yml

playbook looks like

- hosts: all
  remote_user: root
  gather_facts: no

  roles:
- ovirt-ova-query

and it looks like it only runs query_ova.py but on all hosts?

How does this work? ...or should it work?

I am using latest 4.2.1.7-1.el7.centos version

Cheers,
Jiri Slezka


[1] https://haas.cesnet.cz/#!index.md - Cesnet HAAS
[2] https://haas.cesnet.cz/downloads/release-01/ - Image repository



smime.p7s
Description: S/MIME Cryptographic Signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


  1   2   >