date:20161124

Ok thanks

On 24 November 2016 at 10:00, Ala Hino  wrote:

> It is a known issue and Maor Lipchuk (mlipchuk) is working on a fix.
>
> On Thu, Nov 24, 2016 at 11:57 AM, Maton, Brett 
> wrote:
>
>> If I try to migrate a disk of a running VM to another storage domain it
>> fails with the following message:
>>
>> Operation Cancelled
>>
>> Error while executing action: User is not logged in.
>>
>>
>> Migrating disks of stopped VM's continues to work.
>>
>> Probably a bug ?
>>
>> ___
>> Users mailing list
>> Users@ovirt.org
>> http://lists.ovirt.org/mailman/listinfo/users
>>
>>
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] vm nic 10g

2016-11-24 Thread Gianluca Cecchi

On Thu, Nov 24, 2016 at 9:24 AM, qinglong.d...@horebdata.cn <
qinglong.d...@horebdata.cn> wrote:

> Hi, all:
> I have an ovirt 4.0.5 environment. There is a 10g nic in the
> host. I have created a vm which uses the 10g nic. The type of the nic is
> virtio. Then I got a 1g virtual nic in the vm. Now I want a 10g virtual nic
> in the vm. Is it possible? Anyone can help? Thanks!
>
>
>
Did you make any performance test to evaluate real performance?
What is the qemu-kvm command line generated by your oVirt?

I see in my 4.0.5 environment with 1Gb/snic/vnic, it is something like this:

 -netdev tap,fd=24,id=hostnet0,vhost=on,vhostfd=28 -device
virtio-net-pci,netdev=hostnet0,id=net0,mac=00:1a:4a:16:01:58,bus=pci.0,addr=0x3

According to this link below, this syntax should give you 10Gb/s
performance I don't have a 10Gb/s to verify though at my side... ;-)

http://www.linux-kvm.org/page/10G_NIC_performance:_VFIO_vs_virtio

Gianluca
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

[ovirt-users] ovirt 4.0.6 - live disk migration fails

If I try to migrate a disk of a running VM to another storage domain it
fails with the following message:

Operation Cancelled

Error while executing action: User is not logged in.


Migrating disks of stopped VM's continues to work.

Probably a bug ?
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

[ovirt-users] [ANN] oVirt 4.0.6 Second Release Candidate is now available

2016-11-24 Thread Sandro Bonazzola

The oVirt Project is pleased to announce the availability of oVirt 4.0.6
second release candidate for testing, as of November 24th, 2016.

This release is available now for:
* Fedora 23 (tech preview)
* Red Hat Enterprise Linux 7.2 or later
* CentOS Linux (or similar) 7.2 or later

This release supports Hypervisor Hosts running:
* Red Hat Enterprise Linux 7.2 or later
* CentOS Linux (or similar) 7.2 or later
* Fedora 23 (tech preview)
* oVirt Next Generation Node 4.0

This is pre-release software. Please take a look at our community page[1]
to know how to ask questions and interact with developers and users.
All issues or bugs should be reported via oVirt Bugzilla[2].
This pre-release should not to be used in production.

This update is the second release candidate of the sixth in a series of
stabilization updates to the 4.0 series.
4.0.6 brings 2 enhancements and 43 bugfixes, including 16 high or urgent
severity fixes, on top of oVirt 4.0 series
See the release notes [3] for installation / upgrade instructions and a
list of new features and bugs fixed.

Notes:
* A new oVirt Live ISO is available. [4]
* A new oVirt Next Generation Node will be available soon [4]
* A new oVirt Engine Appliance is available for Red Hat Enterprise Linux
and CentOS Linux (or similar)
* Mirrors[5] might need up to one day to synchronize.

Additional Resources:
* Read more about the oVirt 4.0.6 release highlights:
http://www.ovirt.org/release/4.0.6/
* Get more oVirt Project updates on Twitter: https://twitter.com/ovirt
* Check out the latest project news on the oVirt blog:
http://www.ovirt.org/blog/

[1] https://www.ovirt.org/community/
[2] https://bugzilla.redhat.com/enter_bug.cgi?classification=oVirt
[3] http://www.ovirt.org/release/4.0.6/
[4] http://resources.ovirt.org/pub/ovirt-4.0-pre/iso/
[5] http://www.ovirt.org/Repository_mirrors#Current_mirrors

-- 
Sandro Bonazzola
Better technology. Faster innovation. Powered by community collaboration.
See how it works at redhat.com
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] Recommended Ovirt Implementation on Active-Active Datacenters (Site 1 and Site2) - Same Cluster

2016-11-24 Thread Ondra Machacek

There are examples of how to work with affinity labels in Python SDK
version 4, if you are interested:

-
https://github.com/oVirt/ovirt-engine-sdk/blob/master/sdk/examples/add_affinity_label.py

-
https://github.com/oVirt/ovirt-engine-sdk/blob/master/sdk/examples/assign_affinity_label_to_vm.py

-
https://github.com/oVirt/ovirt-engine-sdk/blob/master/sdk/examples/list_affinity_labels.py

On 11/24/2016 07:28 PM, Rogério Ceni Coelho wrote:

Thanks Roy. I will try.

Em qui, 24 de nov de 2016 às 13:01, Roy Golan > escreveu:

Affinity labels [1] will allow you to label the hosts and vms to
site1 and site2 and that should be it.

- create label per site
- add the redpective label to each vm and host

Unfortunately there is no UI for that but with SDK or rest it's easy

[1] https://www.ovirt.org/blog/2016/07/affinity-labels/

On Nov 24, 2016 3:12 PM, "Rogério Ceni Coelho"
>
wrote:

Hi Ovirt Jedi´s !!!

First of all, congrats about the product !!! I love Ovirt !!!

I am using Ovirt 4.0.4 with 10 hosts and 58 virtual machines on
two Active-Active Datacenters using two EMC Vplex + two EMC
VNX5500 + eight Dell Blades + 8 Dell PowerEdge M610 and two M620
Servers.

Half servers are on Site 1 and Half servers on Site 2. The same
with VMs. All Sites work as one and have redundant network,
storage, power, etc etc etc ...

I want to know what is the best way to set that VM number 1 runs
on Site 1 and VM number 2 runs on Site 2 ?

On Vmware 5.1 we use DRS Group Manager and on Hyper-V we use
Custom Properties on hosts and on VMs. What we use on oVirt
without segregate on two different Datacenters or two different
clusters ?

Thanks in advance.

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] vm.conf on one of the node is missing

On Thu, Nov 24, 2016 at 3:35 PM, Simone Tiraboschi 
wrote:

>
>
> On Thu, Nov 24, 2016 at 3:23 PM, knarra  wrote:
>
>> On 11/24/2016 07:47 PM, Simone Tiraboschi wrote:
>>
>>
>>
>> On Thu, Nov 24, 2016 at 3:06 PM, knarra  wrote:
>>
>>> On 11/24/2016 07:27 PM, Simone Tiraboschi wrote:
>>>
>>>
>>>
>>> On Thu, Nov 24, 2016 at 2:39 PM, knarra  wrote:
>>>
 On 11/24/2016 06:56 PM, Simone Tiraboschi wrote:



 On Thu, Nov 24, 2016 at 2:08 PM, knarra  wrote:

> On 11/24/2016 06:15 PM, Simone Tiraboschi wrote:
>
>
>
> On Thu, Nov 24, 2016 at 1:26 PM, knarra  wrote:
>
>> Hi,
>>
>> I have three nodes with glusterfs as storage domain. For some
>> reason i see that vm.conf from /var/run/ovirt-hosted-engine-ha is
>> missing and due to this on one of my host i see that Hosted Engine HA : 
>> Not
>> Active. Once i copy the file from some other node and restart
>> ovirt-ha-broker and ovirt-ha-agent services everything works fine. But 
>> then
>> this happens again. Can some please help me identify why this happens.
>> Below is the log i see in ovirt-ha-agent.logs.
>>
>>
>> https://paste.fedoraproject.org/489120/79990345/
>>
>>
> Once the engine correctly imported the hosted-engine storage domain, a
> couple of OVF_STORE volumes will appear there.
> Every modification to the engine VM configuration will be written by
> the engine into that OVF_STORE, so all the ovirt-ha-agent running on the
> hosted-engine hosts will be able to re-start the engine VM with a coherent
> configuration.
>
> Till the engine imports the hosted-engine storage domain,
> ovirt-ha-agent will fall back to the initial vm.conf.
>
> In you case the OVF_STORE volume is there,
> but the agent fails extracting the engine VM configuration:
> MainThread::INFO::2016-11-24 17:55:04,914::ovf_store::112::
> ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
> Extracting Engine VM OVF from the OVF_STORE
> MainThread::INFO::2016-11-24 17:55:04,919::ovf_store::119::
> ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
> OVF_STORE volume path: /rhev/data-center/mnt/glusterSD/10.70.36.79:
> _engine/27f054c3-c245-4039-b42a-c28b37043016/i
> mages/fdf49778-9a06-49c6-bf7a-a0f12425911c/8c954add-6bcf-47f
> 8-ac2e-4c85fc3f8699
> MainThread::ERROR::2016-11-24 17:55:04,928::ovf_store::124::
> ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
> Unable to extract HEVM OVF
>
> So it tries to rollback to the initial vm.conf, but also that one
> seams to miss some values and so the agent is failing:
> MainThread::ERROR::2016-11-24 17:55:04,974::agent::205::ovir
> t_hosted_engine_ha.agent.agent.Agent::(_run_agent) Error:
> ''Configuration value not found: 
> file=/var/run/ovirt-hosted-engine-ha/vm.conf,
> key=memSize'' - trying to restart agent
>
> Both of the issue seams storage related, could yuo please share your
> gluster logs?
>
>
>>
>>
>> Thanks
>>
>> kasturi
>>
>>
> Hi Simone,
>
> Below [1] is the link for the sosreports on the first two hosts.
> The third host has some issue. Once it is up will give the sosreport from
> there as well.
>

 And the host where you see the initial issue was the third one?

 It is on the first host.




>>> It seams that host1 is failing reading from the the hosted-engine
>>> storage domain:
>>>
>>> [2016-11-24 12:33:43.678467] W [MSGID: 114031]
>>> [client-rpc-fops.c:2938:client3_3_lookup_cbk] 0-engine-client-2: remote
>>> operation failed. Path: / (----0001)
>>> [Transport endpoint is not connected]
>>> [2016-11-24 12:33:43.678747] E [rpc-clnt.c:365:saved_frames_unwind]
>>> (--> /lib64/libglusterfs.so.0(_gf_log_callingfn+0x192)[0x7f077eba1642]
>>> (--> /lib64/libgfrpc.so.0(saved_frames_unwind+0x1de)[0x7f077e96775e]
>>> (--> /lib64/libgfrpc.so.0(saved_frames_destroy+0xe)[0x7f077e96786e]
>>> (--> /lib64/libgfrpc.so.0(rpc_clnt_connection_cleanup+0x84)[0x7f077e968fc4]
>>> (--> /lib64/libgfrpc.so.0(rpc_clnt_notify+0x120)[0x7f077e9698a0] )
>>> 0-engine-client-2: forced unwinding frame type(GlusterFS 3.3)
>>> op(LOOKUP(27)) called at 2016-11-24 12:33:07.495178 (xid=0x82a1c)
>>> [2016-11-24 12:33:43.678982] E [rpc-clnt.c:365:saved_frames_unwind]
>>> (--> /lib64/libglusterfs.so.0(_gf_log_callingfn+0x192)[0x7f077eba1642]
>>> (--> /lib64/libgfrpc.so.0(saved_frames_unwind+0x1de)[0x7f077e96775e]
>>> (--> /lib64/libgfrpc.so.0(saved_frames_destroy+0xe)[0x7f077e96786e]
>>> (--> /lib64/libgfrpc.so.0(rpc_clnt_connection_cleanup+0x84)[0x7f077e968fc4]
>>> (--> /lib64/libgfrpc.so.0(rpc_clnt_notify+0x120)[0x7f077e9698a0]

[ovirt-users] 4.0.5 create vm from template on cinder/ceph backed storage

2016-11-24 Thread Jordan Conway

I've got a problem where since upgrading to 4.0.5 I am still unable to
create a vm from a template.
The issue now is that it seems to be stuck in a loop trying to and failing
to create the vm, which is making my logs explode with the following

https://paste.fedoraproject.org/489161/14799981/

And in the engine UI "Failed to complete VM fromtemplate-1 creation."
thousands of times.

The issue started I believe with this sql error

https://paste.fedoraproject.org/489183/00010131/

Any insight on how I can stop the persistent error would be appreciated, as
it is also preventng me from running engine-setup due to running jobs, even
though vdsclient shows nothing.

vdsClient -s 0 getAllTasksStatuses
{'status': {'message': 'OK', 'code': 0}, 'allTasksStatus': {}}


Thank you,
Jordan Conway
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

[ovirt-users] Timout with large image uploads in ovirt 4.0.5

2016-11-24 Thread Claude Durocher


When trying to upload large kvm images (more than 50 GB) with the upload 
option, we get timeouts. In the ovirt-imageio-proxy/image-proxy.log we have:

ERROR 2016-11-24 16:18:51,211 session:293:root:(_decode_ovirt_ticket) Failed to 
verify proxy ticket: Ticket life time expired

We can recover and restart the upload but it is annoying to constantly watch 
the process. Is there a timeout parameter we can change in the imageio-proxy?
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] can not use iscsi storage type on ovirtandGlusterfshyper-converged environment

2016-11-24 Thread 胡茂荣

I find this problem more info : 
  I use flashcache  on ovirt hosts , have dm device in /dev/mapper/ :


[root@horeba init.d]# dmsetup info -C （and I set them in multipath 
blacklist ）
Name Maj Min Stat Open Targ Event  UUID
flash_sdb253   0 L--w01  0 
flash_sdf253   2 L--w01  0 
flash_sdd253   1 L--w01  0   
[root@horeba init.d]# multipath -l  
[root@horeba init.d]# 
  
[root@horeba init.d]# ll /dev/mapper/
total 0
crw--- 1 root root 10, 236 Nov 25 10:09 control
lrwxrwxrwx 1 root root   7 Nov 25 12:51 flash_sdb -> ../dm-0
lrwxrwxrwx 1 root root   7 Nov 25 12:51 flash_sdd -> ../dm-1
lrwxrwxrwx 1 root root   7 Nov 25 12:51 flash_sdf -> ../dm-2


  on this condition , ovirt UI add  iscsi type storage  will fail .


   If I delete the flashcache device  /dev/mapper/flash_*  , ovirt UI  add 
iscsi type storage  have no problem .  
  
   I need flashcache for using ssd cache on my environment ,  how can I use 
iscsi type storage on this environment , please help me , Thanks！
   
 
-- Original --
From:  "Sahina Bose";
Date:  Thu, Nov 3, 2016 05:54 PM
To:  "胡茂荣"; 
Cc:  "Maor Lipchuk"; "Jeff Nelson"; 
"users"; 
Subject:  Re: [ovirt-users] can not use iscsi storage type on 
ovirtandGlusterfshyper-converged environment

 
A wild guess, not sure if it is related - can you check if multipathd service 
is enabled. If you set up your oVirt-Gluster hyperconverged environment via 
gdeploy, multipathd service is disabled and the /etc/multipath.conf is edited 
to blacklist all devices - this was to fix 
https://bugzilla.redhat.com/show_bug.cgi?id=1303940


Since you mention you're unable to add iscsi storage only in this environment, 
thought it's worth checking.


On Thu, Nov 3, 2016 at 6:40 AM, 胡茂荣  wrote:


 my   environment rpm  are :
 [root@horeba ~]# rpm -q vdsm
vdsm-4.18.13-1.el7.centos.x86_64


[root@horeba ~]# rpm -aq | grep ovirt
ovirt-vmconsole-host-1.0.4-1.el7.centos.noarch
ovirt-imageio-common-0.4.0-1.el7.noarch
ovirt-hosted-engine-setup-2.0.2.2-1.el7.centos.noarch
ovirt-imageio-daemon-0.4.0-1.el7.noarch
ovirt-engine-appliance-4.0-20160928.1.el7.centos.noarch
ovirt-vmconsole-1.0.4-1.el7.centos.noarch
ovirt-host-deploy-1.5.2-1.el7.centos.noarch
ovirt-hosted-engine-ha-2.0.4-1.el7.centos.noarch
ovirt-release40-4.0.4-1.noarch
ovirt-setup-lib-1.0.2-1.el7.centos.noarch
ovirt-engine-sdk-python-3.6.9.1-1.el7.centos.noarch



  and I test if not on ' ovirt and Glusterfs hyper-converged environment ' ,  
ovirt WEB UI  add iscsi storage  work ok .


 
-- Original --
From:  "Maor Lipchuk";
Date:  Wed, Nov 2, 2016 07:37 PM
To:  "胡茂荣"; 
Cc:  "users"; "Jeff Nelson"; "Nir 
Soffer"; 
Subject:  Re: [ovirt-users] can not use iscsi storage type on ovirtandGlusterfs 
hyper-converged environment



 
Thanks for the logs,

What kind of VDSM version are you using? 
"rpm -q vdsm"
There seems to be a similar issue which was reported recently in the VDSM area
 (see https://bugzilla.redhat.com/show_bug.cgi?id=1197292)
It should be fixed in later versions of VDSM vdsm-4.16.12-2.el7ev.x86_64
Adding also Nir and Jeff to the thread, if they have any insights


Regards,
Maor


On Wed, Nov 2, 2016 at 4:11 AM, 胡茂荣  wrote:


 Hi Maor:
  vdsm/supervdsm/engine log on attachment .  I mkfs.xfs the lun block 
device and mount to /mnt , dd write  ,dmesg not report error ,dd result is ok :

/dev/sdi  50G   33M   50G   1% /mnt 


[root@horebc mnt]# for i in `seq 3`; do dd if=/dev/zero of=./file   bs=1G 
count=1 oflag=direct ; done
1+0 records in
1+0 records out
1073741824 bytes (1.1 GB) copied, 13.3232 s, 80.6 MB/s
1+0 records in
1+0 records out
1073741824 bytes (1.1 GB) copied, 9.89988 s, 108 MB/s
1+0 records in
1+0 records out
1073741824 bytes (1.1 GB) copied, 14.0143 s, 76.6 MB/s 
 
   my envirnment  have three  network segments (hosts have 3 network segments ) 
:
   engine  and glusterfs mount : 192.168.11.X/24   
glusterfs brick : 192.168.10.x/24
iscsi : 192.168.1.0/24  


and I add 192.168.1.0/24 to engine vm ,  ovirt web UI report the same error 
.



 humaorong 
  2016-11-2


-- Original --
From:  "Maor Lipchuk";
Date:  Tue, Nov 1, 2016 08:14 PM
To:  "胡茂荣"; 
Cc:  "users"; 
Subject:  Re: [ovirt-users] can not use iscsi storage type on ovirt 
andGlusterfs hyper-converged environment

 

Hi 胡茂荣

Can u please also add the VDSM and engine logs.
If you try discover and

Re: [ovirt-users] expired cert for aaa

2016-11-24 Thread Yedidyah Bar David

On Thu, Nov 24, 2016 at 12:47 PM, cmc  wrote:
> Hi Yedidyah,
>
> Attached are the setup logs, sorry for the delay. I checked all the backup
> certs, and the expiry dates were either in 2021 or 2026.

Sorry, no idea.

This means that all certs generated by engine-setup were ok.

Not sure what caused this message. If it happens again, please
check the certificate's details, who issued/signed it etc.

Best,

>
> Regards,
>
> Cam
>
> On Tue, Nov 8, 2016 at 7:25 AM, Yedidyah Bar David  wrote:
>>
>> On Mon, Nov 7, 2016 at 9:15 PM, cmc  wrote:
>> > To reply to my own email:
>> >
>> > This is now fixed.
>> >
>> > I originally ran these steps for the upgrade:
>> >
>> > # yum install
>> > http://resources.ovirt.org/pub/yum-repo/ovirt-release40.rpm
>> > # yum update "ovirt-engine-setup*"
>> > # engine-setup
>> >
>> > There were no errors reported during the process. I could login as the
>> > internal user without any errors. It was just using an external
>> > provider,
>> > which made me think it was an aaa issue, so I looked
>> > at the certificate exported from AD which had an expiry of 2063.
>> >
>> > I tried running engine-setup again, and this fixed the issue. I have no
>> > idea
>> > what happened along the way, I will check the logs. I notice it reports:
>> >
>> > [ INFO  ] Upgrading CA
>>
>> engine-setup always emits this message. You might find more details in the
>> setup logs regarding what it actually did.
>>
>> >
>> > so it looks like it creates a cert. Why it would have created one with
>> > such
>> > a short expiry date is a mystery to me.
>> >
>> > Hope this helps anyone who might come across this issue
>>
>> Thanks for the report!
>>
>> Can you please share both setup logs? Thanks.
>>
>> Also, most files should be backed up by engine-setup prior to being
>> changed/removed. So you can check the backups. E.g.:
>>
>> # openssl x509 -in /etc/pki/ovirt-engine/ca.pem.20160120160548 -noout
>> -enddate
>> notAfter=May 22 07:32:23 2025 GMT
>> # openssl x509 -in /etc/pki/ovirt-engine/ca.pem -noout -enddate
>> notAfter=Mar  6 09:46:44 2026 GMT
>>
>> Or,
>>
>> find /etc/pki/ovirt-engine -name "*.cer*" -o -name "*.pem*" | while
>> read file; do echo $file $(openssl x509 -in $file -noout -enddate);
>> done
>>
>> Best,
>> --
>> Didi
>
>



-- 
Didi
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] vm.conf on one of the node is missing

On Thu, Nov 24, 2016 at 1:26 PM, knarra  wrote:

> Hi,
>
> I have three nodes with glusterfs as storage domain. For some reason i
> see that vm.conf from /var/run/ovirt-hosted-engine-ha is missing and due
> to this on one of my host i see that Hosted Engine HA : Not Active. Once i
> copy the file from some other node and restart ovirt-ha-broker and
> ovirt-ha-agent services everything works fine. But then this happens again.
> Can some please help me identify why this happens. Below is the log i see
> in ovirt-ha-agent.logs.
>
>
> https://paste.fedoraproject.org/489120/79990345/
>
>
Once the engine correctly imported the hosted-engine storage domain, a
couple of OVF_STORE volumes will appear there.
Every modification to the engine VM configuration will be written by the
engine into that OVF_STORE, so all the ovirt-ha-agent running on the
hosted-engine hosts will be able to re-start the engine VM with a coherent
configuration.

Till the engine imports the hosted-engine storage domain, ovirt-ha-agent
will fall back to the initial vm.conf.

In you case the OVF_STORE volume is there,
but the agent fails extracting the engine VM configuration:
MainThread::INFO::2016-11-24
17:55:04,914::ovf_store::112::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
Extracting Engine VM OVF from the OVF_STORE
MainThread::INFO::2016-11-24
17:55:04,919::ovf_store::119::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
OVF_STORE volume path: /rhev/data-center/mnt/glusterSD/10.70.36.79:
_engine/27f054c3-c245-4039-b42a-c28b37043016/images/fdf49778-9a06-49c6-bf7a-a0f12425911c/8c954add-6bcf-47f8-ac2e-4c85fc3f8699
MainThread::ERROR::2016-11-24
17:55:04,928::ovf_store::124::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
Unable to extract HEVM OVF

So it tries to rollback to the initial vm.conf, but also that one seams to
miss some values and so the agent is failing:
MainThread::ERROR::2016-11-24
17:55:04,974::agent::205::ovirt_hosted_engine_ha.agent.agent.Agent::(_run_agent)
Error: ''Configuration value not found:
file=/var/run/ovirt-hosted-engine-ha/vm.conf, key=memSize'' - trying to
restart agent

Both of the issue seams storage related, could yuo please share your
gluster logs?


>
>
> Thanks
>
> kasturi
>
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] expired cert for aaa

2016-11-24 Thread cmc

I ran engine-setup again, but the issue was still present. However, I found
that by using a different browser (Firefox instead of Chrome), I did not
get the error. I cleared the cookies in Chrome and the issue no longer
occured. So it may well be a browser issue.

Thanks,

C

On Thu, Nov 24, 2016 at 11:22 AM, cmc  wrote:

> Interestingly, I just got this same error again after I upgraded (I
> upgraded from 4.0.4 to 4.0.5 to fix the 'internal server error' bug that
> was fixed in 4.0.5)
>
> server_error: The connection reader was unable to successfully complete
> TLS negotiation: javax.net.ssl.SSLHandshakeException: 
> java.security.cert.CertificateExpiredException:
> NotAfter: Fri Nov 04 00:19:18 GMT 2016 caused by 
> java.security.cert.CertificateExpiredException:
> NotAfter: Fri Nov 04 00:19:18 GMT 2016
>
> Shall I send the logs?
>
> On Thu, Nov 24, 2016 at 10:55 AM, Yedidyah Bar David 
> wrote:
>
>> On Thu, Nov 24, 2016 at 12:47 PM, cmc  wrote:
>> > Hi Yedidyah,
>> >
>> > Attached are the setup logs, sorry for the delay. I checked all the
>> backup
>> > certs, and the expiry dates were either in 2021 or 2026.
>>
>> Sorry, no idea.
>>
>> This means that all certs generated by engine-setup were ok.
>>
>> Not sure what caused this message. If it happens again, please
>> check the certificate's details, who issued/signed it etc.
>>
>> Best,
>>
>> >
>> > Regards,
>> >
>> > Cam
>> >
>> > On Tue, Nov 8, 2016 at 7:25 AM, Yedidyah Bar David 
>> wrote:
>> >>
>> >> On Mon, Nov 7, 2016 at 9:15 PM, cmc  wrote:
>> >> > To reply to my own email:
>> >> >
>> >> > This is now fixed.
>> >> >
>> >> > I originally ran these steps for the upgrade:
>> >> >
>> >> > # yum install
>> >> > http://resources.ovirt.org/pub/yum-repo/ovirt-release40.rpm
>> >> > # yum update "ovirt-engine-setup*"
>> >> > # engine-setup
>> >> >
>> >> > There were no errors reported during the process. I could login as
>> the
>> >> > internal user without any errors. It was just using an external
>> >> > provider,
>> >> > which made me think it was an aaa issue, so I looked
>> >> > at the certificate exported from AD which had an expiry of 2063.
>> >> >
>> >> > I tried running engine-setup again, and this fixed the issue. I have
>> no
>> >> > idea
>> >> > what happened along the way, I will check the logs. I notice it
>> reports:
>> >> >
>> >> > [ INFO  ] Upgrading CA
>> >>
>> >> engine-setup always emits this message. You might find more details in
>> the
>> >> setup logs regarding what it actually did.
>> >>
>> >> >
>> >> > so it looks like it creates a cert. Why it would have created one
>> with
>> >> > such
>> >> > a short expiry date is a mystery to me.
>> >> >
>> >> > Hope this helps anyone who might come across this issue
>> >>
>> >> Thanks for the report!
>> >>
>> >> Can you please share both setup logs? Thanks.
>> >>
>> >> Also, most files should be backed up by engine-setup prior to being
>> >> changed/removed. So you can check the backups. E.g.:
>> >>
>> >> # openssl x509 -in /etc/pki/ovirt-engine/ca.pem.20160120160548 -noout
>> >> -enddate
>> >> notAfter=May 22 07:32:23 2025 GMT
>> >> # openssl x509 -in /etc/pki/ovirt-engine/ca.pem -noout -enddate
>> >> notAfter=Mar  6 09:46:44 2026 GMT
>> >>
>> >> Or,
>> >>
>> >> find /etc/pki/ovirt-engine -name "*.cer*" -o -name "*.pem*" | while
>> >> read file; do echo $file $(openssl x509 -in $file -noout -enddate);
>> >> done
>> >>
>> >> Best,
>> >> --
>> >> Didi
>> >
>> >
>>
>>
>>
>> --
>> Didi
>>
>
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] How to notify cluster nodes after "engine-config --set IPTablesConfigSiteCustom..." ?

2016-11-24 Thread Yedidyah Bar David

On Thu, Nov 24, 2016 at 1:10 PM,   wrote:
> Thank you Didi.
>
> The proposed method works.
> I described my experience here:
> https://blog.it-kb.ru/2016/11/24/extension-of-iptables-add-custom-rules-on-the-ovirt-4-0-hosts/

Thanks for this post, and the report! (although I can't read Russian).

Best,

>
> 23.11.2016, 16:12, "Yedidyah Bar David" :
>> On Wed, Nov 23, 2016 at 1:54 PM,  wrote:
>>>  "As I wrote there, you can also do this manually"
>>>
>>>  How?
>>
>> I am not sure I understand the question.
>>
>> The same way you configure iptables on non-oVirt-hosts machines.
>>
>> If you mean "How to imitate the way the engine does this during
>> host deploy", then I don't know - you can check engine sources
>> for that. I am guessing that you can get the values of IPTablesConfig
>> and IPTablesConfigSiteCustom with engine-config, replace inside the
>> latter "@CUSTOM_RULES@" with the contents of the former, then copy
>> the result to the host and load it with iptables-restore (and/or
>> copy to /etc/sysconfig/iptables and restart iptables service).
>>
>>>  23.11.2016, 14:23, "Yedidyah Bar David" :
  On Wed, Nov 23, 2016 at 12:51 PM,  wrote:
>   Hi Didi!
>
>   https://www.mail-archive.com/users@ovirt.org/msg37193.html
>
>   "Move to maintenance and reinstall" to add the iptables rules ?
>
>   Are you serious?
>
>   There is no other way (without reinstalling the hosts) ?

  AFAIK, using ovirt-host-deploy, no.

  I am not aware of an engine API or vdsm verb to do this, but these are
  not my main area of expertise.

  As I wrote there, you can also do this manually.

  The oVirt engine is not a replacement for configuration management
  systems. If you have complex needs, might as well uncheck this
  checkbox and use other means.

  Best,

>   23.11.2016, 13:07, "Yedidyah Bar David" :
>>   On Wed, Nov 23, 2016 at 12:02 PM,  wrote:
>>>Hmm. I just rebooted the host, but the iptables rules have not been 
>>> updated :(
>>>
>>>On Engine server my custom iptables rules are visible:
>>>
>>># engine-config --get IPTablesConfigSiteCustom
>>>
>>>IPTablesConfigSiteCustom:
>>>-A INPUT -p tcp --dport 2301 -j ACCEPT -m comment --comment 'HPE 
>>> System Management Homepage'
>>>-A INPUT -p tcp --dport 2381 -j ACCEPT -m comment --comment 'HPE 
>>> System Management Homepage (Secure port)'
>>> version: general
>>>
>>>How to update the configuration on the hosts ?
>>>
>>>23.11.2016, 11:30, "aleksey.maksi...@it-kb.ru" 
>>> :
Hello oVirt guru`s !

oVirt Engine Version: 4.0.5.5-1.el7.centos

I updated the configuration of the firewall on the Engine server 
 with "engine-config --set IPTablesConfigSiteCustom...".
How to notify cluster nodes (all virtualization hosts) about the 
 changes without reboot?
>>
>>   Please check the other thread here "[ovirt-users] Hook to add firewall
>>   rules". Thanks.
>>
>>>___
>>>Users mailing list
>>>Users@ovirt.org
>>>http://lists.ovirt.org/mailman/listinfo/users
>>
>>   --
>>   Didi

  --
  Didi
>>
>> --
>> Didi



-- 
Didi
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] expired cert for aaa

2016-11-24 Thread cmc

Interestingly, I just got this same error again after I upgraded (I
upgraded from 4.0.4 to 4.0.5 to fix the 'internal server error' bug that
was fixed in 4.0.5)

server_error: The connection reader was unable to successfully complete TLS
negotiation: javax.net.ssl.SSLHandshakeException:
java.security.cert.CertificateExpiredException: NotAfter: Fri Nov 04
00:19:18 GMT 2016 caused by java.security.cert.CertificateExpiredException:
NotAfter: Fri Nov 04 00:19:18 GMT 2016

Shall I send the logs?

On Thu, Nov 24, 2016 at 10:55 AM, Yedidyah Bar David 
wrote:

> On Thu, Nov 24, 2016 at 12:47 PM, cmc  wrote:
> > Hi Yedidyah,
> >
> > Attached are the setup logs, sorry for the delay. I checked all the
> backup
> > certs, and the expiry dates were either in 2021 or 2026.
>
> Sorry, no idea.
>
> This means that all certs generated by engine-setup were ok.
>
> Not sure what caused this message. If it happens again, please
> check the certificate's details, who issued/signed it etc.
>
> Best,
>
> >
> > Regards,
> >
> > Cam
> >
> > On Tue, Nov 8, 2016 at 7:25 AM, Yedidyah Bar David 
> wrote:
> >>
> >> On Mon, Nov 7, 2016 at 9:15 PM, cmc  wrote:
> >> > To reply to my own email:
> >> >
> >> > This is now fixed.
> >> >
> >> > I originally ran these steps for the upgrade:
> >> >
> >> > # yum install
> >> > http://resources.ovirt.org/pub/yum-repo/ovirt-release40.rpm
> >> > # yum update "ovirt-engine-setup*"
> >> > # engine-setup
> >> >
> >> > There were no errors reported during the process. I could login as the
> >> > internal user without any errors. It was just using an external
> >> > provider,
> >> > which made me think it was an aaa issue, so I looked
> >> > at the certificate exported from AD which had an expiry of 2063.
> >> >
> >> > I tried running engine-setup again, and this fixed the issue. I have
> no
> >> > idea
> >> > what happened along the way, I will check the logs. I notice it
> reports:
> >> >
> >> > [ INFO  ] Upgrading CA
> >>
> >> engine-setup always emits this message. You might find more details in
> the
> >> setup logs regarding what it actually did.
> >>
> >> >
> >> > so it looks like it creates a cert. Why it would have created one with
> >> > such
> >> > a short expiry date is a mystery to me.
> >> >
> >> > Hope this helps anyone who might come across this issue
> >>
> >> Thanks for the report!
> >>
> >> Can you please share both setup logs? Thanks.
> >>
> >> Also, most files should be backed up by engine-setup prior to being
> >> changed/removed. So you can check the backups. E.g.:
> >>
> >> # openssl x509 -in /etc/pki/ovirt-engine/ca.pem.20160120160548 -noout
> >> -enddate
> >> notAfter=May 22 07:32:23 2025 GMT
> >> # openssl x509 -in /etc/pki/ovirt-engine/ca.pem -noout -enddate
> >> notAfter=Mar  6 09:46:44 2026 GMT
> >>
> >> Or,
> >>
> >> find /etc/pki/ovirt-engine -name "*.cer*" -o -name "*.pem*" | while
> >> read file; do echo $file $(openssl x509 -in $file -noout -enddate);
> >> done
> >>
> >> Best,
> >> --
> >> Didi
> >
> >
>
>
>
> --
> Didi
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] How to notify cluster nodes after "engine-config --set IPTablesConfigSiteCustom..." ?

2016-11-24 Thread aleksey . maksimov

Thank you Didi. 

The proposed method works.
I described my experience here:
https://blog.it-kb.ru/2016/11/24/extension-of-iptables-add-custom-rules-on-the-ovirt-4-0-hosts/

23.11.2016, 16:12, "Yedidyah Bar David" :
> On Wed, Nov 23, 2016 at 1:54 PM,  wrote:
>>  "As I wrote there, you can also do this manually"
>>
>>  How?
>
> I am not sure I understand the question.
>
> The same way you configure iptables on non-oVirt-hosts machines.
>
> If you mean "How to imitate the way the engine does this during
> host deploy", then I don't know - you can check engine sources
> for that. I am guessing that you can get the values of IPTablesConfig
> and IPTablesConfigSiteCustom with engine-config, replace inside the
> latter "@CUSTOM_RULES@" with the contents of the former, then copy
> the result to the host and load it with iptables-restore (and/or
> copy to /etc/sysconfig/iptables and restart iptables service).
>
>>  23.11.2016, 14:23, "Yedidyah Bar David" :
>>>  On Wed, Nov 23, 2016 at 12:51 PM,  wrote:
   Hi Didi!

   https://www.mail-archive.com/users@ovirt.org/msg37193.html

   "Move to maintenance and reinstall" to add the iptables rules ?

   Are you serious?

   There is no other way (without reinstalling the hosts) ?
>>>
>>>  AFAIK, using ovirt-host-deploy, no.
>>>
>>>  I am not aware of an engine API or vdsm verb to do this, but these are
>>>  not my main area of expertise.
>>>
>>>  As I wrote there, you can also do this manually.
>>>
>>>  The oVirt engine is not a replacement for configuration management
>>>  systems. If you have complex needs, might as well uncheck this
>>>  checkbox and use other means.
>>>
>>>  Best,
>>>
   23.11.2016, 13:07, "Yedidyah Bar David" :
>   On Wed, Nov 23, 2016 at 12:02 PM,  wrote:
>>    Hmm. I just rebooted the host, but the iptables rules have not been 
>> updated :(
>>
>>    On Engine server my custom iptables rules are visible:
>>
>>    # engine-config --get IPTablesConfigSiteCustom
>>
>>    IPTablesConfigSiteCustom:
>>    -A INPUT -p tcp --dport 2301 -j ACCEPT -m comment --comment 'HPE 
>> System Management Homepage'
>>    -A INPUT -p tcp --dport 2381 -j ACCEPT -m comment --comment 'HPE 
>> System Management Homepage (Secure port)'
>> version: general
>>
>>    How to update the configuration on the hosts ?
>>
>>    23.11.2016, 11:30, "aleksey.maksi...@it-kb.ru" 
>> :
>>>    Hello oVirt guru`s !
>>>
>>>    oVirt Engine Version: 4.0.5.5-1.el7.centos
>>>
>>>    I updated the configuration of the firewall on the Engine server 
>>> with "engine-config --set IPTablesConfigSiteCustom...".
>>>    How to notify cluster nodes (all virtualization hosts) about the 
>>> changes without reboot?
>
>   Please check the other thread here "[ovirt-users] Hook to add firewall
>   rules". Thanks.
>
>>    ___
>>    Users mailing list
>>    Users@ovirt.org
>>    http://lists.ovirt.org/mailman/listinfo/users
>
>   --
>   Didi
>>>
>>>  --
>>>  Didi
>
> --
> Didi
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

[ovirt-users] vm.conf on one of the node is missing


Hi,

I have three nodes with glusterfs as storage domain. For some 
reason i see that vm.conf from /var/run/ovirt-hosted-engine-ha is 
missing and due to this on one of my host i see that Hosted Engine HA : 
Not Active. Once i copy the file from some other node and restart 
ovirt-ha-broker and ovirt-ha-agent services everything works fine. But 
then this happens again. Can some please help me identify why this 
happens. Below is the log i see in ovirt-ha-agent.logs.



https://paste.fedoraproject.org/489120/79990345/


Thanks

kasturi

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] expired cert for aaa

2016-11-24 Thread Yedidyah Bar David

On Thu, Nov 24, 2016 at 1:58 PM, cmc  wrote:
> I ran engine-setup again, but the issue was still present. However, I found
> that by using a different browser (Firefox instead of Chrome), I did not get
> the error. I cleared the cookies in Chrome and the issue no longer occured.
> So it may well be a browser issue.

Thanks for the report. Adding Alexander in case he wishes to
check/note something.

Best,

>
> Thanks,
>
> C
>
> On Thu, Nov 24, 2016 at 11:22 AM, cmc  wrote:
>>
>> Interestingly, I just got this same error again after I upgraded (I
>> upgraded from 4.0.4 to 4.0.5 to fix the 'internal server error' bug that was
>> fixed in 4.0.5)
>>
>> server_error: The connection reader was unable to successfully complete
>> TLS negotiation: javax.net.ssl.SSLHandshakeException:
>> java.security.cert.CertificateExpiredException: NotAfter: Fri Nov 04
>> 00:19:18 GMT 2016 caused by java.security.cert.CertificateExpiredException:
>> NotAfter: Fri Nov 04 00:19:18 GMT 2016
>>
>> Shall I send the logs?
>>
>> On Thu, Nov 24, 2016 at 10:55 AM, Yedidyah Bar David 
>> wrote:
>>>
>>> On Thu, Nov 24, 2016 at 12:47 PM, cmc  wrote:
>>> > Hi Yedidyah,
>>> >
>>> > Attached are the setup logs, sorry for the delay. I checked all the
>>> > backup
>>> > certs, and the expiry dates were either in 2021 or 2026.
>>>
>>> Sorry, no idea.
>>>
>>> This means that all certs generated by engine-setup were ok.
>>>
>>> Not sure what caused this message. If it happens again, please
>>> check the certificate's details, who issued/signed it etc.
>>>
>>> Best,
>>>
>>> >
>>> > Regards,
>>> >
>>> > Cam
>>> >
>>> > On Tue, Nov 8, 2016 at 7:25 AM, Yedidyah Bar David 
>>> > wrote:
>>> >>
>>> >> On Mon, Nov 7, 2016 at 9:15 PM, cmc  wrote:
>>> >> > To reply to my own email:
>>> >> >
>>> >> > This is now fixed.
>>> >> >
>>> >> > I originally ran these steps for the upgrade:
>>> >> >
>>> >> > # yum install
>>> >> > http://resources.ovirt.org/pub/yum-repo/ovirt-release40.rpm
>>> >> > # yum update "ovirt-engine-setup*"
>>> >> > # engine-setup
>>> >> >
>>> >> > There were no errors reported during the process. I could login as
>>> >> > the
>>> >> > internal user without any errors. It was just using an external
>>> >> > provider,
>>> >> > which made me think it was an aaa issue, so I looked
>>> >> > at the certificate exported from AD which had an expiry of 2063.
>>> >> >
>>> >> > I tried running engine-setup again, and this fixed the issue. I have
>>> >> > no
>>> >> > idea
>>> >> > what happened along the way, I will check the logs. I notice it
>>> >> > reports:
>>> >> >
>>> >> > [ INFO  ] Upgrading CA
>>> >>
>>> >> engine-setup always emits this message. You might find more details in
>>> >> the
>>> >> setup logs regarding what it actually did.
>>> >>
>>> >> >
>>> >> > so it looks like it creates a cert. Why it would have created one
>>> >> > with
>>> >> > such
>>> >> > a short expiry date is a mystery to me.
>>> >> >
>>> >> > Hope this helps anyone who might come across this issue
>>> >>
>>> >> Thanks for the report!
>>> >>
>>> >> Can you please share both setup logs? Thanks.
>>> >>
>>> >> Also, most files should be backed up by engine-setup prior to being
>>> >> changed/removed. So you can check the backups. E.g.:
>>> >>
>>> >> # openssl x509 -in /etc/pki/ovirt-engine/ca.pem.20160120160548 -noout
>>> >> -enddate
>>> >> notAfter=May 22 07:32:23 2025 GMT
>>> >> # openssl x509 -in /etc/pki/ovirt-engine/ca.pem -noout -enddate
>>> >> notAfter=Mar  6 09:46:44 2026 GMT
>>> >>
>>> >> Or,
>>> >>
>>> >> find /etc/pki/ovirt-engine -name "*.cer*" -o -name "*.pem*" | while
>>> >> read file; do echo $file $(openssl x509 -in $file -noout -enddate);
>>> >> done
>>> >>
>>> >> Best,
>>> >> --
>>> >> Didi
>>> >
>>> >
>>>
>>>
>>>
>>> --
>>> Didi
>>
>>
>



-- 
Didi
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] vm.conf on one of the node is missing

On Thu, Nov 24, 2016 at 2:08 PM, knarra  wrote:

> On 11/24/2016 06:15 PM, Simone Tiraboschi wrote:
>
>
>
> On Thu, Nov 24, 2016 at 1:26 PM, knarra  wrote:
>
>> Hi,
>>
>> I have three nodes with glusterfs as storage domain. For some reason
>> i see that vm.conf from /var/run/ovirt-hosted-engine-ha is missing and
>> due to this on one of my host i see that Hosted Engine HA : Not Active.
>> Once i copy the file from some other node and restart ovirt-ha-broker and
>> ovirt-ha-agent services everything works fine. But then this happens again.
>> Can some please help me identify why this happens. Below is the log i see
>> in ovirt-ha-agent.logs.
>>
>>
>> https://paste.fedoraproject.org/489120/79990345/
>>
>>
> Once the engine correctly imported the hosted-engine storage domain, a
> couple of OVF_STORE volumes will appear there.
> Every modification to the engine VM configuration will be written by the
> engine into that OVF_STORE, so all the ovirt-ha-agent running on the
> hosted-engine hosts will be able to re-start the engine VM with a coherent
> configuration.
>
> Till the engine imports the hosted-engine storage domain, ovirt-ha-agent
> will fall back to the initial vm.conf.
>
> In you case the OVF_STORE volume is there,
> but the agent fails extracting the engine VM configuration:
> MainThread::INFO::2016-11-24 17:55:04,914::ovf_store::112::
> ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
> Extracting Engine VM OVF from the OVF_STORE
> MainThread::INFO::2016-11-24 17:55:04,919::ovf_store::119::
> ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
> OVF_STORE volume path: /rhev/data-center/mnt/glusterSD/10.70.36.79:
> _engine/27f054c3-c245-4039-b42a-c28b37043016/images/fdf49778-
> 9a06-49c6-bf7a-a0f12425911c/8c954add-6bcf-47f8-ac2e-4c85fc3f8699
> MainThread::ERROR::2016-11-24 17:55:04,928::ovf_store::124::
> ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
> Unable to extract HEVM OVF
>
> So it tries to rollback to the initial vm.conf, but also that one seams to
> miss some values and so the agent is failing:
> MainThread::ERROR::2016-11-24 17:55:04,974::agent::205::
> ovirt_hosted_engine_ha.agent.agent.Agent::(_run_agent) Error:
> ''Configuration value not found: file=/var/run/ovirt-hosted-engine-ha/vm.conf,
> key=memSize'' - trying to restart agent
>
> Both of the issue seams storage related, could yuo please share your
> gluster logs?
>
>
>>
>>
>> Thanks
>>
>> kasturi
>>
>>
> Hi Simone,
>
> Below [1] is the link for the sosreports on the first two hosts. The
> third host has some issue. Once it is up will give the sosreport from there
> as well.
>

And the host where you see the initial issue was the third one?


> [1] http://rhsqe-repo.lab.eng.blr.redhat.com/sosreports/HC/vm_conf/
>
> Thanks
>
> kasturi
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] vm.conf on one of the node is missing

On Thu, Nov 24, 2016 at 2:39 PM, knarra  wrote:

> On 11/24/2016 06:56 PM, Simone Tiraboschi wrote:
>
>
>
> On Thu, Nov 24, 2016 at 2:08 PM, knarra  wrote:
>
>> On 11/24/2016 06:15 PM, Simone Tiraboschi wrote:
>>
>>
>>
>> On Thu, Nov 24, 2016 at 1:26 PM, knarra  wrote:
>>
>>> Hi,
>>>
>>> I have three nodes with glusterfs as storage domain. For some reason
>>> i see that vm.conf from /var/run/ovirt-hosted-engine-ha is missing and
>>> due to this on one of my host i see that Hosted Engine HA : Not Active.
>>> Once i copy the file from some other node and restart ovirt-ha-broker and
>>> ovirt-ha-agent services everything works fine. But then this happens again.
>>> Can some please help me identify why this happens. Below is the log i see
>>> in ovirt-ha-agent.logs.
>>>
>>>
>>> https://paste.fedoraproject.org/489120/79990345/
>>>
>>>
>> Once the engine correctly imported the hosted-engine storage domain, a
>> couple of OVF_STORE volumes will appear there.
>> Every modification to the engine VM configuration will be written by the
>> engine into that OVF_STORE, so all the ovirt-ha-agent running on the
>> hosted-engine hosts will be able to re-start the engine VM with a coherent
>> configuration.
>>
>> Till the engine imports the hosted-engine storage domain, ovirt-ha-agent
>> will fall back to the initial vm.conf.
>>
>> In you case the OVF_STORE volume is there,
>> but the agent fails extracting the engine VM configuration:
>> MainThread::INFO::2016-11-24 17:55:04,914::ovf_store::112::
>> ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
>> Extracting Engine VM OVF from the OVF_STORE
>> MainThread::INFO::2016-11-24 17:55:04,919::ovf_store::119::
>> ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
>> OVF_STORE volume path: /rhev/data-center/mnt/glusterSD/10.70.36.79:
>> _engine/27f054c3-c245-4039-b42a-c28b37043016/images/fdf49778-9a06-49c6-
>> bf7a-a0f12425911c/8c954add-6bcf-47f8-ac2e-4c85fc3f8699
>> MainThread::ERROR::2016-11-24 17:55:04,928::ovf_store::124::
>> ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
>> Unable to extract HEVM OVF
>>
>> So it tries to rollback to the initial vm.conf, but also that one seams
>> to miss some values and so the agent is failing:
>> MainThread::ERROR::2016-11-24 17:55:04,974::agent::205::ovir
>> t_hosted_engine_ha.agent.agent.Agent::(_run_agent) Error:
>> ''Configuration value not found: 
>> file=/var/run/ovirt-hosted-engine-ha/vm.conf,
>> key=memSize'' - trying to restart agent
>>
>> Both of the issue seams storage related, could yuo please share your
>> gluster logs?
>>
>>
>>>
>>>
>>> Thanks
>>>
>>> kasturi
>>>
>>>
>> Hi Simone,
>>
>> Below [1] is the link for the sosreports on the first two hosts. The
>> third host has some issue. Once it is up will give the sosreport from there
>> as well.
>>
>
> And the host where you see the initial issue was the third one?
>
> It is on the first host.
>
>
>
>
It seams that host1 is failing reading from the the hosted-engine storage
domain:

[2016-11-24 12:33:43.678467] W [MSGID: 114031]
[client-rpc-fops.c:2938:client3_3_lookup_cbk] 0-engine-client-2: remote
operation failed. Path: / (----0001) [Transport
endpoint is not connected]
[2016-11-24 12:33:43.678747] E [rpc-clnt.c:365:saved_frames_unwind] (-->
/lib64/libglusterfs.so.0(_gf_log_callingfn+0x192)[0x7f077eba1642] (-->
/lib64/libgfrpc.so.0(saved_frames_unwind+0x1de)[0x7f077e96775e] (-->
/lib64/libgfrpc.so.0(saved_frames_destroy+0xe)[0x7f077e96786e] (-->
/lib64/libgfrpc.so.0(rpc_clnt_connection_cleanup+0x84)[0x7f077e968fc4] (-->
/lib64/libgfrpc.so.0(rpc_clnt_notify+0x120)[0x7f077e9698a0] )
0-engine-client-2: forced unwinding frame type(GlusterFS 3.3)
op(LOOKUP(27)) called at 2016-11-24 12:33:07.495178 (xid=0x82a1c)
[2016-11-24 12:33:43.678982] E [rpc-clnt.c:365:saved_frames_unwind] (-->
/lib64/libglusterfs.so.0(_gf_log_callingfn+0x192)[0x7f077eba1642] (-->
/lib64/libgfrpc.so.0(saved_frames_unwind+0x1de)[0x7f077e96775e] (-->
/lib64/libgfrpc.so.0(saved_frames_destroy+0xe)[0x7f077e96786e] (-->
/lib64/libgfrpc.so.0(rpc_clnt_connection_cleanup+0x84)[0x7f077e968fc4] (-->
/lib64/libgfrpc.so.0(rpc_clnt_notify+0x120)[0x7f077e9698a0] )
0-engine-client-2: forced unwinding frame type(GlusterFS 3.3)
op(LOOKUP(27)) called at 2016-11-24 12:33:08.770637 (xid=0x82a1d)
[2016-11-24 12:33:43.679001] W [MSGID: 114031]
[client-rpc-fops.c:2938:client3_3_lookup_cbk] 0-engine-client-2: remote
operation failed. Path:
/27f054c3-c245-4039-b42a-c28b37043016/images/39960f40-4aae-4714-ba73-1637785fae7c/38fa3519-f21e-4671-8c69-d1497ff8a490
(1090c25b-9c90-434e-a133-faf9647cc992) [Transport endpoint is not connected]
[2016-11-24 12:33:43.679303] E [rpc-clnt.c:365:saved_frames_unwind] (-->
/lib64/libglusterfs.so.0(_gf_log_callingfn+0x192)[0x7f077eba1642] (-->
/lib64/libgfrpc.so.0(saved_frames_unwind+0x1de)[0x7f077e96775e] (-->

Re: [ovirt-users] ovirt 4.0.6 - live disk migration fails

2016-11-24 Thread Ala Hino

Thanks!
Please add Vdsm and Engine logs.

On Nov 24, 2016 4:03 PM, "Maton, Brett"  wrote:

> Sure when I get 5 minutes :)
>
> Which logs would you be interested in ?
>
> On 24 November 2016 at 13:33, Ala Hino  wrote:
>
>> Hi Brett,
>>
>> I apologize but I confused this with a different issue in this area.
>> Would appreciate if you could provide logs. If possible and actually might
>> be simpler to open a bug and the logs there.
>>
>> Thanks!
>>
>> On Thu, Nov 24, 2016 at 12:13 PM, Maton, Brett 
>> wrote:
>>
>>> Ok thanks
>>>
>>> On 24 November 2016 at 10:00, Ala Hino  wrote:
>>>
 It is a known issue and Maor Lipchuk (mlipchuk) is working on a fix.

 On Thu, Nov 24, 2016 at 11:57 AM, Maton, Brett <
 mat...@ltresources.co.uk> wrote:

> If I try to migrate a disk of a running VM to another storage domain
> it fails with the following message:
>
> Operation Cancelled
>
> Error while executing action: User is not logged in.
>
>
> Migrating disks of stopped VM's continues to work.
>
> Probably a bug ?
>
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
>

>>>
>>
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] vm.conf on one of the node is missing


On 11/24/2016 06:15 PM, Simone Tiraboschi wrote:



On Thu, Nov 24, 2016 at 1:26 PM, knarra > wrote:


Hi,

I have three nodes with glusterfs as storage domain. For some
reason i see that vm.conf from /var/run/ovirt-hosted-engine-ha is
missing and due to this on one of my host i see that Hosted Engine
HA : Not Active. Once i copy the file from some other node and
restart ovirt-ha-broker and ovirt-ha-agent services everything
works fine. But then this happens again. Can some please help me
identify why this happens. Below is the log i see in
ovirt-ha-agent.logs.


https://paste.fedoraproject.org/489120/79990345/



Once the engine correctly imported the hosted-engine storage domain, a 
couple of OVF_STORE volumes will appear there.
Every modification to the engine VM configuration will be written by 
the engine into that OVF_STORE, so all the ovirt-ha-agent running on 
the hosted-engine hosts will be able to re-start the engine VM with a 
coherent configuration.


Till the engine imports the hosted-engine storage domain, 
ovirt-ha-agent will fall back to the initial vm.conf.


In you case the OVF_STORE volume is there,
but the agent fails extracting the engine VM configuration:
MainThread::INFO::2016-11-24 
17:55:04,914::ovf_store::112::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF) 
Extracting Engine VM OVF from the OVF_STORE
MainThread::INFO::2016-11-24 
17:55:04,919::ovf_store::119::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF) 
OVF_STORE volume path: 
/rhev/data-center/mnt/glusterSD/10.70.36.79:_engine/27f054c3-c245-4039-b42a-c28b37043016/images/fdf49778-9a06-49c6-bf7a-a0f12425911c/8c954add-6bcf-47f8-ac2e-4c85fc3f8699
MainThread::ERROR::2016-11-24 
17:55:04,928::ovf_store::124::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF) 
Unable to extract HEVM OVF


So it tries to rollback to the initial vm.conf, but also that one 
seams to miss some values and so the agent is failing:
MainThread::ERROR::2016-11-24 
17:55:04,974::agent::205::ovirt_hosted_engine_ha.agent.agent.Agent::(_run_agent) 
Error: ''Configuration value not found: 
file=/var/run/ovirt-hosted-engine-ha/vm.conf, key=memSize'' - trying 
to restart agent


Both of the issue seams storage related, could yuo please share your 
gluster logs?



Thanks

kasturi



Hi Simone,

Below [1] is the link for the sosreports on the first two hosts. 
The third host has some issue. Once it is up will give the sosreport 
from there as well.


[1] http://rhsqe-repo.lab.eng.blr.redhat.com/sosreports/HC/vm_conf/

Thanks

kasturi

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] ovirt 4.0.6 - live disk migration fails

2016-11-24 Thread Ala Hino

Hi Brett,

I apologize but I confused this with a different issue in this area. Would
appreciate if you could provide logs. If possible and actually might be
simpler to open a bug and the logs there.

Thanks!

On Thu, Nov 24, 2016 at 12:13 PM, Maton, Brett 
wrote:

> Ok thanks
>
> On 24 November 2016 at 10:00, Ala Hino  wrote:
>
>> It is a known issue and Maor Lipchuk (mlipchuk) is working on a fix.
>>
>> On Thu, Nov 24, 2016 at 11:57 AM, Maton, Brett 
>> wrote:
>>
>>> If I try to migrate a disk of a running VM to another storage domain it
>>> fails with the following message:
>>>
>>> Operation Cancelled
>>>
>>> Error while executing action: User is not logged in.
>>>
>>>
>>> Migrating disks of stopped VM's continues to work.
>>>
>>> Probably a bug ?
>>>
>>> ___
>>> Users mailing list
>>> Users@ovirt.org
>>> http://lists.ovirt.org/mailman/listinfo/users
>>>
>>>
>>
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

[ovirt-users] Recommended Ovirt Implementation on Active-Active Datacenters (Site 1 and Site2) - Same Cluster

2016-11-24 Thread Rogério Ceni Coelho

Hi Ovirt Jedi´s !!!

First of all, congrats about the product !!! I love Ovirt !!!

I am using Ovirt 4.0.4 with 10 hosts and 58 virtual machines on two
Active-Active Datacenters using two EMC Vplex + two EMC VNX5500 + eight
Dell Blades + 8 Dell PowerEdge M610 and two M620 Servers.

Half servers are on Site 1 and Half servers on Site 2. The same with VMs.
All Sites work as one and have redundant network, storage, power, etc etc
etc ...

I want to know what is the best way to set that VM number 1 runs on Site 1
and VM number 2 runs on Site 2 ?

On Vmware 5.1 we use DRS Group Manager and on Hyper-V we use Custom
Properties on hosts and on VMs. What we use on oVirt without segregate on
two different Datacenters or two different clusters ?

Thanks in advance.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] ovirt 4.0.6 - live disk migration fails

Sure when I get 5 minutes :)

Which logs would you be interested in ?

On 24 November 2016 at 13:33, Ala Hino  wrote:

> Hi Brett,
>
> I apologize but I confused this with a different issue in this area. Would
> appreciate if you could provide logs. If possible and actually might be
> simpler to open a bug and the logs there.
>
> Thanks!
>
> On Thu, Nov 24, 2016 at 12:13 PM, Maton, Brett 
> wrote:
>
>> Ok thanks
>>
>> On 24 November 2016 at 10:00, Ala Hino  wrote:
>>
>>> It is a known issue and Maor Lipchuk (mlipchuk) is working on a fix.
>>>
>>> On Thu, Nov 24, 2016 at 11:57 AM, Maton, Brett >> > wrote:
>>>
 If I try to migrate a disk of a running VM to another storage domain it
 fails with the following message:

 Operation Cancelled

 Error while executing action: User is not logged in.


 Migrating disks of stopped VM's continues to work.

 Probably a bug ?

 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users


>>>
>>
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] vm.conf on one of the node is missing


On 11/24/2016 07:27 PM, Simone Tiraboschi wrote:



On Thu, Nov 24, 2016 at 2:39 PM, knarra > wrote:


On 11/24/2016 06:56 PM, Simone Tiraboschi wrote:



On Thu, Nov 24, 2016 at 2:08 PM, knarra > wrote:

On 11/24/2016 06:15 PM, Simone Tiraboschi wrote:



On Thu, Nov 24, 2016 at 1:26 PM, knarra > wrote:

Hi,

I have three nodes with glusterfs as storage domain.
For some reason i see that vm.conf from
/var/run/ovirt-hosted-engine-ha is missing and due to
this on one of my host i see that Hosted Engine HA : Not
Active. Once i copy the file from some other node and
restart ovirt-ha-broker and ovirt-ha-agent services
everything works fine. But then this happens again. Can
some please help me identify why this happens. Below is
the log i see in ovirt-ha-agent.logs.


https://paste.fedoraproject.org/489120/79990345/



Once the engine correctly imported the hosted-engine storage
domain, a couple of OVF_STORE volumes will appear there.
Every modification to the engine VM configuration will be
written by the engine into that OVF_STORE, so all the
ovirt-ha-agent running on the hosted-engine hosts will be
able to re-start the engine VM with a coherent configuration.

Till the engine imports the hosted-engine storage domain,
ovirt-ha-agent will fall back to the initial vm.conf.

In you case the OVF_STORE volume is there,
but the agent fails extracting the engine VM configuration:
MainThread::INFO::2016-11-24

17:55:04,914::ovf_store::112::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
Extracting Engine VM OVF from the OVF_STORE
MainThread::INFO::2016-11-24

17:55:04,919::ovf_store::119::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
OVF_STORE volume path:

/rhev/data-center/mnt/glusterSD/10.70.36.79:_engine/27f054c3-c245-4039-b42a-c28b37043016/images/fdf49778-9a06-49c6-bf7a-a0f12425911c/8c954add-6bcf-47f8-ac2e-4c85fc3f8699
MainThread::ERROR::2016-11-24

17:55:04,928::ovf_store::124::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
Unable to extract HEVM OVF

So it tries to rollback to the initial vm.conf, but also
that one seams to miss some values and so the agent is failing:
MainThread::ERROR::2016-11-24

17:55:04,974::agent::205::ovirt_hosted_engine_ha.agent.agent.Agent::(_run_agent)
Error: ''Configuration value not found:
file=/var/run/ovirt-hosted-engine-ha/vm.conf, key=memSize''
- trying to restart agent

Both of the issue seams storage related, could yuo please
share your gluster logs?


Thanks

kasturi



Hi Simone,

Below [1] is the link for the sosreports on the first two
hosts. The third host has some issue. Once it is up will give
the sosreport from there as well.


And the host where you see the initial issue was the third one?

It is on the first host.


It seams that host1 is failing reading from the the hosted-engine 
storage domain:


[2016-11-24 12:33:43.678467] W [MSGID: 114031] 
[client-rpc-fops.c:2938:client3_3_lookup_cbk] 0-engine-client-2: 
remote operation failed. Path: / 
(----0001) [Transport endpoint is not 
connected]
[2016-11-24 12:33:43.678747] E [rpc-clnt.c:365:saved_frames_unwind] 
(--> /lib64/libglusterfs.so.0(_gf_log_callingfn+0x192)[0x7f077eba1642] 
(--> /lib64/libgfrpc.so.0(saved_frames_unwind+0x1de)[0x7f077e96775e] 
(--> /lib64/libgfrpc.so.0(saved_frames_destroy+0xe)[0x7f077e96786e] 
(--> 
/lib64/libgfrpc.so.0(rpc_clnt_connection_cleanup+0x84)[0x7f077e968fc4] 
(--> /lib64/libgfrpc.so.0(rpc_clnt_notify+0x120)[0x7f077e9698a0] ) 
0-engine-client-2: forced unwinding frame type(GlusterFS 3.3) 
op(LOOKUP(27)) called at 2016-11-24 12:33:07.495178 (xid=0x82a1c)
[2016-11-24 12:33:43.678982] E [rpc-clnt.c:365:saved_frames_unwind] 
(--> /lib64/libglusterfs.so.0(_gf_log_callingfn+0x192)[0x7f077eba1642] 
(--> /lib64/libgfrpc.so.0(saved_frames_unwind+0x1de)[0x7f077e96775e] 
(--> /lib64/libgfrpc.so.0(saved_frames_destroy+0xe)[0x7f077e96786e] 
(--> 
/lib64/libgfrpc.so.0(rpc_clnt_connection_cleanup+0x84)[0x7f077e968fc4] 
(--> /lib64/libgfrpc.so.0(rpc_clnt_notify+0x120)[0x7f077e9698a0] ) 
0-engine-client-2: forced unwinding frame type(GlusterFS 3.3) 
op(LOOKUP(27)) called at 2016-11-24 12:33:08.770637 (xid=0x82a1d)
[2016-11-24 12:33:43.679001] W [MSGID: 114031] 
[client-rpc-fops.c:2938:client3_3_lookup_cbk] 0-engine-client-2: 
remote operation

Re: [ovirt-users] vm.conf on one of the node is missing

On Thu, Nov 24, 2016 at 3:06 PM, knarra  wrote:

> On 11/24/2016 07:27 PM, Simone Tiraboschi wrote:
>
>
>
> On Thu, Nov 24, 2016 at 2:39 PM, knarra  wrote:
>
>> On 11/24/2016 06:56 PM, Simone Tiraboschi wrote:
>>
>>
>>
>> On Thu, Nov 24, 2016 at 2:08 PM, knarra  wrote:
>>
>>> On 11/24/2016 06:15 PM, Simone Tiraboschi wrote:
>>>
>>>
>>>
>>> On Thu, Nov 24, 2016 at 1:26 PM, knarra  wrote:
>>>
 Hi,

 I have three nodes with glusterfs as storage domain. For some
 reason i see that vm.conf from /var/run/ovirt-hosted-engine-ha is
 missing and due to this on one of my host i see that Hosted Engine HA : Not
 Active. Once i copy the file from some other node and restart
 ovirt-ha-broker and ovirt-ha-agent services everything works fine. But then
 this happens again. Can some please help me identify why this happens.
 Below is the log i see in ovirt-ha-agent.logs.


 https://paste.fedoraproject.org/489120/79990345/


>>> Once the engine correctly imported the hosted-engine storage domain, a
>>> couple of OVF_STORE volumes will appear there.
>>> Every modification to the engine VM configuration will be written by the
>>> engine into that OVF_STORE, so all the ovirt-ha-agent running on the
>>> hosted-engine hosts will be able to re-start the engine VM with a coherent
>>> configuration.
>>>
>>> Till the engine imports the hosted-engine storage domain, ovirt-ha-agent
>>> will fall back to the initial vm.conf.
>>>
>>> In you case the OVF_STORE volume is there,
>>> but the agent fails extracting the engine VM configuration:
>>> MainThread::INFO::2016-11-24 17:55:04,914::ovf_store::112::
>>> ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
>>> Extracting Engine VM OVF from the OVF_STORE
>>> MainThread::INFO::2016-11-24 17:55:04,919::ovf_store::119::
>>> ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
>>> OVF_STORE volume path: /rhev/data-center/mnt/glusterSD/10.70.36.79:
>>> _engine/27f054c3-c245-4039-b42a-c28b37043016/i
>>> mages/fdf49778-9a06-49c6-bf7a-a0f12425911c/8c954add-6bcf-
>>> 47f8-ac2e-4c85fc3f8699
>>> MainThread::ERROR::2016-11-24 17:55:04,928::ovf_store::124::
>>> ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
>>> Unable to extract HEVM OVF
>>>
>>> So it tries to rollback to the initial vm.conf, but also that one seams
>>> to miss some values and so the agent is failing:
>>> MainThread::ERROR::2016-11-24 17:55:04,974::agent::205::ovir
>>> t_hosted_engine_ha.agent.agent.Agent::(_run_agent) Error:
>>> ''Configuration value not found: 
>>> file=/var/run/ovirt-hosted-engine-ha/vm.conf,
>>> key=memSize'' - trying to restart agent
>>>
>>> Both of the issue seams storage related, could yuo please share your
>>> gluster logs?
>>>
>>>


 Thanks

 kasturi


>>> Hi Simone,
>>>
>>> Below [1] is the link for the sosreports on the first two hosts. The
>>> third host has some issue. Once it is up will give the sosreport from there
>>> as well.
>>>
>>
>> And the host where you see the initial issue was the third one?
>>
>> It is on the first host.
>>
>>
>>
>>
> It seams that host1 is failing reading from the the hosted-engine storage
> domain:
>
> [2016-11-24 12:33:43.678467] W [MSGID: 114031] 
> [client-rpc-fops.c:2938:client3_3_lookup_cbk]
> 0-engine-client-2: remote operation failed. Path: /
> (----0001) [Transport endpoint is not
> connected]
> [2016-11-24 12:33:43.678747] E [rpc-clnt.c:365:saved_frames_unwind] (-->
> /lib64/libglusterfs.so.0(_gf_log_callingfn+0x192)[0x7f077eba1642] (-->
> /lib64/libgfrpc.so.0(saved_frames_unwind+0x1de)[0x7f077e96775e] (-->
> /lib64/libgfrpc.so.0(saved_frames_destroy+0xe)[0x7f077e96786e] (-->
> /lib64/libgfrpc.so.0(rpc_clnt_connection_cleanup+0x84)[0x7f077e968fc4]
> (--> /lib64/libgfrpc.so.0(rpc_clnt_notify+0x120)[0x7f077e9698a0] )
> 0-engine-client-2: forced unwinding frame type(GlusterFS 3.3)
> op(LOOKUP(27)) called at 2016-11-24 12:33:07.495178 (xid=0x82a1c)
> [2016-11-24 12:33:43.678982] E [rpc-clnt.c:365:saved_frames_unwind] (-->
> /lib64/libglusterfs.so.0(_gf_log_callingfn+0x192)[0x7f077eba1642] (-->
> /lib64/libgfrpc.so.0(saved_frames_unwind+0x1de)[0x7f077e96775e] (-->
> /lib64/libgfrpc.so.0(saved_frames_destroy+0xe)[0x7f077e96786e] (-->
> /lib64/libgfrpc.so.0(rpc_clnt_connection_cleanup+0x84)[0x7f077e968fc4]
> (--> /lib64/libgfrpc.so.0(rpc_clnt_notify+0x120)[0x7f077e9698a0] )
> 0-engine-client-2: forced unwinding frame type(GlusterFS 3.3)
> op(LOOKUP(27)) called at 2016-11-24 12:33:08.770637 (xid=0x82a1d)
> [2016-11-24 12:33:43.679001] W [MSGID: 114031] 
> [client-rpc-fops.c:2938:client3_3_lookup_cbk]
> 0-engine-client-2: remote operation failed. Path: /27f054c3-c245-4039-b42a-
> c28b37043016/images/39960f40-4aae-4714-ba73-1637785fae7c/
> 38fa3519-f21e-4671-8c69-d1497ff8a490 (1090c25b-9c90-434e-a133-faf9647cc992)
>

Re: [ovirt-users] vm.conf on one of the node is missing


On 11/24/2016 07:47 PM, Simone Tiraboschi wrote:



On Thu, Nov 24, 2016 at 3:06 PM, knarra > wrote:


On 11/24/2016 07:27 PM, Simone Tiraboschi wrote:



On Thu, Nov 24, 2016 at 2:39 PM, knarra > wrote:

On 11/24/2016 06:56 PM, Simone Tiraboschi wrote:



On Thu, Nov 24, 2016 at 2:08 PM, knarra > wrote:

On 11/24/2016 06:15 PM, Simone Tiraboschi wrote:



On Thu, Nov 24, 2016 at 1:26 PM, knarra
> wrote:

Hi,

I have three nodes with glusterfs as storage
domain. For some reason i see that vm.conf from
/var/run/ovirt-hosted-engine-ha is missing and due
to this on one of my host i see that Hosted Engine
HA : Not Active. Once i copy the file from some
other node and restart ovirt-ha-broker and
ovirt-ha-agent services everything works fine. But
then this happens again. Can some please help me
identify why this happens. Below is the log i see
in ovirt-ha-agent.logs.


https://paste.fedoraproject.org/489120/79990345/



Once the engine correctly imported the hosted-engine
storage domain, a couple of OVF_STORE volumes will
appear there.
Every modification to the engine VM configuration will
be written by the engine into that OVF_STORE, so all
the ovirt-ha-agent running on the hosted-engine hosts
will be able to re-start the engine VM with a coherent
configuration.

Till the engine imports the hosted-engine storage
domain, ovirt-ha-agent will fall back to the initial
vm.conf.

In you case the OVF_STORE volume is there,
but the agent fails extracting the engine VM configuration:
MainThread::INFO::2016-11-24

17:55:04,914::ovf_store::112::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
Extracting Engine VM OVF from the OVF_STORE
MainThread::INFO::2016-11-24

17:55:04,919::ovf_store::119::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
OVF_STORE volume path:

/rhev/data-center/mnt/glusterSD/10.70.36.79:_engine/27f054c3-c245-4039-b42a-c28b37043016/images/fdf49778-9a06-49c6-bf7a-a0f12425911c/8c954add-6bcf-47f8-ac2e-4c85fc3f8699
MainThread::ERROR::2016-11-24

17:55:04,928::ovf_store::124::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
Unable to extract HEVM OVF

So it tries to rollback to the initial vm.conf, but
also that one seams to miss some values and so the
agent is failing:
MainThread::ERROR::2016-11-24

17:55:04,974::agent::205::ovirt_hosted_engine_ha.agent.agent.Agent::(_run_agent)
Error: ''Configuration value not found:
file=/var/run/ovirt-hosted-engine-ha/vm.conf,
key=memSize'' - trying to restart agent

Both of the issue seams storage related, could yuo
please share your gluster logs?


Thanks

kasturi



Hi Simone,

Below [1] is the link for the sosreports on the
first two hosts. The third host has some issue. Once it
is up will give the sosreport from there as well.


And the host where you see the initial issue was the third one?

It is on the first host.


It seams that host1 is failing reading from the the hosted-engine
storage domain:

[2016-11-24 12:33:43.678467] W [MSGID: 114031]
[client-rpc-fops.c:2938:client3_3_lookup_cbk] 0-engine-client-2:
remote operation failed. Path: /
(----0001) [Transport endpoint is not
connected]
[2016-11-24 12:33:43.678747] E
[rpc-clnt.c:365:saved_frames_unwind] (-->
/lib64/libglusterfs.so.0(_gf_log_callingfn+0x192)[0x7f077eba1642]
(-->
/lib64/libgfrpc.so.0(saved_frames_unwind+0x1de)[0x7f077e96775e]
(-->
/lib64/libgfrpc.so.0(saved_frames_destroy+0xe)[0x7f077e96786e]
(-->
/lib64/libgfrpc.so.0(rpc_clnt_connection_cleanup+0x84)[0x7f077e968fc4]
(--> /lib64/libgfrpc.so.0(rpc_clnt_notify+0x120)[0x7f077e9698a0]
) 0-engine-client-2: forced unwinding frame type(GlusterFS
3.3) op(LOOKUP(27)) called at 2016-11-24 12:33:07.495178
(xid=0x82a1c)
[2016-11-24 12:33:43.678982] E
[rpc-clnt.c:365:saved_frames_unwind] (-->
/lib64/libglusterfs.so.0(_gf_log_callingfn+0x192)[0x7f077eba1642]
(-->

Re: [ovirt-users] vm.conf on one of the node is missing

On Thu, Nov 24, 2016 at 3:23 PM, knarra  wrote:

> On 11/24/2016 07:47 PM, Simone Tiraboschi wrote:
>
>
>
> On Thu, Nov 24, 2016 at 3:06 PM, knarra  wrote:
>
>> On 11/24/2016 07:27 PM, Simone Tiraboschi wrote:
>>
>>
>>
>> On Thu, Nov 24, 2016 at 2:39 PM, knarra  wrote:
>>
>>> On 11/24/2016 06:56 PM, Simone Tiraboschi wrote:
>>>
>>>
>>>
>>> On Thu, Nov 24, 2016 at 2:08 PM, knarra  wrote:
>>>
 On 11/24/2016 06:15 PM, Simone Tiraboschi wrote:



 On Thu, Nov 24, 2016 at 1:26 PM, knarra  wrote:

> Hi,
>
> I have three nodes with glusterfs as storage domain. For some
> reason i see that vm.conf from /var/run/ovirt-hosted-engine-ha is
> missing and due to this on one of my host i see that Hosted Engine HA : 
> Not
> Active. Once i copy the file from some other node and restart
> ovirt-ha-broker and ovirt-ha-agent services everything works fine. But 
> then
> this happens again. Can some please help me identify why this happens.
> Below is the log i see in ovirt-ha-agent.logs.
>
>
> https://paste.fedoraproject.org/489120/79990345/
>
>
 Once the engine correctly imported the hosted-engine storage domain, a
 couple of OVF_STORE volumes will appear there.
 Every modification to the engine VM configuration will be written by
 the engine into that OVF_STORE, so all the ovirt-ha-agent running on the
 hosted-engine hosts will be able to re-start the engine VM with a coherent
 configuration.

 Till the engine imports the hosted-engine storage domain,
 ovirt-ha-agent will fall back to the initial vm.conf.

 In you case the OVF_STORE volume is there,
 but the agent fails extracting the engine VM configuration:
 MainThread::INFO::2016-11-24 17:55:04,914::ovf_store::112::
 ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
 Extracting Engine VM OVF from the OVF_STORE
 MainThread::INFO::2016-11-24 17:55:04,919::ovf_store::119::
 ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
 OVF_STORE volume path: /rhev/data-center/mnt/glusterSD/10.70.36.79:
 _engine/27f054c3-c245-4039-b42a-c28b37043016/i
 mages/fdf49778-9a06-49c6-bf7a-a0f12425911c/8c954add-6bcf-47f
 8-ac2e-4c85fc3f8699
 MainThread::ERROR::2016-11-24 17:55:04,928::ovf_store::124::
 ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
 Unable to extract HEVM OVF

 So it tries to rollback to the initial vm.conf, but also that one seams
 to miss some values and so the agent is failing:
 MainThread::ERROR::2016-11-24 17:55:04,974::agent::205::ovir
 t_hosted_engine_ha.agent.agent.Agent::(_run_agent) Error:
 ''Configuration value not found: 
 file=/var/run/ovirt-hosted-engine-ha/vm.conf,
 key=memSize'' - trying to restart agent

 Both of the issue seams storage related, could yuo please share your
 gluster logs?


>
>
> Thanks
>
> kasturi
>
>
 Hi Simone,

 Below [1] is the link for the sosreports on the first two hosts.
 The third host has some issue. Once it is up will give the sosreport from
 there as well.

>>>
>>> And the host where you see the initial issue was the third one?
>>>
>>> It is on the first host.
>>>
>>>
>>>
>>>
>> It seams that host1 is failing reading from the the hosted-engine storage
>> domain:
>>
>> [2016-11-24 12:33:43.678467] W [MSGID: 114031]
>> [client-rpc-fops.c:2938:client3_3_lookup_cbk] 0-engine-client-2: remote
>> operation failed. Path: / (----0001)
>> [Transport endpoint is not connected]
>> [2016-11-24 12:33:43.678747] E [rpc-clnt.c:365:saved_frames_unwind] (-->
>> /lib64/libglusterfs.so.0(_gf_log_callingfn+0x192)[0x7f077eba1642] (-->
>> /lib64/libgfrpc.so.0(saved_frames_unwind+0x1de)[0x7f077e96775e] (-->
>> /lib64/libgfrpc.so.0(saved_frames_destroy+0xe)[0x7f077e96786e] (-->
>> /lib64/libgfrpc.so.0(rpc_clnt_connection_cleanup+0x84)[0x7f077e968fc4]
>> (--> /lib64/libgfrpc.so.0(rpc_clnt_notify+0x120)[0x7f077e9698a0] )
>> 0-engine-client-2: forced unwinding frame type(GlusterFS 3.3)
>> op(LOOKUP(27)) called at 2016-11-24 12:33:07.495178 (xid=0x82a1c)
>> [2016-11-24 12:33:43.678982] E [rpc-clnt.c:365:saved_frames_unwind] (-->
>> /lib64/libglusterfs.so.0(_gf_log_callingfn+0x192)[0x7f077eba1642] (-->
>> /lib64/libgfrpc.so.0(saved_frames_unwind+0x1de)[0x7f077e96775e] (-->
>> /lib64/libgfrpc.so.0(saved_frames_destroy+0xe)[0x7f077e96786e] (-->
>> /lib64/libgfrpc.so.0(rpc_clnt_connection_cleanup+0x84)[0x7f077e968fc4]
>> (--> /lib64/libgfrpc.so.0(rpc_clnt_notify+0x120)[0x7f077e9698a0] )
>> 0-engine-client-2: forced unwinding frame type(GlusterFS 3.3)
>> op(LOOKUP(27)) called at 2016-11-24 12:33:08.770637 (xid=0x82a1d)
>> [2016-11-24 12:33:43.679001] W [MSGID: 114031]
>>

Re: [ovirt-users] ovirt 4.0.6 - live disk migration fails

2016-11-24 Thread Eyal Edri

Hi,
Can you tell which repos you're using? ( I'm guessing you're using the
snapshot nightly repos )
You can try using the new 'experimental' repos which gives you verified
RPMs which passed basic sanity testing:

http://resources.ovirt.org/repos/ovirt/experimental/master/latest.tested/

On Thu, Nov 24, 2016 at 11:57 AM, Maton, Brett 
wrote:

> If I try to migrate a disk of a running VM to another storage domain it
> fails with the following message:
>
> Operation Cancelled
>
> Error while executing action: User is not logged in.
>
>
> Migrating disks of stopped VM's continues to work.
>
> Probably a bug ?
>
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
>


-- 
Eyal Edri
Associate Manager
RHV DevOps
EMEA ENG Virtualization R
Red Hat Israel

phone: +972-9-7692018
irc: eedri (on #tlv #rhev-dev #rhev-integ)
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] ovirt 4.0.6 - live disk migration fails

I'm using the repos from the ovirt-release40-pre-4.0.6-1.noarch rpm

On 24 November 2016 at 15:04, Eyal Edri  wrote:

> Hi,
> Can you tell which repos you're using? ( I'm guessing you're using the
> snapshot nightly repos )
> You can try using the new 'experimental' repos which gives you verified
> RPMs which passed basic sanity testing:
>
> http://resources.ovirt.org/repos/ovirt/experimental/master/latest.tested/
>
> On Thu, Nov 24, 2016 at 11:57 AM, Maton, Brett 
> wrote:
>
>> If I try to migrate a disk of a running VM to another storage domain it
>> fails with the following message:
>>
>> Operation Cancelled
>>
>> Error while executing action: User is not logged in.
>>
>>
>> Migrating disks of stopped VM's continues to work.
>>
>> Probably a bug ?
>>
>> ___
>> Users mailing list
>> Users@ovirt.org
>> http://lists.ovirt.org/mailman/listinfo/users
>>
>>
>
>
> --
> Eyal Edri
> Associate Manager
> RHV DevOps
> EMEA ENG Virtualization R
> Red Hat Israel
>
> phone: +972-9-7692018
> irc: eedri (on #tlv #rhev-dev #rhev-integ)
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Re: [ovirt-users] ovirt 4.0.6 - live disk migration fails