[ovirt-users] Setting up oVirt for the first time

2015-12-01 Thread Gervais de Montbrun
Hi All,

I've done a lot of reading and lots of comparison of different hypervisors and 
tools and have decided that oVirt would be the best option. My initial use case 
is a not too new server that I will setup to run multiple development 
environments for the devs here, but I wanted something that will scale out to 
production as the vm infrastructure, hardware, etc. grows here.

I'll soon have a second server to run my vm's on and want to setup a 
self-hosted engine. I'm trying to find the most recent version of a how-to on 
the same. I found a presentation showing off how much easier it is to do this 
in oVirt 3.6 but can't find the correct docs. I seem to always end up with 3.5 
or older versions. Can someone point me at a how-to of how to best achieve 
this. 

Also, while I have you all here... :-)
Is it possible to setup the hypervisor hosts themselves as NFS servers to 
create Storage (I realize that this will play havoc with the HA). We do have an 
NFS server that we will be upgrading to add storage and faster drives, but I 
was thinking that I may be able to use the internal storage of the hypervisors 
themselves as a short term stopgap and then migrate vm's to the upgraded NFS 
server later. Will that even work, or will it break somehow?

Any advice for a new install would be welcome.

Thank you 

Cheers,
Gervais

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Setting up oVirt for the first time

2015-12-01 Thread Gervais de Montbrun
Simone,

Thanks so much for the quick reply with the link. Much obliged.

How far off is hyper-convergence? If it is close, I could possibly wait.

Cheers,
Gervais

> On Dec 1, 2015, at 6:47 PM, Simone Tiraboschi <stira...@redhat.com> wrote:
> 
> 
> 
> On Tue, Dec 1, 2015 at 10:59 PM, Gervais de Montbrun <gerv...@demontbrun.com 
> <mailto:gerv...@demontbrun.com>> wrote:
> Hi All,
> 
> I've done a lot of reading and lots of comparison of different hypervisors 
> and tools and have decided that oVirt would be the best option. My initial 
> use case is a not too new server that I will setup to run multiple 
> development environments for the devs here, but I wanted something that will 
> scale out to production as the vm infrastructure, hardware, etc. grows here.
> 
> I'll soon have a second server to run my vm's on and want to setup a 
> self-hosted engine. I'm trying to find the most recent version of a how-to on 
> the same. I found a presentation showing off how much easier it is to do this 
> in oVirt 3.6 but can't find the correct docs. I seem to always end up with 
> 3.5 or older versions. Can someone point me at a how-to of how to best 
> achieve this.
> 
> http://www.ovirt.org/Features/HEApplianceFlow 
> <http://www.ovirt.org/Features/HEApplianceFlow>
>  
>  
> Also, while I have you all here... :-)
> Is it possible to setup the hypervisor hosts themselves as NFS servers to 
> create Storage (I realize that this will play havoc with the HA). We do have 
> an NFS server that we will be upgrading to add storage and faster drives, but 
> I was thinking that I may be able to use the internal storage of the 
> hypervisors themselves as a short term stopgap and then migrate vm's to the 
> upgraded NFS server later. Will that even work, or will it break somehow?
> 
> What you are asking for is generally called hyper-convergence. We tried to 
> have it for 3.6 with glusterfs on each node but it wasn't valuated stable 
> enough to  be released. We are still working on that for the next release.
>  
> Any advice for a new install would be welcome.
> 
> Thank you 
> 
> Cheers,
> Gervais
> 
> 
> ___
> Users mailing list
> Users@ovirt.org <mailto:Users@ovirt.org>
> http://lists.ovirt.org/mailman/listinfo/users 
> <http://lists.ovirt.org/mailman/listinfo/users>
> 
> 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Updated Invitation: [Deep dive] Host Network QoS - oVirt 3.6 @ Thu Nov 26, 2015 5pm - 6pm (ibar...@redhat.com)

2015-11-27 Thread Gervais de Montbrun
Did this event date change? I tried to participate and it never seemed to 
start. If it was recorded, I'd love the link.

Cheers,
Gervais

> On Nov 24, 2015, at 9:08 AM, ibar...@redhat.com wrote:
> 
> This event has been changed.
> more details »
> [Deep dive] Host Network QoS - oVirt 3.6
> Hangouts on air: https://plus.google.com/events/c3la9vdse911atq991qflogtq0g
> you tube link: https://plus.google.com/events/c3la9vdse911atq991qflogtq0g
> When
> Changed: Thu Nov 26, 2015 5pm – 6pm Jerusalem
> Calendar
> ibar...@redhat.com
> Who
> • 
> ibar...@redhat.com - organizer
> • 
> users@ovirt.org
> Going?   Yes - Maybe - Nomore options »
> Invitation from Google Calendar
> 
> You are receiving this courtesy email at the account users@ovirt.org because 
> you are an attendee of this event.
> 
> To stop receiving future updates for this event, decline this event. 
> Alternatively you can sign up for a Google account at 
> https://www.google.com/calendar/ and control your notification settings for 
> your entire calendar.
> 
> Forwarding this invitation could allow any recipient to modify your RSVP 
> response. Learn More.
> 
>  Attachment.ics>___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Adding another host to my cluster

2016-05-11 Thread Gervais de Montbrun
Hi Folks,

I hate to reply to my own message, but I'm really hoping someone can help me 
with my issue
http://lists.ovirt.org/pipermail/users/2016-May/039690.html 
<http://lists.ovirt.org/pipermail/users/2016-May/039690.html>

Does anyone have a suggestion for me? If there is any more information that I 
can provide that would help you to help me, please advise.

Cheers,
Gervais



> On May 9, 2016, at 1:42 PM, Gervais de Montbrun <gerv...@demontbrun.com 
> <mailto:gerv...@demontbrun.com>> wrote:
> 
> Hi All,
> 
> I'm trying to add a third host into my oVirt cluster. I have hosted engine 
> setup on the first two. It's failing to finish the hosted-engine --deploy on 
> this third host. I wiped the server and did a CentOS 7 minimum install and 
> ran it again to have a clean machine.
> 
> My setup:
> CentOS 7 clean install
> yum install -y http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm 
> <http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm>
> yum install -y ovirt-hosted-engine-setup
> yum upgrade -y && reboot
> systemctl disable NetworkManager ; systemctl stop NetworkManager ; systemctl 
> disable firewalld ; systemctl stop firewalld
> hosted-engine --deploy
> 
> hosted-engine --deploy always throws an error:
> [ ERROR ] The VDSM host was found in a failed state. Please check engine and 
> bootstrap installation logs.
> [ ERROR ] Unable to add Cultivar2 to the manager
> and then echo's
> [ INFO  ] Waiting for VDSM hardware info
> ...
> [ ERROR ] Failed to execute stage 'Closing up': VDSM did not start within 120 
> seconds
> [ INFO  ] Stage: Clean up
> [ INFO  ] Generating answer file 
> '/var/lib/ovirt-hosted-engine-setup/answers/answers-20160509131103.conf'
> [ INFO  ] Stage: Pre-termination
> [ INFO  ] Stage: Termination
> [ ERROR ] Hosted Engine deployment failed: this system is not reliable, 
> please check the issue, fix and redeploy
>  Log file is located at 
> /var/log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20160509130658-qb8ev0.log
> 
> Full output of hosted-engine --deploy included in the attached zip file.
> I've also included vdsm.log (There is more than one tries worth of tries in 
> there).
> You'll also find the ovirt-hosted-engine-setup-20160509130658-qb8ev0.log 
> listed above.
> 
> This is my "test" setup. Cultivar0 is my first host and my nfs server for 
> storage. I have two hosts in the setup already and everything is working 
> fine. The host does show up in the oVirt admin, but shows "Installed Failed"
> 
> 
> Trying to reinstall from within the interface just fails again.
> 
> The ovirt bridge interface is not configured and there are no config files in 
> /etc/sysconfi/network-scripts related to ovirt.
> 
> OS:
> [root@cultivar2 ovirt-hosted-engine-setup]# cat /etc/redhat-release 
> CentOS Linux release 7.2.1511 (Core) 
> 
> [root@cultivar2 ovirt-hosted-engine-setup]# uname -a
> Linux cultivar2.grove.silverorange.com 
> <http://cultivar2.grove.silverorange.com/> 3.10.0-327.13.1.el7.x86_64 #1 SMP 
> Thu Mar 31 16:04:38 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux
> 
> Versions:
> [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i ovirt
> libgovirt-0.3.3-1.el7_2.1.x86_64
> ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch
> ovirt-host-deploy-1.4.1-1.el7.centos.noarch
> ovirt-vmconsole-1.0.0-1.el7.centos.noarch
> ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch
> ovirt-release36-007-1.noarch
> ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch
> ovirt-setup-lib-1.0.1-1.el7.centos.noarch
> ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch
> [root@cultivar2 ovirt-hosted-engine-setup]# 
> [root@cultivar2 ovirt-hosted-engine-setup]# 
> [root@cultivar2 ovirt-hosted-engine-setup]# 
> [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i virt
> libvirt-daemon-driver-secret-1.2.17-13.el7_2.4.x86_64
> virt-viewer-2.0-6.el7.x86_64
> libgovirt-0.3.3-1.el7_2.1.x86_64
> libvirt-daemon-kvm-1.2.17-13.el7_2.4.x86_64
> ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch
> fence-virt-0.3.2-2.el7.x86_64
> virt-what-1.13-6.el7.x86_64
> libvirt-python-1.2.17-2.el7.x86_64
> libvirt-daemon-1.2.17-13.el7_2.4.x86_64
> libvirt-daemon-config-nwfilter-1.2.17-13.el7_2.4.x86_64
> libvirt-lock-sanlock-1.2.17-13.el7_2.4.x86_64
> libvirt-daemon-driver-nodedev-1.2.17-13.el7_2.4.x86_64
> libvirt-daemon-driver-network-1.2.17-13.el7_2.4.x86_64
> libvirt-daemon-driver-storage-1.2.17-13.el7_2.4.x86_64
> ovirt-host-deploy-1.4.1-1.el7.centos.noarch
> virt-v2v-1.28.1-1.55.el7.centos.2.x86_64
> ovirt-vmconsole-1.0.0-1.el7.centos.noarch
> ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch
> libvirt-client-1.2.17-13.el7_2.4.x86_64
> li

Re: [ovirt-users] Adding another host to my cluster

2016-05-12 Thread Gervais de Montbrun
Hi Wee
(and others)

Thanks for the reply. I tried what you suggested, but I am in the exact same 
state. :-(

I don't want to completely remove my hosted engine setup as it is working on 
the two other hosts in my cluster. I did not run the rm -rf stes listed here 
(https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install
 
<https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install>)
 that would wipe my hosted_engine nfs mount. If you know that this is 100% 
necessary, please let me know.

I did:
hosted-engine --clean-metadata --force-cleanup --host-id=3
run the bash script to remove all of the ovirt packages and config files
reinstalled ovirt-hosted-engine-setup
ran "hosted-engine --deploy"

I'm back exactly where I started. Is there a way to run just the network 
configuration part of the deploy? 

Since the last attempt, I did upgrade my hosted engine and my cluster is now 
running oVirt 3.6.5.

Cheers,
Gervais



> On May 12, 2016, at 11:50 AM, Wee Sritippho <we...@forest.go.th> wrote:
> 
> Hi,
> 
> I used to have a similar problem where one of my host can't be deployed due 
> to the absence of ovirtmgmt bridge. Simone said it's a bug ( 
> https://bugzilla.redhat.com/1323465 <https://bugzilla.redhat.com/1323465> ) 
> which would be fixed in 3.6.6.
> 
> This is what I've done to solve it:
> 
> 1. In the web UI, set the failed host to maintenance.
> 2. Remove it.
> 3. In that host, run a script from 
> https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install
>  
> <https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install>
> 4. Install ovirt-hosted-engine-setup again.
> 5. Redeploy again.
> 
> Hope that helps
> 
> On 11 พฤษภาคม 2016 22 นาฬิกา 48 นาที 58 วินาที GMT+07:00, Gervais de Montbrun 
> <gerv...@demontbrun.com> wrote:
> Hi Folks,
> 
> I hate to reply to my own message, but I'm really hoping someone can help me 
> with my issue
> http://lists.ovirt.org/pipermail/users/2016-May/039690.html 
> <http://lists.ovirt.org/pipermail/users/2016-May/039690.html>
> 
> Does anyone have a suggestion for me? If there is any more information that I 
> can provide that would help you to help me, please advise.
> 
> Cheers,
> Gervais
> 
> 
> 
>> On May 9, 2016, at 1:42 PM, Gervais de Montbrun <gerv...@demontbrun.com 
>> <mailto:gerv...@demontbrun.com>> wrote:
>> 
>> Hi All,
>> 
>> I'm trying to add a third host into my oVirt cluster. I have hosted engine 
>> setup on the first two. It's failing to finish the hosted-engine --deploy on 
>> this third host. I wiped the server and did a CentOS 7 minimum install and 
>> ran it again to have a clean machine.
>> 
>> My setup:
>> CentOS 7 clean install
>> yum install -y http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm 
>> <http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm>
>> yum install -y ovirt-hosted-engine-setup
>> yum upgrade -y && reboot
>> systemctl disable NetworkManager ; systemctl stop NetworkManager ; systemctl 
>> disable firewalld ; systemctl stop firewalld
>> hosted-engine --deploy
>> 
>> hosted-engine --deploy always throws an error:
>> [ ERROR ] The VDSM host was found in a failed state. Please check engine and 
>> bootstrap installation logs.
>> [ ERROR ] Unable to add Cultivar2 to the manager
>> and then echo's
>> [ INFO  ] Waiting for VDSM hardware info
>> ...
>> [ ERROR ] Failed to execute stage 'Closing up': VDSM did not start within 
>> 120 seconds
>> [ INFO  ] Stage: Clean up
>> [ INFO  ] Generating answer file 
>> '/var/lib/ovirt-hosted-engine-setup/answers/answers-20160509131103.conf'
>> [ INFO  ] Stage: Pre-termination
>> [ INFO  ] Stage: Termination
>> [ ERROR ] Hosted Engine deployment failed: this system is not reliable, 
>> please check the issue, fix and redeploy
>>  Log file is located at 
>> /var/log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20160509130658-qb8ev0.log
>> 
>> Full output of hosted-engine --deploy included in the attached zip file.
>> I've also included vdsm.log (There is more than one tries worth of tries in 
>> there).
>> You'll also find the ovirt-hosted-engine-setup-20160509130658-qb8ev0.log 
>> listed above.
>> 
>> This is my "test" setup. Cultivar0 is my first host and my nfs server for 
>> storage. I have two hosts in the setup already and everything is working 
>> fine. The host does show up in the oVirt admin, but shows "Installed Failed"
>> 
>> 
>> Trying 

Re: [ovirt-users] Adding another host to my cluster

2016-05-16 Thread Gervais de Montbrun
Hi Roy,

I've added /var/log/ovirt-engine/engine.log to my DropBox: 
https://www.dropbox.com/sh/21lkl6ao3hv2r85/AADDMCK4Dp7UYgs5P6dMQxHKa/vdsm?dl=0 
<https://www.dropbox.com/sh/21lkl6ao3hv2r85/AADDMCK4Dp7UYgs5P6dMQxHKa/vdsm?dl=0>

When I look in the engine.log, I see the error "unable to load number from 
serial.txt"
I did a find for serial.txt and the only file that comes up is: 
/etc/pki/ovirt-engine/serial.txt 
(Makes sense that this is the file it wants to read)
When I open /etc/pki/ovirt-engine/serial.txt, it is an empty file. 
[root@cultivar ovirt-engine]# cat /etc/pki/ovirt-engine/serial.txt
[root@cultivar ovirt-engine]# 

What should be in serial.txt?

I don't understand where to find the other log that you mention in your email 
below. I don't see any paths listed in the UI when I click on the failed event 
for certificate enrollment.

Cheers,
Gervais



> On May 15, 2016, at 5:10 PM, Roy Golan <rgo...@redhat.com 
> <mailto:rgo...@redhat.com>> wrote:
> 
> 
> 
> On Sun, May 15, 2016 at 1:32 AM, Gervais de Montbrun <gerv...@demontbrun.com 
> <mailto:gerv...@demontbrun.com>> wrote:
> Hey Folks,
> 
> I'm still stuck here... I believe the issue may be because the Enrolling 
> certificates stage is failing...
> 
> Any idea's how to sort this out?
> 
> I see:
>   Installing Host Cultivar2. Enrolling certificate
>   Failed to install Host Cultivar2. Certificate enrollment failed.
>   Host Cultivar2 installation failed. Certificate enrollement failed.
> 
> Any attempt to re-install fails at this step.
> 
> Cheers,
> Gervais
> 
> 
> 
> I also see in the logs that the manager (ovirt-engine) rejected the host 
> addition to the setup. Please add the /var/log/ovirt-engine/engine.log from 
> the engine machine. Also, the engine machine has the install log - open one 
> of the Event logs in the UI that says you host installation failed and you 
> will see it retrieved the install log to a path under your engine. Please 
> include this as well.
>  
> 
>> On May 13, 2016, at 1:36 PM, Gervais de Montbrun <gerv...@demontbrun.com 
>> <mailto:gerv...@demontbrun.com>> wrote:
>> 
>> Hi Nir,
>> 
>> Thank you for you input.
>> 
>> It was suggest that I try starting vdsm from the shell so I could see the 
>> output in an effort to solve my issue: 
>> http://lists.ovirt.org/pipermail/users/2016-May/039690.html 
>> <http://lists.ovirt.org/pipermail/users/2016-May/039690.html>
>> 
>> Cheers,
>> Gervais
>> 
>> 
>> 
>>> On May 13, 2016, at 12:12 PM, Nir Soffer <nsof...@redhat.com 
>>> <mailto:nsof...@redhat.com>> wrote:
>>> 
>>> On Fri, May 13, 2016 at 3:37 PM, Gervais de Montbrun
>>> <gerv...@demontbrun.com <mailto:gerv...@demontbrun.com>> wrote:
>>>> Hi Charles,
>>>> 
>>>> I think the problem I am having is due to the setup failing and not
>>>> something in vdsm configs as I have never gotten this server to start up
>>>> properly and the BRIDGE ethernet interface + ovirt routes are not setup.
>>>> 
>>>> I put the logs here:
>>>> https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=0 
>>>> <https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=0>
>>>> 
>>>> hosted-engine--deploy-logs.zip # Logs from when I tried to deploy and it
>>>> failed
>>>> vdsm.tar.gz # /var/log/vdsm
>>>> 
>>>> Output from running vdsm from the command line:
>>>> 
>>>> [root@cultivar2 log]# su -s /bin/bash vdsm
>>> 
>>> This cannot work unless supervdsmd is running...
>>> 
>>>> [vdsm@cultivar2 log]$ python /usr/share/vdsm/vdsm
>>>> (PID: 6521) I am the actual vdsm 4.17.26-1.el7
>>>> cultivar2.grove.silverorange.com 
>>>> <http://cultivar2.grove.silverorange.com/> (3.10.0-327.el7.x86_64)
>>>> VDSM will run with cpu affinity: frozenset([1])
>>>> /usr/bin/taskset --all-tasks --pid --cpu-list 1 6521 (cwd None)
>>>> SUCCESS:  = '';  = 0
>>>> Starting scheduler vdsm.Scheduler
>>>> started
>>>> Run and protect:
>>>> registerDomainStateChangeCallback(callbackFunc=>>> 0x381b158>)
>>>> Run and protect: registerDomainStateChangeCallback, Return response: None
>>>> Trying to connect to Super Vdsm
>>>> Preparing MOM interface
>>>> Using named unix socket /var/run/vdsm/mom-vdsm.sock
>>>> Unregistering all secrests
>>>> trying 

Re: [ovirt-users] Adding another host to my cluster

2016-05-16 Thread Gervais de Montbrun
Roy,

Thank you so much for asking for the logs on the engine. The serial.txt bein 
blank was my issue. I found a solution in the following thread online here: 
http://lists.ovirt.org/pipermail/users/2014-November/028870.html
I did the exact same thing that Dinuwan did to solve his issue back in 2014 and 
was able to enrol my new server and install the hosted-engine just fine on it.

I am not certain what caused serial.txt to be empty and am will to share logs 
to anyone from the oVirt team who might want to see them. For my purposes, I'm 
considering this issue solved.

Thanks to everyone who offered advice and suggestions.

Cheers,
Gervais



> On May 16, 2016, at 2:04 PM, Gervais de Montbrun <gerv...@demontbrun.com> 
> wrote:
> 
> Hi Roy,
> 
> I've added /var/log/ovirt-engine/engine.log to my DropBox: 
> https://www.dropbox.com/sh/21lkl6ao3hv2r85/AADDMCK4Dp7UYgs5P6dMQxHKa/vdsm?dl=0
>  
> <https://www.dropbox.com/sh/21lkl6ao3hv2r85/AADDMCK4Dp7UYgs5P6dMQxHKa/vdsm?dl=0>
> 
> When I look in the engine.log, I see the error "unable to load number from 
> serial.txt"
> I did a find for serial.txt and the only file that comes up is: 
> /etc/pki/ovirt-engine/serial.txt 
> (Makes sense that this is the file it wants to read)
> When I open /etc/pki/ovirt-engine/serial.txt, it is an empty file. 
>   [root@cultivar ovirt-engine]# cat /etc/pki/ovirt-engine/serial.txt
>   [root@cultivar ovirt-engine]# 
> 
> What should be in serial.txt?
> 
> I don't understand where to find the other log that you mention in your email 
> below. I don't see any paths listed in the UI when I click on the failed 
> event for certificate enrollment.
> 
> Cheers,
> Gervais
> 
> 
> 
>> On May 15, 2016, at 5:10 PM, Roy Golan <rgo...@redhat.com 
>> <mailto:rgo...@redhat.com>> wrote:
>> 
>> 
>> 
>> On Sun, May 15, 2016 at 1:32 AM, Gervais de Montbrun <gerv...@demontbrun.com 
>> <mailto:gerv...@demontbrun.com>> wrote:
>> Hey Folks,
>> 
>> I'm still stuck here... I believe the issue may be because the Enrolling 
>> certificates stage is failing...
>> 
>> Any idea's how to sort this out?
>> 
>> I see:
>>  Installing Host Cultivar2. Enrolling certificate
>>  Failed to install Host Cultivar2. Certificate enrollment failed.
>>  Host Cultivar2 installation failed. Certificate enrollement failed.
>> 
>> Any attempt to re-install fails at this step.
>> 
>> Cheers,
>> Gervais
>> 
>> 
>> 
>> I also see in the logs that the manager (ovirt-engine) rejected the host 
>> addition to the setup. Please add the /var/log/ovirt-engine/engine.log from 
>> the engine machine. Also, the engine machine has the install log - open one 
>> of the Event logs in the UI that says you host installation failed and you 
>> will see it retrieved the install log to a path under your engine. Please 
>> include this as well.
>>  
>> 
>>> On May 13, 2016, at 1:36 PM, Gervais de Montbrun <gerv...@demontbrun.com 
>>> <mailto:gerv...@demontbrun.com>> wrote:
>>> 
>>> Hi Nir,
>>> 
>>> Thank you for you input.
>>> 
>>> It was suggest that I try starting vdsm from the shell so I could see the 
>>> output in an effort to solve my issue: 
>>> http://lists.ovirt.org/pipermail/users/2016-May/039690.html 
>>> <http://lists.ovirt.org/pipermail/users/2016-May/039690.html>
>>> 
>>> Cheers,
>>> Gervais
>>> 
>>> 
>>> 
>>>> On May 13, 2016, at 12:12 PM, Nir Soffer <nsof...@redhat.com 
>>>> <mailto:nsof...@redhat.com>> wrote:
>>>> 
>>>> On Fri, May 13, 2016 at 3:37 PM, Gervais de Montbrun
>>>> <gerv...@demontbrun.com <mailto:gerv...@demontbrun.com>> wrote:
>>>>> Hi Charles,
>>>>> 
>>>>> I think the problem I am having is due to the setup failing and not
>>>>> something in vdsm configs as I have never gotten this server to start up
>>>>> properly and the BRIDGE ethernet interface + ovirt routes are not setup.
>>>>> 
>>>>> I put the logs here:
>>>>> https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=0 
>>>>> <https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=0>
>>>>> 
>>>>> hosted-engine--deploy-logs.zip # Logs from when I tried to deploy and it
>>>>> failed
>>>>> vdsm.tar.gz # /var/log/vdsm
>>>>> 
>>>>> Output from running 

Re: [ovirt-users] Adding another host to my cluster

2016-05-14 Thread Gervais de Montbrun
Hey Folks,

I'm still stuck here... I believe the issue may be because the Enrolling 
certificates stage is failing...

Any idea's how to sort this out?

I see:
Installing Host Cultivar2. Enrolling certificate
Failed to install Host Cultivar2. Certificate enrollment failed.
Host Cultivar2 installation failed. Certificate enrollement failed.

Any attempt to re-install fails at this step.

Cheers,
Gervais



> On May 13, 2016, at 1:36 PM, Gervais de Montbrun <gerv...@demontbrun.com> 
> wrote:
> 
> Hi Nir,
> 
> Thank you for you input.
> 
> It was suggest that I try starting vdsm from the shell so I could see the 
> output in an effort to solve my issue: 
> http://lists.ovirt.org/pipermail/users/2016-May/039690.html 
> <http://lists.ovirt.org/pipermail/users/2016-May/039690.html>
> 
> Cheers,
> Gervais
> 
> 
> 
>> On May 13, 2016, at 12:12 PM, Nir Soffer <nsof...@redhat.com 
>> <mailto:nsof...@redhat.com>> wrote:
>> 
>> On Fri, May 13, 2016 at 3:37 PM, Gervais de Montbrun
>> <gerv...@demontbrun.com <mailto:gerv...@demontbrun.com>> wrote:
>>> Hi Charles,
>>> 
>>> I think the problem I am having is due to the setup failing and not
>>> something in vdsm configs as I have never gotten this server to start up
>>> properly and the BRIDGE ethernet interface + ovirt routes are not setup.
>>> 
>>> I put the logs here:
>>> https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=0 
>>> <https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=0>
>>> 
>>> hosted-engine--deploy-logs.zip # Logs from when I tried to deploy and it
>>> failed
>>> vdsm.tar.gz # /var/log/vdsm
>>> 
>>> Output from running vdsm from the command line:
>>> 
>>> [root@cultivar2 log]# su -s /bin/bash vdsm
>> 
>> This cannot work unless supervdsmd is running...
>> 
>>> [vdsm@cultivar2 log]$ python /usr/share/vdsm/vdsm
>>> (PID: 6521) I am the actual vdsm 4.17.26-1.el7
>>> cultivar2.grove.silverorange.com <http://cultivar2.grove.silverorange.com/> 
>>> (3.10.0-327.el7.x86_64)
>>> VDSM will run with cpu affinity: frozenset([1])
>>> /usr/bin/taskset --all-tasks --pid --cpu-list 1 6521 (cwd None)
>>> SUCCESS:  = '';  = 0
>>> Starting scheduler vdsm.Scheduler
>>> started
>>> Run and protect:
>>> registerDomainStateChangeCallback(callbackFunc=>> 0x381b158>)
>>> Run and protect: registerDomainStateChangeCallback, Return response: None
>>> Trying to connect to Super Vdsm
>>> Preparing MOM interface
>>> Using named unix socket /var/run/vdsm/mom-vdsm.sock
>>> Unregistering all secrests
>>> trying to connect libvirt
>>> recovery: started
>>> Setting channels' timeout to 30 seconds.
>>> Starting VM channels listener thread.
>>> Listening at 0.0.0.0:54321 <http://0.0.0.0:54321/>
>>> Adding detector 
>>> recovery: completed in 0s
>>> Adding detector 
>>> Starting executor
>>> Starting worker jsonrpc.Executor/0
>>> Worker started
>>> Starting worker jsonrpc.Executor/1
>>> Worker started
>>> Starting worker jsonrpc.Executor/2
>>> Worker started
>>> Starting worker jsonrpc.Executor/3
>>> Worker started
>>> Starting worker jsonrpc.Executor/4
>>> Worker started
>>> Starting worker jsonrpc.Executor/5
>>> Worker started
>>> Starting worker jsonrpc.Executor/6
>>> Worker started
>>> Starting worker jsonrpc.Executor/7
>>> Worker started
>>> XMLRPC server running
>>> Starting executor
>>> Starting worker periodic/0
>>> Worker started
>>> Starting worker periodic/1
>>> Worker started
>>> Starting worker periodic/2
>>> Worker started
>>> Starting worker periodic/3
>>> Worker started
>>> trying to connect libvirt
>>> Panic: Connect to supervdsm service failed: [Errno 2] No such file or
>>> directory
>>> Traceback (most recent call last):
>>> File "/usr/share/vdsm/supervdsm.py", line 78, in _connect
>>>  utils.retry(self._manager.connect, Exception, timeout=60, tries=3)
>>> File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 959, in retry
>>>  return func()
>>> File "/usr/lib64/python2.7/multiprocessing/managers.py", line 500, in
>>> connect
>>>  conn = Client(self._address, authkey=self._authkey)
>>> File "/usr/lib64/python2.7/multiprocessing/connection.py", line 173, in
>>> Client
>>>  c = SocketClient(address)
>>> File "/usr/lib64/python2.7/multiprocessing/connection.py", line 308, in
>>> SocketClient
>>>  s.connect(address)
>>> File "/usr/lib64/python2.7/socket.py", line 224, in meth
>>>  return getattr(self._sock,name)(*args)
>>> error: [Errno 2] No such file or directory
>> 
>> Vdsm tries to connect to supervdsmd on startup, and if it is not running
>> it will fail.
>> 
>> You can do:
>> 
>> systemctl start supervdsmd
>> 
>> And they you can run vdsmd from the shell.
>> 
>> But why do you need to run vdsm from the shell?
>> 
>> Nir
> 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Adding another host to my cluster

2016-05-13 Thread Gervais de Montbrun
Hi Charles,

I think the problem I am having is due to the setup failing and not
something in vdsm configs as I have never gotten this server to start up
properly and the BRIDGE ethernet interface + ovirt routes are not setup.

I put the logs here:
https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=0

hosted-engine--deploy-logs.zip # Logs from when I tried to deploy and it
failed
vdsm.tar.gz # /var/log/vdsm

Output from running vdsm from the command line:

[root@cultivar2 log]# su -s /bin/bash vdsm
[vdsm@cultivar2 log]$ python /usr/share/vdsm/vdsm
(PID: 6521) I am the actual vdsm 4.17.26-1.el7
cultivar2.grove.silverorange.com (3.10.0-327.el7.x86_64)
VDSM will run with cpu affinity: frozenset([1])
/usr/bin/taskset --all-tasks --pid --cpu-list 1 6521 (cwd None)
SUCCESS:  = '';  = 0
Starting scheduler vdsm.Scheduler
started
Run and protect:
registerDomainStateChangeCallback(callbackFunc=)
Run and protect: registerDomainStateChangeCallback, Return response: None
Trying to connect to Super Vdsm
Preparing MOM interface
Using named unix socket /var/run/vdsm/mom-vdsm.sock
Unregistering all secrests
trying to connect libvirt
recovery: started
Setting channels' timeout to 30 seconds.
Starting VM channels listener thread.
Listening at 0.0.0.0:54321
Adding detector 
recovery: completed in 0s
Adding detector 
Starting executor
Starting worker jsonrpc.Executor/0
Worker started
Starting worker jsonrpc.Executor/1
Worker started
Starting worker jsonrpc.Executor/2
Worker started
Starting worker jsonrpc.Executor/3
Worker started
Starting worker jsonrpc.Executor/4
Worker started
Starting worker jsonrpc.Executor/5
Worker started
Starting worker jsonrpc.Executor/6
Worker started
Starting worker jsonrpc.Executor/7
Worker started
XMLRPC server running
Starting executor
Starting worker periodic/0
Worker started
Starting worker periodic/1
Worker started
Starting worker periodic/2
Worker started
Starting worker periodic/3
Worker started
trying to connect libvirt
Panic: Connect to supervdsm service failed: [Errno 2] No such file or
directory
Traceback (most recent call last):
  File "/usr/share/vdsm/supervdsm.py", line 78, in _connect
utils.retry(self._manager.connect, Exception, timeout=60, tries=3)
  File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 959, in retry
return func()
  File "/usr/lib64/python2.7/multiprocessing/managers.py", line 500, in
connect
conn = Client(self._address, authkey=self._authkey)
  File "/usr/lib64/python2.7/multiprocessing/connection.py", line 173, in
Client
c = SocketClient(address)
  File "/usr/lib64/python2.7/multiprocessing/connection.py", line 308, in
SocketClient
s.connect(address)
  File "/usr/lib64/python2.7/socket.py", line 224, in meth
return getattr(self._sock,name)(*args)
error: [Errno 2] No such file or directory
Killed


Thanks for the help. It's really appreciated.

Cheers,
Gervais

On Fri, May 13, 2016 at 12:55 AM, Charles Tassell <ctass...@gmail.com>
wrote:

> Hi Gervais,
>
>   Hmm, can you tar up the logfiles (/var/log/vdsm/* on the host you are
> installing on) and put them somewhere to look at?  Also, I found that
> starting VDSM from the command line is useful as it sometimes spits out
> error messages that don't show up in the logs.  I think the command I used
> was:
> su -s /bin/bash vdsm
> python /usr/share/vdsm/vdsm
>
> My problem was that I customized the logging settings in /etc/vdsm/*conf
> to try and tone down the debugging stuff and had a syntax error.
>
>
> On 16-05-12 10:24 PM, Gervais de Montbrun wrote:
>
> Hi Charles,
>
> Thanks for the suggestion.
>
> I cleaned up again using the bash script from the
> recoving-from-failed-install link below, then reinstalled (yum install
> ovirt-hosted-engine-setup).
>
> I enabled NetworkManager and firewalld as you suggested. The install stops
> very early on with an error:
> [ ERROR ] Failed to execute stage 'Programs detection': hosted-engine
> cannot be deployed while NetworkManager is running, please stop and disable
> it before proceeding
>
> I disabled and stopped NetworkManager and tried again. Same result. :(
>
> Any more guesses?
>
> Cheers,
> Gervais
>
>
>
> On May 12, 2016, at 9:08 PM, Charles Tassell <ctass...@gmail.com> wrote:
>
> Hey Gervais,
>
> Try enabling NetworkManager and firewalld before doing the hosted-engine
> --deploy.  I have run into problems with oVirt trying to perform tasks on
> hosts where firewalld is disabled, so maybe you are running into a similar
> problem.  Also, I think the setup script will disable NetworkManager if it
> needs to.  I know I didn't manually disable it on any of the boxes I
> installed on.
>
> On 16-05-12 04:49 PM, users-requ...@ovirt.org wrote:
>
> Message: 1
> Date: Thu, 12 May 2016 14:22:

Re: [ovirt-users] Adding another host to my cluster

2016-05-12 Thread Gervais de Montbrun
Hi Charles,

Thanks for the suggestion.

I cleaned up again using the bash script from the
recoving-from-failed-install link below, then reinstalled (yum install
ovirt-hosted-engine-setup).

I enabled NetworkManager and firewalld as you suggested. The install stops
very early on with an error:
[ ERROR ] Failed to execute stage 'Programs detection': hosted-engine
cannot be deployed while NetworkManager is running, please stop and disable
it before proceeding

I disabled and stopped NetworkManager and tried again. Same result. :(

Any more guesses?

Cheers,
Gervais



On May 12, 2016, at 9:08 PM, Charles Tassell <ctass...@gmail.com> wrote:

Hey Gervais,

Try enabling NetworkManager and firewalld before doing the hosted-engine
--deploy.  I have run into problems with oVirt trying to perform tasks on
hosts where firewalld is disabled, so maybe you are running into a similar
problem.  Also, I think the setup script will disable NetworkManager if it
needs to.  I know I didn't manually disable it on any of the boxes I
installed on.

On 16-05-12 04:49 PM, users-requ...@ovirt.org wrote:

Message: 1
Date: Thu, 12 May 2016 14:22:12 -0300
From: Gervais de Montbrun <gerv...@demontbrun.com>
To: Wee Sritippho <we...@forest.go.th>
Cc: users <users@ovirt.org>
Subject: Re: [ovirt-users] Adding another host to my cluster
Message-ID: <28b7fc74-5c52-4f60-b9f3-39a36621a...@demontbrun.com>
Content-Type: text/plain; charset="utf-8"

Hi Wee
(and others)

Thanks for the reply. I tried what you suggested, but I am in the exact
same state. :-(

I don't want to completely remove my hosted engine setup as it is working
on the two other hosts in my cluster. I did not run the rm -rf stes listed
here (
https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install
<
https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install>)
that would wipe my hosted_engine nfs mount. If you know that this is 100%
necessary, please let me know.

I did:
hosted-engine --clean-metadata --force-cleanup --host-id=3
run the bash script to remove all of the ovirt packages and config files
reinstalled ovirt-hosted-engine-setup
ran "hosted-engine --deploy"

I'm back exactly where I started. Is there a way to run just the network
configuration part of the deploy?

Since the last attempt, I did upgrade my hosted engine and my cluster is
now running oVirt 3.6.5.

Cheers,
Gervais



On May 12, 2016, at 11:50 AM, Wee Sritippho <we...@forest.go.th> wrote:

Hi,

I used to have a similar problem where one of my host can't be deployed due
to the absence of ovirtmgmt bridge. Simone said it's a bug (
https://bugzilla.redhat.com/1323465 <https://bugzilla.redhat.com/1323465> )
which would be fixed in 3.6.6.

This is what I've done to solve it:

1. In the web UI, set the failed host to maintenance.
2. Remove it.
3. In that host, run a script from
https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install
<
https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install
>
4. Install ovirt-hosted-engine-setup again.
5. Redeploy again.

Hope that helps

On 11 ??? 2016 22 ?? 48  58 ?? GMT+07:00, Gervais de
Montbrun <gerv...@demontbrun.com> wrote:
Hi Folks,

I hate to reply to my own message, but I'm really hoping someone can help
me with my issue
http://lists.ovirt.org/pipermail/users/2016-May/039690.html <
http://lists.ovirt.org/pipermail/users/2016-May/039690.html>

Does anyone have a suggestion for me? If there is any more information that
I can provide that would help you to help me, please advise.

Cheers,
Gervais



On May 9, 2016, at 1:42 PM, Gervais de Montbrun <gerv...@demontbrun.com
<mailto:gerv...@demontbrun.com>> wrote:

Hi All,

I'm trying to add a third host into my oVirt cluster. I have hosted engine
setup on the first two. It's failing to finish the hosted-engine --deploy
on this third host. I wiped the server and did a CentOS 7 minimum install
and ran it again to have a clean machine.

My setup:
CentOS 7 clean install
yum install -y http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm <
http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm>
yum install -y ovirt-hosted-engine-setup
yum upgrade -y && reboot
systemctl disable NetworkManager ; systemctl stop NetworkManager ;
systemctl disable firewalld ; systemctl stop firewalld
hosted-engine --deploy

hosted-engine --deploy always throws an error:
[ ERROR ] The VDSM host was found in a failed state. Please check engine
and bootstrap installation logs.
[ ERROR ] Unable to add Cultivar2 to the manager
and then echo's
[ INFO  ] Waiting for VDSM hardware info
...
[ ERROR ] Failed to execute stage 'Closing up': VDSM did not start within
120 seconds
[ INFO  ] Stage: Clean up
[ INFO  ] Generating answer file
'/var/lib/ovirt-hosted-engine-setup/answers/answers-201605091311

Re: [ovirt-users] Adding another host to my cluster

2016-05-13 Thread Gervais de Montbrun
Hi Nir,

Thank you for you input.

It was suggest that I try starting vdsm from the shell so I could see the
output in an effort to solve my issue:
http://lists.ovirt.org/pipermail/users/2016-May/039690.html

Cheers,
Gervais



On May 13, 2016, at 12:12 PM, Nir Soffer <nsof...@redhat.com> wrote:

On Fri, May 13, 2016 at 3:37 PM, Gervais de Montbrun
<gerv...@demontbrun.com> wrote:

Hi Charles,

I think the problem I am having is due to the setup failing and not
something in vdsm configs as I have never gotten this server to start up
properly and the BRIDGE ethernet interface + ovirt routes are not setup.

I put the logs here:
https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=0

hosted-engine--deploy-logs.zip # Logs from when I tried to deploy and it
failed
vdsm.tar.gz # /var/log/vdsm

Output from running vdsm from the command line:

[root@cultivar2 log]# su -s /bin/bash vdsm


This cannot work unless supervdsmd is running...

[vdsm@cultivar2 log]$ python /usr/share/vdsm/vdsm
(PID: 6521) I am the actual vdsm 4.17.26-1.el7
cultivar2.grove.silverorange.com (3.10.0-327.el7.x86_64)
VDSM will run with cpu affinity: frozenset([1])
/usr/bin/taskset --all-tasks --pid --cpu-list 1 6521 (cwd None)
SUCCESS:  = '';  = 0
Starting scheduler vdsm.Scheduler
started
Run and protect:
registerDomainStateChangeCallback(callbackFunc=)
Run and protect: registerDomainStateChangeCallback, Return response: None
Trying to connect to Super Vdsm
Preparing MOM interface
Using named unix socket /var/run/vdsm/mom-vdsm.sock
Unregistering all secrests
trying to connect libvirt
recovery: started
Setting channels' timeout to 30 seconds.
Starting VM channels listener thread.
Listening at 0.0.0.0:54321
Adding detector 
recovery: completed in 0s
Adding detector 
Starting executor
Starting worker jsonrpc.Executor/0
Worker started
Starting worker jsonrpc.Executor/1
Worker started
Starting worker jsonrpc.Executor/2
Worker started
Starting worker jsonrpc.Executor/3
Worker started
Starting worker jsonrpc.Executor/4
Worker started
Starting worker jsonrpc.Executor/5
Worker started
Starting worker jsonrpc.Executor/6
Worker started
Starting worker jsonrpc.Executor/7
Worker started
XMLRPC server running
Starting executor
Starting worker periodic/0
Worker started
Starting worker periodic/1
Worker started
Starting worker periodic/2
Worker started
Starting worker periodic/3
Worker started
trying to connect libvirt
Panic: Connect to supervdsm service failed: [Errno 2] No such file or
directory
Traceback (most recent call last):
File "/usr/share/vdsm/supervdsm.py", line 78, in _connect
 utils.retry(self._manager.connect, Exception, timeout=60, tries=3)
File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 959, in retry
 return func()
File "/usr/lib64/python2.7/multiprocessing/managers.py", line 500, in
connect
 conn = Client(self._address, authkey=self._authkey)
File "/usr/lib64/python2.7/multiprocessing/connection.py", line 173, in
Client
 c = SocketClient(address)
File "/usr/lib64/python2.7/multiprocessing/connection.py", line 308, in
SocketClient
 s.connect(address)
File "/usr/lib64/python2.7/socket.py", line 224, in meth
 return getattr(self._sock,name)(*args)
error: [Errno 2] No such file or directory


Vdsm tries to connect to supervdsmd on startup, and if it is not running
it will fail.

You can do:

systemctl start supervdsmd

And they you can run vdsmd from the shell.

But why do you need to run vdsm from the shell?

Nir
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] stuck host in hosted engine migration 3.6->4.0

2016-07-21 Thread Gervais de Montbrun
Hi Oved,

Thanks for the suggestion.

I tried setting "management_ip = 0.0.0.0" but same result.
BTW, management_ip='0.0.0.0' (as suggested in the post) doesn't work for me. 
vdsmd wouldn't start.

Cheers,
Gervais



> On Jul 20, 2016, at 10:50 AM, Oved Ourfali <oourf...@redhat.com> wrote:
> 
> Also, this thread seems similar.
> Also talking about IPV4/IPV6 issue.
> Does it help?
> 
> [1] http://lists.ovirt.org/pipermail/users/2016-June/040602.html 
> <http://lists.ovirt.org/pipermail/users/2016-June/040602.html>
> 
> On Wed, Jul 20, 2016 at 4:43 PM, Martin Perina <mper...@redhat.com 
> <mailto:mper...@redhat.com>> wrote:
> Hi,
> 
> could you please create a bug and attach engine host logs (all from
> /var/log/ovirt-engine) and VDSM logs (from /var/log/vdsm)?
> 
> Thanks
> 
> Martin Perina
> 
> 
> On Wed, Jul 20, 2016 at 1:50 PM, Gervais de Montbrun <gerv...@demontbrun.com 
> <mailto:gerv...@demontbrun.com>
> > wrote:
> 
> > Hi Qiong,
> >
> > I am experiencing the exact same issue. All four of my hosts are throwing
> > the same error to the vdsm.log If you find a solution, please let me know
> 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] API endpoint?

2016-07-18 Thread Gervais de Montbrun
Hi Folks,

Has the api endpoint moved?

I am having issues with my nagios check and also with ovirt-shell. Both are 
trying to git the api endpoint at https://myhostedengine.mydomain/api 
 and both are returning a 404 error.

Cheers,
Gervais



___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] stuck host in hosted engine migration 3.6->4.0

2016-07-20 Thread Gervais de Montbrun
Hi Qiong,

I am experiencing the exact same issue. All four of my hosts are throwing the 
same error to the vdsm.log If you find a solution, please let me know. I will 
do the same, of course.

Cheers,
Gervais



> On Jul 19, 2016, at 6:40 AM, qiong...@telekom.de wrote:
> 
> Hi there, I have another issue with ovirt 4.0 after upgrading my hosted 
> engine using the recommended method 
> athttp://www.ovirt.org/documenta…/migration-engine-3.6-to-4.0/ 
> 
> Everything works well, except for the fact that my hosted engine host is kind 
> of broken in the engine. It doesnt show up and neither does the hosted engine 
> storage domain. The hosted engine host from 3.6 is still there, but non 
> responsive.
> 
> VDSM.log on the hosted engine host gives me lots of errors:
> 
> JsonRpc (StompReactor)::ERROR::2016-07-19 
> 09:56:12,236::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
> reading data: (104, 'Connection reset by peer')
> JsonRpc (StompReactor)::ERROR::2016-07-19 
> 09:56:13,468::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
> reading data: (104, 'Connection reset by peer')
> JsonRpc (StompReactor)::ERROR::2016-07-19 
> 09:56:13,860::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
> reading data: (104, 'Connection reset by peer')
> JsonRpc (StompReactor)::ERROR::2016-07-19 
> 09:56:20,767::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
> reading data: (104, 'Connection reset by peer')
> JsonRpc (StompReactor)::ERROR::2016-07-19 
> 09:56:20,944::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
> reading data: (104, 'Connection reset by peer')
> Reactor thread::ERROR::2016-07-19 
> 09:56:23,666::m2cutils::301::ProtocolDetector.SSLHandshakeDispatcher::(handle_read)
>  Error during handshake: unexpected eof
> JsonRpc (StompReactor)::ERROR::2016-07-19 
> 09:56:25,532::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
> reading data: (104, 'Connection reset by peer')
> JsonRpc (StompReactor)::ERROR::2016-07-19 
> 09:56:29,936::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
> reading data: (104, 'Connection reset by peer')
> Reactor thread::ERROR::2016-07-19 
> 09:56:35,640::m2cutils::301::ProtocolDetector.SSLHandshakeDispatcher::(handle_read)
>  Error during handshake: unexpected eof
> JsonRpc (StompReactor)::ERROR::2016-07-19 
> 09:56:48,277::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
> reading data: (104, 'Connection reset by peer')
> JsonRpc (StompReactor)::ERROR::2016-07-19 
> 09:56:49,459::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
> reading data: (104, 'Connection reset by peer')
> JsonRpc (StompReactor)::ERROR::2016-07-19 
> 09:56:49,991::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
> reading data: (104, 'Connection reset by peer')
> JsonRpc (StompReactor)::ERROR::2016-07-19 
> 09:56:56,862::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
> reading data: (104, 'Connection reset by peer')
> JsonRpc (StompReactor)::ERROR::2016-07-19 
> 09:56:57,049::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
> reading data: (104, 'Connection reset by peer')
> JsonRpc (StompReactor)::ERROR::2016-07-19 
> 09:57:01,611::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
> reading data: (104, 'Connection reset by peer')
> 
> Does anyone have an idea how to clean this up? Thanks a lot!
> 
> 
> Cheers & Thanks
> 
> Qiong Wu
> 
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] stuck host in hosted engine migration 3.6->4.0

2016-07-22 Thread Gervais de Montbrun
Hi Simone,

I did have the issue you link to below when doing a `hosted-engine --deploy` on 
this server when I was setting it up to run 3.6. I've commented on the bug with 
my experiences. I did get the host working in 3.6 and there were no errors, but 
this one has cropped up since upgrading to 4.0.1.

I did not have the same issue on all of my hosts, but the error I am 
experiencing now:
JsonRpc (StompReactor)::ERROR::2016-07-22 
09:59:56,062::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
reading data: unexpected eof
JsonRpc (StompReactor)::ERROR::2016-07-22 
10:00:11,240::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
reading data: unexpected eof
JsonRpc (StompReactor)::ERROR::2016-07-22 
10:00:21,158::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
reading data: unexpected eof
JsonRpc (StompReactor)::ERROR::2016-07-22 
10:00:21,441::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
reading data: unexpected eof
JsonRpc (StompReactor)::ERROR::2016-07-22 
10:00:26,717::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
reading data: unexpected eof
JsonRpc (StompReactor)::ERROR::2016-07-22 
10:00:31,856::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
reading data: unexpected eof
JsonRpc (StompReactor)::ERROR::2016-07-22 
10:00:36,982::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
reading data: unexpected eof
JsonRpc (StompReactor)::ERROR::2016-07-22 
10:00:52,180::betterAsyncore::113::vds.dispatcher::(recv) SSL error during 
reading data: unexpected eof

is happening on all of them.
:-(

Cheers,
Gervais



> On Jul 22, 2016, at 5:35 AM, Simone Tiraboschi <stira...@redhat.com> wrote:
> 
> On Thu, Jul 21, 2016 at 8:08 PM, Gervais de Montbrun
> <gerv...@demontbrun.com> wrote:
>> Hi Martin
>> 
>> Logs are attached.
>> 
>> Thank you for any help you can offer.
>> :-)
>> 
>> Cheers,
>> Gervais
> 
> see also this one: https://bugzilla.redhat.com/show_bug.cgi?id=1358530
> 
> the results are pretty similar.
> 
>> On Jul 21, 2016, at 10:20 AM, Martin Perina <mper...@redhat.com> wrote:
>> 
>> So could you please share logs?
>> 
>> Thanks
>> 
>> Martin
>> 
>> On Thu, Jul 21, 2016 at 3:17 PM, Gervais de Montbrun
>> <gerv...@demontbrun.com> wrote:
>>> 
>>> Hi Oved,
>>> 
>>> Thanks for the suggestion.
>>> 
>>> I tried setting "management_ip = 0.0.0.0" but same result.
>>> BTW, management_ip='0.0.0.0' (as suggested in the post) doesn't work for
>>> me. vdsmd wouldn't start.
>>> 
>>> Cheers,
>>> Gervais
>>> 
>>> 
>>> 
>>> On Jul 20, 2016, at 10:50 AM, Oved Ourfali <oourf...@redhat.com> wrote:
>>> 
>>> Also, this thread seems similar.
>>> Also talking about IPV4/IPV6 issue.
>>> Does it help?
>>> 
>>> [1] http://lists.ovirt.org/pipermail/users/2016-June/040602.html
>>> 
>>> On Wed, Jul 20, 2016 at 4:43 PM, Martin Perina <mper...@redhat.com> wrote:
>>>> 
>>>> Hi,
>>>> 
>>>> could you please create a bug and attach engine host logs (all from
>>>> /var/log/ovirt-engine) and VDSM logs (from /var/log/vdsm)?
>>>> 
>>>> Thanks
>>>> 
>>>> Martin Perina
>>>> 
>>>> 
>>>> On Wed, Jul 20, 2016 at 1:50 PM, Gervais de Montbrun
>>>> <gerv...@demontbrun.com
>>>>> wrote:
>>>> 
>>>>> Hi Qiong,
>>>>> 
>>>>> I am experiencing the exact same issue. All four of my hosts are
>>>>> throwing
>>>>> the same error to the vdsm.log If you find a solution, please let me
>>>>> know
>>> 
>>> 
>>> 
>> 
>> 
>> 
>> ___
>> Users mailing list
>> Users@ovirt.org
>> http://lists.ovirt.org/mailman/listinfo/users
>> 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Cluster shows N/A in Dashboard

2016-09-06 Thread Gervais de Montbrun
Thanks!

It looks like something is broken, so I am glad that this is normal.

Cheers,
Gervais



> On Sep 6, 2016, at 3:03 PM, Alexander Wels <aw...@redhat.com> wrote:
> 
> On Tuesday, September 6, 2016 3:00:37 PM EDT Gervais de Montbrun wrote:
>> Hey Folks,
>> 
>> Anyone know why my cluster might be showing N/A in the Dashboard?
>> 
>> 
>> feedback-on-oVirt-engine-4.0.3-1.el7.centos
>> 
>> Cheers,
>> Gervais
> 
> Because clusters don't have a 'status' to display. We wanted to display the 
> count but the widgets also want to display a status. Since there is no status 
> the widget says N/A
> 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Cluster shows N/A in Dashboard

2016-09-06 Thread Gervais de Montbrun
Hey Folks,

Anyone know why my cluster might be showing N/A in the Dashboard?


feedback-on-oVirt-engine-4.0.3-1.el7.centos

Cheers,
Gervais



___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Hung task finalizing live migration

2016-09-10 Thread Gervais de Montbrun
Hi Maton,

I have seen tasks in a weird state on my cluster also. I've had a vm get 
"stuck" during a migration where it says "migrating to" in the web GUI, but it 
has finished migrating hours ago... If I click "Cancel Migraton" the gui tells 
me that it is not migrating, but I can't do any action on the vm because I am 
then told that the vm can't be acted upon while it is migrating. I also try to 
kill the task, but there are none listed

What has worked for me has been to put my hosted-engine in global maintenance 
mode, then ssh into the hosted engine and run the "engine-setup" command. I am 
not saying the is the best course of action, but when the engine comes back 
online the task is cleared.

Cheers,
Gervais



> On Sep 10, 2016, at 11:06 AM, Maton, Brett  wrote:
> 
> Anyone know how to fix this broken task ?
> 
> It's persisted through a reboot of all hosts and the engine, something needs 
> deleting from the database to clear the task and release the locked disk
> 
> On 8 September 2016 at 13:25, Maton, Brett  > wrote:
> Thanks for the pointer Mikhail, however I don't get any tasks listed with 
> that command:
> 
> vdsClient -s 0 getAllTasksStatuses
> 
> /usr/share/vdsm/vdsClient.py:33: DeprecationWarning: vdscli uses xmlrpc. 
> since ovirt 3.6 xmlrpc is deprecated, please use vdsm.jsonrpcvdscli
>   from vdsm import utils, vdscli, constants
> 
> {'status': {'message': 'OK', 'code': 0}, 'allTasksStatus': {}}
> 
> 
> On 8 September 2016 at 09:51, Краснобаев Михаил  > wrote:
> Hi,
>  
> There is a way to cancel a running task  -  look here 
> http://lists.ovirt.org/pipermail/users/2014-November/028946.html 
> 
> I was able to stop snapshot deletion this way.
>  
> Best, Mikhail.
>  
> 08.09.2016, 08:14, "Maton, Brett"  >:
>> Any suggestions ?
>> 
>> THe task has been hung for 5 days now, I can't start the machine or destroy 
>> it.
>> 
>> 
>> On 7 September 2016 at 06:49, Maton, Brett > > wrote:
>> Sorry just hit reply
>> 
>> I'm seeing these errors in the logs which look related to the problem:
>> 
>> 
>> 2016-09-07 06:46:35,123 ERROR 
>> [org.ovirt.engine.core.bll.tasks.CommandCallbacksPoller] 
>> (DefaultQuartzScheduler6) [19c58c0d] Failed invoking callback end method 
>> 'onFailed' for command '07608003-ca05-4e2e-b917-85ce525c011b' with exception 
>> 'null', the callback is marked for end method retries
>> 2016-09-07 06:46:45,184 ERROR [org.ovirt.engine.core.bll.Com 
>> mandsFactory] 
>> (DefaultQuartzScheduler7) [19c58c0d] Error in invocating CTOR of command 
>> 'LiveMigrateDisk': null
>> 2016-09-07 06:46:45,185 ERROR 
>> [org.ovirt.engine.core.bll.tasks.CommandCallbacksPoller] 
>> (DefaultQuartzScheduler7) [19c58c0d] Failed invoking callback end method 
>> 'onFailed' for command '07608003-ca05-4e2e-b917-85ce525c011b' with exception 
>> 'null', the callback is marked for end method retries
>> 
>> On 5 September 2016 at 06:46, Nir Soffer > > wrote:
>> Hi Maton,
>> 
>> Please reply to the list, not to me directly.
>> 
>> Ala, can you look at this? is this a known issue?
>> 
>> Thanks,
>> Nir
>> 
>> On Mon, Sep 5, 2016 at 8:43 AM, Maton, Brett > > wrote:
>> > Log files as requested
>> >
>> > https://ufile.io/4fc35  vdsm log
>> > https://ufile.io/e9836  engine 03-Sep
>> > https://ufile.io/15f37  engine 04-Sep
>> >
>> > vdsm log stops on the 01-Sep...
>> >
>> > Couple of entries from the event log:
>> >
>> > Sep 3, 2016 7:31:07 PMSnapshot 'Auto-generated for Live Storage
>> > Migration' deletion for VM 'lv01' has been completed.
>> > Sep 3, 2016 6:46:46 PMSnapshot 'Auto-generated for Live Storage
>> > Migration' deletion for VM 'lv01' was initiated by SYSTEM
>> >
>> > And the related tasks
>> >
>> > Removing Snapshot Auto-generated for Live Storage Migration of VM lv01
>> > Sep 3, 2016 6:46:44 PMN/A29f45ca9
>> > ValidatingSep 3, 2016 6:46:44 PMuntilSep 3, 2016 6:46:44 PM
>> > ExecutingSep 3, 2016 6:46:44 PMuntilSep 3, 2016 7:31:06 PM
>> >
>> > FinalizingSep 3, 2016 7:31:06 PMN/A
>> >
>> >
>> >
>> > On 4 September 2016 at 14:27, Nir Soffer > > > wrote:
>> >>
>> >> On Sun, Sep 4, 2016 at 12:40 PM, Maton, Brett > >> >
>> >> wrote:
>> >>>
>> >>> How do I fix / kill a hung vdsm task?
>> >>>
>> >>> It seems to have completed the task but is stuck finalising.
>> >>>
>> >>> Removing Snapshot Auto-generated for Live 

Re: [ovirt-users] Hung task finalizing live migration

2016-09-10 Thread Gervais de Montbrun
YAY!! Glad it worked for you.

:-)

Cheers,
Gervais



> On Sep 10, 2016, at 12:05 PM, Maton, Brett <mat...@ltresources.co.uk> wrote:
> 
> Way-hey! finally the task has gone and I can do 'stuff' with that VM again.
> 
> Thanks Gervais, you're a star
> 
> On 10 September 2016 at 15:40, Maton, Brett <mat...@ltresources.co.uk 
> <mailto:mat...@ltresources.co.uk>> wrote:
> Thanks Gervais I'll give that a go
> 
> On 10 September 2016 at 15:39, Gervais de Montbrun <gerv...@demontbrun.com 
> <mailto:gerv...@demontbrun.com>> wrote:
> Hi Maton,
> 
> I have seen tasks in a weird state on my cluster also. I've had a vm get 
> "stuck" during a migration where it says "migrating to" in the web GUI, but 
> it has finished migrating hours ago... If I click "Cancel Migraton" the gui 
> tells me that it is not migrating, but I can't do any action on the vm 
> because I am then told that the vm can't be acted upon while it is migrating. 
> I also try to kill the task, but there are none listed
> 
> What has worked for me has been to put my hosted-engine in global maintenance 
> mode, then ssh into the hosted engine and run the "engine-setup" command. I 
> am not saying the is the best course of action, but when the engine comes 
> back online the task is cleared.
> 
> Cheers,
> Gervais
> 
> 
> 
>> On Sep 10, 2016, at 11:06 AM, Maton, Brett <mat...@ltresources.co.uk 
>> <mailto:mat...@ltresources.co.uk>> wrote:
>> 
>> Anyone know how to fix this broken task ?
>> 
>> It's persisted through a reboot of all hosts and the engine, something needs 
>> deleting from the database to clear the task and release the locked disk
>> 
>> On 8 September 2016 at 13:25, Maton, Brett <mat...@ltresources.co.uk 
>> <mailto:mat...@ltresources.co.uk>> wrote:
>> Thanks for the pointer Mikhail, however I don't get any tasks listed with 
>> that command:
>> 
>> vdsClient -s 0 getAllTasksStatuses
>> 
>> /usr/share/vdsm/vdsClient.py:33: DeprecationWarning: vdscli uses xmlrpc. 
>> since ovirt 3.6 xmlrpc is deprecated, please use vdsm.jsonrpcvdscli
>>   from vdsm import utils, vdscli, constants
>> 
>> {'status': {'message': 'OK', 'code': 0}, 'allTasksStatus': {}}
>> 
>> 
>> On 8 September 2016 at 09:51, Краснобаев Михаил <mi...@ya.ru 
>> <mailto:mi...@ya.ru>> wrote:
>> Hi,
>>  
>> There is a way to cancel a running task  -  look here 
>> http://lists.ovirt.org/pipermail/users/2014-November/028946.html 
>> <http://lists.ovirt.org/pipermail/users/2014-November/028946.html>
>> I was able to stop snapshot deletion this way.
>>  
>> Best, Mikhail.
>>  
>> 08.09.2016, 08:14, "Maton, Brett" <mat...@ltresources.co.uk 
>> <mailto:mat...@ltresources.co.uk>>:
>>> Any suggestions ?
>>> 
>>> THe task has been hung for 5 days now, I can't start the machine or destroy 
>>> it.
>>> 
>>> 
>>> On 7 September 2016 at 06:49, Maton, Brett <mat...@ltresources.co.uk 
>>> <mailto:mat...@ltresources.co.uk>> wrote:
>>> Sorry just hit reply
>>> 
>>> I'm seeing these errors in the logs which look related to the problem:
>>> 
>>> 
>>> 2016-09-07 06:46:35,123 ERROR 
>>> [org.ovirt.engine.core.bll.tasks.CommandCallbacksPoller] 
>>> (DefaultQuartzScheduler6) [19c58c0d] Failed invoking callback end method 
>>> 'onFailed' for command '07608003-ca05-4e2e-b917-85ce525c011b' with 
>>> exception 'null', the callback is marked for end method retries
>>> 2016-09-07 06:46:45,184 ERROR [org.ovirt.engine.core.bll.Com 
>>> <http://org.ovirt.engine.core.bll.com/>mandsFactory] 
>>> (DefaultQuartzScheduler7) [19c58c0d] Error in invocating CTOR of command 
>>> 'LiveMigrateDisk': null
>>> 2016-09-07 06:46:45,185 ERROR 
>>> [org.ovirt.engine.core.bll.tasks.CommandCallbacksPoller] 
>>> (DefaultQuartzScheduler7) [19c58c0d] Failed invoking callback end method 
>>> 'onFailed' for command '07608003-ca05-4e2e-b917-85ce525c011b' with 
>>> exception 'null', the callback is marked for end method retries
>>> 
>>> On 5 September 2016 at 06:46, Nir Soffer <nsof...@redhat.com 
>>> <mailto:nsof...@redhat.com>> wrote:
>>> Hi Maton,
>>> 
>>> Please reply to the list, not to me directly.
>>> 
>>> Ala, can you look at this? is this a known issue?
>>> 
>>> Thanks,
>>> Nir
>>> 
>>> On Mon, Sep 5, 2016 at 

Re: [ovirt-users] oVirt 4.0.x - hosted-engine was not starting properly

2016-09-29 Thread Gervais de Montbrun
Hi Martin,

The entropy was super low. Somewhere around 140. I installed and configured 
haveged.service to start at bootup, reverted my apache changes... After a 
reboot, my systemctl status still says that there are 7 services queued (note 
that I erroneously said degraded in my previous email - the services are, in 
fact, queued), but the oVirt GUI comes up almost immediately and everything 
seems to be great.

Thank you for the tip. You solved my issue.

Cheers,
Gervais



> On Sep 29, 2016, at 7:47 AM, Martin Perina <mper...@redhat.com> wrote:
> 
> Hi,
> 
> please take a look at my inline comments:
> 
> On Tue, Sep 27, 2016 at 7:23 PM, Gervais de Montbrun <gerv...@demontbrun.com 
> <mailto:gerv...@demontbrun.com>> wrote:
> Hey All,
> 
> Since updating to 4.0.x of oVirt, I have had an issue with my hosted engine. 
> After a some poking around, I think I have figured out my issue and thought I 
> would share to see what others think.
> The issue has existed with 4.0, 4.0.1, 4.0.2, 4.0.3, and still exists in 
> 4.0.4.
> 
> Description:
> When my hosted engine starts it reports that it is in a degraded state with 7 
> or 8 services still not started when I run systemctl status. It takes about 6 
> or 7 minutes to eventually start all the services and come online. If I don't 
> set my cluster to Global-Maintenance mode it eventually thinks that my 
> hosted-engine needs to be rebooted and restarts it before it can start 
> everything.
> 
> ​Could you please share with us logs gathered by ovirt-log-collector?
> 
> It's just a guess but could you please take a look if you HE VM has enough 
> entropy?
> 
>   cat /proc/sys/kernel/random/entropy_avail
> 
> If the value is low (below or around 200),  you really need to install and 
> configure some entropy generator such as haveged
> 
> 
> Solution:
> I realized that Apache was the culprit and found that the proxy to the 
> ovirt-engine in /etc/httpd/conf.d/z-ovirt-engine-proxy.conf has a super long 
> timeout with many retries. I changed the settings and now everything works 
> for me.
> 
> -> Before change:
>  ^/(ovirt-engine($|/)|api($|/)|RHEVManagerWeb/|OvirtEngineWeb/|ca.crt$|engine.ssh.key.txt$|rhevm.ssh.key.txt$)>
> ProxyPassMatch ajp://127.0.0.1:8702 <> timeout=3600 retry=5
> 
> 
> AddOutputFilterByType DEFLATE text/javascript text/css text/html 
> text/xml text/json application/xml application/json application/x-yaml
> 
> 
> 
> -> After change:
> 
> ProxyPassMatch ajp://127.0.0.1:8702 <> timeout=5 retry=2
> 
> 
> AddOutputFilterByType DEFLATE text/javascript text/css text/html 
> text/xml text/json application/xml application/json application/x-yaml
> 
> 
> 
> ​This one is correct for 4.0​​, not sure why it was not updated during 
> upgrade from 3.6. @Simone?
> ​ 
> 
> If I read the timeout settings correctly, it will wait 60 minutes with 5 
> retries. 5 hours is way too long for my little server to hold onto all those 
> apache processes.
> The change I made allows for there to be an error, and also releases apache's 
> hold on the process. Once everything is ready, apache is ready to serve 
> requests and everything/everyone is happy. Before making the change, I just 
> get a whitescreen in my browser and then nothing works until I restart Apache 
> (or I end up in an endless loop of ovirt-ha services restarting my 
> hosted-engine.
> 
> ​Well, if you have an issue with too many apache processes waiting for engine 
> to respond, then there's some issue in engine. As I wrote above please share 
> the logs with us and check entropy.
> 
> Thanks
> 
> Martin Perina
> ​ 
> 
> I noticed that this setting reverts to the original setting, so oVirt must be 
> writing this file. Perhaps these number can be changed in oVirt? If not, I 
> will just setup and ansible play to revert the settings with working values 
> and restart apache on my engine.
> :-)
> 
> Cheers,
> Gervais
> 
> 
> 
> 
> ___
> Users mailing list
> Users@ovirt.org <mailto:Users@ovirt.org>
> http://lists.ovirt.org/mailman/listinfo/users 
> <http://lists.ovirt.org/mailman/listinfo/users>
> 
> 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] oVirt 4.0.x - hosted-engine was not starting properly

2016-09-29 Thread Gervais de Montbrun
Hi Simone,

Thanks for the info. I'll look at the solution that you suggested.

Cheers,
Gervais



> On Sep 29, 2016, at 10:01 AM, Simone Tiraboschi <stira...@redhat.com> wrote:
> 
> 
> 
> On Thu, Sep 29, 2016 at 2:51 PM, Gervais de Montbrun <gerv...@demontbrun.com 
> <mailto:gerv...@demontbrun.com>> wrote:
> Hi Martin,
> 
> The entropy was super low. Somewhere around 140. I installed and configured 
> haveged.service to start at bootup, reverted my apache changes... After a 
> reboot, my systemctl status still says that there are 7 services queued (note 
> that I erroneously said degraded in my previous email - the services are, in 
> fact, queued), but the oVirt GUI comes up almost immediately and everything 
> seems to be great.
> 
> 
> Take care that using havaged on a VM should not be considered a good source 
> of entropy and the oVirt PKi is managed by the engine.
> http://security.stackexchange.com/questions/34523/is-it-appropriate-to-use-haveged-as-a-source-of-entropy-on-virtual-machines
>  
> <http://security.stackexchange.com/questions/34523/is-it-appropriate-to-use-haveged-as-a-source-of-entropy-on-virtual-machines>
> 
> A better approach is the virtio-rng paravirtualised rng driver as for patch 
> https://gerrit.ovirt.org/#/c/62334/ <https://gerrit.ovirt.org/#/c/62334/>
> 
>  
> Thank you for the tip. You solved my issue.
> 
> Cheers,
> Gervais
> 
> 
> 
>> On Sep 29, 2016, at 7:47 AM, Martin Perina <mper...@redhat.com 
>> <mailto:mper...@redhat.com>> wrote:
>> 
>> Hi,
>> 
>> please take a look at my inline comments:
>> 
>> On Tue, Sep 27, 2016 at 7:23 PM, Gervais de Montbrun <gerv...@demontbrun.com 
>> <mailto:gerv...@demontbrun.com>> wrote:
>> Hey All,
>> 
>> Since updating to 4.0.x of oVirt, I have had an issue with my hosted engine. 
>> After a some poking around, I think I have figured out my issue and thought 
>> I would share to see what others think.
>> The issue has existed with 4.0, 4.0.1, 4.0.2, 4.0.3, and still exists in 
>> 4.0.4.
>> 
>> Description:
>> When my hosted engine starts it reports that it is in a degraded state with 
>> 7 or 8 services still not started when I run systemctl status. It takes 
>> about 6 or 7 minutes to eventually start all the services and come online. 
>> If I don't set my cluster to Global-Maintenance mode it eventually thinks 
>> that my hosted-engine needs to be rebooted and restarts it before it can 
>> start everything.
>> 
>> ​Could you please share with us logs gathered by ovirt-log-collector?
>> 
>> It's just a guess but could you please take a look if you HE VM has enough 
>> entropy?
>> 
>>   cat /proc/sys/kernel/random/entropy_avail
>> 
>> If the value is low (below or around 200),  you really need to install and 
>> configure some entropy generator such as haveged
>> 
>> 
>> Solution:
>> I realized that Apache was the culprit and found that the proxy to the 
>> ovirt-engine in /etc/httpd/conf.d/z-ovirt-engine-proxy.conf has a super long 
>> timeout with many retries. I changed the settings and now everything works 
>> for me.
>> 
>> -> Before change:
>> > ^/(ovirt-engine($|/)|api($|/)|RHEVManagerWeb/|OvirtEngineWeb/|ca.crt$|engine.ssh.key.txt$|rhevm.ssh.key.txt$)>
>> ProxyPassMatch ajp://127.0.0.1:8702 <> timeout=3600 retry=5
>> 
>> 
>> AddOutputFilterByType DEFLATE text/javascript text/css text/html 
>> text/xml text/json application/xml application/json application/x-yaml
>> 
>> 
>> 
>> -> After change:
>> 
>> ProxyPassMatch ajp://127.0.0.1:8702 <> timeout=5 retry=2
>> 
>> 
>> AddOutputFilterByType DEFLATE text/javascript text/css text/html 
>> text/xml text/json application/xml application/json application/x-yaml
>> 
>> 
>> 
>> ​This one is correct for 4.0​​, not sure why it was not updated during 
>> upgrade from 3.6. @Simone?
>> ​ 
>> 
>> If I read the timeout settings correctly, it will wait 60 minutes with 5 
>> retries. 5 hours is way too long for my little server to hold onto all those 
>> apache processes.
>> The change I made allows for there to be an error, and also releases 
>> apache's hold on the process. Once everything is ready, apache is ready to 
>> serve requests and everything/everyone is happy. Before making the change, I 
>> just get a whitescreen in my browser and then nothing works until I restart 
>>

Re: [ovirt-users] oVirt 4.0.x - hosted-engine was not starting properly

2016-09-29 Thread Gervais de Montbrun
Hi Simone,

Yes... I guess it was not clear in my original email. I changed the numbers 
myself to lower the timeout and retries. With them set as they were set by 
ovirt (timeout=3600 retry=5) things were not working for me. 

Cheers,
Gervais



> On Sep 29, 2016, at 10:04 AM, Simone Tiraboschi <stira...@redhat.com> wrote:
> 
> 
> 
> On Thu, Sep 29, 2016 at 12:47 PM, Martin Perina <mper...@redhat.com 
> <mailto:mper...@redhat.com>> wrote:
> Hi,
> 
> please take a look at my inline comments:
> 
> On Tue, Sep 27, 2016 at 7:23 PM, Gervais de Montbrun <gerv...@demontbrun.com 
> <mailto:gerv...@demontbrun.com>> wrote:
> Hey All,
> 
> Since updating to 4.0.x of oVirt, I have had an issue with my hosted engine. 
> After a some poking around, I think I have figured out my issue and thought I 
> would share to see what others think.
> The issue has existed with 4.0, 4.0.1, 4.0.2, 4.0.3, and still exists in 
> 4.0.4.
> 
> Description:
> When my hosted engine starts it reports that it is in a degraded state with 7 
> or 8 services still not started when I run systemctl status. It takes about 6 
> or 7 minutes to eventually start all the services and come online. If I don't 
> set my cluster to Global-Maintenance mode it eventually thinks that my 
> hosted-engine needs to be rebooted and restarts it before it can start 
> everything.
> 
> ​Could you please share with us logs gathered by ovirt-log-collector?
> 
> It's just a guess but could you please take a look if you HE VM has enough 
> entropy?
> 
>   cat /proc/sys/kernel/random/entropy_avail
> 
> If the value is low (below or around 200),  you really need to install and 
> configure some entropy generator such as haveged
> 
> 
> Solution:
> I realized that Apache was the culprit and found that the proxy to the 
> ovirt-engine in /etc/httpd/conf.d/z-ovirt-engine-proxy.conf has a super long 
> timeout with many retries. I changed the settings and now everything works 
> for me.
> 
> -> Before change:
>  ^/(ovirt-engine($|/)|api($|/)|RHEVManagerWeb/|OvirtEngineWeb/|ca.crt$|engine.ssh.key.txt$|rhevm.ssh.key.txt$)>
> ProxyPassMatch ajp://127.0.0.1:8702 <> timeout=3600 retry=5
> 
> 
> AddOutputFilterByType DEFLATE text/javascript text/css text/html 
> text/xml text/json application/xml application/json application/x-yaml
> 
> 
> 
> -> After change:
> 
> ProxyPassMatch ajp://127.0.0.1:8702 <> timeout=5 retry=2
> 
> 
> AddOutputFilterByType DEFLATE text/javascript text/css text/html 
> text/xml text/json application/xml application/json application/x-yaml
> 
> 
> 
> ​This one is correct for 4.0​​, not sure why it was not updated during 
> upgrade from 3.6. @Simone?
> ​
> 
> Honestly it's
> 
> ProxyPassMatch ajp://127.0.0.1:8702 <http://127.0.0.1:8702/> 
> timeout=3600 retry=5
> 
> 
> AddOutputFilterByType DEFLATE text/javascript text/css text/html 
> text/xml text/json application/xml application/json application/x-yaml
> 
> 
> also on a fresh 4.0 engine from our latest engine-appliance.
>  
> 
> If I read the timeout settings correctly, it will wait 60 minutes with 5 
> retries. 5 hours is way too long for my little server to hold onto all those 
> apache processes.
> The change I made allows for there to be an error, and also releases apache's 
> hold on the process. Once everything is ready, apache is ready to serve 
> requests and everything/everyone is happy. Before making the change, I just 
> get a whitescreen in my browser and then nothing works until I restart Apache 
> (or I end up in an endless loop of ovirt-ha services restarting my 
> hosted-engine.
> 
> ​Well, if you have an issue with too many apache processes waiting for engine 
> to respond, then there's some issue in engine. As I wrote above please share 
> the logs with us and check entropy.
> 
> Thanks
> 
> Martin Perina
> ​ 
> 
> I noticed that this setting reverts to the original setting, so oVirt must be 
> writing this file. Perhaps these number can be changed in oVirt? If not, I 
> will just setup and ansible play to revert the settings with working values 
> and restart apache on my engine.
> :-)
> 
> Cheers,
> Gervais
> 
> 
> 
> 
> ___
> Users mailing list
> Users@ovirt.org <mailto:Users@ovirt.org>
> http://lists.ovirt.org/mailman/listinfo/users 
> <http://lists.ovirt.org/mailman/listinfo/users>
> 
> 
> 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Failed Import - Hung finalizing

2016-09-20 Thread Gervais de Montbrun
Hi Brett,

You may be able to clear the task using the vdsClient:

On the SPM host, you can run vdsClient to query and make changes to the cluster 
To cleanup stuck task, for example

[root@oVirt1 ~]
# vdsClient -s 0 getAllTasksStatuses
{'status': {'message': 'OK', 'code': 0}, 'allTasksStatus': {}}
 

[root@oVirt1 ~]# vdsClient -s 0 clearTask 

In my experience, the stuck task is never listed and therefore not available to 
be removed with vdsClient. What has worked for me in the past is to ssh into 
the engine and re-running `engine-setup` If you are running a hosted engine 
make sure that you put it in global maintenance mode first.

Cheers,
Gervais



> On Sep 20, 2016, at 7:10 AM, Maton, Brett  wrote:
> 
> Hi,
> 
>   I had a network outage while I was import VM backups.
>   The tasks show 'Copying Image - Failed', and are now stuck 'finalizing'.
> 
>   The VM's I was trying to import show status Image Locked in the web-ui.
> 
>   What o I need to do to cancel the tasks and start again ?
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] global vs local maintenance with single host

2016-09-28 Thread Gervais de Montbrun
Hi Gianluca,

Instead of editing the system's built in systemd configuration, you can do the 
following...

Create a file called /etc/systemd/system/ovirt-ha-broker.service

# My custom ovirt-ha-broker.service config that ensures NFS starts before 
ovirt-ha-broker.service
# thanks Gervais for this tip!  :-)

.include /usr/lib/systemd/system/ovirt-ha-broker.service

[Unit]
After=nfs-server.service

Then disable and enable ovirt-ha-broker.service (systemctl disable 
ovirt-ha-broker.service ; systemctl enable ovirt-ha-broker.service) and you 
should see that it is using your customized systemd unit definition. You can 
see that systemd is using your file by running systemctl status 
ovirt-ha-broker.service. You'll see something like "Loaded: loaded 
(/etc/systemd/system/ovirt-ha-broker.service;" in the output.

Your file will survive updates and therefore always wait for nfs to start prior 
to starting. You can do the same for your other customizations.

Cheers,
Gervais



> On Sep 28, 2016, at 1:31 PM, Gianluca Cecchi  
> wrote:
> 
> On Sun, Sep 4, 2016 at 10:54 AM, Yedidyah Bar David  > wrote:
> On Sat, Sep 3, 2016 at 1:18 PM, Gianluca Cecchi
> > wrote:
> > Hello,
> > how do the two modes apply in case of single host?
> > During an upgrade phase, after having upgraded the self hosted engine and
> > leaving global maintenance and having checked all is ok, what is the correct
> > mode then to put host if I want finally to update it too?
> 
> The docs say to put hosts to maintenance from the engine before upgrading 
> them.
> 
> This is (also) so that VMs on them are migrated away to other hosts.
> 
> With a single host, you have no other hosts to migrate VMs to.
> 
> So you should do something like this:
> 
> 1. Set global maintenance (because you are going to take down the
> engine and its vm)
> 2. Shutdown all other VMs
> 3. Shutdown engine vm from itself
> At this point, you should be able to simply stop HA services. But it
> might be cleaner to first set local maintenance. Not sure but perhaps
> this might be required for vdsm. So:
> 4. Set local maintenance
> 5. Stop HA services. If setting local maintenance didn't work, perhaps
> better stop also vdsm services. This stop should obviously happen
> automatically by yum/rpm, but perhaps better do this manually to see
> that it worked.
> 6. yum (or dnf) update stuff.
> 7. Start HA services
> 8. Check status. I think you'll see that both local and global maint
> are still set.
> 9. Set maintenance to none
> 10. Check status again - I think that after some time HA will decide
> to start engine vm and should succeed.
> 11. Start all other VMs.
> 
> Didn't try this myself.
> 
> Best,
> --
> Didi
> 
> Hello Didi,
> I would like to leverage the update I have to do on 2 small different lab 
> environments to crosscheck the steps suggested.
> They are both single host environments with self hosted engine.
> One is 4.0.2 and the other is 4.0.3. Both on CentoS 7.2
> I plan to migrate to the just released 4.0.4
> 
> One note: in both environments the storage is NFS and is provided by the host 
> itself, so a corner case (for all hosted_storage domain, main data domain and 
> iso storage domain).
> I customized the init scripts, basically for start phase of the server and to 
> keep in count of the NFS service, but probably something has to be done for 
> stop too?
> 
> 1) In /usr/lib/systemd/system/ovirt-ha-broker.service
> 
> added in section [Unit]
> 
> After=nfs-server.service
> 
> The file is overwritten at update so one has to keep in mind this
> 
> 2) also in vdsmd.service changed 
> from:
> After=multipathd.service libvirtd.service iscsid.service rpcbind.service \
>   supervdsmd.service sanlock.service vdsm-network.service
> 
> to:
> After=multipathd.service libvirtd.service iscsid.service rpcbind.service \
>   supervdsmd.service sanlock.service vdsm-network.service \
>   nfs-server.service
> 
> Do you think any order setup I have to put in place related to NFS service 
> and oVirt services stop?
> 
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] global vs local maintenance with single host

2016-10-03 Thread Gervais de Montbrun
Hi Gianluca,

I forgot to mention that you need to ensure that systemd knows that the new 
file exists. You should likely run `systemctl daemon-reload` after 
creating/modifying your custom systemd files. You can see that the After 
directive is combined from both files. Check it out by running `systemctl show 
vdsmd.service | grep After`

It makes sense to make further changes to ensure that NFS stops last, but I 
haven't looked into that yet.
:-)

Cheers,
Gervais



> On Oct 3, 2016, at 7:22 AM, Gianluca Cecchi <gianluca.cec...@gmail.com> wrote:
> 
> 
> Il 28/Set/2016 21:09, "Gervais de Montbrun" <gerv...@demontbrun.com 
> <mailto:gerv...@demontbrun.com>> ha scritto:
> >
> > Hi Gianluca,
> >
> > Instead of editing the system's built in systemd configuration, you can do 
> > the following...
> >
> > Create a file called /etc/systemd/system/ovirt-ha-broker.service
> >
> >> # My custom ovirt-ha-broker.service config that ensures NFS starts before 
> >> ovirt-ha-broker.service
> >> # thanks Gervais for this tip!  :-)
> >>
> >> .include /usr/lib/systemd/system/ovirt-ha-broker.service
> >>
> >> [Unit]
> >> After=nfs-server.service
> >
> >
> > Then disable and enable ovirt-ha-broker.service (systemctl disable 
> > ovirt-ha-broker.service ; systemctl enable ovirt-ha-broker.service) and you 
> > should see that it is using your customized systemd unit definition. You 
> > can see that systemd is using your file by running systemctl status 
> > ovirt-ha-broker.service. You'll see something like "Loaded: loaded 
> > (/etc/systemd/system/ovirt-ha-broker.service;" in the output.
> >
> > Your file will survive updates and therefore always wait for nfs to start 
> > prior to starting. You can do the same for your other customizations.
> >
> > Cheers,
> > Gervais
> >
> >
> >
> >> On Sep 28, 2016, at 1:31 PM, Gianluca Cecchi <gianluca.cec...@gmail.com 
> >> <mailto:gianluca.cec...@gmail.com>> wrote:
> >>
> >> On Sun, Sep 4, 2016 at 10:54 AM, Yedidyah Bar David <d...@redhat.com 
> >> <mailto:d...@redhat.com>> wrote:
> >>>
> >>> On Sat, Sep 3, 2016 at 1:18 PM, Gianluca Cecchi
> >>> <gianluca.cec...@gmail.com <mailto:gianluca.cec...@gmail.com>> wrote:
> >>> > Hello,
> >>> > how do the two modes apply in case of single host?
> >>> > During an upgrade phase, after having upgraded the self hosted engine 
> >>> > and
> >>> > leaving global maintenance and having checked all is ok, what is the 
> >>> > correct
> >>> > mode then to put host if I want finally to update it too?
> >>>
> >>> The docs say to put hosts to maintenance from the engine before upgrading 
> >>> them.
> >>>
> >>> This is (also) so that VMs on them are migrated away to other hosts.
> >>>
> >>> With a single host, you have no other hosts to migrate VMs to.
> >>>
> >>> So you should do something like this:
> >>>
> >>> 1. Set global maintenance (because you are going to take down the
> >>> engine and its vm)
> >>> 2. Shutdown all other VMs
> >>> 3. Shutdown engine vm from itself
> >>> At this point, you should be able to simply stop HA services. But it
> >>> might be cleaner to first set local maintenance. Not sure but perhaps
> >>> this might be required for vdsm. So:
> >>> 4. Set local maintenance
> >>> 5. Stop HA services. If setting local maintenance didn't work, perhaps
> >>> better stop also vdsm services. This stop should obviously happen
> >>> automatically by yum/rpm, but perhaps better do this manually to see
> >>> that it worked.
> >>> 6. yum (or dnf) update stuff.
> >>> 7. Start HA services
> >>> 8. Check status. I think you'll see that both local and global maint
> >>> are still set.
> >>> 9. Set maintenance to none
> >>> 10. Check status again - I think that after some time HA will decide
> >>> to start engine vm and should succeed.
> >>> 11. Start all other VMs.
> >>>
> >>> Didn't try this myself.
> >>>
> >>> Best,
> >>> --
> >>> Didi
> >>
> >>
> >> Hello Didi,
> >> I would like to leverage the update I have to do on 2 small different lab 
> >> environments to crosscheck the steps suggest

[ovirt-users] oVirt 4.0.x - hosted-engine was not starting properly

2016-09-27 Thread Gervais de Montbrun
Hey All,

Since updating to 4.0.x of oVirt, I have had an issue with my hosted engine. 
After a some poking around, I think I have figured out my issue and thought I 
would share to see what others think.
The issue has existed with 4.0, 4.0.1, 4.0.2, 4.0.3, and still exists in 4.0.4.

Description:
When my hosted engine starts it reports that it is in a degraded state with 7 
or 8 services still not started when I run systemctl status. It takes about 6 
or 7 minutes to eventually start all the services and come online. If I don't 
set my cluster to Global-Maintenance mode it eventually thinks that my 
hosted-engine needs to be rebooted and restarts it before it can start 
everything.

Solution:
I realized that Apache was the culprit and found that the proxy to the 
ovirt-engine in /etc/httpd/conf.d/z-ovirt-engine-proxy.conf has a super long 
timeout with many retries. I changed the settings and now everything works for 
me.

-> Before change:

ProxyPassMatch ajp://127.0.0.1:8702 timeout=3600 retry=5


AddOutputFilterByType DEFLATE text/javascript text/css text/html 
text/xml text/json application/xml application/json application/x-yaml



-> After change:

ProxyPassMatch ajp://127.0.0.1:8702 timeout=5 retry=2


AddOutputFilterByType DEFLATE text/javascript text/css text/html 
text/xml text/json application/xml application/json application/x-yaml



If I read the timeout settings correctly, it will wait 60 minutes with 5 
retries. 5 hours is way too long for my little server to hold onto all those 
apache processes. The change I made allows for there to be an error, and also 
releases apache's hold on the process. Once everything is ready, apache is 
ready to serve requests and everything/everyone is happy. Before making the 
change, I just get a whitescreen in my browser and then nothing works until I 
restart Apache (or I end up in an endless loop of ovirt-ha services restarting 
my hosted-engine.

I noticed that this setting reverts to the original setting, so oVirt must be 
writing this file. Perhaps these number can be changed in oVirt? If not, I will 
just setup and ansible play to revert the settings with working values and 
restart apache on my engine.
:-)

Cheers,
Gervais



___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Hosted Engine won't deploy

2016-12-15 Thread Gervais de Montbrun
Hi Kasturi,

They were imported automatically. I see the hosted_engine domain and the 
hosted_engine vm in my list of vm's. 

Cheers,
Gervais



> On Dec 15, 2016, at 3:06 AM, knarra <kna...@redhat.com> wrote:
> 
> On 12/15/2016 03:35 AM, Gervais de Montbrun wrote:
>> Hi all,
>> 
>> I had to reinstall one of my hosts today and I noticed an issue. The error 
>> message was:
>> 
>> Ovirt2:
>> Cannot edit Host. You are using an unmanaged hosted engine VM. Please 
>> upgrade the cluster level to 3.6 and wait for the hosted engine storage 
>> domain to be properly imported.
>> I am running oVirt 4.0.5 and have a hosted engine and Cluster and Data 
>> Center say that they are running in 4.0 compatibility mode, so I don't 
>> understand this error. I did get the host setup by running `hosted-engine 
>> --deploy` and walking through the command line options. Alarmingly, I was 
>> warned that this is deprecated and will not be possible in oVirt 4.1. 
>> 
>> Any suggestions as to what I should do to sort out my issue?
>> 
>> Cheers,
>> Gervais
> Hi Gervais,
> 
> Have you imported hosted_storage into your environment. I have hit this 
> issue when i did not have hosted_storage domain and hosted_engine vm imported 
> into my setup. 
> 
> Thanks
> kasturi
>> 
>> 
>> 
>> 
>> 
>> ___
>> Users mailing list
>> Users@ovirt.org <mailto:Users@ovirt.org>
>> http://lists.phx.ovirt.org/mailman/listinfo/users 
>> <http://lists.phx.ovirt.org/mailman/listinfo/users>
> 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Hosted Engine won't deploy

2016-12-15 Thread Gervais de Montbrun
Hi Martin,

I do see the hosted_engine storage domain. Should it be listed as (Master)? If 
not, how can I force my "proper" Data domain to take over as master?


I also see my hosted engine showing up:


At some point I renamed it to match the name I am using for it, but now when I 
try to change settings on it, I get an error:



It's great that there will be GUI ability to setup a hosted engine. That's not 
great for me if they are not working -- which seems to be the case right now 
:-( and doubly so if the ability to make it work goes away on the command line.

I truly appreciate the help and hope there are more good suggestions coming my 
way.

Cheers,
Gervais



> On Dec 15, 2016, at 4:30 AM, Martin Sivak <msi...@redhat.com> wrote:
> 
>> I am running oVirt 4.0.5 and have a hosted engine and Cluster and Data
>> Center say that they are running in 4.0 compatibility mode, so I don't
>> understand this error.
> 
> Do you see the hosted engine storage domain and the hosted engine VM
> in the webadmin? Both should be imported automatically on 3.6+
> compatibility level when a master storage domain is added to the
> system.
> 
>> Alarmingly, I was
>> warned that this is deprecated and will not be possible in oVirt 4.1.
> 
> We have a nice UI that allows to control the hosted engine deployment
> to additional hosts directly from the webadmin. You will be able to
> add a hosted engine capable host by just marking it as such in the Add
> host dialog.
> 
> --
> Martin Sivak
> SLA / oVirt
> 
> On Wed, Dec 14, 2016 at 11:05 PM, Gervais de Montbrun
> <gerv...@demontbrun.com> wrote:
>> Hi all,
>> 
>> I had to reinstall one of my hosts today and I noticed an issue. The error
>> message was:
>> 
>> Ovirt2:
>> 
>> Cannot edit Host. You are using an unmanaged hosted engine VM. Please
>> upgrade the cluster level to 3.6 and wait for the hosted engine storage
>> domain to be properly imported.
>> 
>> I am running oVirt 4.0.5 and have a hosted engine and Cluster and Data
>> Center say that they are running in 4.0 compatibility mode, so I don't
>> understand this error. I did get the host setup by running `hosted-engine
>> --deploy` and walking through the command line options. Alarmingly, I was
>> warned that this is deprecated and will not be possible in oVirt 4.1.
>> 
>> Any suggestions as to what I should do to sort out my issue?
>> 
>> Cheers,
>> Gervais
>> 
>> 
>> 
>> 
>> ___
>> Users mailing list
>> Users@ovirt.org
>> http://lists.phx.ovirt.org/mailman/listinfo/users
>> 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Hosted Engine won't deploy

2016-12-14 Thread Gervais de Montbrun
Hi all,

I had to reinstall one of my hosts today and I noticed an issue. The error 
message was:

Ovirt2:
Cannot edit Host. You are using an unmanaged hosted engine VM. Please upgrade 
the cluster level to 3.6 and wait for the hosted engine storage domain to be 
properly imported.
I am running oVirt 4.0.5 and have a hosted engine and Cluster and Data Center 
say that they are running in 4.0 compatibility mode, so I don't understand this 
error. I did get the host setup by running `hosted-engine --deploy` and walking 
through the command line options. Alarmingly, I was warned that this is 
deprecated and will not be possible in oVirt 4.1. 

Any suggestions as to what I should do to sort out my issue?

Cheers,
Gervais



___
Users mailing list
Users@ovirt.org
http://lists.phx.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Hosted Engine won't deploy

2016-12-19 Thread Gervais de Montbrun
Hi All,

Any ideas what is going on with this and thoughts on what I can do to correct 
this situation?

All help is appreciated!

Cheers,
Gervais



> On Dec 15, 2016, at 11:36 AM, Martin Sivak <msi...@redhat.com> wrote:
> 
> Hi,
> 
> Thanks for the info. The hosted engine domain should not be the master one 
> indeed. I will add some people that know the storage aspect better to the 
> thread to figure out how this can be solved.
> 
> Simone, Nir: Is there a way to force another domain to take over the master 
> role? I think the current situation is not how it should work, the hosted 
> engine storage should have never gotten the master storage duties.
> 
> Martin
> 
> On Thu, Dec 15, 2016 at 4:18 PM, Gervais de Montbrun <gerv...@demontbrun.com 
> <mailto:gerv...@demontbrun.com>> wrote:
> Hi Martin,
> 
> I do see the hosted_engine storage domain. Should it be listed as (Master)? 
> If not, how can I force my "proper" Data domain to take over as master?
> 
> 
> I also see my hosted engine showing up:
> 
> 
> At some point I renamed it to match the name I am using for it, but now when 
> I try to change settings on it, I get an error:
> 
> 
> 
> It's great that there will be GUI ability to setup a hosted engine. That's 
> not great for me if they are not working -- which seems to be the case right 
> now :-( and doubly so if the ability to make it work goes away on the command 
> line.
> 
> I truly appreciate the help and hope there are more good suggestions coming 
> my way.
> 
> Cheers,
> Gervais
> 
> 
> 
>> On Dec 15, 2016, at 4:30 AM, Martin Sivak <msi...@redhat.com 
>> <mailto:msi...@redhat.com>> wrote:
>> 
>>> I am running oVirt 4.0.5 and have a hosted engine and Cluster and Data
>>> Center say that they are running in 4.0 compatibility mode, so I don't
>>> understand this error.
>> 
>> Do you see the hosted engine storage domain and the hosted engine VM
>> in the webadmin? Both should be imported automatically on 3.6+
>> compatibility level when a master storage domain is added to the
>> system.
>> 
>>> Alarmingly, I was
>>> warned that this is deprecated and will not be possible in oVirt 4.1.
>> 
>> We have a nice UI that allows to control the hosted engine deployment
>> to additional hosts directly from the webadmin. You will be able to
>> add a hosted engine capable host by just marking it as such in the Add
>> host dialog.
>> 
>> --
>> Martin Sivak
>> SLA / oVirt
>> 
>> On Wed, Dec 14, 2016 at 11:05 PM, Gervais de Montbrun
>> <gerv...@demontbrun.com <mailto:gerv...@demontbrun.com>> wrote:
>>> Hi all,
>>> 
>>> I had to reinstall one of my hosts today and I noticed an issue. The error
>>> message was:
>>> 
>>> Ovirt2:
>>> 
>>> Cannot edit Host. You are using an unmanaged hosted engine VM. Please
>>> upgrade the cluster level to 3.6 and wait for the hosted engine storage
>>> domain to be properly imported.
>>> 
>>> I am running oVirt 4.0.5 and have a hosted engine and Cluster and Data
>>> Center say that they are running in 4.0 compatibility mode, so I don't
>>> understand this error. I did get the host setup by running `hosted-engine
>>> --deploy` and walking through the command line options. Alarmingly, I was
>>> warned that this is deprecated and will not be possible in oVirt 4.1.
>>> 
>>> Any suggestions as to what I should do to sort out my issue?
>>> 
>>> Cheers,
>>> Gervais
>>> 
>>> 
>>> 
>>> 
>>> ___
>>> Users mailing list
>>> Users@ovirt.org <mailto:Users@ovirt.org>
>>> http://lists.phx.ovirt.org/mailman/listinfo/users 
>>> <http://lists.phx.ovirt.org/mailman/listinfo/users>
>>> 
> 
> 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Hosted Engine won't deploy

2016-12-19 Thread Gervais de Montbrun
done
https://bugzilla.redhat.com/show_bug.cgi?id=1406169

Cheers,
Gervais



> On Dec 19, 2016, at 4:07 PM, Nir Soffer <nsof...@redhat.com> wrote:
> 
> On Thu, Dec 15, 2016 at 5:36 PM, Martin Sivak <msi...@redhat.com 
> <mailto:msi...@redhat.com>> wrote:
> Hi,
> 
> Thanks for the info. The hosted engine domain should not be the master one 
> indeed. I will add some people that know the storage aspect better to the 
> thread to figure out how this can be solved.
> 
> Simone, Nir: Is there a way to force another domain to take over the master 
> role? I think the current situation is not how it should work, the hosted 
> engine storage should have never gotten the master storage duties.
> 
> Yes, this is a bad bug on engine side, similar to the bug that let engine 
> deactivate
> this domain.
> 
> Please open urgent bug for this.
>  
> 
> Martin
> 
> On Thu, Dec 15, 2016 at 4:18 PM, Gervais de Montbrun <gerv...@demontbrun.com 
> <mailto:gerv...@demontbrun.com>> wrote:
> Hi Martin,
> 
> I do see the hosted_engine storage domain. Should it be listed as (Master)? 
> If not, how can I force my "proper" Data domain to take over as master?
> 
> 
> I also see my hosted engine showing up:
> 
> 
> At some point I renamed it to match the name I am using for it, but now when 
> I try to change settings on it, I get an error:
> 
> 
> 
> It's great that there will be GUI ability to setup a hosted engine. That's 
> not great for me if they are not working -- which seems to be the case right 
> now :-( and doubly so if the ability to make it work goes away on the command 
> line.
> 
> I truly appreciate the help and hope there are more good suggestions coming 
> my way.
> 
> Cheers,
> Gervais
> 
> 
> 
>> On Dec 15, 2016, at 4:30 AM, Martin Sivak <msi...@redhat.com 
>> <mailto:msi...@redhat.com>> wrote:
>> 
>>> I am running oVirt 4.0.5 and have a hosted engine and Cluster and Data
>>> Center say that they are running in 4.0 compatibility mode, so I don't
>>> understand this error.
>> 
>> Do you see the hosted engine storage domain and the hosted engine VM
>> in the webadmin? Both should be imported automatically on 3.6+
>> compatibility level when a master storage domain is added to the
>> system.
>> 
>>> Alarmingly, I was
>>> warned that this is deprecated and will not be possible in oVirt 4.1.
>> 
>> We have a nice UI that allows to control the hosted engine deployment
>> to additional hosts directly from the webadmin. You will be able to
>> add a hosted engine capable host by just marking it as such in the Add
>> host dialog.
>> 
>> --
>> Martin Sivak
>> SLA / oVirt
>> 
>> On Wed, Dec 14, 2016 at 11:05 PM, Gervais de Montbrun
>> <gerv...@demontbrun.com <mailto:gerv...@demontbrun.com>> wrote:
>>> Hi all,
>>> 
>>> I had to reinstall one of my hosts today and I noticed an issue. The error
>>> message was:
>>> 
>>> Ovirt2:
>>> 
>>> Cannot edit Host. You are using an unmanaged hosted engine VM. Please
>>> upgrade the cluster level to 3.6 and wait for the hosted engine storage
>>> domain to be properly imported.
>>> 
>>> I am running oVirt 4.0.5 and have a hosted engine and Cluster and Data
>>> Center say that they are running in 4.0 compatibility mode, so I don't
>>> understand this error. I did get the host setup by running `hosted-engine
>>> --deploy` and walking through the command line options. Alarmingly, I was
>>> warned that this is deprecated and will not be possible in oVirt 4.1.
>>> 
>>> Any suggestions as to what I should do to sort out my issue?
>>> 
>>> Cheers,
>>> Gervais
>>> 
>>> 
>>> 
>>> 
>>> ___
>>> Users mailing list
>>> Users@ovirt.org <mailto:Users@ovirt.org>
>>> http://lists.phx.ovirt.org/mailman/listinfo/users 
>>> <http://lists.phx.ovirt.org/mailman/listinfo/users>
>>> 
> 
> 
> 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Re: global vs local maintenance with single host

2019-05-15 Thread Gervais de Montbrun
Hi Gianluca,

I forgot to mention that you need to ensure that systemd knows that the new 
file exists. You should likely run `systemctl daemon-reload` after 
creating/modifying your custom systemd files. You can see that the After 
directive is combined from both files. Check it out by running `systemctl show 
vdsmd.service | grep After`

It makes sense to make further changes to ensure that NFS stops last, but I 
haven't looked into that yet.
:-)

Cheers,
Gervais



> On Oct 3, 2016, at 7:22 AM, Gianluca Cecchi  wrote:
> 
> 
> Il 28/Set/2016 21:09, "Gervais de Montbrun"  <mailto:gerv...@demontbrun.com>> ha scritto:
> >
> > Hi Gianluca,
> >
> > Instead of editing the system's built in systemd configuration, you can do 
> > the following...
> >
> > Create a file called /etc/systemd/system/ovirt-ha-broker.service
> >
> >> # My custom ovirt-ha-broker.service config that ensures NFS starts before 
> >> ovirt-ha-broker.service
> >> # thanks Gervais for this tip!  :-)
> >>
> >> .include /usr/lib/systemd/system/ovirt-ha-broker.service
> >>
> >> [Unit]
> >> After=nfs-server.service
> >
> >
> > Then disable and enable ovirt-ha-broker.service (systemctl disable 
> > ovirt-ha-broker.service ; systemctl enable ovirt-ha-broker.service) and you 
> > should see that it is using your customized systemd unit definition. You 
> > can see that systemd is using your file by running systemctl status 
> > ovirt-ha-broker.service. You'll see something like "Loaded: loaded 
> > (/etc/systemd/system/ovirt-ha-broker.service;" in the output.
> >
> > Your file will survive updates and therefore always wait for nfs to start 
> > prior to starting. You can do the same for your other customizations.
> >
> > Cheers,
> > Gervais
> >
> >
> >
> >> On Sep 28, 2016, at 1:31 PM, Gianluca Cecchi  >> <mailto:gianluca.cec...@gmail.com>> wrote:
> >>
> >> On Sun, Sep 4, 2016 at 10:54 AM, Yedidyah Bar David  >> <mailto:d...@redhat.com>> wrote:
> >>>
> >>> On Sat, Sep 3, 2016 at 1:18 PM, Gianluca Cecchi
> >>> mailto:gianluca.cec...@gmail.com>> wrote:
> >>> > Hello,
> >>> > how do the two modes apply in case of single host?
> >>> > During an upgrade phase, after having upgraded the self hosted engine 
> >>> > and
> >>> > leaving global maintenance and having checked all is ok, what is the 
> >>> > correct
> >>> > mode then to put host if I want finally to update it too?
> >>>
> >>> The docs say to put hosts to maintenance from the engine before upgrading 
> >>> them.
> >>>
> >>> This is (also) so that VMs on them are migrated away to other hosts.
> >>>
> >>> With a single host, you have no other hosts to migrate VMs to.
> >>>
> >>> So you should do something like this:
> >>>
> >>> 1. Set global maintenance (because you are going to take down the
> >>> engine and its vm)
> >>> 2. Shutdown all other VMs
> >>> 3. Shutdown engine vm from itself
> >>> At this point, you should be able to simply stop HA services. But it
> >>> might be cleaner to first set local maintenance. Not sure but perhaps
> >>> this might be required for vdsm. So:
> >>> 4. Set local maintenance
> >>> 5. Stop HA services. If setting local maintenance didn't work, perhaps
> >>> better stop also vdsm services. This stop should obviously happen
> >>> automatically by yum/rpm, but perhaps better do this manually to see
> >>> that it worked.
> >>> 6. yum (or dnf) update stuff.
> >>> 7. Start HA services
> >>> 8. Check status. I think you'll see that both local and global maint
> >>> are still set.
> >>> 9. Set maintenance to none
> >>> 10. Check status again - I think that after some time HA will decide
> >>> to start engine vm and should succeed.
> >>> 11. Start all other VMs.
> >>>
> >>> Didn't try this myself.
> >>>
> >>> Best,
> >>> --
> >>> Didi
> >>
> >>
> >> Hello Didi,
> >> I would like to leverage the update I have to do on 2 small different lab 
> >> environments to crosscheck the steps suggested.
> >> They are both single host environments with self hosted engine.
> >> One is 4.0.2 and the other is 4.0.3. Both on CentoS 7.2
&g

[ovirt-users] Re: Ovirt and Vagrant

2021-01-05 Thread Gervais de Montbrun
Thanks for the feedback. Are you using ansible to launch the vm from the 
template, or to provision the template once it is up?

We have 15+ developers bringing up vm's for a variety of different environments 
(like 80) for different, custom configurations of client environments. Vagrant 
is really just to stand up (suspend, destroy) the vms and then puppet runs on 
them to apply custom configuration.

I noticed Terraform support. I suspect that it would be a ton of work for us to 
switch to it.

Cheers,
Gervais



> On Jan 5, 2021, at 12:33 AM, Strahil Nikolov via Users  
> wrote:
> 
> 
>> I wonder what other folks are using or if someone has any suggestions to 
>> offer.
> 
> I'm using Ansible do deploy some stuff from templates.
> I think that terraform is also used with oVirt, so you can give it a try.
> 
> Best Regards,
> Strahil Nikolov
> ___
> Users mailing list -- users@ovirt.org
> To unsubscribe send an email to users-le...@ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct: 
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives: 
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/MUM7MMOZTU54HSGAEOME7PDW4FMA7QQW/

___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/WSBBX7K6MUWA44KOR35FHFDI2PL6OM3Q/


[ovirt-users] Re: Ovirt and Vagrant

2021-01-06 Thread Gervais de Montbrun
I saw that yesterday!

Thanks for the help guys. I'm not a Ruby programmer, so have been poking at 
this on and off for weeks. It looks like the two most recent PR's may have 
solved my two most apparent problems with the plugin.

I want to test more functionality with vagrant (like suspend, halt, resume, 
etc), but 爛we are fully on track again.

Cheers,
Gervais



> On Jan 6, 2021, at 8:18 AM, Staniforth, Paul 
>  wrote:
> 
> Hello Gervais,
>   update from our developers
> 
> I've just checked the email chain, and it looks like it has since been fixed 
> up stream (https://github.com/myoung34/vagrant-ovirt4/pull/121 
> <https://eur02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fmyoung34%2Fvagrant-ovirt4%2Fpull%2F121=04%7C01%7CP.Staniforth%40leedsbeckett.ac.uk%7C990aea02d5ce4f8d3bc308d8b2354768%7Cd79a81124fbe417aa112cd0fb490d85c%7C0%7C0%7C637455289187570654%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000=HTXfDoq87MPBV1MHujlUjdSLVqkFoRSmZMaHVvFqgAU%3D=0>)
>  but perhaps not included in any gem release yet...
> 
> Regards,
> Paul S.
> From: Staniforth, Paul  <mailto:p.stanifo...@leedsbeckett.ac.uk>>
> Sent: 06 January 2021 10:59
> To: Gervais de Montbrun  <mailto:gerv...@demontbrun.com>>; users@ovirt.org <mailto:users@ovirt.org> 
> mailto:users@ovirt.org>>
> Subject: Re: [ovirt-users] Ovirt and Vagrant
>  
> Hello Gervais,
>  I have asked our developers who use the plugin and 
> here is the response.
> Regards, 
>  
> Paul S.
> 
> Hi Paul,
> 
> We have had to maintain our own changes to the oVirt Vagrant plugin. Probably 
> we need to start committing to and publishing our own fork.
> 
> Theres an obtuse and broken long one liner that get's the IP addresses, which 
> has always been a bit problematic, and stopped working altogether with the 
> newest release of oVirt we are using.
> 
> Anyway, here are the changes we made to get the IP addresses working with the 
> new versions of oVirt:
> 
> line 58 wait_till_up.rb
>   ip_addr = nil
>   vm_nics_service = server.nics_service
>   nics = vm_nics_service.list
>   nics.each { |nic|
> puts "Waiting on #{nic.mac.address}"
> vm_nics_service.nic_service(nic.id 
> <http://nic.id/>).reported_devices_service.list.each { |device|
>   device.ips&.each {|ip|
> puts "ip: #{ip.to_s}"
> if ip && ip.version == 'v4' && ip.address
>   ip_addr = ip.address
>   break
> end
>   }
> }
>   }
> 
> Also same in read_ssh.rb:
>  ip_addr = nil
>   vm_nics_service = server.nics_service
>   nics = vm_nics_service.list
>   nics.each { |nic|
> vm_nics_service.nic_service(nic.id 
> <http://nic.id/>).reported_devices_service.list.each { |device|
>   device.ips&.each {|ip|
> if ip && ip.version == 'v4' && ip.address
>   ip_addr = ip.address
>   break
> end
>   }
> }
>   }
> Sorry it's not the best way to share code, but feel free to pass this along.
> 
> Cheers,
> Cliffe.
> 
> 
> .
> 
> From: Gervais de Montbrun  <mailto:gerv...@demontbrun.com>>
> Sent: 04 January 2021 18:14
> To: users@ovirt.org <mailto:users@ovirt.org>  <mailto:users@ovirt.org>>
> Subject: [ovirt-users] Ovirt and Vagrant
>  
> Caution External Mail: Do not click any links or open any attachments unless 
> you trust the sender and know that the content is safe.
> Anyone using vagrant with oVirt 4.4?
> 
> I am trying to get the vagrant plugin working with oVirt, but I am 
> experiencing issues. https://github.com/myoung34/vagrant-ovirt4/issues/120 
> <https://eur02.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fmyoung34%2Fvagrant-ovirt4%2Fissues%2F120=04%7C01%7Cp.staniforth%40leedsbeckett.ac.uk%7C2489f8033a8f4f960d4e08d8b0e08d88%7Cd79a81124fbe417aa112cd0fb490d85c%7C0%7C0%7C637453825790913752%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000=zfmBH3M%2BqyVRMPsoXPSSwMONjDS64Mz6nOpFdjiENkc%3D=0>
> I wonder what other folks are using or if someone has any suggestions to 
> offer.
> 
> We are trying to switch from vSphere to oVirt for internal, developer vm's at 
> my office and we have a successful workflow using

[ovirt-users] Ovirt and Vagrant

2021-01-04 Thread Gervais de Montbrun
Anyone using vagrant with oVirt 4.4?

I am trying to get the vagrant plugin working with oVirt, but I am experiencing 
issues. https://github.com/myoung34/vagrant-ovirt4/issues/120 

I wonder what other folks are using or if someone has any suggestions to offer.

We are trying to switch from vSphere to oVirt for internal, developer vm's at 
my office and we have a successful workflow using vagrant-vsphere plugin. 
Switching to oVirt should have been a simple step, but the issue I an having is 
a blocker.

I built a single hyperconverged server to test with. Everything seems to work 
fine when I bring up vm's manually in my "cluster." Networking, console, etc 
all work fine. Ovirt-guest-additions work fine and display the vm's ip in the 
gui.

When I try with the vagrant plugin, there seems to be a communication issue. 
Even though I can see the vm reporting its IP address fine in the ovirt web 
gui, it seems like the vagrant plugin is unable to do the same. The maintainer 
of the code says that he no longer runs oVirt which is why I am reaching out 
here to see if anyone has a suggestion or even an alternative to suggest.

Cheers,
Gervais



___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/PZXIAAFXHJ42FAOA6F5ME25BD5W7DRVF/


[ovirt-users] Building new cluster today - OS recommendations

2021-02-16 Thread Gervais de Montbrun
Hi All,

I've been testing with a single server "cluster" running oVirt w/Gluster and am 
about to build my in house cluster using three Dell servers. I was planing on 
putting CentOS 8 on these servers and building oVirt on top. With the recent 
changes to CentOS changing to CentOS Stream, I have concerns. 

I'm tempted to build my cluster with CentOS 8 (as this is what I tested on) and 
assume that there will be some official word from oVirt as to how to proceed 
come end of year. Any suggestions from folks here? Any one know if there is 
anything out from oVirt crew already around this?

Cheers,
Gervais


___
Users mailing list -- users@ovirt.org
To unsubscribe send an email to users-le...@ovirt.org
Privacy Statement: https://www.ovirt.org/privacy-policy.html
oVirt Code of Conduct: 
https://www.ovirt.org/community/about/community-guidelines/
List Archives: 
https://lists.ovirt.org/archives/list/users@ovirt.org/message/6ZPWKPOKCC4MAVNHYJVSGN657BEANAIV/