Re: [ovirt-users] [ANN] oVirt 4.2.0 First Beta Release is now available for testing

2017-11-01 Thread Robert Story
On Tue 2017-10-31 19:57:32+0200 Oved wrote:
> As mentioned earlier, this is one motivation but not the only one.
> You see right click less and less in web applications, as it isn't
> considered a good user experience. This is also the patternfly
> guideline (patternfly is a framework we heavily use throughout the
> application).

Their user guideline is probably based on UI for the masses. I'd argue
that oVirt, particularly the admin portal, is for a much more
technical audience. I think right-click should stay for admin portal.

Users are more likely to be less technical. I'd care much less if
everything in the user portal had its own button or was in a menu list. 


-- 
Robert Story <http://www.isi.edu/~rstory>
USC Information Sciences Institute <http://www.isi.edu/>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Can't connect to my VMs' consoles using any method

2017-01-10 Thread Robert Story
On Tue, 10 Jan 2017 12:50:12 -0500 summit wrote:
SH> I have tried the SPICE HTML5 client, and I get an error: "WebSocket
SH> error: Can't connect to websocket on URL [snip] [object Event]" So I
SH> tried giving noVNC a go, but noDice: "Server disconnected (code: 1006)"
SH> 
SH> What I found on those errors is that maybe it's a cert issue, even
SH> though I've allowed the certs in both Firefox and Chromium browsers and
SH> it's worked in neither.
SH> 
SH> So I tried doing the native client, but remote-viewer gives me "Could
SH> not connect to [IP address]: No route to host"

Unless you've set up the engine as a proxy, wherever you're connecting from
must be able to reach the network that the hosts are on.

When I ran into this, I set my browser to save the .vv files instead of
running the remote-viewer client directly, and then I could look at the
contents of the .vv file to see which IP address it was trying to connect
to. I ended up creating a VPN tunnel from my laptop to a gateway machine
that could reach my internal ovirtmgmt network.

Robert

-- 
Senior Software Engineer @ Parsons


pgp8TG3hSWtVa.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Redirect login requests to FQDN

2017-01-09 Thread Robert Story
On Mon, 9 Jan 2017 19:35:13 + Michael wrote:
MW> After upgrading to ovirt-engine 4 some of our users are receiving a
MW> message as follows.
MW> 
MW> > The client is not authorized to request an authorization. It's required to
MW> > access the system using FQDN.  

I hit this annoying change as well. Create
ovirt-engine/engine.conf.d/99-alternate-engine-fqdns.conf and add e.g.

  SSO_ALTERNATE_ENGINE_FQDNS="ov.example.com"


Robert

-- 
Senior Software Engineer @ Parsons


pgpwMtTEbi3L5.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] unable to start VMs after upgrade

2017-01-08 Thread Robert Story
On Sun, 8 Jan 2017 18:15:27 -0800 Jim wrote:
JK> Just to be clear, the "proper" procedure for rebooting a host in oVirt is
JK> to put it in maintence mode, ssh to the node, issue the reboot, then after
JK> confirming its back up, right click on the node in the web UI and select
JK> "confirm node reboot", then take it out of maintence mode?

I think the 'confirm node reboot' step can be stepped if you put it into
maintenance before rebooting. I think it's needed when a host which was
running VMs gets hung and you need to force a reboot. Confirming the reboot
lets the engine know that those VMs are no longer running on that node. But
it certainly won't do any harm to do it anyways.

Robert

-- 
Senior Software Engineer @ Parsons


pgp2C0Nl6l7FR.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] VM Disks Locked

2017-01-08 Thread Robert Story
On Mon, 9 Jan 2017 00:44:57 +0200 Yaniv wrote:
YD> Please provide logs for the SPM and engine.

UI:

Dec 29, 2016 9:36:08 PM VM tmpl.iip.el7.x2 creation was initiated by 
admin@internal
Dec 29, 2016 9:42:01 PM VM tmpl.iip.el7.x2 creation has been completed.

At 2016-12-29 22:20:05 I tried to run the original VM, and was told that
images were still locked.

Engine log attached. Unfortunately the vdsmd logs don't go back that far..

I ended up solving my issue using unlock_entity.sh [1] to unlock the images.

I just tried to repeat the experiment, and of course it worked fine.

One bug that I found that seemed to be similar was
https://bugzilla.redhat.com/show_bug.cgi?id=1020909. In my case, the VM I
was trying to clone had just been restored from a snapshot, so a race
condition seems believable.


Robert

[1] /usr/share/ovirt-engine/setup/dbutils/unlock_entity.sh

-- 
Senior Software Engineer @ Parsons


engine.log-20170102.gz
Description: application/gzip


pgpDnmezxyDWd.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] unable to start VMs after upgrade

2017-01-08 Thread Robert Story
On Sat, 7 Jan 2017 15:02:10 -0800 Jim wrote:
JK> I went on about the work I came in to do, and tried to start up a VM.  It
JK> appeared to start, but it never booted.  It did  raise the CPU usage for
JK> that VM, but console was all black, no resize or anything.  Tried several
JK> settings.  This was on a VM I had just powered down.  I noticed it was
JK> starting the VM on engine3, so I did a runonce specifying the vm start on
JK> engine2.  Booted up just fine.  After booting, I could migrate to engine3,
JK> and all was good.
JK> 
JK> What happened?  I get no error messages, starting any vm on engine3, start
JK> paused, attaching display, then running it, I always get the same thing:
JK> blank console, about 50% cpu usage reported by the web interface, no
JK> response on any network, and by all signs available to me, no actual
JK> booting (reminds me of a PC that doesn't POST).  Simply changing the engine
JK> it starts on to one that has not been upgraded fixes the problem.

I had this issue too, except I had 100% cpu usage reported on the web
interface. have you rebooted the troublesome host since it was upgraded? I
think that was what solved it for me.


Robert

-- 
Senior Software Engineer @ Parsons


pgp_Okx6PlKtJ.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] VM Disks Locked

2017-01-05 Thread Robert Story
On Thu, 5 Jan 2017 14:35:33 + Clint wrote:
CB> I'm having trouble with a VM showing that the disk is locked after
CB> trying to clone that VM to a new VM. The source machine was powered off
CB> when the clone was initiated. The source and destination disks both
CB> show to be locked and are currently un-usable. This situation was
CB> brought on around December 27 2016 and persists.

I have the exact same issue, except I'm running 4.0.5.


Robert

-- 
Senior Software Engineer @ Parsons


pgpC5cfj5BGP0.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] 4.0 web UI Session expired please try again

2017-01-04 Thread Robert Story
On Wed, 4 Jan 2017 16:17:09 -0500 Ravi wrote:
RN> A redirect to the login page from error page would be a more reasonable
RN> solution IMO.

That would still mean that I have to type in my login credential twice,
which is what I'm trying to avoid.


Robert

-- 
Senior Software Engineer @ Parsons


pgprRY0PYn5pI.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] 4.0 web UI Session expired please try again

2017-01-04 Thread Robert Story
On Wed, 4 Jan 2017 14:40:06 -0500 Ravi wrote:
RN> With SSO the client sends the client secret to SSO which is stored in the
RN> session. Now when the clients session expires all the information including
RN> the client secret is lost when the session is purged by the application
RN> server.

Is the session expiration time configurable?

RN> 1. login to webadmin
RN> 2. Leave the session until session time out on engine and user is
RN> redirected to login page (the client id and secret are sent)
RN> 3. If user tries to login now everything will be fine but if user leaves
RN> and the session expires the session is purged, client secret is lost
RN> 4. User enters user name password on the screen after coming back. The
RN> login form does not have a session associated with it so the client and
RN> secret are not found and SSO needs to report that the session has expired
RN> and redirect user to welcome page.

So in step 4, can't it just start a new session instead of going to an
expiration page? Or show the page for a few seconds and then start a new
session? 

Or in step 2, set a refresh on the login page that still has a session so
that when the session expires it will redirect to a login screen that will
start a new session?



Robert

-- 
Senior Software Engineer @ Parsons


pgpTUCYHmj98k.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] 4.0 web UI Session expired please try again

2017-01-04 Thread Robert Story
Since I upgrade to 4.0, I get this annoying message when I try to log in
again after I've been away for a while. On 3.6 the ui would go to a login
screen after some period of inactivity, and I could log right back in. With
4.0, logging in after inactivity goes to a page with this message, and I
have to click to get a login page and then log in again. This is very
annoying. Is there a way to revert to the old behavior?


Robert

-- 
Senior Software Engineer @ Parsons


pgpGUaMXB9UrN.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] recovering from failed 3.6 -> 4.0 engine upgrade

2017-01-04 Thread Robert Story
On Wed, 4 Jan 2017 16:43:07 +0100 Simone wrote:
ST> engine-setup should be completely transactional and it should rollback by
ST> itself on failures; could you please share engine-setup logs from your
ST> failed attempt?

See  https://bugzilla.redhat.com/show_bug.cgi?id=1408493


Robert

-- 
Senior Software Engineer @ Parsons


pgpkxQbCrMXpq.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] recovering from failed 3.6 -> 4.0 engine upgrade

2017-01-04 Thread Robert Story
On Fri, 23 Dec 2016 13:58:09 -0500 Robert wrote:
RS> I'm trying to upgrade to 4.0 from 3.6. I dutify checked that my clusters
RS> were set to 3.6 compatibility and started the engine upgrade. unfortunately
RS> I forgot to check the unused Default cluser. :-/

So it turns out that it wasn't the cluster compatibility that I missed, but
that the data center compatibility level needed to be updated to 3.6 as
well. I just didn't read the error message closely enough.

RS> So the upgrade failed, but the 4.0 rpms were installed, so the engine won't
RS> start. How can I fix the compat level (or delete Default cluster)?

For the record, I ended up using 'yum history' to roll back the updated
packages, and was then able to start the engine, change the datacenter to
3.6, and re-run setup successfully.



Robert

-- 
Senior Software Engineer @ Parsons


pgpTZQeWlIgMj.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Current status of 4.0.6 | EL7.3?

2017-01-01 Thread Robert Story
On Sun, 1 Jan 2017 20:27:21 +0100 Michal wrote:
MS> >  Or will qemu-kvm-common-ev-2.6.0 get
MS> > released in the ovirt-release40 repo sometime soon?  (I'm glad I haven't
MS> > updated yet!)  
MS> 
MS> It wouldn't let you upgrade the host due to the dependency so
MS> hopefully nothing would break, but indeed 7.3 needs libvirt-2.0 and
MS> qemu-kvm-ev-1.6

That's a typo, right? I htink you meant qemu-kvm-ev-2.6.

Robert

-- 
Senior Software Engineer @ Parsons


pgpnnoNXZZGcT.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Current status of 4.0.6 | EL7.3?

2017-01-01 Thread Robert Story
On Fri, 30 Dec 2016 21:29:17 -0500 Derek wrote:
DA> Is this the official response from the ovirt team, to use the
DA> centos-release-qemu-ev repo?  Or will qemu-kvm-common-ev-2.6.0 get
DA> released in the ovirt-release40 repo sometime soon?  (I'm glad I haven't
DA> updated yet!)

I read it here:

 From: Sandro Bonazzola 
 To: users 
 Subject: Re: [ovirt-users] [HEADS UP] CentOS 7.3 is rolling out, need 
qemu-kvm-ev 2.6
 Date: Tue, 13 Dec 2016 08:43:15 +0100
 On Tue, 13 Dec 2016 08:43:15 +0100 Sandro wrote:
SB> On Mon, Dec 12, 2016 at 6:38 PM, Chris Adams  wrote:
SB> 
SB> > Once upon a time, Sandro Bonazzola  said:  
SB> > > In terms of ovirt repositories, qemu-kvm-ev 2.6 is available right now 
in
SB> > > ovirt-master-snapshot-static, ovirt-4.0-snapshot-static, and  
SB> > ovirt-4.0-pre  
SB> > > (contains 4.0.6 RC4 rpms going to be announced in a few minutes.)  
SB> >
SB> > Will qemu-kvm-ev 2.6 be added to any of the oVirt repos for prior
SB> > versions (such as 3.5 or 3.6)?
SB> 
SB> You can enable CentOS Virt SIG repo by running "yum install
SB> centos-release-qemu-ev" on your CentOS 7 systems.
SB> and you'll have updated qemu-kvm-ev.




Robert

-- 
Senior Software Engineer @ Parsons


pgpR7pWadnHU0.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Current status of 4.0.6 | EL7.3?

2016-12-29 Thread Robert Story
On Thu, 29 Dec 2016 15:32:07 -0500 Derek wrote:
DA> Hi,
DA> 
DA> What is the current status of Ovirt 4.0.6 and EL7.3?  From previous
DA> threads it seemed to me that there was a potential compatibility issue
DA> with the 7.3 kernel and an updated version of vdsm or qemu?  I just want
DA> to ensure any potential issues have been cleared up before I upgrade my
DA> systems.
DA> 
DA> Thanks,
DA> 
DA> -derek
DA> 

I think you need to enable CentOS Virt SIG repo to get the latest qemu-kvm:

# yum list qemu-kvm-common\*
qemu-kvm-common.x86_64   10:1.5.3-126.el7   base

# yum install http://resources.ovirt.org/pub/yum-repo/ovirt-release40.rpm
# yum -C list qemu-kvm-common\*
qemu-kvm-common.x86_64 10:1.5.3-126.el7base 
qemu-kvm-common-ev.x86_64  10:2.3.0-31.el7.16.1ovirt-4.0

# yum install centos-release-qemu-ev
# yum list qemu-kvm-common\*
qemu-kvm-common.x86_64   10:1.5.3-126.el7 base
qemu-kvm-common-ev.x86_6410:2.6.0-27.1.el7centos-qemu-ev

That worked for me earlier this week.


Robert

-- 
Senior Software Engineer @ Parsons


pgp2DWsMRsLUl.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] oVirt 3.6.7 upgrade to 4.0.5 and CentOS 7.3

2016-12-29 Thread Robert Story
On Wed, 28 Dec 2016 09:39:16 +0200 Yaniv wrote:
YK> On Dec 27, 2016 10:09 PM, "Robert Story"  wrote:
YK> >Just as a counterpoint, I had 2 issues, but both were related to changes
YK> >I'd made..
YK> >
YK> >- I have ipv6 disabled. with CentOS 7.3 this can cause rpcbind to choke,
YK> >  causing cascading dependency failures. I had to manually tweak systemd
YK> >  config.
YK> 
YK> Did you open a bug about it, on rpcbind?

There was an existing bug. Unfortunately it's another case of people just
accepting that systemd can break what ever it wants and folks just have to
'get over it'.

  https://bugzilla.redhat.com/show_bug.cgi?id=1402961

YK> - I had tweaked vdsm.conf for debugging a long time ago, so the new version
YK>   installed it's conf file as vsdm.conf.rpmnew. New vdsm uses a different
YK>   logger, and wouldn't start with the old logger config. No error message,
YK>   0 byte log file. Copying rpmnew conf file over existing log files fixed
YK>   this one.
YK> 
YK> That is somewhat expected.

It is a little distressing that it fails with no indication of why it's
failing. It would be nice if something caught stderr to be able to log it.


I also forgot about my 3rd issue - the GUI 'broke' because I had a CNAME
for ovirt.local to ovirt-engine.local. Luckily I don't have a lot of users
to retrain to use a different domain name.


Robert

-- 
Senior Software Engineer @ Parsons


pgpO_2JrcMf1O.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] oVirt 3.6.7 upgrade to 4.0.5 and CentOS 7.3

2016-12-27 Thread Robert Story
On Tue, 27 Dec 2016 20:04:23 +0100 ov...@timmi.org wrote:
OO> Hi all,
OO> 
OO> please note that I have upgrade my oVirt 3.6.5 environment to 4.0.5 
OO> without problems.
OO> I saw some network messages in my evens but I will post them later in a 
OO> separate thread.
OO> 
OO> Really great work how the upgrades are working.

Just as a counterpoint, I had 2 issues, but both were related to changes
I'd made..

- I have ipv6 disabled. with CentOS 7.3 this can cause rpcbind to choke,
  causing cascading dependency failures. I had to manually tweak systemd
  config.

- I had tweaked vdsm.conf for debugging a long time ago, so the new version
  installed it's conf file as vsdm.conf.rpmnew. New vdsm uses a different
  logger, and wouldn't start with the old logger config. No error message,
  0 byte log file. Copying rpmnew conf file over existing log files fixed
  this one.


Robert

-- 
Senior Software Engineer @ Parsons


pgpPwIMDU1e6O.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] recovering from failed 3.6 -> 4.0 engine upgrade

2016-12-23 Thread Robert Story
Hello,

I'm trying to upgrade to 4.0 from 3.6. I dutify checked that my clusters
were set to 3.6 compatibility and started the engine upgrade. unfortunately
I forgot to check the unused Default cluser. :-/

So the upgrade failed, but the 4.0 rpms were installed, so the engine won't
start. How can I fix the compat level (or delete Default cluster)?

I found this message
 that suggests
that the engine should be startable if only the update rpms were installed,
but part of the install procedure updates all the rpms, including the
engine.

I created this bug:

  https://bugzilla.redhat.com/show_bug.cgi?id=1408493


Any help on how to proceed appreciated, either here or on IRC..



Robert

-- 
Senior Software Engineer @ Parsons


pgpzKfcZ3Mkd0.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Hook to add firewall rules

2016-11-22 Thread Robert Story
On Tue, 22 Nov 2016 10:56:50 +0200 Yedidyah wrote:
YBD> On Mon, Nov 21, 2016 at 9:45 PM, Claude Durocher
YBD>  wrote:
YBD> > Ok, i've configured my custom iptable rules with "engine-config --get
YBD> > IPTablesConfigSiteCustom" on the engine. Now, how do I apply this on 
already
YBD> > deployed nodes?  
YBD> 
YBD> Move to maintenance, reinstall?
YBD> 
YBD> I do not think there is another way. But I also do not think oVirt
YBD> will overwrite your conf by any other process, so you can also simply
YBD> do this manually. Didn't try this myself.

I seem to recall the engine-config option being added because engine would
overwrite iptables config on every upgrade.


Robert

-- 
Senior Software Engineer @ Parsons


pgpG5EOUzg3nX.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Move a VM between 2 setups

2016-11-15 Thread Robert Story
On Mon, 14 Nov 2016 19:42:46 + Daniel wrote:
BD> An “export domain” is made for just this purpose. Create a NFS (version 3) 
share and make it accessible to the hypervisors for each engine. (It should be 
a dedicated NFS share, not used for anything else.) As I recall it should be 
owned by vdsm:vdsm (36:36). In one of the engines (doesn’t matter which), in 
the web admin page go to Storage and add a new NFS based export domain, using 
the NFS share you created. Once it’s activated, test it out; try right-clicking 
on a VM to “export” it.
BD> 
BD> Note that there can only be one engine connected to a given export domain 
at any one time. When you’re done testing the export domain on the first 
engine, you need to put it into “maintenance” and ultimately “detach” it.
BD> 
BD> Then go to the other engine, and this time under Storage instead of “new 
domain” click “import domain” and enter the same NFS share information. It 
should recognize that you already have an export domain setup under that NFS 
share.  Attach and activate it, and under Storage /   
/ VM Import, try importing the VM you had previously exported.

What I did in the past was to create a nfs share for each engine in
subdirectories on the same filesystem (eg /srv/ovirt/export/beta,
/srv/ovirt/export/production). After exporting from one, log in the the nfs
machine and move (or cp -al) the exported domain from one to the other.
It's pretty much instantaneous since it's the same file system. Saves the
hassle of the disconnect/reconnect.


Robert

-- 
Senior Software Engineer @ Parsons


pgpXcCviDGxLk.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Time synchronization in guest OS for Hosted Engine VM (for Kerberos)

2016-09-28 Thread Robert Story
On Tue, 27 Sep 2016 13:22:29 -0400 (EDT) Michal wrote:
MS> > On 27 Sep 2016, at 19:12, aleksey.maksi...@it-kb.ru wrote:
MS> > 
MS> > I'm afraid that in the future OS time may get out of sync because of 
kvm-clock
MS> > And as a result Kerberos may stop working
MS> > I hope I explained clearly  
MS> 
MS> Sorry, not really. You said you set up ntpd/chrony correctly. So how can 
the time get out of sync? Why do you think it can be because of kvmclock 
anyway? Do you refer to some specific bug?

I'd guess that it's a misunderstanding of what kvmclock is. Someone
guessing based on the name might think that it keeps the vm time in sync
with the host. Which might lead one to think it would conflict with ntp
(two different things trying to manage time).

If you know that kvmclock is essentially just a way to monitor the passage
of time (tick-tock-tick-tock) using the host's timer, then it makes sense
that you need also need ntp to tweak the current time to adjust for the
minor drift inherit in any clock.


Robert

-- 
Senior Software Engineer @ Parsons


pgpFr50IROpBx.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Solved: Re: Failed to connect Host to the Storage Domains hosted_storage.

2016-07-22 Thread Robert Story
On Fri, 22 Jul 2016 18:21:15 +0200 Simone wrote:
ST> On Fri, Jul 22, 2016 at 4:48 PM, Ralf Schenk  wrote:
ST> 
ST> > Hello,
ST> >
ST> > I also see from the logs that all your Storage-Domains that work are
ST> > mounted as nfsVersion='V4' but ovirt-nfs.netsec:/ovirt/hosted-engine is
ST> > mounted as nfsVersion='null'.
ST> >  
ST> 
ST> Hi Robert,
ST> unfortunately Ralf is right: I reproduced the issue.
ST> 
ST> The auto-import procedure for the hosted-engine storage domain ignores the
ST> nfsVersion parameter and so we don't have a value for that in the engine DB.
ST> On hosted-engine hosts, the agent mounts the hosted-engine storage domain
ST> before the engine and so everything is fine since the agent knows that it's
ST> nfsv4.
ST> 
ST> The issue comes with the hosts of that datacenter not involved in
ST> hosted-engine: in this case the engine simply tries to mount without
ST> the nfsVersion parameter and so, if the NFS server cannot be access over
ST> nfsv3, the mount could fail and the host will be declared as not operation.
ST> 
ST> I opened a ticket to track it:
ST> https://bugzilla.redhat.com/show_bug.cgi?id=1359265
ST> 
ST> If you need a quick fix you can:
ST> - fix the configuration of your storage server to allow it to be accessed
ST> also over nfsv3
ST> - edit the configuration of the storage connection in the engine DB on the
ST> engine VM to add the missing parameter. Something like:
ST>  # sudo -u postgresl psql
ST>  \c engine;
ST>  select * from storage_server_connections;
ST>  UPDATE storage_server_connections SET nfs_version = '4' WHERE connection =
ST> 'ovirt-nfs.netsec:/ovirt/hosted-engine';
ST>  commit;
ST>  select * from storage_server_connections;

Thanks for that workaround. I've added it to the bugzilla. The easy manual
workaround I tried was:

# mkdir /rhev/data-center/mnt/ovirt-nfs.localdomain:_ovirt_hosted-engine

# /usr/bin/mount -t nfs -o soft,nosharecache,timeo=600,retrans=6,nfsvers=4 \
  ovirt-nfs.localdomain:/ovirt/hosted-engine \
  /rhev/data-center/mnt/ovirt-nfs.netsec:_ovirt_hosted-engine

which got the hosts operational again.

Thanks for all the help!


Robert

-- 
Senior Software Engineer @ Parsons


pgp0ZZWfGusFe.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Failed to connect Host to the Storage Domains hosted_storage.

2016-07-22 Thread Robert Story
Hello,

I'm in the process of upgrading from 3.5.x to 3.6.x. My hosted engine and
hosts in the primary cluster are all upgraded and appear to be running fine.

I have a second cluster of 2 machines which are just regular hosts, without
the hosted-engine. Both have been marked non-operational, with the
following messages logged about every 5 minutes:


Failed to connect Host perses to Storage Pool Default

Host perses cannot access the Storage Domain(s) hosted_storage attached to the 
Data Center Default. Setting Host state to Non-Operational.

Host perses reports about one of the Active Storage Domains as Problematic.

Failed to connect Host perses to Storage Servers

Failed to connect Host perses to the Storage Domains hosted_storage.


I could see the normal storage/iso/export domains mounted on the host, and
the VMs running on the host are fine.

I shut down the VMs on one host, put it in maintenance mode, installed 3.6
repo and ran yum update. All went well, but when I activated the host, same
deal.

I've attached the engine log snippet for the activation attempt.

Robert

-- 
Senior Software Engineer @ Parsons


engine.log-0722
Description: Binary data


pgphIjB2LDVYl.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Solved: Re: 3.5 to 3.6 upgrade stuck

2016-07-21 Thread Robert Story
On Thu, 21 Jul 2016 16:04:41 -0400 Robert wrote:
RS> 
Thread-1::config::278::ovirt_hosted_engine_ha.broker.notifications.Notifications.config
RS>  ::(refresh_local_conf_file) local conf file was correctly written
RS> 
RS> And then  nothing. It just hangs. Nothing more is logged Thread-1.

So I started digging around the the python source, starting from
refresh_local_conf_file. I ended up in ./broker/notifications.py, in
send_email. I added some logging:

def send_email(cfg, email_body):
"""Send email."""

logger = logging.getLogger("%s.Notifications" % __name__)

try:
logger.debug(" setting up smtp 1")
server = smtplib.SMTP(cfg["smtp-server"], port=cfg["smtp-port"])
logger.debug(" setting up smtp 2")
...

Now the final messages are:

Thread-1::DEBUG::2016-07-21 21:35:05,280::config::278::
  ovirt_hosted_engine_ha.broker.notifications.Notifications.config::
  (refresh_local_conf_file) local conf file was correctly written
Thread-1::DEBUG::2016-07-21 21:35:05,282::notifications::27::
  ovirt_hosted_engine_ha.broker.notifications.Notifications::
  (send_email)  setting up smtp 1


So the culprit is:

server = smtplib.SMTP(cfg["smtp-server"], port=cfg["smtp-port"])

Note that this does actually send the email - 2 minutes later.

So I tried:

  $ telnet localhost 25
  Trying ::1...

which hung, and a little bell went off in my brain...

After changing /etc/hosts from:

127.0.0.1   localhost localhost.localdomain localhost4 localhost4.localdomain4
::1 localhost localhost.localdomain localhost6 localhost6.localdomain6

to

127.0.0.1   localhost localhost.localdomain localhost4 localhost4.localdomain4
::1 localhost6 localhost6.localdomain6

localhost resolves to 127.0.0.1, the delay is gone, and everything is fine.

I don't want to update /etc/hosts on each host. Is there somewhere I can
edit the broker config for mail?


Robert

-- 
Senior Software Engineer @ Parsons


pgpdH5_HtiAx1.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] 3.5 to 3.6 upgrade stuck

2016-07-21 Thread Robert Story
On Thu, 21 Jul 2016 14:43:50 -0400 Robert wrote:
RS> So after some debugging with Simone on irc, we've determined that the issue
RS> is the agent timing out trying to communicate with the broker. The problem
RS> is that we have no idea why.

So more detail attached. The agent is sending:

   
MainThread::hosted_engine::436::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine
  ::(start_monitoring) Processing engine state 
 
MainThread::brokerlink::111::ovirt_hosted_engine_ha.lib.brokerlink.BrokerLink
  ::(notify) Trying: notify time=1469129518.85 type=state_transition 
detail=StartState-ReinitializeFSM hostname='poseidon.netsec' 
MainThread::brokerlink::273::ovirt_hosted_engine_ha.lib.brokerlink.BrokerLink
  ::(_communicate) Sending request: notify time=1469129518.85 
type=state_transition detail=StartState-ReinitializeFSM 
hostname='poseidon.netsec'

Which the broker sees:

Thread-1::util::69::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler
 ::(socket_readline) socket_readline in blocking mode
Thread-1::listener::163::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler
 ::(handle) Input: notify time=1469129518.85 type=state_transition 
detail=StartState-ReinitializeFSM hostname='poseidon.netsec'

It then refreshes the local config file:

Thread-1::config::251::ovirt_hosted_engine_ha.broker.notifications.Notifications.config
 ::(refresh_local_conf_file) Reading 'broker.conf' from 
'/rhev/data-center/mnt/ovirt-nfs.netsec:_ovirt_hosted-engine/2daba0ab-2b3d-4026-bcfc-1cd071c30038/images/a04a45b9-e780-4104-ad4b-d5901a5490c4/34a7

Which succeeds:

Thread-1::config::271::ovirt_hosted_engine_ha.broker.notifications.Notifications.config
 ::(refresh_local_conf_file) Writing to 
'/var/lib/ovirt-hosted-engine-ha/broker.conf'
Thread-1::config::278::ovirt_hosted_engine_ha.broker.notifications.Notifications.config
 ::(refresh_local_conf_file) local conf file was correctly written

And then  nothing. It just hangs. Nothing more is logged Thread-1.



Robert

-- 
Senior Software Engineer @ Parsons



Robert

-- 
Senior Software Engineer @ Parsons

MainThread::DEBUG::2016-07-21 15:31:58,847::hosted_engine::436::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(start_monitoring) Processing engine state 
MainThread::INFO::2016-07-21 15:31:58,847::brokerlink::111::ovirt_hosted_engine_ha.lib.brokerlink.BrokerLink::(notify) Trying: notify time=1469129518.85 type=state_transition detail=StartState-ReinitializeFSM hostname='poseidon.netsec'
MainThread::DEBUG::2016-07-21 15:31:58,847::brokerlink::273::ovirt_hosted_engine_ha.lib.brokerlink.BrokerLink::(_communicate) Sending request: notify time=1469129518.85 type=state_transition detail=StartState-ReinitializeFSM hostname='poseidon.netsec'
MainThread::DEBUG::2016-07-21 15:31:58,848::util::77::ovirt_hosted_engine_ha.lib.brokerlink.BrokerLink::(socket_readline) socket_readline with 30.0 seconds timeout
MainThread::DEBUG::2016-07-21 15:32:28,866::util::88::ovirt_hosted_engine_ha.lib.brokerlink.BrokerLink::(socket_readline) Connection timeout while reading from socket
MainThread::ERROR::2016-07-21 15:32:28,867::brokerlink::279::ovirt_hosted_engine_ha.lib.brokerlink.BrokerLink::(_communicate) Connection closed: Connection timed out
MainThread::DEBUG::2016-07-21 15:32:28,867::brokerlink::86::ovirt_hosted_engine_ha.lib.brokerlink.BrokerLink::(disconnect) Closing connection to ha-broker
MainThread::ERROR::2016-07-21 15:32:28,867::agent::205::ovirt_hosted_engine_ha.agent.agent.Agent::(_run_agent) Error: 'Failed to start monitor state_transition, options {'hostname': 'poseidon.netsec'}: Connection timed out' - trying to restart agent


Thread-1::util::69::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler
 ::(socket_readline) socket_readline in blocking mode
Thread-1::listener::163::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler
 ::(handle) Input: notify time=1469129518.85 type=state_transition detail=StartState-ReinitializeFSM hostname='poseidon.netsec'
Thread-1::listener::238::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler
 ::(_dispatch) Request type notify from 139793244509952
Thread-1::notifications::46::ovirt_hosted_engine_ha.broker.notifications.Notifications
 ::(notify) nofity: {'hostname': 'poseidon.netsec', 'type': 'state_transition', 'detail': 'StartState-ReinitializeFSM', 'time': '1469129518.85'}
Thread-1::config::251::ovirt_hosted_engine_ha.broker.notifications.Notifications.config
 ::(refresh_local_conf_file) Reading 'broker.conf' from '/rhev/data-center/mnt/ov

Re: [ovirt-users] 3.5 to 3.6 upgrade stuck

2016-07-21 Thread Robert Story
So after some debugging with Simone on irc, we've determined that the issue
is the agent timing out trying to communicate with the broker. The problem
is that we have no idea why.

Thread-942::INFO::2016-07-21
09:19:51,934::listener::134::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(setup)
Connection established Thread-942::INFO::2016-07-21
09:19:51,936::listener::186::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(handle)
Connection closed Thread-943::INFO::2016-07-21
09:19:53,099::listener::134::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(setup)
Connection established Thread-943::INFO::2016-07-21
09:19:53,554::listener::186::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(handle)
Connection closed

Thread-135::DEBUG::2016-07-21
09:47:34,941::util::69::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(socket_readline)
socket_readline in blocking mode Thread-135::DEBUG::2016-07-21
09:47:34,941::util::99::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(socket_readline)
Connection closed while reading from socket


So I tried to reinstall instead:

- host -> maintenance
- host removed from cluster
- yum remove ovirt\*
- yum install ovirt-hosted-engine-setup
- hosted-engine --deploy
  - chose new node id
  - reused same name/hostname

Once the host activated, it went right back to the same state.

I'm open to any suggestions to get me back on track. The engine is at
3.6.7, but functioning hosts are still at 3.5.x. Should I try to upgrade
the engine and a host to 4.0.x? I had planned on having a stable 3.6 system
for a few days before trying to jump to 4.0. Or is there some way to go
back to 3.5?


Robert

-- 
Senior Software Engineer @ Parsons


pgpayyu94ifIL.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] direct mounting of vm disks?

2016-06-27 Thread Robert Story
So as (bad) luck would have it, I needed to restore an older backup that
the one I initially used. This time I decided to dig deeper into this
possibility, and got it working. libguestfs took care of all the dirty work,
even with dealing with snapshots and thin provisioning. Here's what I did:

- yum install libguestfs libguestfs-tools-c
- find the disk id
  - via GUI 'Disks' tab, search for VM in 'attached to' column, copy disk ID
- shut down vm
- on nfs server
  - switch to the directory for the ID found above
- e.g. cd /srv/ovirt/storage/*/images/$ID/
  - find latest modified file: ls -lt
  - mount it: guestmount -a $last_modified_file -m /dev/vg_el7/root /mnt/tmp
  - start restore
- cd /mnt/tmp
- ionice -c 3 tar xvf /tmp/backup.tar

Average load on my NFS server for about 12 VMs is generally less than 1.
During the initial restore over the weekend, run on the VM, restore took 5
hours and the load climbed up to 16. I'm running the new restore now, and
the load is holding steady at just under 2.

Robert


On Sun, 26 Jun 2016 12:27:24 + Scott wrote:
S> Hi Robert,
S> 
S> I've modified disk images for oVirt virtual machines before.  I mostly run
S> servers so they all use preallocated as opposed to thin provisioned disks.
S> I'm not sure if that matters, but it means my VM disk images are raw files
S> (as opposed to qcow).  Therefore, I used something like the following guide
S> to get into the disk image.  These directions don't use kpartx or
S> libguestfs, but I guess those would work too.  I think it goes without
S> saying, your VM should be off before you modify its disk without its
S> knowledge.
S> 
S> 
https://major.io/2010/12/14/mounting-a-raw-partition-file-made-with-dd-or-dd_rescue-in-linux/
S> 
S> For me, the biggest problem I had was the partition in the disk image was
S> really an LVM PV.  And that LVM group had the same volume group name as one
S> on the server I was doing this modification.  I had two volume groups with
S> the same name, which made things a little tricky.  But I'll leave that to
S> you to figure out :)
S> 
S> Scott
S> 
S> On Sat, Jun 25, 2016 at 11:41 PM Robert Story  wrote:
S> 
S> > Hello,
S> >
S> > I'm using oVirt 3.5.x w/nfs for vm file storage. I'm trying to restore a vm
S> > from backup, which entails:
S> >
S> >  - scp backup.tar to vm
S> >  - untar backup on vm
S> >
S> > this means all the data makes 3 trips over the network, each of which
S> > causes a load spike on my nfs server. That nfs load, of course, affects all
S> > other vms.
S> >
S> > what I'd like to be able to do is
S> >
S> >  - scp backup.tar to nfs server
S> >  - stop vm
S> >  - mount vm disks on nfs server
S> >  - untar backup on nfs server (using ionice to minimze load impact)
S> >  - unmount vm disks
S> >  - start vm
S> >
S> > I remember that I used to use kpartx to mount regular KVM disks, so I'm
S> > hoping that it can also be done here. Anyone else tried to make this work?
S> >
S> >
S> > Robert
S> >
S> > --
S> > Senior Software Engineer @ Parsons
S> > ___
S> > Users mailing list
S> > Users@ovirt.org
S> > http://lists.ovirt.org/mailman/listinfo/users
S> >  



Robert

-- 
Senior Software Engineer @ Parsons


pgp2T1dW2bHDe.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] direct mounting of vm disks?

2016-06-25 Thread Robert Story
Hello,

I'm using oVirt 3.5.x w/nfs for vm file storage. I'm trying to restore a vm
from backup, which entails:

 - scp backup.tar to vm
 - untar backup on vm

this means all the data makes 3 trips over the network, each of which
causes a load spike on my nfs server. That nfs load, of course, affects all
other vms.

what I'd like to be able to do is

 - scp backup.tar to nfs server
 - stop vm
 - mount vm disks on nfs server
 - untar backup on nfs server (using ionice to minimze load impact)
 - unmount vm disks
 - start vm

I remember that I used to use kpartx to mount regular KVM disks, so I'm
hoping that it can also be done here. Anyone else tried to make this work?


Robert

-- 
Senior Software Engineer @ Parsons


pgpVh26sSZWLP.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] latest CentOS libvirt updates safe?

2016-06-25 Thread Robert Story
I have oVirt 3.5.x on CentOS 7 hosts. These hosts have updates which
include livbirt:

 libvirt-client   x86_64  1.2.17-13.el7_2.5  updates  4.3 M
 libvirt-daemon   x86_64  1.2.17-13.el7_2.5  updates  585 k
 libvirt-daemon-config-nwfilter   x86_64  1.2.17-13.el7_2.5  updates  122 k
 libvirt-daemon-driver-interface  x86_64  1.2.17-13.el7_2.5  updates  162 k
 libvirt-daemon-driver-networkx86_64  1.2.17-13.el7_2.5  updates  302 k
 libvirt-daemon-driver-nodedevx86_64  1.2.17-13.el7_2.5  updates  161 k
 libvirt-daemon-driver-nwfilter   x86_64  1.2.17-13.el7_2.5  updates  185 k
 libvirt-daemon-driver-qemu   x86_64  1.2.17-13.el7_2.5  updates  571 k
 libvirt-daemon-driver-secret x86_64  1.2.17-13.el7_2.5  updates  155 k
 libvirt-daemon-driver-storagex86_64  1.2.17-13.el7_2.5  updates  328 k
 libvirt-daemon-kvm   x86_64  1.2.17-13.el7_2.5  updates  118 k
 libvirt-lock-sanlock

Is it safe to let yum update these packages while the host has running VMs?
in maintenance mode? or not at all?


Robert

-- 
Senior Software Engineer @ Parsons


pgpkSKIC8GM2W.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Terribly slow web consile post migration of oVirt Engine from one host to another host

2016-06-08 Thread Robert Story
On Wed, 8 Jun 2016 12:15:20 -0400 Brett wrote:
BIH> > I was able to migrate the oVirt Engine from one host to another using 
BIH> > engine-backup utility. On the new host, the data is loaded properly 
BIH> > but the Administration Portal is terribly slow.
BIH> >
BIH> > It take about 2 to 3 minutes to allow me to enter user name & 
BIH> > password.
BIH> [...]
BIH> What version of oVirt are you running.  Before 3.6.5 there was an issue 
BIH> because of the generation of random numbers.  Many hosts and VMs don't 
BIH> have much mouse and keyboard activity so it  takes forever to generate 
BIH> the entropy needed for the connection process.  I was told to check the 
BIH> entropy level with
BIH> 
BIH> cat /proc/sys/kernel/random/entropy_avail
BIH> 
BIH> I installed haveged and that fixed it but I'm now on 3.6.5.

I was having this issue on 3.5.x and thought it was because I didn't have
enough memory. Checked my entropy, and it was less than 200. Installed
haveged on my hosted-engine, and now entropy is over 2000 and it takes less
than 10 seconds to log in. Thank you!


Robert

-- 
Senior Software Engineer @ Parsons


pgpscOnYhcThr.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] oVirt 3.5 and SSLv3

2016-04-25 Thread Robert Story
On Sun, 24 Apr 2016 21:37:07 +0200 Piotr wrote:
PK> Looking at the info you pasted I see:
PK> "java.net.NoRouteToHostException: No route to host".
PK> It usually mean that there is/was something wrong with your network.

I saw that too, and tried pings first. Those worked fine, and the
re-install worked right away after I made the java sslv3 change.

I'm going to reinstall and move a host from a different lab. We'll see if I
have the same experience with it...

Robert

PK> On Wed, Apr 20, 2016 at 3:28 PM, Robert Story  wrote:
PK> > On Wed, 20 Apr 2016 08:52:49 -0400 Alexander wrote:  
PK> > AW> On Wednesday, April 20, 2016 08:39:14 AM Robert Story wrote:  
PK> > AW> > Yesterday I had to re-install a host node in my 3.5.6 cluster. 
After a fresh
PK> > AW> > install of CentOS 7.2, attempts to re-install failed, as did 
removing and
PK> > AW> > re-adding the node. Here is a log excerpt from the engine:
PK> > AW> >
PK> > AW> > [...]
PK> > AW> > [org.ovirt.engine.core.vdsbroker.VdsManager]
PK> > AW> > (DefaultQuartzScheduler_Worker-38) Host eclipse is not responding. 
It will
PK> > AW> > stay in Connecting state for a grace period of 120 seconds and 
after that
PK> > AW> > an attempt to fence the host will be issued. 2016-04-19 
18:22:01,938 ERROR
PK> > AW> > [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo]
PK> > AW> > (DefaultQuartzScheduler_Worker-38) Failure to refresh Vds runtime 
info:
PK> > AW> > org.ovirt.engine.core.vdsbroker.vdsbroker.VDSNetworkException:
PK> > AW> > java.net.NoRouteToHostException: No route to host at
PK> > AW> > 
org.ovirt.engine.core.vdsbroker.vdsbroker.VdsBrokerCommand.createNetworkExc
PK> > AW> > eption(VdsBrokerCommand.java:126) [vdsbroker.jar:]
PK> > AW> >
PK> > AW> > Luckily seeing SSL+java in the log tickled my memory about java 
disabling
PK> > AW> > SSLv3, and google helped me find this workaround:
PK> > AW> >
PK> > AW> >  - edit /usr/lib/jvm/java/jre/lib/security/java.security
PK> > AW> >  - look for jdk.tls.disabledAlgorithms
PK> > AW> >  - remove SSLv3 from the list
PK> > AW> >  - service ovirt-engine restart
PK> > AW> >
PK> > AW> > Google also tells me that this should be an issue for 3.5, and 
there is a
PK> > AW> > vdsm setting, VdsmSSLProtocol, that can be set to use TLS, but I 
can't find
PK> > AW> > how to change/set it. Anyone know the secret?  
PK> > AW>
PK> > AW> Pretty much everything engine related can be configured with
PK> > AW> engine-config. engine-config -l will give you a list of all the
PK> > AW> options. engine-config -g  will get the current value,
PK> > AW> engine-config -s = will set it. A quick grep indicates 
that
PK> > AW> you are looking for the VdsmSSLProtocol key.  
PK> >
PK> > Hmmm..
PK> >
PK> >   # engine-config -g VdsmSSLProtocol
PK> >   VdsmSSLProtocol: TLSv1 version: general
PK> >
PK> > Looks like it's already set to TLS, making me wonder why I needed to 
remove SSLv3.  I just put it back and restarted the engine, and it seems to be 
communicating with all hosts ok. So maybe it's just some process/code using 
during install that isn't using this setting...


pgpEdK00i1P3K.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] oVirt 3.5 and SSLv3

2016-04-20 Thread Robert Story
On Wed, 20 Apr 2016 08:52:49 -0400 Alexander wrote:
AW> On Wednesday, April 20, 2016 08:39:14 AM Robert Story wrote:
AW> > Yesterday I had to re-install a host node in my 3.5.6 cluster. After a 
fresh
AW> > install of CentOS 7.2, attempts to re-install failed, as did removing and
AW> > re-adding the node. Here is a log excerpt from the engine:
AW> > 
AW> > [...]
AW> > [org.ovirt.engine.core.vdsbroker.VdsManager]
AW> > (DefaultQuartzScheduler_Worker-38) Host eclipse is not responding. It will
AW> > stay in Connecting state for a grace period of 120 seconds and after that
AW> > an attempt to fence the host will be issued. 2016-04-19 18:22:01,938 ERROR
AW> > [org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo]
AW> > (DefaultQuartzScheduler_Worker-38) Failure to refresh Vds runtime info:
AW> > org.ovirt.engine.core.vdsbroker.vdsbroker.VDSNetworkException:
AW> > java.net.NoRouteToHostException: No route to host at
AW> > 
org.ovirt.engine.core.vdsbroker.vdsbroker.VdsBrokerCommand.createNetworkExc
AW> > eption(VdsBrokerCommand.java:126) [vdsbroker.jar:]
AW> > 
AW> > Luckily seeing SSL+java in the log tickled my memory about java disabling
AW> > SSLv3, and google helped me find this workaround:
AW> > 
AW> >  - edit /usr/lib/jvm/java/jre/lib/security/java.security
AW> >  - look for jdk.tls.disabledAlgorithms
AW> >  - remove SSLv3 from the list
AW> >  - service ovirt-engine restart
AW> > 
AW> > Google also tells me that this should be an issue for 3.5, and there is a
AW> > vdsm setting, VdsmSSLProtocol, that can be set to use TLS, but I can't 
find
AW> > how to change/set it. Anyone know the secret?
AW> 
AW> Pretty much everything engine related can be configured with
AW> engine-config. engine-config -l will give you a list of all the
AW> options. engine-config -g  will get the current value,
AW> engine-config -s = will set it. A quick grep indicates that
AW> you are looking for the VdsmSSLProtocol key.

Hmmm..

  # engine-config -g VdsmSSLProtocol
  VdsmSSLProtocol: TLSv1 version: general

Looks like it's already set to TLS, making me wonder why I needed to remove 
SSLv3.  I just put it back and restarted the engine, and it seems to be 
communicating with all hosts ok. So maybe it's just some process/code using 
during install that isn't using this setting...


Robert

-- 
Senior Software Engineer @ Parsons


pgpgkDJo5spii.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] oVirt 3.5 and SSLv3

2016-04-20 Thread Robert Story
Yesterday I had to re-install a host node in my 3.5.6 cluster. After a fresh 
install of CentOS 7.2, attempts to re-install failed, as did removing and 
re-adding the node. Here is a log excerpt from the engine:


2016-04-19 18:22:01,100 INFO  
[org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp Reactor) 
Connecting to eclipse.localdomain/10.71.10.249
2016-04-19 18:22:01,116 WARN  
[org.ovirt.vdsm.jsonrpc.client.utils.retry.Retryable] (SSL Stomp Reactor) Retry 
failed
2016-04-19 18:22:01,129 ERROR 
[org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] 
(DefaultQuartzScheduler_Worker-38) Exception during connection
2016-04-19 18:22:01,208 ERROR 
[org.ovirt.engine.core.vdsbroker.vdsbroker.GetCapabilitiesVDSCommand] 
(DefaultQuartzScheduler_Worker-38) Command GetCapabilitiesVDSCommand(HostName = 
eclipse, HostId = 37a4a1c2-4906-489e-947c-1ef9fb828bc5, 
vds=Host[eclipse,37a4a1c2-4906-489e-947c-1ef9fb828bc5]) execution failed. 
Exception: VDSNetworkException: java.net.NoRouteToHostException: No route to 
host
2016-04-19 18:22:01,209 WARN  [org.ovirt.engine.core.vdsbroker.VdsManager] 
(DefaultQuartzScheduler_Worker-38) Host eclipse is not responding. It will stay 
in Connecting state for a grace period of 120 seconds and after that an attempt 
to fence the host will be issued.
2016-04-19 18:22:01,938 ERROR 
[org.ovirt.engine.core.vdsbroker.VdsUpdateRunTimeInfo] 
(DefaultQuartzScheduler_Worker-38) Failure to refresh Vds runtime info: 
org.ovirt.engine.core.vdsbroker.vdsbroker.VDSNetworkException: 
java.net.NoRouteToHostException: No route to host
at 
org.ovirt.engine.core.vdsbroker.vdsbroker.VdsBrokerCommand.createNetworkException(VdsBrokerCommand.java:126)
 [vdsbroker.jar:]


Luckily seeing SSL+java in the log tickled my memory about java disabling 
SSLv3, and google helped me find this workaround:

 - edit /usr/lib/jvm/java/jre/lib/security/java.security
 - look for jdk.tls.disabledAlgorithms
 - remove SSLv3 from the list
 - service ovirt-engine restart

Google also tells me that this should be an issue for 3.5, and there is a
vdsm setting, VdsmSSLProtocol, that can be set to use TLS, but I can't find
how to change/set it. Anyone know the secret?


Robert

-- 
Senior Software Engineer @ Parsons


pgpPaFIlxN6q6.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] glusterfs public key failure for rpm

2016-03-22 Thread Robert Story
On Tue, 22 Mar 2016 17:28:20 +0100 Fabrice wrote:
FB> I tried to add a new host on a RHEL7, but it fails.
FB> 
FB> In the ovirt-host-deploy-20160322171347-XXX-6ba9d4a3.log file, I found:
FB> 
FB> warning: 
/var/cache/yum/x86_64/7/ovirt-3.6-glusterfs-epel/packages/glusterfs-libs-3.7.9-1.el7.x86_64.rpm:
FB> Header V4 RSA/SHA256 Signature, key ID d5dc52dc: NOKEY Retrieving key
FB> from https://download.gluster.org/pub/gluster/glusterfs/LATEST/pub.key

Try this patch, it worked for me:

diff --git a/yum.repos.d/ovirt-3.5-dependencies.repo 
b/yum.repos.d/ovirt-3.5-dependencies.repo
index c1914bb..3ef8a28 100644
--- a/yum.repos.d/ovirt-3.5-dependencies.repo
+++ b/yum.repos.d/ovirt-3.5-dependencies.repo
@@ -14,7 +14,7 @@ 
baseurl=http://download.gluster.org/pub/gluster/glusterfs/LATEST/EPEL.repo/epel-
 enabled=1
 skip_if_unavailable=1
 gpgcheck=1
-gpgkey=https://download.gluster.org/pub/gluster/glusterfs/LATEST/pub.key
+gpgkey=https://download.gluster.org/pub/gluster/glusterfs/LATEST/rsa.pub

 [ovirt-3.5-glusterfs-noarch-epel]
 name=GlusterFS is a clustered file-system capable of scaling to several 
petabytes.
@@ -22,7 +22,7 @@ 
baseurl=http://download.gluster.org/pub/gluster/glusterfs/LATEST/EPEL.repo/epel-
 enabled=1
 skip_if_unavailable=1
 gpgcheck=1
-gpgkey=https://download.gluster.org/pub/gluster/glusterfs/LATEST/pub.key
+gpgkey=https://download.gluster.org/pub/gluster/glusterfs/LATEST/rsa.pub

 [ovirt-3.5-patternfly1-noarch-epel]
 name=Copr repo for patternfly1 owned by patternfly


Robert

-- 
Senior Software Engineer @ Parsons


pgpxmjNIVkriN.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] can't add new host because gluster key file name changed

2016-03-21 Thread Robert Story
On Mon, 21 Mar 2016 12:03:03 -0400 Robert wrote:
RS> Hello,
RS> 
RS> I'm trying to add a host to a 3.5 cluster, and it's failing because
RS> gluster folks apparently created a new key and renamed the file...
RS> 
RS> 
RS> warning: 
/var/cache/yum/x86_64/7/ovirt-3.5-glusterfs-epel/packages/glusterfs-libs-3.7.9-1.el7.x86_64.rpm:
RS> Header V4 RSA/SHA256 Signature, key ID d5dc52dc: NOKEY Retrieving key
RS> from https://download.gluster.org/pub/gluster/glusterfs/LATEST/pub.key
RS> 
RS> see
RS> 
RS>  
https://download.gluster.org/pub/gluster/glusterfs/LATEST/NEW_PUBLIC_KEY.README
RS> 
RS> and
RS> 
RS>  https://download.gluster.org/pub/gluster/glusterfs/LATEST/rsa.pub
RS> 
RS> Is there a workaround?

Of course I figured it out right after pressing send..

Editing ovirt dependencies yum repo conf and updating the key name worked
fine..


Robert

-- 
Senior Software Engineer @ Parsons


pgpDH7a73VKEy.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] can't add new host because gluster key file name changed

2016-03-21 Thread Robert Story
Hello,

I'm trying to add a host to a 3.5 cluster, and it's failing because gluster
folks apparently created a new key and renamed the file...


warning: 
/var/cache/yum/x86_64/7/ovirt-3.5-glusterfs-epel/packages/glusterfs-libs-3.7.9-1.el7.x86_64.rpm:
 Header V4 RSA/SHA256 Signature, key ID d5dc52dc: NOKEY
Retrieving key from 
https://download.gluster.org/pub/gluster/glusterfs/LATEST/pub.key

see

 https://download.gluster.org/pub/gluster/glusterfs/LATEST/NEW_PUBLIC_KEY.README

and

 https://download.gluster.org/pub/gluster/glusterfs/LATEST/rsa.pub

Is there a workaround?

Robert

-- 
Senior Software Engineer @ Parsons


pgpCWBE9sf2QE.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] vdsm using 100% CPU, rapidly filling logs with _handle_event messages

2015-11-27 Thread Robert Story
On Wed, 18 Nov 2015 07:28:23 -0500 Robert wrote:
RS> On Wed, 18 Nov 2015 12:35:17 +0100 Vinzenz wrote:
RS> VF> On 11/12/2015 03:16 PM, Robert Story wrote:
RS> VF> > On Thu, 12 Nov 2015 16:02:59 +0200 Dan wrote:
RS> VF> > DK> On Thu, Nov 12, 2015 at 08:45:43AM -0500, Robert Story wrote:
RS> VF> > DK> > I'm running oVirt 3.5.x with a hosted engine. This morning I
RS> VF> > DK> > noticed that 2 of my 5 hosts were showing 99-100% cpu usage.
RS> VF> > DK> > Logging in to them, vdsmd seemed to be the culprit, and it
RS> VF> > DK> > was filling the log file with these messages:
RS> VF> > DK>
RS> VF> > DK> You're probably seeing
RS> VF> > DK> Bug 1226911 - vmchannel thread consumes 100% of CPU
RS> VF> > DK>
RS> VF> > DK> which was closed due to missing information. Do you have any
RS> VF> > DK> information on when this pops up? Is it reproducible? Would
RS> VF> > DK> you be bale to test a suggested patch
RS> VF> > DK>
RS> VF> > DK> https://gerrit.ovirt.org/#/c/42570/
RS> VF> >
RS> VF> > Hi Dan,
RS> VF> >
RS> VF> > Thanks for the pointers. If it comes up again, I'll try this
RS> VF> > patch and report back on the bug...
RS> VF> >
RS> VF> Out of curiosity, did you happen again to see that happening again?
RS> 
RS> No, I have not.

So naturally it shows up again during a holiday. I came in today to find 1
of my 5 nodes (the SPM and host where hosted engine is running) with two
vdsmd threads chewing up 90-100% of the CPU. I applied the patch from above
and restarted vdsmd. This resulted in another node being selected as the
SPM, and within about 15 minutes, that node had the same issue. Applied the
patch to the new node, and restarted vdsmd, and the SPM went back to the
previous (now patched) node. Hopefully things will stay stable.

I've attached snippets of the logs from the SPM when the problem started,
along with the server/engine log snippets from the hosted engine around the
same time..




Robert

-- 
Senior Software Engineer @ Parsons


vdsm-runaway.tgz
Description: application/compressed-tar


pgpT38OiTML4U.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] vdsm using 100% CPU, rapidly filling logs with _handle_event messages

2015-11-18 Thread Robert Story
On Wed, 18 Nov 2015 12:35:17 +0100 Vinzenz wrote:
VF> On 11/12/2015 03:16 PM, Robert Story wrote:
VF> > On Thu, 12 Nov 2015 16:02:59 +0200 Dan wrote:
VF> > DK> On Thu, Nov 12, 2015 at 08:45:43AM -0500, Robert Story wrote:
VF> > DK> > I'm running oVirt 3.5.x with a hosted engine. This morning I
VF> > DK> > noticed that 2 of my 5 hosts were showing 99-100% cpu usage.
VF> > DK> > Logging in to them, vdsmd seemed to be the culprit, and it was
VF> > DK> > filling the log file with these messages:
VF> > DK>
VF> > DK> You're probably seeing
VF> > DK> Bug 1226911 - vmchannel thread consumes 100% of CPU
VF> > DK>
VF> > DK> which was closed due to missing information. Do you have any
VF> > DK> information on when this pops up? Is it reproducible? Would you
VF> > DK> be bale to test a suggested patch
VF> > DK>
VF> > DK> https://gerrit.ovirt.org/#/c/42570/
VF> >
VF> > Hi Dan,
VF> >
VF> > Thanks for the pointers. If it comes up again, I'll try this patch and
VF> > report back on the bug...
VF> >
VF> Out of curiosity, did you happen again to see that happening again?

No, I have not.

Robert

-- 
Senior Software Engineer @ Parsons


pgpTptOzRh2yy.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] _initialize_sanlock cannot get lock, host already holds lock on a different host id

2015-11-12 Thread Robert Story
On Thu, 12 Nov 2015 12:54:49 -0500 Robert wrote:
RS> On Thu, 12 Nov 2015 15:22:18 +0100 Sandro wrote:
RS> SB> > I'm running oVirt 3.5.x with a hosted engine. On 3 of my 5 nodes,
RS> SB> > ovirt-ha-agent won't start, complaining that
RS> SB> > "(_initialize_sanlock) cannot get lock on host id 5: host already
RS> SB> > holds lock on a different host id."
RS> SB> >
RS> SB> >
RS> SB> It should correctly refuse to start the vm since the lock is already
RS> SB> taken, not sure if the message log is just confusing or a real
RS> SB> issue.
RS> 
RS> Just to clarify, this isn't about a vm. The engine VM is up and I'm not
RS> having issues with any other vms. The problem is with the
RS> ovirt-ha-agent.

Some additional info. I ran 'sanlock client status -D' on 1 working and 1
non-working host.

host 3 (working):
s 
hosted-engine:3:/var/run/vdsm/storage/2daba0ab-2b3d-4026-bcfc-1cd071c30038/04b08c8e-657f-4bac-9ddf-c9c57373409c/2d7f5020-42c1-442d-8237-fba9d6787080:0
list=spaces
space_id=4
io_timeout=10
host_generation=5
renew_fail=0
space_dead=0
killing_pids=0
used_retries=0
external_used=0
used_by_orphans=0
corrupt_result=0
acquire_last_result=1
renewal_last_result=1
acquire_last_attempt=2178388
acquire_last_success=2178528
renewal_last_attempt=3523708
renewal_last_success=3523708

host 5 (not working):
s 
hosted-engine:5:/rhev/data-center/mnt/ovirt-nfs.netsec\:_ovirt_hosted-engine/2daba0ab-2b3d-4026-bcfc-1cd071c30038/images/04b08c8e-657f-4bac-9ddf-c9c57373409c/2d7f5020-42c1-442d-8237-fba9d6787080:0
list=spaces
space_id=2
io_timeout=10
host_generation=17
renew_fail=0
space_dead=0
killing_pids=0
used_retries=0
external_used=0
used_by_orphans=0
corrupt_result=0
acquire_last_result=1
renewal_last_result=1
acquire_last_attempt=101
acquire_last_success=241
renewal_last_attempt=3532404
renewal_last_success=3532404

And running 'sanlock client host_status -s hosted-engine -D' (on either 3
or 5), info for hosts 3 and 5 is:

3 timestamp 3523933
last_check=3523954
last_live=3523954
last_req=0
owner_id=3
owner_generation=5
timestamp=3523933
io_timeout=10
owner_name=53d2cee3-fdd8-4c4c-8265-83328bf729af.eclipse.ne
5 timestamp 3532732
last_check=3523954
last_live=3523954
last_req=0
owner_id=5
owner_generation=17
timestamp=3532732
io_timeout=10
owner_name=2c1ec955-4802-4f89-a824-d8a7470c2c9f.apollo.net

Robert

-- 
Senior Software Engineer @ Parsons


pgpHXOIyXWTti.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] _initialize_sanlock cannot get lock, host already holds lock on a different host id

2015-11-12 Thread Robert Story
On Thu, 12 Nov 2015 15:22:18 +0100 Sandro wrote:
SB> > I'm running oVirt 3.5.x with a hosted engine. On 3 of my 5 nodes,
SB> > ovirt-ha-agent won't start, complaining that "(_initialize_sanlock)
SB> > cannot get lock on host id 5: host already holds lock on a different
SB> > host id."
SB> >
SB> >
SB> It should correctly refuse to start the vm since the lock is already
SB> taken, not sure if the message log is just confusing or a real issue.

Just to clarify, this isn't about a vm. The engine VM is up and I'm not
having issues with any other vms. The problem is with the ovirt-ha-agent.


Robert

-- 
Senior Software Engineer @ Parsons


pgp0BnOIZgMqV.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] vdsm using 100% CPU, rapidly filling logs with _handle_event messages

2015-11-12 Thread Robert Story
On Thu, 12 Nov 2015 16:02:59 +0200 Dan wrote:
DK> On Thu, Nov 12, 2015 at 08:45:43AM -0500, Robert Story wrote:
DK> > I'm running oVirt 3.5.x with a hosted engine. This morning I noticed
DK> > that 2 of my 5 hosts were showing 99-100% cpu usage. Logging in to
DK> > them, vdsmd seemed to be the culprit, and it was filling the log file
DK> > with these messages:
DK> 
DK> You're probably seeing
DK> Bug 1226911 - vmchannel thread consumes 100% of CPU
DK> 
DK> which was closed due to missing information. Do you have any information
DK> on when this pops up? Is it reproducible? Would you be bale to test a
DK> suggested patch
DK> 
DK> https://gerrit.ovirt.org/#/c/42570/

Hi Dan,

Thanks for the pointers. If it comes up again, I'll try this patch and
report back on the bug...

Robert

-- 
Senior Software Engineer @ Parsons


pgprPpkTJVCUB.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] _initialize_sanlock cannot get lock, host already holds lock on a different host id

2015-11-12 Thread Robert Story
I'm running oVirt 3.5.x with a hosted engine. On 3 of my 5 nodes,
ovirt-ha-agent won't start, complaining that "(_initialize_sanlock) cannot
get lock on host id 5: host already holds lock on a different host id."

Running 'grep host_id /etc/ovirt-hosted-engine/hosted-engine.conf' on each
host shows that they all have different unique ids, 1-5.

How can I debug this? Is there a command I can run to see which host holds a
sanlock for each id?


Robert

-- 
Senior Software Engineer @ Parsons


pgpFqaT6fE0IB.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] vdsm using 100% CPU, rapidly filling logs with _handle_event messages

2015-11-12 Thread Robert Story
I'm running oVirt 3.5.x with a hosted engine. This morning I noticed that 2
of my 5 hosts were showing 99-100% cpu usage. Logging in to them, vdsmd
seemed to be the culprit, and it was filling the log file with these
messages:

VM Channels Listener::DEBUG::2015-11-12
08:09:26,292::vmchannels::59::vds::(_handle_event) Received 0011. On fd 
removed by epoll. VM Channels Listener::INFO::2015-11-12 
08:09:26,293::vmchannels::54::vds::(_handle_event) Received 0011 on fileno 
119
VM Channels Listener::DEBUG::2015-11-12 
08:09:26,293::vmchannels::59::vds::(_handle_event) Received 0011. On fd 
removed by epoll.
VM Channels Listener::INFO::2015-11-12 
08:09:26,293::vmchannels::54::vds::(_handle_event) Received 0011 on fileno 
75
VM Channels Listener::DEBUG::2015-11-12 
08:09:26,293::vmchannels::59::vds::(_handle_event) Received 0011. On fd 
removed by epoll.
VM Channels Listener::INFO::2015-11-12 
08:09:26,294::vmchannels::54::vds::(_handle_event) Received 0011 on fileno 
119
VM Channels Listener::DEBUG::2015-11-12 
08:09:26,294::vmchannels::59::vds::(_handle_event) Received 0011. On fd 
removed by epoll.

I googled to see how to change the debug level to turn of DEBUG messages
for vdsm, which referred me to libvirtd.conf, but the debug level there was
not set, which should have meant a log level of 3 (warnings and errors), so
I'm not sure why the log was filling up with DEBUG/INFO messages.

I restarted vdsmd, which resulted in those nodes being marked as
'disconnected', but they did eventually recover and loads went back to
normal.

This may or may not be related to the fact that the 3 hosts where this did
not happen can't seem to keep their ha brokers up. I'll be starting a new
thread on that shortly.


Robert

-- 
Senior Software Engineer @ Parsons


pgpwQ2Ye8tRLu.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] oVirt hosts and yum updates

2015-11-12 Thread Robert Story
I was wondering what other folks do to keep their oVirt hosts up to date,
both with the base OS pacakges and oVirt packages.Is it safe to enable
yum-cron? I'm concerned that it might cause issues if vdsm/ovirt packages
are updated without following the correct procedure, but would like other
OS packages to be kept up to date.

Robert

-- 
Senior Software Engineer @ Parsons


pgpBRUQ9S_wyF.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] oVirt host installations failing due to missing GPG key (key location no longer valid)

2015-11-10 Thread Robert Story
On Mon, 9 Nov 2015 16:48:33 -0500 Neal wrote:
NG> I attempted to set up an oVirt host today through the engine UI, and it
NG> kept bombing out on the GPG key, saying it couldn't retrieve it (HTTP
NG> 404). I went to the engine logs and found out that the key that it
NG> can't retrieve is the one for GlusterFS, which lived at
NG> http://download.gluster.org/pub/gluster/glusterfs/LATEST/EPEL.repo/pub.key
NG> according to the logs and
NG> the /etc/yum.repos.d/ovirt-3.6-dependencies.repo file.

I'm seeing this on my oVirt 3.5 hosts as well..


Robert

-- 
Senior Software Engineer @ Parsons


pgp7ArlcfnQ21.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Can ISO Domain be shared across multiple data centers? Ovirt 3.5, 3.6? Does not appear so

2015-11-05 Thread Robert Story
On Wed, 4 Nov 2015 19:16:58 -0500 Itamar wrote:
IH> On 11/02/2015 10:14 AM, Robert Story wrote:
IH> > On Fri, 30 Oct 2015 18:58:33 -0400 Liam wrote:
IH> > LC> Tried setting up NFS ISO domain which works on one host/local
IH> > LC> datacenter. I do not want to have to have multiple copies of iso
IH> > LC> files on all hosts. If I try to import pre-existing ISO domain, I
IH> > LC> either get an error that storage path exists or the dialog just
IH> > LC> does nothing.
IH> >
IH> > I had the same goal, but for multiple independent ovirt installs. My
IH> > work-around is to deal with it outside ovirt.
IH> >
IH> > - create generic nfs export for my iso mirrors
IH> > - create iso export points for each install
IH> > - add iso mount to each install
IH> > - manually (or using puppet, etc) add mirror mount on each host node
IH> > - run a script that creates a symlink in each ovirt iso mirror for
IH> > all the isos in the mirror export
IH> >- 3.5 handles iso images in subdirectories, but 3.4 does not [1],
IH> > so the script creates symlinks for every iso image in the top level
IH> > of the iso export for 3.4, while only needing a symlnk to the distro
IH> > directory fro 3.5.
IH> >
IH> > It's a hack, but it works.
IH> 
IH> This should work without any hacks. An ISO domain can be shared between 
IH> multiple DCs or Engines.
IH> Tal?

When I try and create an iso domain with the same NFS path as another
domain I get 'the selected storage path is not empty[...] Either remove the
existing Storage Domain from this path, or change the storage path.'

Robert

-- 
Senior Software Engineer @ Parsons


pgph5pLEoOccv.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] [RFI] oVirt 3.6 Planning

2015-11-05 Thread Robert Story
On Wed, 4 Nov 2015 19:06:18 -0500 Itamar wrote:
IH> On 09/18/2014 11:35 AM, Robert Story wrote:
IH> > On Fri, 12 Sep 2014 15:22:41 +0300 Itamar wrote:
IH> > IH> With oVirt 3.5 nearing GA, time to ask for "what do you want to
IH> > IH> see in oVirt 3.6"?
IH> >
IH> > I've always thought it would be a good idea to have a 'storage agent'
IH> > to run on storage domains, which could perform some operations more
IH> > optimally that the current system. For example, exporting/importing
IH> > domains currently reads the whole VM image over the network, then
IH> > writes it back. In my case, my storage is all located on the smae NFS
IH> > server, which could simply do an OS copy locally, instead of
IH> > reading/writing gigabytes over the network. I don't know if other
IH> > storage type could perform similar optimizations
IH> 
IH> not in 3.6. doing native storage operations is for sure a top item in 
IH> the storage list going forward.
IH> notice you can do these operations today at disk level and register the 
IH> externally cloned disks via the REST API.

That's good to know. Is there a wiki page/script for how to externally clone
a VM for export?


Robert

-- 
Senior Software Engineer @ Parsons


pgpz3k5lyNFWc.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Must a user see his/her VMs through the web browser?

2015-11-04 Thread Robert Story
On Wed, 4 Nov 2015 17:50:15 +0800 John wrote:
JH> I have installed the oVirt all in one, and I can log in the user portal
JH> through web browser to see user's VMs.
JH> 
JH> I am wondering if there is an client Application that can do the same
JH> thing, like VMware Horizon client has version for Windows, Linux and
JH> IOS, etc.

Haven't heard much about it lately, but you might want to check out this
android app:

  https://github.com/matobet/moVirt/


Robert

-- 
Senior Software Engineer @ Parsons


pgpDGBxICJwO5.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Can ISO Domain be shared across multiple data centers? Ovirt 3.5, 3.6? Does not appear so

2015-11-02 Thread Robert Story
On Fri, 30 Oct 2015 18:58:33 -0400 Liam wrote:
LC> Tried setting up NFS ISO domain which works on one host/local
LC> datacenter. I do not want to have to have multiple copies of iso files
LC> on all hosts. If I try to import pre-existing ISO domain, I either get
LC> an error that storage path exists or the dialog just does nothing.

I had the same goal, but for multiple independent ovirt installs. My
work-around is to deal with it outside ovirt.

- create generic nfs export for my iso mirrors
- create iso export points for each install
- add iso mount to each install
- manually (or using puppet, etc) add mirror mount on each host node
- run a script that creates a symlink in each ovirt iso mirror for all the
  isos in the mirror export
  - 3.5 handles iso images in subdirectories, but 3.4 does not [1], so the
script creates symlinks for every iso image in the top level of the iso
export for 3.4, while only needing a symlnk to the distro directory fro
3.5.

It's a hack, but it works.


Robert

[1] 3.4 will descend to subdirectories to display the list of iso images
when attaching them in the UI, but the host will fail to boot because it
can't find the image unless it's at the top level.

-- 
Senior Software Engineer @ Parsons


pgp738mRAW6I0.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

2015-10-29 Thread Robert Story
On Thu, 29 Oct 2015 12:37:44 -0400 Robert wrote:
RS> All 3 hosts that had down ha-agents were down again, so I'm guessing
RS> that's the issue.. 

As an experiment, I migrated the engine VM to one of the hosts with a
working ha-agent process, and I'm no longer getting getting these emails.

So the next question is: how can I fix the ha-agent on my other three
nodes. Seems to be an issue with host_id and sanlock...


Robert

-- 
Senior Software Engineer @ Parsons


pgpfwYnvppcfu.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

2015-10-29 Thread Robert Story
On Thu, 29 Oct 2015 16:00:27 +0100 Simone wrote:
ST> And indeed ares was host 1 so when it failed it was correctly trying to
ST> get lock for host 1 but it seams that previously it acquired a lock as
ST> different host.
ST> Could you please check
ST>  grep host_id /etc/ovirt-hosted-engine/hosted-engine.conf
ST> on ares and share vdsm and sanlock logs from that host?

$ for x in ares hera eclipse poseidon apollo; do echo "* $x"; ssh root@$x grep 
host_id /etc/ovirt-hosted-engine/hosted-engine.conf 2>/dev/null; done
* ares
host_id=1
* hera
host_id=2
* eclipse
host_id=3
* poseidon
host_id=4
* apollo
host_id=5

Since I've upgraded, I figured I'd reproduce and send new logs. In that
process, I noticed that the ha-agent was down on 3 hosts, and the 2 other
hosts were the ones generating the messages. So I restarted ha-agent on all
5, disabled global maintenance for 2 minutes, re-enabeled it, then ran a
grep on all the logs on all 5 hosts for those 2 minutes. I'll sent that to
you directly, as it's rather large to be sending to the list..

All 3 hosts that had down ha-agents were down again, so I'm guessing
that's the issue.. 

Robert

-- 
Senior Software Engineer @ Parsons


pgpNJKBZsERuk.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

2015-10-29 Thread Robert Story
On Thu, 29 Oct 2015 15:40:23 +0100 Simone wrote:
ST> Here the host IDs seam coherent.
ST> Can you please specify the name of the hosts where you took the logs in
ST> your first log archive (complaining host and engine host) ?

Hmm.. I know the complaining host was posedion, and I'm pretty sure the
engine was running on ares.


Robert

-- 
Senior Software Engineer @ Parsons


pgppMALnQxgnB.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

2015-10-29 Thread Robert Story
On Thu, 29 Oct 2015 14:08:22 +0100 Simone wrote:
ST> it seams that two hosts are fighting fir the same host ID:
ST> 
ST> MainThread::INFO::2015-10-27
ST> 
09:14:56,764::hosted_engine::562::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_sanlock)
ST> Ensuring lease for lockspace hosted-engine, host id 1 is acquired (file:
ST> 
/var/run/vdsm/storage/2daba0ab-2b3d-4026-bcfc-1cd071c30038/04b08c8e-657f-4bac-9ddf-c9c57373409c/2d7f5020-42c1-442d-8237-fba9d6787080)
ST> MainThread::ERROR::2015-10-27
ST> 
09:14:56,766::hosted_engine::578::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_sanlock)
ST> cannot get lock on host id 1: host already holds lock on a different
ST> host id MainThread::ERROR::2015-10-27
ST> 
09:14:56,767::agent::177::ovirt_hosted_engine_ha.agent.agent.Agent::(_run_agent)
ST> Error: '(22, 'Sanlock lockspace add failure', 'Invalid argument')' -
ST> trying to restart agent
ST> 
ST> can you please share the output of: hosted-engine --vm-status

Hi Simone, thanks for taking the time to look at this. Here is the outpu:


# hosted-engine --vm-status

!! Cluster is in GLOBAL MAINTENANCE mode !!

--== Host 1 status ==--
Status up-to-date  : False
Hostname   : ares.netsec
Host ID: 1
Engine status  : unknown stale-data
Score  : 2334
Local maintenance  : False
Host timestamp : 2496391
Extra metadata (valid at timestamp):
metadata_parse_version=1
metadata_feature_version=1
timestamp=2496391 (Tue Oct 27 07:41:00 2015)
host-id=1
score=2334
maintenance=False
state=EngineUp


--== Host 2 status ==--
Status up-to-date  : False
Hostname   : hera.netsec
Host ID: 2
Engine status  : unknown stale-data
Score  : 1689
Local maintenance  : False
Host timestamp : 2038037
Extra metadata (valid at timestamp):
metadata_parse_version=1
metadata_feature_version=1
timestamp=2038037 (Mon Oct 26 08:50:13 2015)
host-id=2
score=1689
maintenance=False
state=EngineDown


--== Host 3 status ==--
Status up-to-date  : False
Hostname   : eclipse.netsec
Host ID: 3
Engine status  : unknown stale-data
Score  : 2000
Local maintenance  : False
Host timestamp : 2298393
Extra metadata (valid at timestamp):
metadata_parse_version=1
metadata_feature_version=1
timestamp=2298393 (Thu Oct 29 09:46:21 2015)
host-id=3
score=2000
maintenance=False
state=GlobalMaintenance


--== Host 4 status ==--
Status up-to-date  : False
Hostname   : poseidon.netsec
Host ID: 4
Engine status  : unknown stale-data
Score  : 2000
Local maintenance  : False
Host timestamp : 123241
Extra metadata (valid at timestamp):
metadata_parse_version=1
metadata_feature_version=1
timestamp=123241 (Thu Oct 29 09:46:30 2015)
host-id=4
score=2000
maintenance=False
state=GlobalMaintenance


--== Host 5 status ==--
Status up-to-date  : False
Hostname   : apollo.netsec
Host ID: 5
Engine status  : unknown stale-data
Score  : 2000
Local maintenance  : False
Host timestamp : 2028116
Extra metadata (valid at timestamp):
metadata_parse_version=1
metadata_feature_version=1
timestamp=2028116 (Mon Oct 26 04:14:46 2015)
host-id=5
score=2000
maintenance=False
state=EngineDown



Robert

-- 
Senior Software Engineer @ Parsons


pgpCq6EFTWQtZ.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

2015-10-29 Thread Robert Story
On Tue, 27 Oct 2015 09:45:28 -0400 Robert wrote:
RS> I have oVirt 3.5.4 on CentOS 7.1 hosts, and everyone once in a while
RS> one of my hosts starts sending me the 4 engine status messages above
RS> about every 10-15 minutes.

I upgraded the engine and all hosts to 3.5.5, and then 2 hosts started
sending me 4 emails every 10-15 minutes. Currently I'm running with the
engine in global maintenance to keep my inbox from overflowing with these
messages.

Any suggestions on how to get this under control appreciated...


Robert

-- 
Senior Software Engineer @ Parsons


pgpTv9jzYsN_K.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] repeating EngineUnexpectedlyDown/EngineDown/EngineStart/EngineStarting

2015-10-27 Thread Robert Story
Hi,

I have oVirt 3.5.4 on CentOS 7.1 hosts, and everyone once in a while one of
my hosts starts sending me the 4 engine status messages above about every
10-15 minutes.

It looks like the ha broker on the host currently running is having issues
(already tried restarting it once.

I've attached a tarball with log snippets for the engine, host with active
engine vm running, and the complaining host...

(I'll be hanging out in #ovirt all day too, will respond to questions or
suggestions here or there.)


Robert

-- 
Senior Software Engineer @ Parsons


ovirt-logs.tar.gz
Description: application/gzip


pgpEISI7ZpEIN.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] security alerts from host node

2015-09-28 Thread Robert Story
I'm getting hundreds of email messages from one of my hosts, several per
minute, with a subject of "*** SECURITY information for ov1.example ***

  vdsm : problem with defaults entries ; TTY=unknown ; PWD=/ ;


Any ideas on how I can fix this?


Robert

-- 
Senior Software Engineer @ Parsons


pgp5CwpJrob0R.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] moving storage away from a single point of failure

2015-09-18 Thread Robert Story
Hi,

I'm running oVirt 3.5 in our lab, and currently I'm using NFS to a single
server. I'd like to move away from having a single point of failure.
Watching the mailing list, all the issues with gluster getting out of sync
and replica issues has me nervous about gluster, plus I just have 2
machines with lots of drive bays for storage. I've been reading about GFS2
and DRBD, and wanted opinions on if either is a good/bad idea, or to see if
there are other alternatives.

My oVirt setup is currently 5 nodes and about 25 VMs, might double in size
eventually, but probably won't get much bigger than that.


Thanks,

Robert

-- 
Senior Software Engineer @ Parsons


pgptbaK5zkgz4.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Add RAM to Hosted Engine

2015-08-13 Thread Robert Story
On Thu, 13 Aug 2015 09:06:16 + Simon wrote:
SH> [...] is it possible to retroactively add RAM to the virtual machine?

See this thread:

  http://lists.ovirt.org/pipermail/users/2015-May/033021.html


Robert

-- 
Senior Software Engineer @ Parsons


pgpcBj_rQ3XPM.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Neutron Appliance questions

2015-07-29 Thread Robert Story
On Wed, 29 Jul 2015 00:40:51 -0500 Donny wrote:
DD> Depends on your setup... Are you using gre/vxlan with ovs on the hosts,
DD> or are you using the Linux bridge?

Right now I'm still in the planning stages. I've got the hosted engine up
on one host, but haven't gotten gotten the neutron appliance up yet because
I wasn't expecting it to need its own nic/interface.

Maybe some information about my setup will help.. I've got a lab and a
server room, each with their own public ip space. The server room has
internal and external production systems, and the lab has test some
important-but-not-production systems. Right now all these systems
are connected to both public and private networks through multiple
interfaces or vlans. I'd like to use neutron to eliminate the direct
connection to the public networks and use floating IPs instead. I'd also
like to isolate the internal production from public facing production, and
lab test systems from lab important systems.

I should add that I'm not a sysadmin, but a developer who was nominated to
administer group resources. So I'm learning as I go. I've been running
oVirt 3.4 in the lab for a few months, and now am wanting to get oVirt 3.5
into the server room (and eventually to migrate the lab systems to a second
'data center' of the server room install).

Any thoughts or suggestions welcomed, on or off list.


Robert


pgpVcTBfUspwl.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Neutron Appliance questions

2015-07-28 Thread Robert Story
Hello,

I have an oVirt 3.5.x install on CentOS 7.1, and I'm trying to wrap my head
around how to use the Neutron appliance.

In the video demo, two hosts attach the neutron network to their eth1
interfaces, and their VMs can communicate with each other. But the demo
doesn't go on to show how to connect those VMs to the internet with
floating IPs.

In the OpenStack Neutron videos I've watched on youtube, the common basic
configuration seems to be a single 'network node' that connects to the
public and private networks and routes to the 'compute nodes', which aren't
connected to the public network. With the oVirt/Neutron integration, it
seems like all hosts are both 'network' and 'compute' nodes, meaning that
they need to be connected to the public network. Is that right, or am I
missing some fundamental concept? Is it possible to designate 1 or 2 hosts
of a cluster (with public network access) for the neutron appliance, while
the remaining hosts only have private network access + floating IPs for VMs?


Robert


pgpPN45gusURV.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] how to prepare for an engine re-install

2015-02-18 Thread Robert Story
On Wed, 18 Feb 2015 10:46:06 -0500 (EST) Yedidyah wrote:
YBD> Good luck and best regards,

Thanks... after a backup and tracking down a spare disk for the array, I
went ahead with a reboot. Filesystem came up rw, engine started, and GUI is
working.

Issues still remain, however. While I can ssh to my nodes and VMs, VMs show
as ?/unknown, and nodes show as non-responsive.  I'll be looking into that
tomorrow.


Robert

-- 
Senior Software Engineer @ Parsons


pgpDOVYyWl4CG.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] how to prepare for an engine re-install

2015-02-18 Thread Robert Story
I'm running ovirt (engine 3.4.0-1.el6) on a dedicated engine machine with a
degraded raid array. This has caused the root filesystem to go read-only,
and the engine crashed. I can restart the engine, but after doing so, the
web page just loads a blank page. All nodes and VMs seem to be fine, I just
can't manage them via oVirt.

Because I'm offsite, my recovery option is likely limited to a reinstall of
the OS and ovirt.  I'm hoping for some pointers on what I can do in advance
to minimize the pain of this transition. 

- Am I going to be able to recover without having to shut down VMs,
  exporting and importing?

- what directories should I back up?

- when I reinstall, should I upgrade and install 3.5.1, or first install
  3.4, get it working, and then think about upgrading?

- Is it possible/recommended/crazy to migrate to a hosted engine instead
  of a dedicated machine?

So I want to prepare for the worst before I try a reboot. If I'm lucky, the
reboot will at least come up with a writable filesystem, and I won't need to
re-install. If it doesn't come up, I want to be as ready as possible for
the reinstall.

Any hints and/or tips appreciated..

(I'll also be hanging out in the irc channel if anyone wants to respond in
that venue..)

Robert

-- 
Senior Software Engineer @ Parsons


pgp_edPrRmbZK.pgp
Description: OpenPGP digital signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] about Ovirt - KVM - Ubuntu

2015-01-12 Thread Robert Story
On Thu, 08 Jan 2015 17:12:00 -0600 Carlos wrote:
CL> I have questions about ovirt, my english is not good,I do not know if 
CL> there are some mail list in spanish.

Hola Carlos,

CL> Is posible Ovirt server read guest machines in KVM ubuntu server? or I 
CL> need install Ovirt and migrade one to one machines.

You will need to migrate existing KVM guests to oVirt. There is a tool,
virt-v2v, that can help with the conversion. Unfortunately it's a long,
slow multi-step process. Hopefully google can help find some documentation
or a blog in Spanish for you.


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] gmail bounces [was Re: oVirt Weekly Sync Meeting: Dec. 17, 2014

2014-12-17 Thread Robert Story
On Wed, 17 Dec 2014 10:55:24 -0500 (EST) Brian wrote:
BP>   * other topics There is a known problem with gmail subscribers to the
BP> oVirt Users mailing list getting excessive bounces. Still trying to
BP> puzzle this one out. Anyone with Mailman mojo is more than welcome
BP> to lend a hand.  (bkp, 15:49:27)

I'v heard of this issue on other lists. This link seems to have a pretty
good breakdown of what the issue might be (but unfortunately not a
solution).

http://robbat2.livejournal.com/241253.html


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] 答复: 3.5 hosted engine: 2nd host Cannot acquire bridge address

2014-12-09 Thread Robert Story
On Tue, 09 Dec 2014 10:33:51 +0100 Sandro wrote:
SB> > Looking at setup logs, you have:
SB> > OVEHOSTED_NETWORK/bridgeIf=str:'eth1.79'
SB> > [...]
SB> > which seems to fail matching REGEXP:
SB> > 
SB> > _ADDRESS_RE = re.compile(
SB> > flags=re.VERBOSE,
SB> > pattern=r"""
SB> > \s+
SB> > inet
SB> > \s
SB> > (?P\d{1,3}\.\d{1,3}\.\d{1,3}\.\d{1,3})
SB> > .+
SB> > \s+
SB> > (?P\w+)
SB> > $
SB> > """
SB> > )
SB> > 
SB> > I guess the match is failing due to the '.' in 'eth1.79'
SB> 
SB> If you haven't already opened a BZ about it, please open one, thanks

  https://bugzilla.redhat.com/show_bug.cgi?id=1172287


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] [QE][ACTION REQUIRED] oVirt 3.6.0 status

2014-12-03 Thread Robert Story
On Wed, 03 Dec 2014 10:37:19 +0100 Sandro wrote:
SB> Two different proposals have been made about above scheduling [3]:
SB> 1) extend the cycle to 10 months for allowing to include a large
SB> feature set 2) reduce the cycle to less than 6 months and split
SB> features over 3.6 and 3.7

I'd prefer a six-month cycle, so that the smaller features and enhancements
come more quickly.

Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] oVirt 3.5 & NAT

2014-11-19 Thread Robert Story
On Wed, 19 Nov 2014 09:03:44 + (GMT) Phil wrote:
PD> With thanks to Antoni we have this working now.  The working hook may
PD> be viewed at http://paste.fedoraproject.org/152070/

I'd hate for this info to get lost, so I added a page in the oVirt wiki. I
haven't actually tried it, but just pulled info from this thread. Can you
review the page and see if it look right to you, or if I missed anything?

http://www.ovirt.org/VDSM-Hooks/network-nat


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] how to 'reset' a failed install?

2014-11-07 Thread Robert Story
I've been doing lots of unsuccessful 3.5 hosted-engine installs in my lab,
where it's easy for me to re-install the OS if I need to start over. Now I
need to try an install in a remote datacenter where I won't be able to
re-install the OS. So I was wondering if there is a way to 'reset' a failed
install so that another install can be attempted...

My thoughts so far are:

- stop vdsm, supervdsm, and libvirt
- use etckeeper to reset everything under /etc
- delete old log files
- delete hosted_engine storage domain on storage (if install got that far)
- restart vdsm, supervdsm, and libvirt

What am I missing? Maybe some remnants in /var (hmm, probably the vdsm
persistent config)? Anything else?


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Complete CentOS 7 environment

2014-11-06 Thread Robert Story
On Thu, 6 Nov 2014 17:47:52 + Usman wrote:
AU> We are upgrading hardware and I'm upgrading/rebuilding our Ovirt
AU> infrastructure. Are CentOS 7 host nodes supported? And can the engine
AU> be installed on CentOS 7? (3.5 repo isn't working for me)

CentOS 7 is supported for host nodes, but not the engine. If you have a
mixed CentOS 6/7 environment, note that migration works from 6->7 hosts, but
not from 7->6.


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] 3.5 hosted engine: 2nd host Cannot acquire bridge address

2014-11-05 Thread Robert Story
On Wed, 5 Nov 2014 19:57:07 -0500 Robert wrote:
RS> I've got a hosted engine up and running on a freshly installed 3.5 host
RS> (CentOS 6.6), and I'm tyying to add a second host. The install fails
RS> trying to configure the ovirtmgmt bridge:
RS> 
RS> [ INFO  ] Updating hosted-engine configuration
RS> [ INFO  ] Stage: Transaction commit
RS> [ INFO  ] Stage: Closing up
RS> [ ERROR ] Failed to execute stage 'Closing up': Cannot acquire bridge
RS> address
RS> 
RS> From the setup log:
RS> [snip]

complete logs from the second host at http://futz.org/users/tmp/ovirt7/



Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] 3.5 hosted engine: 2nd host Cannot acquire bridge address

2014-11-05 Thread Robert Story
I've got a hosted engine up and running on a freshly installed 3.5 host
(CentOS 6.6), and I'm tyying to add a second host. The install fails trying
to configure the ovirtmgmt bridge:

[ INFO  ] Updating hosted-engine configuration
[ INFO  ] Stage: Transaction commit
[ INFO  ] Stage: Closing up
[ ERROR ] Failed to execute stage 'Closing up': Cannot acquire bridge address

From the setup log:

2014-11-05 19:35:00 DEBUG 
otopi.plugins.ovirt_hosted_engine_setup.engine.add_host add_host._closeup:434 
Updating engine's management network to be vlanned 2014-11-05 19:35:02 DEBUG 
otopi.plugins.ovirt_hosted_engine_setup.engine.add_host add_host._closeup:445 
Adding the host to the cluster 2014-11-05 19:35:03 DEBUG 
otopi.plugins.ovirt_hosted_engine_setup.engine.add_host 
add_host._getIPAddress:184 Acquiring nic address
2014-11-05 19:35:03 DEBUG 
otopi.plugins.ovirt_hosted_engine_setup.engine.add_host plugin.executeRaw:785 
execute: ('/sbin/ip', 'addr', 'show', 'eth1.79'), executable='None', 
cwd='None', env=None
2014-11-05 19:35:03 DEBUG 
otopi.plugins.ovirt_hosted_engine_setup.engine.add_host plugin.executeRaw:803 
execute-result: ('/sbin/ip', 'addr', 'show', 'eth1.79'), rc=0
2014-11-05 19:35:03 DEBUG 
otopi.plugins.ovirt_hosted_engine_setup.engine.add_host plugin.execute:861 
execute-output: ('/sbin/ip', 'addr', 'show', 'eth1.79') stdout:
4: eth1.79@eth1:  mtu 1500 qdisc noqueue state 
UP
link/ether 00:26:9e:0b:31:e5 brd ff:ff:ff:ff:ff:ff
inet 10.69.79.32/24 brd 10.69.79.255 scope global eth1.79
inet6 fe80::226:9eff:fe0b:31e5/64 scope link
   valid_lft forever preferred_lft forever

2014-11-05 19:35:03 DEBUG 
otopi.plugins.ovirt_hosted_engine_setup.engine.add_host plugin.execute:866 
execute-output: ('/sbin/ip', 'addr', 'show', 'eth1.79') stderr:


2014-11-05 19:35:03 DEBUG otopi.context context._executeMethod:152 method 
exception
Traceback (most recent call last):
  File "/usr/lib/python2.6/site-packages/otopi/context.py", line 142, in 
_executeMethod
method['method']()
  File 
"/usr/share/ovirt-hosted-engine-setup/scripts/../plugins/ovirt-hosted-engine-setup/engine/add_host.py",
 line 485, in _closeup
address=self._getIPAddress(),
  File 
"/usr/share/ovirt-hosted-engine-setup/scripts/../plugins/ovirt-hosted-engine-setup/engine/add_host.py",
 line 199, in _getIPAddress
raise RuntimeError(_('Cannot acquire bridge address'))
RuntimeError: Cannot acquire bridge address
2014-11-05 19:35:03 ERROR otopi.context context._executeMethod:161 Failed to 
execute stage 'Closing up': Cannot acquire bridge address

The install on the first host worked fine on a VLAN (eth1.79).

Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] 3.5 install issues: The VDSM host was found in a failed state

2014-11-03 Thread Robert Story
On Tue, 28 Oct 2014 13:05:07 -0400 (EDT) Simone wrote:
ST> Thanks for the report Robert,
ST> I think it can be quite related to this one:
ST> https://bugzilla.redhat.com/show_bug.cgi?id=1086032
ST> 
ST> It seams that currently, if we have more than one required networks, we
ST> are not able to add an host. And it's not just on hosted-engine setup:
ST> currently it seams that it doesn't work also starting it from the web
ST> admin gui. I'm working on that.

Hmm.. When you say 'required network', I think 'required' in terms of the
oVirt gui. That wouldn't apply to my case, because ovirt hasn't been
installed yet. The underlying host has multiple networks, but nothing has
been configured in ovirt (unless the install detects and adds them).


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] 3.5 install issues: The VDSM host was found in a failed state

2014-10-26 Thread Robert Story
So I tried install #6 today on another machine. I dedicated eth0 to
ovirmgmt and put the default route and vlans on eth1. The install got to
the point of starting the VM the second time for the engine install (which
completed fine), but then:

[ INFO  ] Engine replied: DB Up!Welcome to Health Status!
  Enter the name of the cluster to which you want to add the host 
(Default) [Default]: 
[ INFO  ] Waiting for the host to become operational in the engine. This may 
take several minutes...
[ ERROR ] The VDSM host was found in a failed state. Please check engine and 
bootstrap installation logs.
[ ERROR ] Unable to add hosted_engine_1 to the manager
  Please shutdown the VM allowing the system to launch it as a 
monitored service.

I shut down the engine VM, but now it seems to be in a reboot loop,
starting and then being shutdown and rebooted, by the HA agent, I assume..

Logs for this install are at http://futz.org/users/tmp/ovirt6/. (The first
hosted-engine was aborted before it started to fix DNS for of engine VM,
but I included it for completeness).


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] 3.5 install issues: failed to setup ovirtmgmt

2014-10-25 Thread Robert Story
On Sat, 25 Oct 2014 23:30:41 +0100 Dan wrote:
DK> On Sat, Oct 25, 2014 at 03:18:32PM -0400, Robert Story wrote:
DK> > line 225, in _setupNetworks 'message: "%s"' % (networks, code,
DK> > message)) RuntimeError: Failed to setup networks {'ovirtmgmt':
DK> > {'nic': 'eth0', 'netmask': '255.255.255.0', 'bootproto': 'none',
DK> > 'ipaddr': '10.69.79.31'}}. Error code: "16" message: "Unexpected
DK> > exception"
DK> 
DK> This means that something nasty happened inside Vdsm while attempting to
DK> create the bridge.
DK> 
DK> Can you attach vdsm.log and supervdsmd.log?

Sure...

http://futz.org/users/tmp/ovirt5/supervdsm.log
http://futz.org/users/tmp/ovirt5/vdsm.log


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] 3.5 install issues: failed to setup ovirtmgmt

2014-10-25 Thread Robert Story
I'm on my 4th attempt to get a 3.5 hosted-engine set up on CentOS 6.5. I
seem to hit a different snag each time. The first 3 were with the ovirtmgmt
bridge on a tagged VLAN, so I decided to go back to an untagged interface
and see if I had better luck. But this install failed even earlier, when
creating the management bridge:

2014-10-25 12:37:24 INFO otopi.plugins.ovirt_hosted_engine_setup.network.bridge 
bridge._misc:198 Configuring the management bridge
2014-10-25 12:37:25 DEBUG otopi.context context._executeMethod:152 method 
exception
Traceback (most recent call last):
  File "/usr/lib/python2.6/site-packages/otopi/context.py", line 142, in 
_executeMethod
method['method']()
  File 
"/usr/share/ovirt-hosted-engine-setup/scripts/../plugins/ovirt-hosted-engine-setup/network/bridge.py",
 line 207, in _misc
_setupNetworks(conn, networks, {}, {'connectivityCheck': False})
  File 
"/usr/share/ovirt-hosted-engine-setup/scripts/../plugins/ovirt-hosted-engine-setup/network/bridge.py",
 line 225, in _setupNetworks
'message: "%s"' % (networks, code, message))
RuntimeError: Failed to setup networks {'ovirtmgmt': {'nic': 'eth0', 'netmask': 
'255.255.255.0', 'bootproto': 'none', 'ipaddr': '10.69.79.31'}}. Error code: 
"16" message: "Unexpected exception"
2014-10-25 12:37:25 ERROR otopi.context context._executeMethod:161 Failed to 
execute stage 'Misc configuration': Failed to setup networks {'ovirtmgmt': 
{'nic': 'eth0', 'netmask': '255.255.255.0', 'bootproto': 'none', 'ipaddr': 
'10.69.79.31'}}. Error code: "16" message: "Unexpected exception"
2014-10-25 12:37:25 DEBUG otopi.transaction transaction.abort:131 aborting 
'File transaction for '/etc/ovirt-hosted-engine/firewalld/hosted-console.xml''
2014-10-25 12:37:25 DEBUG otopi.transaction transaction.abort:131 aborting 
'File transaction for '/etc/ovirt-hosted-engine/iptables.example''
2014-10-25 12:37:25 DEBUG otopi.transaction transaction.abort:131 aborting 
'File transaction for '/etc/sysconfig/iptables''
2014-10-25 12:37:25 DEBUG otopi.transaction transaction.abort:131 aborting 
'File transaction for '/etc/ovirt-hosted-engine-ha/broker.conf''

Complete log here:

http://futz.org/users/tmp/ovirt5/ovirt-hosted-engine-setup-20141025123111-g51fy0.log

I thought it might because there was still a tagged vlan on the interface,
so I tried to deploy again after removing everything else from eth0, but
still no luck. that log is at

http://futz.org/users/tmp/ovirt5/ovirt-hosted-engine-setup-20141025150830-s2byre.log

any hints tips appreciated...


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] [SOLVED] Re: Hosted engine on CentOS 6.5

2014-10-23 Thread Robert Story
On Thu, 23 Oct 2014 15:09:08 +0200 Sandro wrote:
SB> > The hosted-engine ask for:
SB> > 
SB> > - Enter 'admin@internal' user password that will be used for
SB> > accessing the Administrator Portal:
SB> > 
SB> > The engine-setup asks for:
SB> > 
SB> > - Engine admin password:
SB> > 
SB> > Since they were worded differently, I assumed they were for different
SB> > things (like engine-seup wanting a password for PostgreSQL 'engine'
SB> > database). Maybe the engine-setup prompt could be changed to "Engine
SB> > admin@internal password" to make it clear that they are the same?
SB> 
SB> Please open a bug about above text alignment, so we can keep track of
SB> it.

Ok, I already added a note to the hosted-engine wiki page. I just created
https://bugzilla.redhat.com/show_bug.cgi?id=1156060

SB> > So, assuming that resetting the password on the engine to match my
SB> > admin@internal password is the fix, the next hurdle is getting
SB> > hosted-engine deploy to run again. Current, it fails with:
SB> > 
SB> > # hosted-engine --deploy
SB> > --config-append=/etc/ovirt-hosted-engine/answers.conf ... [ INFO  ]
SB> > Stage: Environment setup [ ERROR ] The following VMs has been found:
SB> > 5f154b49-a0d0-4e8b-8f5d-efcd83115a95 [ ERROR ] Failed to execute
SB> > stage 'Environment setup': Cannot setup Hosted Engine with other VMs
SB> > running
SB> 
SB> Please run "hosted-engine --vm-poweroff". VDSM keep the vm status for
SB> forensic until the VM is destroyed / powered off.

That worked.

And for the record, everything is working fine after:

- hosted-engine --vm-poweroff

- hosted-engine --deploy \
--config-append=/etc/ovirt-hosted-engine/answers.conf

- answering "No" to "Is this an additional host setup"

- entering the correct engine admin password at the 'Enter 'admin@internal'
  user password" prompt

- Connect via VNC to the VM the first time and select 'boot to local hd'
  from the install screen

- shutdown the VM (previous install had completed setup)

- on hosted-engine deploy, enter "1" to continue

- enter "1" to continue second install (no need to connect via VNC,
  previous install had completed setup)

- finish the install as usual..

Thanks again for the help.

Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Hosted engine on CentOS 6.5

2014-10-23 Thread Robert Story
On Thu, 23 Oct 2014 08:56:33 +0200 Sandro wrote:
SB> status: 401
SB> reason: Unauthorized
SB> detail: HTTP Status 401
SB> 
SB> Are you sure you're using the same password for admin@internal both on
SB> 'hosted-engine --deploy' and on 'engine-setup'? Looks like you've the
SB> same error reported in
SB> https://bugzilla.redhat.com/show_bug.cgi?id=1154362

The hosted-engine ask for:

   - Enter 'admin@internal' user password that will be used for accessing
 the Administrator Portal:

The engine-setup asks for:

   - Engine admin password:

Since they were worded differently, I assumed they were for different
things (like engine-seup wanting a password for PostgreSQL 'engine'
database). Maybe the engine-setup prompt could be changed to "Engine
admin@internal password" to make it clear that they are the same?


So, assuming that resetting the password on the engine to match my
admin@internal password is the fix, the next hurdle is getting 
hosted-engine deploy to run again. Current, it fails with:

# hosted-engine --deploy --config-append=/etc/ovirt-hosted-engine/answers.conf
...
[ INFO  ] Stage: Environment setup
[ ERROR ] The following VMs has been found: 5f154b49-a0d0-4e8b-8f5d-efcd83115a95
[ ERROR ] Failed to execute stage 'Environment setup': Cannot setup Hosted 
Engine with other VMs running

But no VMs are running:

# ps -ef|grep qemu
root  4343 18506  0 08:51 pts/100:00:00 grep qemu
qemu 21192 20271  0 Oct22 ?00:00:00 [supervdsmServer] 

And even in the debug logs it sees the VM is down:

2014-10-23 08:51:29 DEBUG otopi.context context._executeMethod:138 Stage 
late_setup METHOD 
otopi.plugins.ovirt_hosted_engine_setup.vm.configurevm.Plugin._late_setup
2014-10-23 08:51:29 DEBUG 
otopi.plugins.ovirt_hosted_engine_setup.vm.configurevm 
configurevm._late_setup:101 [{'status': 'Down', 'vmId': 
'5f154b49-a0d0-4e8b-8f5d-efcd83115a95'}]
2014-10-23 08:51:29 ERROR 
otopi.plugins.ovirt_hosted_engine_setup.vm.configurevm 
configurevm._late_setup:108 The following VMs has been found: 
5f154b49-a0d0-4e8b-8f5d-efcd83115a95
2014-10-23 08:51:29 DEBUG otopi.context context._executeMethod:152 method 
exception
Traceback (most recent call last):
  File "/usr/lib/python2.6/site-packages/otopi/context.py", line 142, in 
_executeMethod
method['method']()
  File 
"/usr/share/ovirt-hosted-engine-setup/scripts/../plugins/ovirt-hosted-engine-setup/vm/configurevm.py",
 line 112, in _late_setup
_('Cannot setup Hosted Engine with other VMs running')
RuntimeError: Cannot setup Hosted Engine with other VMs running


Is there a way around this, or do I need to delete the VM and reinstall it?


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] node host os: CentOS 6.5 vs 7

2014-10-20 Thread Robert Story
On Mon, 20 Oct 2014 18:16:19 +0200 Matt wrote:
M> Have you testes a host with centos 7 already ? It's not booting my
M> VM's here atm.

Not yet, I'm waiting to see if there is a compelling reason (oVirt wise) to
upgrade to 7 before I try any testing.


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] node host os: CentOS 6.5 vs 7

2014-10-20 Thread Robert Story
My current hosts are 6.5, and I'm not thrilled about the prospect of
upgrading to 7. Is there any big advantage to using 7 for oVirt nodes,
specifically with respect to the newer versions of KVM/libvirt?


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] [RFI] oVirt 3.6 Planning

2014-09-18 Thread Robert Story
On Fri, 12 Sep 2014 15:22:41 +0300 Itamar wrote:
IH> With oVirt 3.5 nearing GA, time to ask for "what do you want to see in 
IH> oVirt 3.6"?

I've always thought it would be a good idea to have a 'storage agent' to
run on storage domains, which could perform some operations more optimally
that the current system. For example, exporting/importing domains currently
reads the whole VM image over the network, then writes it back. In my case,
my storage is all located on the smae NFS server, which could simply do an
OS copy locally, instead of reading/writing gigabytes over the network. I
don't know if other storage type could perform similar optimizations


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] NeutronVirtualAppliance and 3.4?

2014-09-18 Thread Robert Story
Can the Neutron appliance for 3.5 be used with 3.4? Or does it depend on
features only in 3.5?


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Problem with hosted engine setup on VLAN/bond combo

2014-07-10 Thread Robert Story
On Thu, 10 Jul 2014 11:02:45 -0500 Chris wrote:
CA> I am trying to install oVirt with the hosted engine.  The physical
CA> system is CentOS 6.5 x86_64 (with all current updates).  It is connected
CA> to a two-switch stack via bond0 (running LACP), which is a VLAN trunk,
CA> and the management interface is vlan51.
CA> 
CA> This doesn't work with oVirt 3.4.2, but I see that both 3.4.3 and 3.5
CA> have "support engine on bond" and "support engine on vlan" in the
CA> release notes, so I tried again with today's 3.4.3 RC.  I got a
CA> different error from "hosted-engine --deploy":

Have you tried setting up the ovirtmgmt bridge manually? (see
http://www.ovirt.org/Installing_VDSM_from_rpm#Configuring_the_bridge_Interface)

I set up my 3.4.2 ovirtmgmt interface as a bridge to a vlan before
installation, and it worked fine.


Robert

-- 
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Processor Type

2014-03-31 Thread Robert Story
On Wed, 19 Mar 2014 10:13:28 -0400 (EDT) Omer wrote:
OF> cpu name is cluster level, and usually the lowest common denominator of
OF> the hosts in cluster, to make sure migration works in the cluster. 

Is there and documentation on the relationship between cpu types and
compatibility? Right now I have a cluster per cpu type, and it might make
sense to merge some of them if the performance hit was minimal.

Robert

--
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] tags?

2014-03-07 Thread Robert Story
On Fri, 7 Mar 2014 11:47:11 -0500 (EST) Einav wrote:
EC> actually, what you are asking for should not be limited only to a
EC> sub-set of objects filtered by tag; we should allow batch operations on
EC> any sub- set of objects, filtered by tags, object name, object's
EC> cluster-description and/or cpu-consumption. 

Yes, I agree.

EC> one way of resolving what you are asking for is to change the
EC> sub-tabs to support displaying data and performing actions in the
EC> context of multiple items, rather than a single item. 
EC> [snip]
EC> please feel free to open an RFE on this issue: 
EC> https://bugzilla.redhat.com/enter_bug.cgi?product=oVirt

See https://bugzilla.redhat.com/show_bug.cgi?id=1074026,  "RFE: support bulk
operations in admin portal".

Robert

--
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] tags?

2014-03-07 Thread Robert Story
On Thu, 6 Mar 2014 21:57:21 -0500 (EST) Andrew wrote:
AC> It's a way of grouping elements together.
AC> Eg. tag all the product VMs / Hosts
AC> Then filter by these tags to perform operations just on those.
AC> 
AC> 
https://access.redhat.com/site/documentation/en-US/Red_Hat_Enterprise_Virtualization/3.3/html-single/Administration_Guide/index.html#sect-Tags

Ahh, I keep forgetting that oVirt is basically RHEV... Thanks.

So I see how it helps to find tagged objects, but not any way to perform
bulk operations on them. It would be really nice to be able to find all VMs
tagged as 'webserver' and add/remove a user to all of them in one operation.


Robert

--
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[Users] tags?

2014-03-06 Thread Robert Story
Hello,

Can someone point me at a guide or explain to me the purpose of tags? I see
that I can assign them to VMs and users, but don't grasp the purpose.


Robert

--
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[Users] taking advantage of extra disk space on nodes

2014-02-12 Thread Robert Story
So I've got a handful of oVirt nodes now, and most of them have over 100G of
unallocated disk space. I'm trying to figure out how to best take advantage
of that space. I see two possibilities: storage for oVirt, or
backup/archive storage.

I'm currently using a single NFS server for VM/iso/export storage (oVirt
3.3.x on CentOS, if it matters) and don't need to scale to hundreds of
nodes (probably 10-20, max). Is current gluster support stable enough for
production? What are the pros/cons of gluster vs NFS?  I like the idea of
not having a single point of failure but worry about performance.

The other option is to stick with my NFS server and use the extra storage
for archive/backup space. Here I'm looking for reliability (i.e. losing a
server won't cause data loss) over performance. Are there pros/cons to
gluster vs ceph? Will either/both peacefully co-exist with an ovirt node?

All suggestions welcomed. Thanks!


Robert

--
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] vdsm-python-cpopen is obsoleted by python-cpopen

2014-01-30 Thread Robert Story
On Thu, 30 Jan 2014 15:57:35 -0600 Trey wrote:
TD> Try manually installing python-cpopen on the node before adding node in
TD> the engine.

Yep, that did the trick. Thanks!


Robert

--
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] ovirt-3.3.3 release postponed due to blockers

2014-01-30 Thread Robert Story
Can we revert these packages to previous versions in the 3.3.2 stable repo
so those of us who want/need to install new hosts in our clusters aren't
dead in the water waiting for 3.3.3?

On Thu, 30 Jan 2014 16:26:51 +0100 Sandro wrote:
SB> Il 30/01/2014 16:13, Yaniv Bronheim ha scritto:
SB> > Hey, 
SB> > 
SB> > we found this yum bug and still struggling with more issuing
SB> > according to the relation between those packages 
SB> > 
SB> > if we drop vdsm-python-cpopen the requirement in vdsm takes
SB> > python-cpopen instead. in python-cpopen we have the same code base
SB> > and it provides all vdsm-ptyhon-cpopen provides, so shouldn't be any
SB> > issues with dropping it from the repository
SB> > 
SB> > is it possible to ship 3.3.3 release that way ? we don't need to
SB> > change the requirement in the code, if python-cpopen 1.3-1 is part of
SB> > the release, it will be taken by vdsm spec (tried with vdsm 4.13.3-2
SB> > with the available cpopen 1.3-1)
SB> > 
SB> 
SB> I can't release 3.3.3 that way.
SB> We're keeping rolling releases on stable repository so if I don't
SB> provide 4.13.3-2, previous one will still be available.


Robert

--
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[Users] vdsm-python-cpopen is obsoleted by python-cpopen

2014-01-30 Thread Robert Story
Hello,

Any status on the pthon-cpopen obsoleted issue? Or any work-arounds? BZ is
here: https://bugzilla.redhat.com/show_bug.cgi?id=1056470. I can't install add 
an el6 host to my cluster.. :-/

--> Processing Dependency: vdsm-python-cpopen for package: 
vdsm-xmlrpc-4.13.3-2.el6.noarch
Package vdsm-python-cpopen is obsoleted by python-cpopen, but obsoleting 
package does not provide for requirements
--> Finished Dependency Resolution
Error: Package: vdsm-xmlrpc-4.13.3-2.el6.noarch (ovirt-3.3.2)
   Requires: vdsm-python-cpopen
   Available: python-cpopen-1.3-1.el6.x86_64 (epel)
   vdsm-python-cpopen = 4.13.2
   Available: vdsm-python-cpopen-4.12.1-2.el6.i686 (ovirt-stable)
   vdsm-python-cpopen = 4.12.1-2.el6
   Available: vdsm-python-cpopen-4.12.1-4.el6.i686 (ovirt-stable)
   vdsm-python-cpopen = 4.12.1-4.el6
   Available: vdsm-python-cpopen-4.13.0-9.el6.i686 (ovirt-stable)
   vdsm-python-cpopen = 4.13.0-9.el6
   Available: vdsm-python-cpopen-4.13.0-11.el6.i686 (ovirt-stable)
   vdsm-python-cpopen = 4.13.0-11.el6
   Available: vdsm-python-cpopen-4.13.2-1.el6.i686 (ovirt-3.3.2)
   vdsm-python-cpopen = 4.13.2-1.el6
   Available: vdsm-python-cpopen-4.13.3-2.el6.i686 (ovirt-3.3.2)
   vdsm-python-cpopen = 4.13.3-2.el6
 You could try using --skip-broken to work around the problem


Robert

--
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] networking: basic vlan help

2014-01-23 Thread Robert Story
On Thu, 23 Jan 2014 13:33:07 -0500 (EST) Assaf wrote:
AM> Sorry, privY on eth1.

For VM to VM communication that doesn't need to go over the public net..


Robert

--
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] networking: basic vlan help

2014-01-23 Thread Robert Story
On Thu, 23 Jan 2014 10:59:57 -0500 (EST) Assaf wrote:
AM> If you enable VLAN tagging on the management network, which is
AM> configured on eth0 (Which also provides internet access from my
AM> understanding) then you will connectivity as (I assume) your physical
AM> switches aren't configured for VLANs.

I'm assuming "will connectivity" should have been "will lose connectivity",
which is what I feared. I'm glad I asked!

AM> For an all-in-one, what I would suggest is the following procedure:

Excellent, I'll try that. Thanks!

My next question is for future planning. There is a second interface
(eth1) with a separate physical network which only contains the engine,
nodes and the nfs server. 

 +--+
 | internet |-|---|--|
 +--+ ++  +---+  +---+  < eth0
  | engine |  | node1 |  | node2 |
+-+   ++  +---+  +---+  < eth1
| nfs |---|---|--|
+-+

Can the mgmt network be easily moved to eth1? Then the pubX would be
non-vlan on eth0, and mgmt + privY would be on eth1. If all the eth1
interfaces are connected to a dedicated/isolated switch, does that switch
need to explicitly support vlans, or does it matter?



Robert

--
Senior Software Engineer @ Parsons


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[Users] networking: basic vlan help

2014-01-23 Thread Robert Story
Hello again,

I'm new to VLANs and have a few questions. Right now I just have the mgmt
interface (bridged with eth0) on my all-in-one oVirt test setup. I want to
separate some VMs from the public facing net, which I think means that they
need to be on a different VLAN.  I created two new networks, pubX and
privY, with vlan ids X and Y, but couldn't assign them to eth0 because the
current mgmt network is non-VLAN. I was about to enable VLAN tagging on the
mgmt network, but I wanted to make sure that doing so wouldn't do anything
to eth0 that would disrupt access to it (I only have remote access and don't
want to lock myself out).  Also, if it is safe, does the mgmt vlan tag id
matter? is 0 the right value?

Any/all help, hints, tips or references to examples/links greatly
appreciated.


Robert


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] another networking question: ssh to VM with private dhcp addr

2014-01-23 Thread Robert Story
On Thu, 23 Jan 2014 05:12:31 -0500 (EST) Assaf wrote:
AM> If so, when the VM generates a DHCP request it goes out to the same
AM> physical network that the ovirtmgmt device is on. Meaning, that if the
AM> VM and the management device both generate DHCP requests they should
AM> get responses in the same subnet.

Ah, ok. So oVirt doesn't automatically do DHCP like virt-manager.. Got it.
All the machines in the lab use static IPs, so I wasn't aware that there
was a DHCP server somewhere else handing out private addresses. I just
assumed oVirt did dhcp like virt-manager.

Sorry for the noise!

Robert


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[Users] another networking question: ssh to VM with private dhcp addr

2014-01-22 Thread Robert Story
Hello,

I've got oVirt 3.3.x all-in-one up and running on CentOS 6.3. The IP
address on the ovirtmgmt bridge is static/public. I have not done any
additional network/VLAN configuration.  If I create a VM and give it an
address on the same subnet, I can ssh to it just fine.

If I create a VM and let it use dhcp, it gets a 172.31.x.x address.
Outgoing access works fine, but I can't figure out what to do to allow me
to be able to ssh into those VMs.  With straight KVM the VM private network
was on a virtual interface on the host, making it easy. But I can't figure
out how to do it with oVirt.

Any help greatly appreciated. Thanks in advance.


Robert


signature.asc
Description: PGP signature
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users