Re: [Users] Vm's being paused

2014-01-29 Thread Neil
Sorry, more on this issue, I see my logs are rapidly filling up my
disk space on node02 with this error in /var/log/messages...

Jan 29 09:56:53 node02 vdsm vm.Vm ERROR
vmId=`dfa2cf7c-3f0e-42e3-b495-10ccb3e0c71b`::Stats function failed:
AdvancedStatsFunction _highWrite at 0x1c2fb90#012Traceback (most
recent call last):#012  File /usr/share/vdsm/sampling.py, line 351,
in collect#012statsFunction()#012  File
/usr/share/vdsm/sampling.py, line 226, in __call__#012retValue =
self._function(*args, **kwargs)#012  File /usr/share/vdsm/vm.py,
line 513, in _highWrite#012self._vm._dom.blockInfo(vmDrive.path,
0)#012  File /usr/share/vdsm/vm.py, line 835, in f#012ret =
attr(*args, **kwargs)#012  File
/usr/lib64/python2.6/site-packages/vdsm/libvirtconnection.py, line
76, in wrapper#012ret = f(*args, **kwargs)#012  File
/usr/lib64/python2.6/site-packages/libvirt.py, line 1814, in
blockInfo#012if ret is None: raise libvirtError
('virDomainGetBlockInfo() failed', dom=self)#012libvirtError: invalid
argument: invalid path
/rhev/data-center/mnt/blockSD/0e6991ae-6238-4c61-96d2-ca8fed35161e/images/fac8a3bb-e414-43c0-affc-6e2628757a28/6c3e5ae8-23fc-4196-ba42-778bdc0fbad8
not assigned to domain
Jan 29 09:56:53 node02 vdsm vm.Vm ERROR
vmId=`ac2a3f99-a6db-4cae-955d-efdfb901abb7`::Stats function failed:
AdvancedStatsFunction _highWrite at 0x1c2fb90#012Traceback (most
recent call last):#012  File /usr/share/vdsm/sampling.py, line 351,
in collect#012statsFunction()#012  File
/usr/share/vdsm/sampling.py, line 226, in __call__#012retValue =
self._function(*args, **kwargs)#012  File /usr/share/vdsm/vm.py,
line 509, in _highWrite#012if not vmDrive.blockDev or
vmDrive.format != 'cow':#012AttributeError: 'Drive' object has no
attribute 'format'

Not sure if this is related at all though?

Thanks.

Regards.

Neil Wilson.

On Wed, Jan 29, 2014 at 9:02 AM, Neil nwilson...@gmail.com wrote:
 Hi Dafna,

 Thanks for clarifying that, I found the migration issue and this was
 resolved once I sorted out the ISO domain problem.

 I'm sorry I don't understand your last question?
  after the engine restart, do you still see a problem with the size
 or did the report of size changed?

 The migration issue was resolved, it's now just trying to track down
 why the two VM's paused on their own, one on the 8th of Jan(I think)
 and one on the 19th of Jan.

 Thank you.


 Regards.

 Neil Wilson.


 On Tue, Jan 28, 2014 at 8:18 PM, Dafna Ron d...@redhat.com wrote:
 yes - engine lost communication with vdsm and it has no way of knowing if
 the host is down or if there was a network issue so a network issue would
 cause the same errors that I see in the logs.

 The error you put on the iso is the reason the vm's have failed migration -
 if a vm is run with a cd and the cd is gone than the vm will not be able to
 be migrated.

 after the engine restart, do you still see a problem with the size or did
 the report of size changed?

 Dafna


 On 01/28/2014 01:02 PM, Neil wrote:

 Hi Dafna,

 Thanks for coming back to me. I'll try answer your queries one by one.

 On Tue, Jan 28, 2014 at 1:38 PM, Dafna Ron d...@redhat.com wrote:

 you had a problem with your storage on the 14th of Jan and one of the
 hosts
 rebooted (if you have the vdsm log from that day than I can see what
 happened on vdsm side)
 in engine, I could see a problem with the export domain and this should
 not
 have cause a reboot.

 1.) I don't unfortunately have logs going back that far. Looking at
 all 3 hosts uptime, the one with the least uptime is 21 days, the
 others are all over 40 days, so there definitely wasn't a host that
 rebooted on the 14th of Jan, would a network issue or Firewall issue
 also cause the error you've seen to look as if a host rebooted? There
 was a bonding mode change on the 14th of January, so perhaps this
 caused the issue?


 Can you tell me if you had a problem with the data
 domain as well or was it just the export domain? were you having any vm's
 exported/imported at that time?
 In any case - this is a bug.

 2.) I think this was the same day that the bonding mode was changed on
 the host while the host was live (by mistake), and had SPM running on
 it. I haven't done any importing or exporting for a few years on this
 oVirt setup.


 As for the vm's - if the vm's are no longer in migrating state than
 please
 restart ovirt-engine service (looks like a cache issue)

 3.) Restarted ovirt-engine, logging now appears to be normal without any
 errors.


 if they are in migrating state - there should have been a timeout a long
 time ago.
 can you please run 'vdsClient -s 0 list table' and 'virsh -r list'  on
 both
 all hosts?

 4.) Ran on all hosts...

 node01.blabla.com
 63da7faa-f92a-4652-90f2-b6660a4fb7b3  11232  adam Up
 502170aa-0fc6-4287-bb08-5844be6e0352  13986  babbage  Up
 ff9036fb-1499-45e4-8cde-e350eee3c489  26733  reports  Up
 2736197b-6dc3-4155-9a29-9306ca64881d  13804  tux  

Re: [Users] two node ovirt cluster with HA

2014-01-29 Thread Eli Mesika


- Original Message -
 From: Andrew Lau and...@andrewklau.com
 To: d...@redhat.com
 Cc: Tareq Alayan tala...@redhat.com, Eli Mesika emes...@redhat.com, 
 Karli Sjöberg karli.sjob...@slu.se,
 users@ovirt.org
 Sent: Tuesday, January 28, 2014 3:12:46 PM
 Subject: Re: [Users] two node ovirt cluster with HA
 
 On Tue, Jan 28, 2014 at 12:02 AM, Dafna Ron d...@redhat.com wrote:
 
  Andrew,
  Once this discussion is finished, and If what you like done is not in the
  current implementation can you please open a bug/feature request for it?
 
 
 Sure - I've opened a RFE here based on the current discussions
 https://bugzilla.redhat.com/show_bug.cgi?id=1058737 but I'm not sure which
 category it should be under.

I had assigned it to infra , thanks 
IMHO we should handle only the first scenario reported in this BZ 

 
 Cheers,
 Andrew.
 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Hosted Engine adding host SSL Failure (w/ engine custom cert)

2014-01-29 Thread Andrew Lau
Shame about the way the CA works, may be worth putting a reverse proxy in
front as unsigned SSL can be a deal breaker.

Anyway, my vdsm.log is here http://www.fpaste.org/72643/98338713/

When it's Still waiting for VDSM host to become operational.. there is no
output in vdsm.log

On Wed, Jan 29, 2014 at 6:11 PM, Yedidyah Bar David d...@redhat.com wrote:

  *From: *Yedidyah Bar David d...@redhat.com
 *To: *Andrew Lau and...@andrewklau.com
 *Cc: *users users@ovirt.org
 *Sent: *Wednesday, January 29, 2014 9:05:06 AM
 *Subject: *Re: [Users] Hosted Engine adding host SSL Failure (w/
 enginecustomcert)


 *From: *Andrew Lau and...@andrewklau.com
 *To: *users users@ovirt.org
 *Sent: *Wednesday, January 29, 2014 8:38:33 AM
 *Subject: *[Users] Hosted Engine adding host SSL Failure (w/ engine
 customcert)

 Hi,

 After running through the new patch posted in BZ 1055153 I'm adding a
 second host to the hosted-engine cluster but it seems to fail right before
 the finish:

 [ ERROR ] Failed to execute stage 'Closing up': [ERROR]::oVirt API
 connection failure, [Errno 1] _ssl.c:492: error:14090086:SSL
 routines:SSL3_GET_SERVER_CERTIFICATE:certificate verify failed

 Couple Extra Notes:
 Engine has a custom SSL cert but the CA has been trusted by the new host.
 When I temporarily return the engine's SSL back to the default generated
 one the install will succeed.

 Setup logs: http://www.fpaste.org/72624/13909770/

 What confuses me is:

 curl https://engine.example.net with the custom SSL cert will succeed but
 with the original self-signed gives the expected insecure message. What
 criteria need to be met so the install will pass?


 Seems like a bug (or a missing feature) - hosted-engine only supports the
 self-signed cert. Can you please open a bug for this?

 You might manage to make it work by replacing /etc/pki/ovirt-engine/ca.pem
 with the certificate of your ca, but this will prevent adding hosts
 (because it's needed to create a certificate for them). Perhaps other
 things will break too, I didn't try that.


 On a second thought, I don't think it will work. The engine will still
 sign certs for hosts with its private key, but the hosts will try to verify
 that with the ca.pem you put there and fail.
 --
 Didi


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] ovirt-3.3.3 release postponed due to blockers

2014-01-29 Thread Sven Kieske
Hi,

I wanted to try it the other way around, installing vdsm-python-cpopen
and check if it runs without python-cpopen .

But that leads me to a question:

Is there any difference between these packages beside their different
name?

If yes, what is the difference and which package should be installed?
I no, why is there a packet vdsm-python-cpopen ?

Thank you!


Am 28.01.2014 20:14, schrieb Trey Dockendorf:
 I found manually installing python-cpopen before adding a node in the
 engine is a viable workaround. This was on EL6.5
 
 - Trey

-- 
Mit freundlichen Grüßen / Regards

Sven Kieske

Systemadministrator
Mittwald CM Service GmbH  Co. KG
Königsberger Straße 6
32339 Espelkamp
T: +49-5772-293-100
F: +49-5772-293-333
https://www.mittwald.de
Geschäftsführer: Robert Meyer
St.Nr.: 331/5721/1033, USt-IdNr.: DE814773217, HRA 6640, AG Bad Oeynhausen
Komplementärin: Robert Meyer Verwaltungs GmbH, HRB 13260, AG Bad Oeynhausen
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] ovirt-3.3.3 release postponed due to blockers

2014-01-29 Thread Sandro Bonazzola
Il 29/01/2014 09:21, Sven Kieske ha scritto:
 Hi,
 
 I wanted to try it the other way around, installing vdsm-python-cpopen
 and check if it runs without python-cpopen .
 
 But that leads me to a question:
 
 Is there any difference between these packages beside their different
 name?
 
 If yes, what is the difference and which package should be installed?
 I no, why is there a packet vdsm-python-cpopen ?

CCing VDSM

 
 Thank you!
 
 
 Am 28.01.2014 20:14, schrieb Trey Dockendorf:
 I found manually installing python-cpopen before adding a node in the
 engine is a viable workaround. This was on EL6.5

 - Trey
 


-- 
Sandro Bonazzola
Better technology. Faster innovation. Powered by community collaboration.
See how it works at redhat.com
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Vm's being paused

2014-01-29 Thread Michal Skrivanek

On Jan 28, 2014, at 19:18 , Dafna Ron d...@redhat.com wrote:

 yes - engine lost communication with vdsm and it has no way of knowing if the 
 host is down or if there was a network issue so a network issue would cause 
 the same errors that I see in the logs.
 
 The error you put on the iso is the reason the vm's have failed migration - 
 if a vm is run with a cd and the cd is gone than the vm will not be able to 
 be migrated.

which, as I learned last week, is not entirely correct. Pure libvirt VM seems 
to work fine…so it must be somewhere something in oVirt:(
looking into it
but just for future reference we want it to work:)

 
 after the engine restart, do you still see a problem with the size or did the 
 report of size changed?
 
 Dafna
 
 On 01/28/2014 01:02 PM, Neil wrote:
 Hi Dafna,
 
 Thanks for coming back to me. I'll try answer your queries one by one.
 
 On Tue, Jan 28, 2014 at 1:38 PM, Dafna Ron d...@redhat.com wrote:
 you had a problem with your storage on the 14th of Jan and one of the hosts
 rebooted (if you have the vdsm log from that day than I can see what
 happened on vdsm side)
 in engine, I could see a problem with the export domain and this should not
 have cause a reboot.
 1.) I don't unfortunately have logs going back that far. Looking at
 all 3 hosts uptime, the one with the least uptime is 21 days, the
 others are all over 40 days, so there definitely wasn't a host that
 rebooted on the 14th of Jan, would a network issue or Firewall issue
 also cause the error you've seen to look as if a host rebooted? There
 was a bonding mode change on the 14th of January, so perhaps this
 caused the issue?
 
 
 Can you tell me if you had a problem with the data
 domain as well or was it just the export domain? were you having any vm's
 exported/imported at that time?
 In any case - this is a bug.
 2.) I think this was the same day that the bonding mode was changed on
 the host while the host was live (by mistake), and had SPM running on
 it. I haven't done any importing or exporting for a few years on this
 oVirt setup.
 
 
 As for the vm's - if the vm's are no longer in migrating state than please
 restart ovirt-engine service (looks like a cache issue)
 3.) Restarted ovirt-engine, logging now appears to be normal without any 
 errors.
 
 
 if they are in migrating state - there should have been a timeout a long
 time ago.
 can you please run 'vdsClient -s 0 list table' and 'virsh -r list'  on both
 all hosts?
 4.) Ran on all hosts...
 
 node01.blabla.com
 63da7faa-f92a-4652-90f2-b6660a4fb7b3  11232  adam Up
 502170aa-0fc6-4287-bb08-5844be6e0352  13986  babbage  Up
 ff9036fb-1499-45e4-8cde-e350eee3c489  26733  reports  Up
 2736197b-6dc3-4155-9a29-9306ca64881d  13804  tux  Up
 0a3af7b2-ea94-42f3-baeb-78b950af4402  25257  Moodle   Up
 
  IdName   State
 
  1 adam   running
  2 reportsrunning
  4 tuxrunning
  6 Moodle running
  7 babbagerunning
 
 node02.blabla.com
 dfa2cf7c-3f0e-42e3-b495-10ccb3e0c71b   2879  spam Up
 23b9212c-1e25-4003-aa18-b1e819bf6bb1  32454  proxy02  Up
 ac2a3f99-a6db-4cae-955d-efdfb901abb7   5605  software Up
 179c293b-e6a3-4ec6-a54c-2f92f875bc5e   8870  zimbra   Up
 
  IdName   State
 
  9 proxy02running
  10spam   running
  12software   running
  13zimbra running
 
 node03.blabla.com
 e42b7ccc-ce04-4308-aeb2-2291399dd3ef  25809  dhcp Up
 16d3f077-b74c-4055-97d0-423da78d8a0c  23939  oliver   Up
 
  IdName   State
 
  13oliver running
  14dhcp   running
 
 
 Last thing is that your ISO domain seems to be having issues as well.
 This should not effect the host status but if any of the vm's were booted
 from an iso or have an iso attached in the boot sequence this will explain
 the migration issue.
 There was an ISO domain issue a while back, but this was corrected
 about 2 weeks ago after iptables re-enabled itself on boot after
 running updates, I've checked now and the ISO domain appears to be
 fine and I can see all the images stored within.
 
 I've stumbled across what appears to be another error and all three
 hosts are showing this over and over in /var/log/messages, and I'm not
 sure if it's related? ...
 
 Jan 28 14:58:59 node01 vdsm vm.Vm ERROR
 vmId=`63da7faa-f92a-4652-90f2-b6660a4fb7b3`::Stats function failed:
 AdvancedStatsFunction _highWrite at 0x2ce0998#012Traceback (most
 

Re: [Users] Adding node to local storage cluster

2014-01-29 Thread Sven Kieske
Hi,

unfortunately this is not possible.

In a local storage datacenter you are bound to exactly
one host in one cluster.

There is done some work to remove all this storage domain
limitations in the future, but afaik you can't mix local
storage with other storage types before oVirt 3.5. or 4.0.

I can post the according bugzilla entries if you would like
to follow the progress.

HTH

Am 29.01.2014 07:30, schrieb Nauman Abbas:
 Hello all
 
 I wanted to ask if there's a way to add a node to a local storage cluster
 i-e adding one more besides the one that's already there?

-- 
Mit freundlichen Grüßen / Regards

Sven Kieske

Systemadministrator
Mittwald CM Service GmbH  Co. KG
Königsberger Straße 6
32339 Espelkamp
T: +49-5772-293-100
F: +49-5772-293-333
https://www.mittwald.de
Geschäftsführer: Robert Meyer
St.Nr.: 331/5721/1033, USt-IdNr.: DE814773217, HRA 6640, AG Bad Oeynhausen
Komplementärin: Robert Meyer Verwaltungs GmbH, HRB 13260, AG Bad Oeynhausen
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Error: The following Network Interfaces were specified more than once:

2014-01-29 Thread Assaf Muller
Please run:
vdsClient -s 0 getVdsCaps

And paste the output.

If this is caused by what I think is causing it, then this bug was fixed
during the 3.4 development cycle and is available in the 3.4 beta release.

(It affects dummy devices and VLANs).

Assaf Muller, Cloud Networking Engineer 
Red Hat 

- Original Message -
From: Alan Murrell li...@murrell.ca
To: users@ovirt.org
Sent: Wednesday, January 29, 2014 10:27:21 AM
Subject: [Users] Error: The following Network Interfaces were specified
more than   once:

I have noticed what seems to me to be a quirk, but wanted to see if  
this is correct behaviour, or perhaps my configuration.

I am running a test install ov oVirt 3.3.2 in an all-in-one install.   
My test box only has one physical NIC, so I am using that for  
management/external network access, and I have create a dummy  
interface called dummy_0 to be used for the VMs.

I created four networks, each assigned their own VLAN (I am wanted to  
simulate a real-world  .  I added the first network (VLAN001) to  
dummy_0 with no problem (I was able to save the configuration).  The  
system created the dummy_0.1.

When I tried to add another network (VLAN002) to dummy_0 and save  
it, I got the following error:

--- START ---
Error while executing action:

local_host:

 Cannot setup Networks. The following Network Interfaces were  
specified more than once: dummy_0.1.
--- END ---

I found that in the Setup Host Network if I dragged VLAN001 off  
dummy_0 to the right under Required, and left VLAN002 on  
dummy_0, I could save it, and VLAN001 appeared to remain active on  
the host.  Indeed, when I went back into Setup Host Network, both  
VLAN001 and VLAN002 were showing up on dummy_0.

However, if I highlight VLAN002 under Networks and click on the  
Hosts tab, the Network Interface is showing a red down arrow,  
indicating it is not up.  The Network Interface for VLAN001 is a  
green up arrow.  I have refreshed several times and at this point in  
time, have left it for almost thirty minutes, but it remains as a red  
down arrow.  The same is true for the other two networks I tried  
adding: I was able to add them as above, but they are also showing a  
red down arrow.

My questions:

   1.) Is doing what I had to do to add additional networks to an  
interface normal, or is it a quirk of using a dummy interface?
   2.) Any ideas why my additional networks are showing a red down  
arrow?  Again, maybe a quirk of using a dummy interface?

I am going to leave it overnight and see what happens (maybe some  
background process needs to run??), but if that does not work, I am  
sure I can scrounge up another NIC in the next couple of days and  
install it into my test server to see if it does the same thing.

Thanks, in advance.

-Alan
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Hosted-Engine startup problem

2014-01-29 Thread Yedidyah Bar David
Hi,

- Original Message -
 From: Sebastian Classen sebastian.clas...@noris.de
 To: users@ovirt.org users@ovirt.org
 Cc: Sebastian Classen sebastian.clas...@noris.de, Frank Wall 
 frank.w...@noris.de
 Sent: Friday, January 24, 2014 10:11:26 AM
 Subject: [Users] Hosted-Engine startup problem
 
 Hi,
 
 we installes ovirt beta with hosted-engine. After Setup was complete the
 engine VM reboots and never came up again. It looks like the host is unable
 to find the VM. As requested in IRC I attached the relevant logs.
 
 Please CC me, as I'm not subscribed.

Sorry for the long delay. It seems to me like a bug in the fix for [1].

I did not try to reproduce your problem yet, and it's probably not very easy to 
do,
or else others would have also reported that. I pushed a fix [2]. If the 
machine is
still in the same state, or you can still reproduce, can you please try the fix?

Thanks a lot, and sorry again for the delay.

[1] https://bugzilla.redhat.com/1055495
[2] http://gerrit.ovirt.org/23823
-- 
Didi
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Ovirt Gluster problems

2014-01-29 Thread Sahina Bose


On 01/29/2014 04:15 AM, Steve Dainard wrote:
Not sure if this is exactly your issue, but this post here: 
http://comments.gmane.org/gmane.comp.emulators.ovirt.user/12200 might 
lead you in the right direction.


one note - if you back it up while its attached to an engine, you will
need to edit its meta data file to remove the association to allow the
other engine to connect it to the new pool for restore.



Did this solve your issue?

If not, could you let us know the error messages from the logs? (engine.log)
If you're looking to remove a host from a gluster cluster when there are 
no online hosts, checking the Force option should do this for you.



*Steve Dainard *



On Tue, Jan 28, 2014 at 12:41 PM, Juan Pablo Lorier 
jplor...@gmail.com mailto:jplor...@gmail.com wrote:


Hi,

I had some issues with a gluster cluster and after some time trying to
get the storage domain up or delete it (I opened a BZ about a deadlock
in the process of removing the domain) I gave up and destroyed the DC.
The thing is that I want to add the hosts that where part of the
DC and
now I get that I can't as they have the volume. I try to stop the
volume
but I can't as no host is running in the deleted cluster and for some
reason, ovirt needs that.
I can't delete the hosts either as they have the volume... so  I'm
back
in another chicken and egg problem.
Any hints??

PD: I can't nuke the hole ovirt plataform as I have another DC in
production otherwise I would :-)

Regards,


___
Users mailing list
Users@ovirt.org mailto:Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users




___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Extremely poor disk access speeds in Windows guest

2014-01-29 Thread Ronen Hod

Adding the virtio-scsi developers.
Anyhow, virtio-scsi is newer and less established than viostor (the block 
device), so you might want to try it out.
A disclaimer: There are time and patches gaps between RHEL and other versions.

Ronen.

On 01/28/2014 10:39 PM, Steve Dainard wrote:

I've had a bit of luck here.

Overall IO performance is very poor during Windows updates, but a contributing factor seems 
to be the SCSI Controller device in the guest. This last install I didn't 
install a driver for that device, and my performance is much better. Updates still chug 
along quite slowly, but I seem to have more than the  100KB/s write speeds I was seeing 
previously.

Does anyone know what this device is for? I have the Red Hat VirtIO SCSI 
Controller listed under storage controllers.

*Steve Dainard *
IT Infrastructure Manager
Miovision http://miovision.com/ | /Rethink Traffic/
519-513-2407 ex.250
877-646-8476 (toll-free)

*Blog http://miovision.com/blog | **LinkedIn 
https://www.linkedin.com/company/miovision-technologies  | Twitter 
https://twitter.com/miovision  | Facebook https://www.facebook.com/miovision*

Miovision Technologies Inc. | 148 Manitou Drive, Suite 101, Kitchener, ON, 
Canada | N2C 1L3
This e-mail may contain information that is privileged or confidential. If you 
are not the intended recipient, please delete the e-mail and any attachments 
and notify us immediately.


On Sun, Jan 26, 2014 at 2:33 AM, Itamar Heim ih...@redhat.com 
mailto:ih...@redhat.com wrote:

On 01/26/2014 02:37 AM, Steve Dainard wrote:

Thanks for the responses everyone, really appreciate it.

I've condensed the other questions into this reply.


Steve,
What is the CPU load of the GlusterFS host when comparing the raw
brick test to the gluster mount point test? Give it 30 seconds and
see what top reports. You'll probably have to significantly increase
the count on the test so that it runs that long.

- Nick



Gluster mount point:

*4K* on GLUSTER host
[root@gluster1 rep2]# dd if=/dev/zero of=/mnt/rep2/test1 bs=4k 
count=50
50+0 records in
50+0 records out
204800 tel:204800 tel:204800 tel:204800 bytes 
(2.0 GB) copied, 100.076 s, 20.5 MB/s


Top reported this right away:
PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEM  TIME+  COMMAND
  1826 root  20   0  294m  33m 2540 S 27.2  0.4 0:04.31 glusterfs
  2126 root  20   0 1391m  31m 2336 S 22.6  0.4  11:25.48 glusterfsd

Then at about 20+ seconds top reports this:
   PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEM  TIME+  COMMAND
  1826 root  20   0  294m  35m 2660 R 141.7  0.5 1:14.94 glusterfs
  2126 root  20   0 1392m  31m 2344 S 33.7  0.4  11:46.56 glusterfsd

*4K* Directly on the brick:
dd if=/dev/zero of=test1 bs=4k count=50
50+0 records in
50+0 records out
204800 tel:204800 tel:204800 tel:204800 bytes 
(2.0 GB) copied, 4.99367 s, 410 MB/s


  7750 root  20   0  102m  648  544 R 50.3  0.0 0:01.52 dd
  7719 root  20   0 000 D  1.0  0.0 0:01.50 flush-253:2

Same test, gluster mount point on OVIRT host:
dd if=/dev/zero of=/mnt/rep2/test1 bs=4k count=50
50+0 records in
50+0 records out
204800 tel:204800 tel:204800 tel:204800 bytes 
(2.0 GB) copied, 42.4518 s, 48.2 MB/s


   PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEM  TIME+  COMMAND
  2126 root  20   0 1396m  31m 2360 S 40.5  0.4  13:28.89 glusterfsd


Same test, on OVIRT host but against NFS mount point:
dd if=/dev/zero of=/mnt/rep2-nfs/test1 bs=4k count=50
50+0 records in
50+0 records out
204800 tel:204800 tel:204800 tel:204800 bytes 
(2.0 GB) copied, 18.8911 s, 108 MB/s


PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEM  TIME+  COMMAND
  2141 root  20   0  550m 184m 2840 R 84.6  2.3  16:43.10 glusterfs
  2126 root  20   0 1407m  30m 2368 S 49.8  0.4  13:49.07 glusterfsd

   

Re: [Users] about the size of an offline snapshot

2014-01-29 Thread Maor Lipchuk
Hi Sandy,

virtual size is the size of the disk the VM knows, it is actually the
size you chose to create it with.
The true size is the summerise of all the true size which the volumes
related to disk.

So for example if you have one disk of 20G and you occupied 18GB of it.
Then you created a snapshot and you occupied 4GB of it, you might see
that the virtual size will still be 20GB though the true size will be 22 GB

You can also check in the host the commands:
vdsClient 0 getVmStats vmId or vdsClient 0 getAllVmStats.

Regards,
Maor

On 01/29/2014 04:13 AM, Sandy Sun wrote:
 Assign 20G virtio disk for VM, create an offline snapshot, find the true
 size of Vm-disk bigger than the virtual size (assign size)?
  
 I want to know how to compute the true size  of vm-disk. Anbody can tell
 me the answer ? thanks.
 
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] ***UNCHECKED*** Re: [vdsm] ovirtmgmt vanishes after reboot

2014-01-29 Thread Fabian Deutsch
Am Dienstag, den 28.01.2014, 14:04 -0300 schrieb Francisco Pérez:
 On the node the content of that file is only this:
 
 
 [addresses]
 management_port = 54321
 
 
 [vars]
 ssl = true
 
 
 so i don't see any options wich will modify net config. Yes, on ovirt
 manager i have ticked save network configuration. 

Hey,

as Anotni noted - please open a bug for this. It seems that some
configuration file is not persisted.

 For the record im using oVirt Node Hypervisor release 3.0.1
 (1.0.2.el6)

That Node is very old, better would be the Node used on the testDay, but
I've take it down by now. We'll have an updated Node next week which you
can try.

- fabian

 
 Regards
 
 
 On Tue, Jan 28, 2014 at 1:34 PM, Antoni Segura Puimedon
 asegu...@redhat.com wrote:
 
 
 - Original Message -
  From: Federico Alberto Sayd fs...@uncu.edu.ar
  To: users@ovirt.org
  Sent: Tuesday, January 28, 2014 5:23:13 PM
  Subject: Re: [Users] ***UNCHECKED*** Re: [vdsm] ovirtmgmt
 vanishes after reboot
 
  On 28/01/14 11:27, Francisco Pérez wrote:
 
 
 
  Im setting up two networks. ovirtmgmt and a LAN. Both of
 them are gone after
  reboot on the node, its starts only with eth0 initial
 configuration.
 
  Antoni, here are the outputs you requested:
 
  [root@ovirth1 ~]# /etc/init.d/vdsmd status
  VDS daemon server is running
  [root@ovirth1 ~]# service vdsmd status
  VDS daemon server is running
  [root@ovirth1 ~]# virsh -r net-list
  setlocale: No such file or directory
  Name State Autostart Persistent
  --
  ;vdsmdummy; active no no
 
  [root@ovirth1 ~]# ls /etc/sysconfig/network-scripts/
  ifcfg-eth0 ifdown-eth ifdown-post ifdown-tunnel ifup-eth
 ifup-plip
  ifup-routes init.ipv6-global route-LAN
  ifcfg-lo ifdown-ippp ifdown-ppp ifup ifup-ippp ifup-plusb
 ifup-sit
  net.hotplug route-ovirtmgmt
  ifdown ifdown-ipv6 ifdown-routes ifup-aliases ifup-ipv6
 ifup-post ifup-tunnel
  network-functions rule-LAN
  ifdown-bnep ifdown-isdn ifdown-sit ifup-bnep ifup-isdn
 ifup-ppp ifup-wireless
  network-functions-ipv6 rule-ovirtmgmt
 
 
 Just to maek sure, /etc/vdsm/vdsm.conf does not modify
 net_persistence nor
 net_configurator, does it?
 
 If it does not, and you ticked Save on the setupNetworks
 dialogs, there is
 some issue with the net persistence flow on the ovirt node and
 a bug should be
 opened for it.
 
 
  Dan those command are excecuted where? on the manager or the
 node? Im new at
  this and the logs are a little overwhelming so if you help
 me to interpret
  them i'll apreciate them. I have attached the whole log
 directory
  /var/log/vdsm/
 
  Regards
 
 
 
  On Thu, Jan 23, 2014 at 4:32 PM, Dan Kenigsberg 
 dan...@redhat.com  wrote:
 
 
 
  On Thu, Jan 23, 2014 at 05:12:17PM +0100, Fabian Deutsch
 wrote:
   Am Donnerstag, den 23.01.2014, 12:34 -0300 schrieb
 Francisco Pérez:
Hi. I have configured an ovirt-engine to manage an ovirt
 node
succesfully. I have created VM and all runs well but im
 trying to do
some escenario testing and i have found an issue.
   
   
If somehow the ovirt node is rebooted(Power failure,
 crash, manual
reboot, etc) when its comes up it does not start the
 ovirtmgmt
network. Even if in the ovirt-engine you save network
 configuration or
if you execute persist
on /etc/sysconfig/network-scripts/ifcfg-ovirtmgmt. It
 tells you it's
already been added to the configuration.
   
   
Is this normal behavior after ovirt node's is rebooted?
 Am i doing
something wrong?
  
   Hey Francisco,
  
   that sounds like a bug.
   VDSM Devs, how can we debug why the network is not brought
 up?
 
  Could you share vdsm.log and supervdsm.log? When you set up
 ovirtmgmt,
  as setupNetworks command should be visible there. Later, on
 save, a
  call to setSafeNetworkConfig is sent, which is expected to
 persist your
  network configuration.
 
  Is it visible? Is it successful?
 
  What do you see in the logs post boot? Can you tell whether
  the vdsm-restore-net-config.init server was run? (its job is

Re: [Users] Hosted Engine adding host SSL Failure (w/ engine custom cert)

2014-01-29 Thread Yedidyah Bar David
 From: Andrew Lau and...@andrewklau.com
 To: Yedidyah Bar David d...@redhat.com
 Cc: users users@ovirt.org
 Sent: Wednesday, January 29, 2014 10:17:21 AM
 Subject: Re: [Users] Hosted Engine adding host SSL Failure (w/ engine custom
 cert)

 Shame about the way the CA works, may be worth putting a reverse proxy in
 front as unsigned SSL can be a deal breaker.

Perhaps. Would you like to open a bug? 

 Anyway, my vdsm.log is here http://www.fpaste.org/72643/98338713/

 When it's Still waiting for VDSM host to become operational.. there is no
 output in vdsm.log

Sorry, didn't follow. What this refers to? The ssl issue or something else? 

Thanks! 
-- 
Didi 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] [vdsm] ovirtmgmt vanishes after reboot

2014-01-29 Thread Fabian Deutsch
Am Dienstag, den 28.01.2014, 17:42 -0500 schrieb Antoni Segura Puimedon:
 
 We made some more tests with Francisco on #ovirt today and we saw that
 for some unknown reason, when rebooting the machine the
 ifcfg-YOURNET
 files disappear while the rule-YOURNET and route-YOURNET stay.
 That
 happens even when the both ifcfg, rule and route have the correct
 entry
 in /config/files and are bound to /config/etc/sysconfig/
 
 After reboot it returns to eth0 having the connectivity as it was
 defined
 with the TUI.
 
 For libvirt networks a bit of the same as for ifcfg files. Before
 reboot
 /etc/libvirt/qemu/networks is present in /config/files and findmnt
 reports
 that /etc/libvirt/qemu/networks is a bound mount
 of /config/etc/libvirt/qemu/networks.
 After reboot it is not the case anymore.
 
 @Fabian: Do you think there is something on the reboot that restores
 the
 network conf to the TUI settings?

Hey Antoni,

I vaguely remember that we had a problem with Node removing some ifcfg-*
files. But this should have been solved in the TestDay iso.

What ISO did you use to reproduce this problem?

- fabian


signature.asc
Description: This is a digitally signed message part
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] [vdsm] ovirtmgmt vanishes after reboot

2014-01-29 Thread Antoni Segura Puimedon


- Original Message -
 From: Fabian Deutsch fabi...@redhat.com
 To: Antoni Segura Puimedon asegu...@redhat.com
 Cc: Dan Kenigsberg dan...@redhat.com, Francisco Pérez 
 fpere...@gmail.com, amul...@redhat.com, VDSM Project
 Development vdsm-de...@lists.fedorahosted.org, users@ovirt.org
 Sent: Wednesday, January 29, 2014 10:59:43 AM
 Subject: Re: [vdsm] [Users] ovirtmgmt vanishes after reboot
 
 Am Dienstag, den 28.01.2014, 17:42 -0500 schrieb Antoni Segura Puimedon:
  
  We made some more tests with Francisco on #ovirt today and we saw that
  for some unknown reason, when rebooting the machine the
  ifcfg-YOURNET
  files disappear while the rule-YOURNET and route-YOURNET stay.
  That
  happens even when the both ifcfg, rule and route have the correct
  entry
  in /config/files and are bound to /config/etc/sysconfig/
  
  After reboot it returns to eth0 having the connectivity as it was
  defined
  with the TUI.
  
  For libvirt networks a bit of the same as for ifcfg files. Before
  reboot
  /etc/libvirt/qemu/networks is present in /config/files and findmnt
  reports
  that /etc/libvirt/qemu/networks is a bound mount
  of /config/etc/libvirt/qemu/networks.
  After reboot it is not the case anymore.
  
  @Fabian: Do you think there is something on the reboot that restores
  the
  network conf to the TUI settings?
 
 Hey Antoni,
 
 I vaguely remember that we had a problem with Node removing some ifcfg-*
 files. But this should have been solved in the TestDay iso.
 
 What ISO did you use to reproduce this problem?

We reproduced on Francisco's setup, I think it was oVirt node 3.0.1
 
 - fabian
 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] [vdsm] ovirtmgmt vanishes after reboot

2014-01-29 Thread Fabian Deutsch
Am Mittwoch, den 29.01.2014, 05:02 -0500 schrieb Antoni Segura Puimedon:
 - Original Message -
  From: Fabian Deutsch fabi...@redhat.com
  To: Antoni Segura Puimedon asegu...@redhat.com
  Cc: Dan Kenigsberg dan...@redhat.com, Francisco Pérez
 fpere...@gmail.com, amul...@redhat.com, VDSM Project
  Development vdsm-de...@lists.fedorahosted.org, users@ovirt.org
  Sent: Wednesday, January 29, 2014 10:59:43 AM
  Subject: Re: [vdsm] [Users] ovirtmgmt vanishes after reboot
  
  Am Dienstag, den 28.01.2014, 17:42 -0500 schrieb Antoni Segura
 Puimedon:
   
   We made some more tests with Francisco on #ovirt today and we saw
 that
   for some unknown reason, when rebooting the machine the
   ifcfg-YOURNET
   files disappear while the rule-YOURNET and route-YOURNET stay.
   That
   happens even when the both ifcfg, rule and route have the correct
   entry
   in /config/files and are bound to /config/etc/sysconfig/
   
   After reboot it returns to eth0 having the connectivity as it was
   defined
   with the TUI.
   
   For libvirt networks a bit of the same as for ifcfg files. Before
   reboot
   /etc/libvirt/qemu/networks is present in /config/files and findmnt
   reports
   that /etc/libvirt/qemu/networks is a bound mount
   of /config/etc/libvirt/qemu/networks.
   After reboot it is not the case anymore.
   
   @Fabian: Do you think there is something on the reboot that
 restores
   the
   network conf to the TUI settings?
  
  Hey Antoni,
  
  I vaguely remember that we had a problem with Node removing some
 ifcfg-*
  files. But this should have been solved in the TestDay iso.
  
  What ISO did you use to reproduce this problem?
 
 We reproduced on Francisco's setup, I think it was oVirt node 3.0.1

Right.
I believe the bug you are seeing is:
http://gerrit.ovirt.org/#/c/20068/

This has been merged into out stable branch, but the release of a new
oVirt Node ISO for 3.3 is pending because of the vdsm-python-cpopen
dependency problem.

We plan to do a build early next week.

- fabian


signature.asc
Description: This is a digitally signed message part
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Extremely poor disk access speeds in Windows guest

2014-01-29 Thread Vadim Rozenfeld
On Wed, 2014-01-29 at 11:30 +0200, Ronen Hod wrote:
 Adding the virtio-scsi developers.
 Anyhow, virtio-scsi is newer and less established than viostor (the
 block device), so you might want to try it out.

[VR]
Was it SCSI Controller or SCSI pass-through controller?
If it's SCSI Controller then it will be viostor (virtio-blk) device
driver.


 A disclaimer: There are time and patches gaps between RHEL and other
 versions.
 
 Ronen.
 
 On 01/28/2014 10:39 PM, Steve Dainard wrote:
 
  I've had a bit of luck here. 
  
  
  Overall IO performance is very poor during Windows updates, but a
  contributing factor seems to be the SCSI Controller device in the
  guest. This last install I didn't install a driver for that device,

[VR]
Does it mean that your system disk is IDE and the data disk (virtio-blk)
is not accessible? 

   and my performance is much better. Updates still chug along quite
  slowly, but I seem to have more than the  100KB/s write speeds I
  was seeing previously.
  
  
  Does anyone know what this device is for? I have the Red Hat VirtIO
  SCSI Controller listed under storage controllers.

[VR]
It's a virtio-blk device. OS cannot see this volume unless you have
viostor.sys driver installed on it.

  
  Steve Dainard 
  IT Infrastructure Manager
  Miovision | Rethink Traffic
  519-513-2407 ex.250
  877-646-8476 (toll-free)
  
  Blog  |  LinkedIn  |  Twitter  |  Facebook 
  
  Miovision Technologies Inc. | 148 Manitou Drive, Suite 101,
  Kitchener, ON, Canada | N2C 1L3
  This e-mail may contain information that is privileged or
  confidential. If you are not the intended recipient, please delete
  the e-mail and any attachments and notify us immediately.
  
  
  On Sun, Jan 26, 2014 at 2:33 AM, Itamar Heim ih...@redhat.com
  wrote:
  On 01/26/2014 02:37 AM, Steve Dainard wrote:
  
  Thanks for the responses everyone, really appreciate
  it.
  
  I've condensed the other questions into this reply.
  
  
  Steve,
  What is the CPU load of the GlusterFS host when
  comparing the raw
  brick test to the gluster mount point test? Give
  it 30 seconds and
  see what top reports. You’ll probably have to
  significantly increase
  the count on the test so that it runs that long.
  
  - Nick
  
  
  
  Gluster mount point:
  
  *4K* on GLUSTER host
  [root@gluster1 rep2]# dd if=/dev/zero
  of=/mnt/rep2/test1 bs=4k count=50
  50+0 records in
  50+0 records out
  
  204800 tel:204800 bytes (2.0 GB) copied,
  100.076 s, 20.5 MB/s 
  
  
  Top reported this right away:
  PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEM
   TIME+  COMMAND
1826 root  20   0  294m  33m 2540 S 27.2  0.4
  0:04.31 glusterfs
2126 root  20   0 1391m  31m 2336 S 22.6  0.4
   11:25.48 glusterfsd
  
  Then at about 20+ seconds top reports this:
 PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEM
 TIME+  COMMAND
1826 root  20   0  294m  35m 2660 R 141.7  0.5
  1:14.94 glusterfs
2126 root  20   0 1392m  31m 2344 S 33.7  0.4
   11:46.56 glusterfsd
  
  *4K* Directly on the brick:
  dd if=/dev/zero of=test1 bs=4k count=50
  50+0 records in
  50+0 records out
  
  204800 tel:204800 bytes (2.0 GB) copied,
  4.99367 s, 410 MB/s 
  
  
7750 root  20   0  102m  648  544 R 50.3  0.0
  0:01.52 dd
7719 root  20   0 000 D  1.0  0.0
  0:01.50 flush-253:2
  
  Same test, gluster mount point on OVIRT host:
  dd if=/dev/zero of=/mnt/rep2/test1 bs=4k
  count=50
  50+0 records in
  50+0 records out
  
  204800 tel:204800 bytes (2.0 GB) copied,
  42.4518 s, 48.2 MB/s 
  
  
 PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEM
 TIME+  COMMAND
2126 root

Re: [Users] Hosted Engine adding host SSL Failure (w/ engine custom cert)

2014-01-29 Thread Andrew Lau
On Wed, Jan 29, 2014 at 8:57 PM, Yedidyah Bar David d...@redhat.com wrote:

  *From: *Andrew Lau and...@andrewklau.com
 *To: *Yedidyah Bar David d...@redhat.com
 *Cc: *users users@ovirt.org
 *Sent: *Wednesday, January 29, 2014 10:17:21 AM

 *Subject: *Re: [Users] Hosted Engine adding host SSL Failure (w/ engine
 custom cert)

 Shame about the way the CA works, may be worth putting a reverse proxy in
 front as unsigned SSL can be a deal breaker.


 Perhaps. Would you like to open a bug?


 Anyway, my vdsm.log is here http://www.fpaste.org/72643/98338713/

 When it's Still waiting for VDSM host to become operational.. there is
 no output in vdsm.log


 Sorry, didn't follow. What this refers to? The ssl issue or something else?


So the output of the install process is like this:

[ INFO  ] Engine replied: DB Up!Welcome to Health Status!
[ ERROR ] Cannot automatically add the host to the Default cluster: Entity
not found: Cluster: name=Default
[ INFO  ] Waiting for the host to become operational in the engine. This
may take several minutes...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ INFO  ] Still waiting for VDSM host to become operational...
[ ERROR ] Timed out while waiting for host to start. Please check the logs.
[ ERROR ] Unable to add HV02 to the manager
[ INFO  ] Enabling and starting HA services
  Hosted Engine successfully set up
[ INFO  ] Stage: Clean up
[ INFO  ] Stage: Pre-termination
[ INFO  ] Stage: Termination


During that whole Still waiting for VDSM host to become operational...
The vdsm.log doesn't report anything at all not until the timeout




 Thanks!
 --
 Didi


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[Users] replace engine hostname /pki

2014-01-29 Thread Sven Kieske
Hi,

I'm just curious, is this wiki page
still correct for 3.3.2 ?
(It mentions 3.1)

http://www.ovirt.org/How_to_change_engine_host_name

-- 
Mit freundlichen Grüßen / Regards

Sven Kieske

Systemadministrator
Mittwald CM Service GmbH  Co. KG
Königsberger Straße 6
32339 Espelkamp
T: +49-5772-293-100
F: +49-5772-293-333
https://www.mittwald.de
Geschäftsführer: Robert Meyer
St.Nr.: 331/5721/1033, USt-IdNr.: DE814773217, HRA 6640, AG Bad Oeynhausen
Komplementärin: Robert Meyer Verwaltungs GmbH, HRB 13260, AG Bad Oeynhausen
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Storage unresponsive after sanlock

2014-01-29 Thread Maor Lipchuk
The VDSM log seems to be from the 26th and from the engine logs it seems
that the incident occurred at the 24th, so I can't really see the what
happened in VDSM that time.

From the engine logs it seems that at around 2014-01-24 16:59 the master
storage domain was in maintenance and then there was an attempt to
activate it, but VDSM threw an exception that it cannot find master
domain with the arguments of
spUUID=5849b030-626e-47cb-ad90-3ce782d831b3,
msdUUID=7c49750d-7eae-4cd2-9b63-1dc71f357b88'

This could be happen from various reasons, for example a failure in
connecting the storage (for example see https://bugzilla.redhat.com/782864)

Since you mentioned that once you have added a second node and it
worked, it seems like to origin of the problem is in the Host it self.

what are the differences between the two hosts (VDSM version, OS version)
Does the first host succeeded to work on other DC?
Have you tried to reinstall it?

Regards,
Maor




On 01/29/2014 02:50 AM, Trey Dockendorf wrote:
 See attached.  The event seems to have begun around 06:00:00 on
 2014-01-26.  I was unable to get the single node cluster back online
 so I provisioned another node to add to the cluster, which became the
 SPM.  Adding the second node worked and I had to power cycle the node
 that hung as sanlock was in a zombie state.  This is my first attempt
 at production use of NFS over RDMA and I'd like to rule out that being
 the cause.  Since the issue I've changed the 'nfs_mount_options' in
 /etc/vdsm/vdsm.conf to 'soft,nosharecache,rdma,port=20049'.  The
 options during the crash were only 'rdma,port=20049'.  I am also
 forcing NFSv3 by setting 'Nfsvers=3' in /etc/nfsmount.conf, which is
 still in place and was in place during the crash.
 
 Thanks
 - Trey
 
 On Tue, Jan 28, 2014 at 2:45 AM, Maor Lipchuk mlipc...@redhat.com wrote:
 Hi Trey,

 Can you please also attach the engine/vdsm logs.

 Thanks,
 Maor

 On 01/27/2014 06:12 PM, Trey Dockendorf wrote:
 I setup my first oVirt instance since 3.0 a few days ago and it went
 very well, and I left the single host cluster running with 1 VM over
 the weekend.  Today I come back and the primary data storage is marked
 as unresponsive.  The logs are full of entries [1] that look very
 similar to a knowledge base article on RHEL's website [2].

 This setup is using NFS over RDMA and so far the ib interfaces report
 no errors (via `ibcheckerrs -v LID 1`).  Based on a doc on ovirt
 site [3] it seems this could be due to response problems.  The storage
 system is a new purchase and not yet in production so if there's any
 advice on how to track down the cause that would be very helpful.
 Please let me know what additional information would be helpful as
 it's been about a year since I've been active in the oVirt community.

 Thanks
 - Trey

 [1]: http://pastebin.com/yRpSLKxJ

 [2]: https://access.redhat.com/site/solutions/400463

 [3]: http://www.ovirt.org/SANLock
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users


 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] replace engine hostname /pki

2014-01-29 Thread Yedidyah Bar David
- Original Message -
 From: Sven Kieske s.kie...@mittwald.de
 To: Users@ovirt.org List Users@ovirt.org
 Sent: Wednesday, January 29, 2014 12:20:58 PM
 Subject: [Users] replace engine hostname /pki
 
 Hi,
 
 I'm just curious, is this wiki page
 still correct for 3.3.2 ?
 (It mentions 3.1)
 
 http://www.ovirt.org/How_to_change_engine_host_name

It was actually replaced with a utility that does that:

http://www.ovirt.org/Changing_Engine_Hostname

You might want to add a link there. I noticed that there are other such
pages and did not bother to fix them all, some in other sites :-(

Best,
-- 
Didi
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Hosted Engine adding host SSL Failure (w/ engine custom cert)

2014-01-29 Thread Yedidyah Bar David
 From: Andrew Lau and...@andrewklau.com
 To: Yedidyah Bar David d...@redhat.com
 Cc: users users@ovirt.org
 Sent: Wednesday, January 29, 2014 12:19:56 PM
 Subject: Re: [Users] Hosted Engine adding host SSL Failure (w/ engine custom
 cert)

 On Wed, Jan 29, 2014 at 8:57 PM, Yedidyah Bar David  d...@redhat.com 
 wrote:

   From: Andrew Lau  and...@andrewklau.com 
  
 
   To: Yedidyah Bar David  d...@redhat.com 
  
 
   Cc: users  users@ovirt.org 
  
 
   Sent: Wednesday, January 29, 2014 10:17:21 AM
  
 

   Subject: Re: [Users] Hosted Engine adding host SSL Failure (w/ engine
   custom
   cert)
  
 

   Shame about the way the CA works, may be worth putting a reverse proxy in
   front as unsigned SSL can be a deal breaker.
  
 

  Perhaps. Would you like to open a bug?
 

   Anyway, my vdsm.log is here http://www.fpaste.org/72643/98338713/
  
 

   When it's Still waiting for VDSM host to become operational.. there is
   no
   output in vdsm.log
  
 

  Sorry, didn't follow. What this refers to? The ssl issue or something else?
 

 So the output of the install process is like this:

 [ INFO ] Engine replied: DB Up!Welcome to Health Status!
 [ ERROR ] Cannot automatically add the host to the Default cluster: Entity
 not found: Cluster: name=Default
 [ INFO ] Waiting for the host to become operational in the engine. This may
 take several minutes...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ INFO ] Still waiting for VDSM host to become operational...
 [ ERROR ] Timed out while waiting for host to start. Please check the logs.
 [ ERROR ] Unable to add HV02 to the manager
 [ INFO ] Enabling and starting HA services
 Hosted Engine successfully set up
 [ INFO ] Stage: Clean up
 [ INFO ] Stage: Pre-termination
 [ INFO ] Stage: Termination

 During that whole Still waiting for VDSM host to become operational...
 The vdsm.log doesn't report anything at all not until the timeout

Can you please post full logs of hosted-engine-setup, vdsm, and 
hosted-engine-ha? 

I looked at previous posts and only found setup logs with the external ca cert. 
Your 
http://www.fpaste.org/72643/98338713/ starts at 19:03:31 where the problem 
might 
be much earlier. 

Thanks a lot, 
-- 
Didi 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Vm's being paused

2014-01-29 Thread Dafna Ron
The reason I asked about the size if because this was the original issue 
no? vm's pausing on lack of space?


You're having a problem with your data domains.
Can you check the rout from the hosts to the storage? I think that you 
have some disconnection to the storage from the hosts
since it's random and not from all the vm's I would suggest that its a 
routing problem?

Thanks,
Dafna


On 01/29/2014 08:00 AM, Neil wrote:

Sorry, more on this issue, I see my logs are rapidly filling up my
disk space on node02 with this error in /var/log/messages...

Jan 29 09:56:53 node02 vdsm vm.Vm ERROR
vmId=`dfa2cf7c-3f0e-42e3-b495-10ccb3e0c71b`::Stats function failed:
AdvancedStatsFunction _highWrite at 0x1c2fb90#012Traceback (most
recent call last):#012  File /usr/share/vdsm/sampling.py, line 351,
in collect#012statsFunction()#012  File
/usr/share/vdsm/sampling.py, line 226, in __call__#012retValue =
self._function(*args, **kwargs)#012  File /usr/share/vdsm/vm.py,
line 513, in _highWrite#012self._vm._dom.blockInfo(vmDrive.path,
0)#012  File /usr/share/vdsm/vm.py, line 835, in f#012ret =
attr(*args, **kwargs)#012  File
/usr/lib64/python2.6/site-packages/vdsm/libvirtconnection.py, line
76, in wrapper#012ret = f(*args, **kwargs)#012  File
/usr/lib64/python2.6/site-packages/libvirt.py, line 1814, in
blockInfo#012if ret is None: raise libvirtError
('virDomainGetBlockInfo() failed', dom=self)#012libvirtError: invalid
argument: invalid path
/rhev/data-center/mnt/blockSD/0e6991ae-6238-4c61-96d2-ca8fed35161e/images/fac8a3bb-e414-43c0-affc-6e2628757a28/6c3e5ae8-23fc-4196-ba42-778bdc0fbad8
not assigned to domain
Jan 29 09:56:53 node02 vdsm vm.Vm ERROR
vmId=`ac2a3f99-a6db-4cae-955d-efdfb901abb7`::Stats function failed:
AdvancedStatsFunction _highWrite at 0x1c2fb90#012Traceback (most
recent call last):#012  File /usr/share/vdsm/sampling.py, line 351,
in collect#012statsFunction()#012  File
/usr/share/vdsm/sampling.py, line 226, in __call__#012retValue =
self._function(*args, **kwargs)#012  File /usr/share/vdsm/vm.py,
line 509, in _highWrite#012if not vmDrive.blockDev or
vmDrive.format != 'cow':#012AttributeError: 'Drive' object has no
attribute 'format'

Not sure if this is related at all though?

Thanks.

Regards.

Neil Wilson.

On Wed, Jan 29, 2014 at 9:02 AM, Neil nwilson...@gmail.com wrote:

Hi Dafna,

Thanks for clarifying that, I found the migration issue and this was
resolved once I sorted out the ISO domain problem.

I'm sorry I don't understand your last question?
 after the engine restart, do you still see a problem with the size
or did the report of size changed?

The migration issue was resolved, it's now just trying to track down
why the two VM's paused on their own, one on the 8th of Jan(I think)
and one on the 19th of Jan.

Thank you.


Regards.

Neil Wilson.


On Tue, Jan 28, 2014 at 8:18 PM, Dafna Ron d...@redhat.com wrote:

yes - engine lost communication with vdsm and it has no way of knowing if
the host is down or if there was a network issue so a network issue would
cause the same errors that I see in the logs.

The error you put on the iso is the reason the vm's have failed migration -
if a vm is run with a cd and the cd is gone than the vm will not be able to
be migrated.

after the engine restart, do you still see a problem with the size or did
the report of size changed?

Dafna


On 01/28/2014 01:02 PM, Neil wrote:

Hi Dafna,

Thanks for coming back to me. I'll try answer your queries one by one.

On Tue, Jan 28, 2014 at 1:38 PM, Dafna Ron d...@redhat.com wrote:

you had a problem with your storage on the 14th of Jan and one of the
hosts
rebooted (if you have the vdsm log from that day than I can see what
happened on vdsm side)
in engine, I could see a problem with the export domain and this should
not
have cause a reboot.

1.) I don't unfortunately have logs going back that far. Looking at
all 3 hosts uptime, the one with the least uptime is 21 days, the
others are all over 40 days, so there definitely wasn't a host that
rebooted on the 14th of Jan, would a network issue or Firewall issue
also cause the error you've seen to look as if a host rebooted? There
was a bonding mode change on the 14th of January, so perhaps this
caused the issue?



Can you tell me if you had a problem with the data
domain as well or was it just the export domain? were you having any vm's
exported/imported at that time?
In any case - this is a bug.

2.) I think this was the same day that the bonding mode was changed on
the host while the host was live (by mistake), and had SPM running on
it. I haven't done any importing or exporting for a few years on this
oVirt setup.



As for the vm's - if the vm's are no longer in migrating state than
please
restart ovirt-engine service (looks like a cache issue)

3.) Restarted ovirt-engine, logging now appears to be normal without any
errors.



if they are in migrating state - there should have been a timeout a long
time ago.
can you please run 

Re: [Users] replace engine hostname /pki

2014-01-29 Thread Sven Kieske
I updated the wiki page with:

This procedure is obsoleted by Changing_Engine_Hostname in oVirt 3.3.2

but I'm not sure this is right, since which version does this script
work? 3.3.1? 3.2.3?

Am 29.01.2014 11:34, schrieb Yedidyah Bar David:
 - Original Message -
 From: Sven Kieske s.kie...@mittwald.de
 To: Users@ovirt.org List Users@ovirt.org
 Sent: Wednesday, January 29, 2014 12:20:58 PM
 Subject: [Users] replace engine hostname /pki

 Hi,

 I'm just curious, is this wiki page
 still correct for 3.3.2 ?
 (It mentions 3.1)

 http://www.ovirt.org/How_to_change_engine_host_name
 
 It was actually replaced with a utility that does that:
 
 http://www.ovirt.org/Changing_Engine_Hostname
 
 You might want to add a link there. I noticed that there are other such
 pages and did not bother to fix them all, some in other sites :-(
 
 Best,
 

-- 
Mit freundlichen Grüßen / Regards

Sven Kieske

Systemadministrator
Mittwald CM Service GmbH  Co. KG
Königsberger Straße 6
32339 Espelkamp
T: +49-5772-293-100
F: +49-5772-293-333
https://www.mittwald.de
Geschäftsführer: Robert Meyer
St.Nr.: 331/5721/1033, USt-IdNr.: DE814773217, HRA 6640, AG Bad Oeynhausen
Komplementärin: Robert Meyer Verwaltungs GmbH, HRB 13260, AG Bad Oeynhausen
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] replace engine hostname /pki

2014-01-29 Thread Sven Kieske
Additional question regarding the certificates/pki:

the wikipage states:

The bigger concern is with the engine's certificate. Currently, to the
best of our knowledge, there is no component that actually checks this
trust.
(All three certificates (CA, httpd, engine) are for the Common Name (CN)
whose value is the hostname entered during engine-setup, which is
supposed to be the hostname of the engine's machine, exist in the dns
(forward and reverse records), and point to an IP address of the
engine's machine. )

Is there a list of values that get checked? e.g. the validity dates
before and after?

users might run into trouble in 10 years if this gets checked, because
that is the current expiration date.

if _nothing_ gets checked I wonder why the PKI is used at all ;)

(I assume at least the keys get checked)

Am 29.01.2014 11:34, schrieb Yedidyah Bar David:
 It was actually replaced with a utility that does that:
 
 http://www.ovirt.org/Changing_Engine_Hostname
 
 You might want to add a link there. I noticed that there are other such
 pages and did not bother to fix them all, some in other sites :-(

-- 
Mit freundlichen Grüßen / Regards

Sven Kieske

Systemadministrator
Mittwald CM Service GmbH  Co. KG
Königsberger Straße 6
32339 Espelkamp
T: +49-5772-293-100
F: +49-5772-293-333
https://www.mittwald.de
Geschäftsführer: Robert Meyer
St.Nr.: 331/5721/1033, USt-IdNr.: DE814773217, HRA 6640, AG Bad Oeynhausen
Komplementärin: Robert Meyer Verwaltungs GmbH, HRB 13260, AG Bad Oeynhausen
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] replace engine hostname /pki

2014-01-29 Thread Yedidyah Bar David
- Original Message -
 From: Sven Kieske s.kie...@mittwald.de
 To: Yedidyah Bar David d...@redhat.com
 Cc: Users@ovirt.org List Users@ovirt.org
 Sent: Wednesday, January 29, 2014 1:15:30 PM
 Subject: Re: [Users] replace engine hostname /pki
 
 I updated the wiki page with:
 
 This procedure is obsoleted by Changing_Engine_Hostname in oVirt 3.3.2
 
 but I'm not sure this is right, since which version does this script
 work? 3.3.1? 3.2.3?

3.3.0. Actually since the nightly builds of Aug 8 or so [1]

[1] http://gerrit.ovirt.org/17408

Thanks!
-- 
Didi
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[Users] How to upload images to a local storage ISO domain

2014-01-29 Thread Dresel Christopher
Hi,

when using the engine-iso-uploader (the only documented tool I found) for a 
local storage ISO domain, I get the following error:

ovirt-iso-uploader upload --iso-domain=ISOs file.iso –v

ERROR: mount.nfs: Failed to resolve server None: Name or service not known

It looks like engine-iso-uploader tries to connect as if it is an NFS storage?

What is the procedure for uploading isos to local storage domains? Are there 
other options than using the engine-iso-uploader?

Best Regards,
Christopher Dresel




Fachhochschule Wiener Neustadt für Wirtschaft und Technik Ges.m.b.H.
University of Applied Sciences Wiener Neustadt for Business and Engineering 
Ltd., Austria
Johannes Gutenberg-Straße 3
2700 Wiener Neustadt
Austria, Europe
ATU: 37772406
Firmenbuchnummer: 77005v
Firmenbuchgericht: Landesgericht Wiener Neustadt
DVR: 0769665
Der Inhalt dieses E-Mails ist ausschliesslich fuer den bezeichneten Adressaten 
bestimmt. Jede Form der Kenntnisnahme, Veroeffentlichung, Vervielfaeltigung 
oder Weitergabe des Inhalts dieses E-Mails durch unberechtigte Dritte ist 
unzulaessig. Wir bitten Sie, sich mit dem Absender des E-Mails in Verbindung zu 
setzen, falls Sie nicht der Adressat dieses E-Mails sind und das Material von 
Ihrem Computer zu loeschen.

This e-mail and any attachments are confidential and intended solely for the 
addressee. The perusal, publication, copying or dissemination of the contents 
of this e-mail by unauthorised third parties is prohibited. If you are not the 
intended recipient of this e-mail, please delete it and immediately notify the 
sender.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] How to upload images to a local storage ISO domain

2014-01-29 Thread Yedidyah Bar David
 From: Dresel Christopher dre...@fhwn.ac.at
 To: users@ovirt.org
 Sent: Wednesday, January 29, 2014 1:33:21 PM
 Subject: [Users] How to upload images to a local storage ISO domain

 Hi,

 when using the engine-iso-uploader (the only documented tool I found) for a
 local storage ISO domain, I get the following error:

 ovirt-iso-uploader upload --iso-domain=ISOs file.iso –v

 ERROR: mount.nfs: Failed to resolve server None: Name or service not known

 It looks like engine-iso-uploader tries to connect as if it is an NFS
 storage?

The manpage says: The default transport is NFS. However, you can use SSH as 
the transport instead. 

 What is the procedure for uploading isos to local storage domains? Are there
 other options than using the engine-iso-uploader?

You can also place your iso image directly in the right location, which is 
/PATH/UUID/images/----/ 
where PATH is the root of the domain , UUID is its uuid (some random string). 
-- 
Didi 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] How to upload images to a local storage ISO domain

2014-01-29 Thread Frank Wall
Hi Christopher,

On Wed, Jan 29, 2014 at 11:33:21AM +, Dresel Christopher wrote:
 What is the procedure for uploading isos to local storage domains? 
 Are there other options than using the engine-iso-uploader?

you can manually copy/rsync your files directly to the locally 
mounted directory of your ISO storage domain:

/rhev/data-center/mnt/node01:_nfs_iso/662b24c0-68bb-4dd2-a8db-5fe67276921f/images/----/

The images/----/ subfolder is where
you should place all your ISO images.


Regards
- Frank
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] How to upload images to a local storage ISO domain

2014-01-29 Thread Sven Kieske
Hi,

you do not upload to the local storage data domain

you must make sure you have an ISO-Domain (it doesn't sound like you got
one?).

also make sure to check firewalld/iptables to allow nfs.

ISO-Domains are always nfs, even in local storage dc (and can even
reside on a remote host).

HTH

Am 29.01.2014 12:33, schrieb Dresel Christopher:
 Hi,
 
 when using the engine-iso-uploader (the only documented tool I found) for a 
 local storage ISO domain, I get the following error:
 
 ovirt-iso-uploader upload --iso-domain=ISOs file.iso –v
 
 ERROR: mount.nfs: Failed to resolve server None: Name or service not known
 
 It looks like engine-iso-uploader tries to connect as if it is an NFS storage?
 
 What is the procedure for uploading isos to local storage domains? Are there 
 other options than using the engine-iso-uploader?
 
 Best Regards,
 Christopher Dresel


-- 
Mit freundlichen Grüßen / Regards

Sven Kieske

Systemadministrator
Mittwald CM Service GmbH  Co. KG
Königsberger Straße 6
32339 Espelkamp
T: +49-5772-293-100
F: +49-5772-293-333
https://www.mittwald.de
Geschäftsführer: Robert Meyer
St.Nr.: 331/5721/1033, USt-IdNr.: DE814773217, HRA 6640, AG Bad Oeynhausen
Komplementärin: Robert Meyer Verwaltungs GmbH, HRB 13260, AG Bad Oeynhausen
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Volume Group does not exist. Blame device-mapper ?

2014-01-29 Thread Maor Lipchuk
Hi Nicolas,

Can u please attach the VDSM logs of the problematic nodes and valid
nodes, the engine log and also the sanlock log.

You wrote that many nodes suddenly began to become
unresponsive,
Do you mean that the hosts switched to non-responsive status in the engine?
I'm asking that because non-responsive status indicate that the engine
could not communicate with the hosts, it could be related to sanlock
since if the host encountered a problem to write to the master domain it
causes sanlock to restart VDSM and make the hosts non responsive.

regards,
Maor

On 01/27/2014 09:26 AM, Nicolas Ecarnot wrote:
 Le 26/01/2014 23:23, Itamar Heim a écrit :
 On 01/20/2014 12:06 PM, Nicolas Ecarnot wrote:
 Hi,

 oVirt 3.3, no big issue since the recent snapshot joke, but all in all
 running fine.

 All my VM are stored in a iSCSI SAN. The VM usually are using only one
 or two disks (1: system, 2: data) and it is OK.

 Friday, I created a new LUN. Inside a VM, I linked to it via iscsiadm
 and successfully login to the Lun (session, automatic attach on boot,
 read, write) : nice.

 Then after detaching it and shuting down the MV, and for the first time,
 I tried to make use of the feature direct attach to attach the disk
 directly from oVirt, login the session via oVirt.
 I connected nice and I saw the disk appear in my VM as /dev/sda or
 whatever. I was able to mount it, read and write.

 Then disaster stoke all this : many nodes suddenly began to become
 unresponsive, quickly migrating their VM to the remaining nodes.
 Hopefully, the migrations ran fine and I lost no VM nor downtime, but I
 had to reboot every concerned node (other actions failed).

 In the failing nodes, /var/log/messages showed the log you can read in
 the end of this message.
 I first get device-mapper warnings, then the host unable to collaborate
 with the logical volumes.

 The 3 volumes are the three main storage domains, perfectly up and
 running where I store my oVirt VMs.

 My reflexions :
 - I'm not sure device-mapper is to blame. I frequently see device mapper
 complaining and nothing is getting worse (not oVirt specifically)
 - I have not change my network settings for months (bonding, linking...)
 The only new factor is the usage of direct attach LUN.
 - This morning I was able to reproduce the bug, just by trying again
 this attachement, and booting the VM. No mounting of the LUN, just VM
 booting, waiting, and this is enough to crash oVirt.
 - when the disaster happens, usually, amongst the nodes, only three
 nodes gets stroke, the only one that run VMs. Obviously, after
 migration, different nodes are hosting the VMs, and those new nodes are
 the one that then get stroke.

 This is quite reproductible.

 And frightening.


 The log :

 Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: table: 253:36:
 multipath: error getting device
 Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: ioctl: error adding
 target to table
 Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: table: 253:36:
 multipath: error getting device
 Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: ioctl: error adding
 target to table
 Jan 20 10:20:47 serv-vm-adm11 vdsm TaskManager.Task ERROR
 Task=`847653e6-8b23-4429-ab25-257538b35293`::Unexpected
 error#012Traceback (most recent call last):#012  File
 /usr/share/vdsm/storage/task.py, line 857, in _run#012return
 fn(*args, **kargs)#012  File /usr/share/vdsm/logUtils.py, line 45, in
 wrapper#012res = f(*args, **kwargs)#012  File
 /usr/share/vdsm/storage/hsm.py, line 3053, in getVolumeSize#012
 volUUID, bs=1))#012  File /usr/share/vdsm/storage/volume.py, line 333,
 in getVSize#012mysd = sdCache.produce(sdUUID=sdUUID)#012  File
 /usr/share/vdsm/storage/sdc.py, line 98, in produce#012
 domain.getRealDomain()#012  File /usr/share/vdsm/storage/sdc.py, line
 52, in getRealDomain#012return
 self._cache._realProduce(self._sdUUID)#012  File
 /usr/share/vdsm/storage/sdc.py, line 122, in _realProduce#012 domain =
 self._findDomain(sdUUID)#012  File /usr/share/vdsm/storage/sdc.py,
 line 141, in _findDomain#012dom = findMethod(sdUUID)#012  File
 /usr/share/vdsm/storage/blockSD.py, line 1288, in findDomain#012
 return
 BlockStorageDomain(BlockStorageDomain.findDomainPath(sdUUID))#012  File
 /usr/share/vdsm/storage/blockSD.py, line 414, in __init__#012
 lvm.checkVGBlockSizes(sdUUID, (self.logBlkSize, self.phyBlkSize))#012
 File /usr/share/vdsm/storage/lvm.py, line 976, in
 checkVGBlockSizes#012raise se.VolumeGroupDoesNotExist(vg_uuid: %s
 % vgUUID)#012VolumeGroupDoesNotExist: Volume Group does not exist:
 ('vg_uuid: 1429ffe2-4137-416c-bb38-63fd73f4bcc1',)
 Jan 20 10:20:47 serv-vm-adm11 ¿11vdsm vm.Vm ERROR
 vmId=`2c0bbb51-0f94-4bf1-9579-4e897260f88e`::Unable to update the volume
 80bac371-6899-4fbe-a8e1-272037186bfb (domain:
 1429ffe2-4137-416c-bb38-63fd73f4bcc1 image:
 a5995c25-cdc9-4499-b9b4-08394a38165c) for the drive vda
 Jan 20 10:20:48 serv-vm-adm11 vdsm TaskManager.Task ERROR
 

Re: [Users] Volume Group does not exist. Blame device-mapper ?

2014-01-29 Thread Nicolas Ecarnot

Le 29/01/2014 13:29, Maor Lipchuk a écrit :

Hi Nicolas,

Can u please attach the VDSM logs of the problematic nodes and valid
nodes, the engine log and also the sanlock log.

You wrote that many nodes suddenly began to become
unresponsive,
Do you mean that the hosts switched to non-responsive status in the engine?
I'm asking that because non-responsive status indicate that the engine
could not communicate with the hosts, it could be related to sanlock
since if the host encountered a problem to write to the master domain it
causes sanlock to restart VDSM and make the hosts non responsive.

regards,
Maor


It will be hard work to provide these logs but I will try asap.
But to answer your question : the engine saw the failing nodes as 
unresponsive, but I was always fully able to ping them and ssh-log on them.


Is there some place I could read further doc about sanlock?

Nicolas Ecarnot



On 01/27/2014 09:26 AM, Nicolas Ecarnot wrote:

Le 26/01/2014 23:23, Itamar Heim a écrit :

On 01/20/2014 12:06 PM, Nicolas Ecarnot wrote:

Hi,

oVirt 3.3, no big issue since the recent snapshot joke, but all in all
running fine.

All my VM are stored in a iSCSI SAN. The VM usually are using only one
or two disks (1: system, 2: data) and it is OK.

Friday, I created a new LUN. Inside a VM, I linked to it via iscsiadm
and successfully login to the Lun (session, automatic attach on boot,
read, write) : nice.

Then after detaching it and shuting down the MV, and for the first time,
I tried to make use of the feature direct attach to attach the disk
directly from oVirt, login the session via oVirt.
I connected nice and I saw the disk appear in my VM as /dev/sda or
whatever. I was able to mount it, read and write.

Then disaster stoke all this : many nodes suddenly began to become
unresponsive, quickly migrating their VM to the remaining nodes.
Hopefully, the migrations ran fine and I lost no VM nor downtime, but I
had to reboot every concerned node (other actions failed).

In the failing nodes, /var/log/messages showed the log you can read in
the end of this message.
I first get device-mapper warnings, then the host unable to collaborate
with the logical volumes.

The 3 volumes are the three main storage domains, perfectly up and
running where I store my oVirt VMs.

My reflexions :
- I'm not sure device-mapper is to blame. I frequently see device mapper
complaining and nothing is getting worse (not oVirt specifically)
- I have not change my network settings for months (bonding, linking...)
The only new factor is the usage of direct attach LUN.
- This morning I was able to reproduce the bug, just by trying again
this attachement, and booting the VM. No mounting of the LUN, just VM
booting, waiting, and this is enough to crash oVirt.
- when the disaster happens, usually, amongst the nodes, only three
nodes gets stroke, the only one that run VMs. Obviously, after
migration, different nodes are hosting the VMs, and those new nodes are
the one that then get stroke.

This is quite reproductible.

And frightening.


The log :

Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: table: 253:36:
multipath: error getting device
Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: ioctl: error adding
target to table
Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: table: 253:36:
multipath: error getting device
Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: ioctl: error adding
target to table
Jan 20 10:20:47 serv-vm-adm11 vdsm TaskManager.Task ERROR
Task=`847653e6-8b23-4429-ab25-257538b35293`::Unexpected
error#012Traceback (most recent call last):#012  File
/usr/share/vdsm/storage/task.py, line 857, in _run#012return
fn(*args, **kargs)#012  File /usr/share/vdsm/logUtils.py, line 45, in
wrapper#012res = f(*args, **kwargs)#012  File
/usr/share/vdsm/storage/hsm.py, line 3053, in getVolumeSize#012
volUUID, bs=1))#012  File /usr/share/vdsm/storage/volume.py, line 333,
in getVSize#012mysd = sdCache.produce(sdUUID=sdUUID)#012  File
/usr/share/vdsm/storage/sdc.py, line 98, in produce#012
domain.getRealDomain()#012  File /usr/share/vdsm/storage/sdc.py, line
52, in getRealDomain#012return
self._cache._realProduce(self._sdUUID)#012  File
/usr/share/vdsm/storage/sdc.py, line 122, in _realProduce#012 domain =
self._findDomain(sdUUID)#012  File /usr/share/vdsm/storage/sdc.py,
line 141, in _findDomain#012dom = findMethod(sdUUID)#012  File
/usr/share/vdsm/storage/blockSD.py, line 1288, in findDomain#012
return
BlockStorageDomain(BlockStorageDomain.findDomainPath(sdUUID))#012  File
/usr/share/vdsm/storage/blockSD.py, line 414, in __init__#012
lvm.checkVGBlockSizes(sdUUID, (self.logBlkSize, self.phyBlkSize))#012
File /usr/share/vdsm/storage/lvm.py, line 976, in
checkVGBlockSizes#012raise se.VolumeGroupDoesNotExist(vg_uuid: %s
% vgUUID)#012VolumeGroupDoesNotExist: Volume Group does not exist:
('vg_uuid: 1429ffe2-4137-416c-bb38-63fd73f4bcc1',)
Jan 20 10:20:47 serv-vm-adm11 ¿11vdsm vm.Vm ERROR

Re: [Users] Volume Group does not exist. Blame device-mapper ?

2014-01-29 Thread Itamar Heim

On 01/29/2014 02:35 PM, Nicolas Ecarnot wrote:

Le 29/01/2014 13:29, Maor Lipchuk a écrit :

Hi Nicolas,

Can u please attach the VDSM logs of the problematic nodes and valid
nodes, the engine log and also the sanlock log.

You wrote that many nodes suddenly began to become
unresponsive,
Do you mean that the hosts switched to non-responsive status in the
engine?
I'm asking that because non-responsive status indicate that the engine
could not communicate with the hosts, it could be related to sanlock
since if the host encountered a problem to write to the master domain it
causes sanlock to restart VDSM and make the hosts non responsive.


non-resposneive for engine is if vdsm is up/responsive.
run locally;
# vdsClient -s 0 getVdsCaps

to check vdsm is ok



regards,
Maor


It will be hard work to provide these logs but I will try asap.
But to answer your question : the engine saw the failing nodes as
unresponsive, but I was always fully able to ping them and ssh-log on them.

Is there some place I could read further doc about sanlock?

Nicolas Ecarnot



On 01/27/2014 09:26 AM, Nicolas Ecarnot wrote:

Le 26/01/2014 23:23, Itamar Heim a écrit :

On 01/20/2014 12:06 PM, Nicolas Ecarnot wrote:

Hi,

oVirt 3.3, no big issue since the recent snapshot joke, but all in all
running fine.

All my VM are stored in a iSCSI SAN. The VM usually are using only one
or two disks (1: system, 2: data) and it is OK.

Friday, I created a new LUN. Inside a VM, I linked to it via iscsiadm
and successfully login to the Lun (session, automatic attach on boot,
read, write) : nice.

Then after detaching it and shuting down the MV, and for the first
time,
I tried to make use of the feature direct attach to attach the disk
directly from oVirt, login the session via oVirt.
I connected nice and I saw the disk appear in my VM as /dev/sda or
whatever. I was able to mount it, read and write.

Then disaster stoke all this : many nodes suddenly began to become
unresponsive, quickly migrating their VM to the remaining nodes.
Hopefully, the migrations ran fine and I lost no VM nor downtime,
but I
had to reboot every concerned node (other actions failed).

In the failing nodes, /var/log/messages showed the log you can read in
the end of this message.
I first get device-mapper warnings, then the host unable to
collaborate
with the logical volumes.

The 3 volumes are the three main storage domains, perfectly up and
running where I store my oVirt VMs.

My reflexions :
- I'm not sure device-mapper is to blame. I frequently see device
mapper
complaining and nothing is getting worse (not oVirt specifically)
- I have not change my network settings for months (bonding,
linking...)
The only new factor is the usage of direct attach LUN.
- This morning I was able to reproduce the bug, just by trying again
this attachement, and booting the VM. No mounting of the LUN, just VM
booting, waiting, and this is enough to crash oVirt.
- when the disaster happens, usually, amongst the nodes, only three
nodes gets stroke, the only one that run VMs. Obviously, after
migration, different nodes are hosting the VMs, and those new nodes
are
the one that then get stroke.

This is quite reproductible.

And frightening.


The log :

Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: table: 253:36:
multipath: error getting device
Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: ioctl: error
adding
target to table
Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: table: 253:36:
multipath: error getting device
Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: ioctl: error
adding
target to table
Jan 20 10:20:47 serv-vm-adm11 vdsm TaskManager.Task ERROR
Task=`847653e6-8b23-4429-ab25-257538b35293`::Unexpected
error#012Traceback (most recent call last):#012  File
/usr/share/vdsm/storage/task.py, line 857, in _run#012return
fn(*args, **kargs)#012  File /usr/share/vdsm/logUtils.py, line
45, in
wrapper#012res = f(*args, **kwargs)#012  File
/usr/share/vdsm/storage/hsm.py, line 3053, in getVolumeSize#012
volUUID, bs=1))#012  File /usr/share/vdsm/storage/volume.py, line
333,
in getVSize#012mysd = sdCache.produce(sdUUID=sdUUID)#012  File
/usr/share/vdsm/storage/sdc.py, line 98, in produce#012
domain.getRealDomain()#012  File /usr/share/vdsm/storage/sdc.py,
line
52, in getRealDomain#012return
self._cache._realProduce(self._sdUUID)#012  File
/usr/share/vdsm/storage/sdc.py, line 122, in _realProduce#012
domain =
self._findDomain(sdUUID)#012  File /usr/share/vdsm/storage/sdc.py,
line 141, in _findDomain#012dom = findMethod(sdUUID)#012  File
/usr/share/vdsm/storage/blockSD.py, line 1288, in findDomain#012
return
BlockStorageDomain(BlockStorageDomain.findDomainPath(sdUUID))#012
File
/usr/share/vdsm/storage/blockSD.py, line 414, in __init__#012
lvm.checkVGBlockSizes(sdUUID, (self.logBlkSize, self.phyBlkSize))#012
File /usr/share/vdsm/storage/lvm.py, line 976, in
checkVGBlockSizes#012raise se.VolumeGroupDoesNotExist(vg_uuid:
%s
% 

Re: [Users] Error: The following Network Interfaces were specified more than once:

2014-01-29 Thread Lior Vernia


On 29/01/14 10:27, Alan Murrell wrote:
 I have noticed what seems to me to be a quirk, but wanted to see if this
 is correct behaviour, or perhaps my configuration.
 
 I am running a test install ov oVirt 3.3.2 in an all-in-one install.  My
 test box only has one physical NIC, so I am using that for
 management/external network access, and I have create a dummy interface
 called dummy_0 to be used for the VMs.
 
 I created four networks, each assigned their own VLAN (I am wanted to
 simulate a real-world  .  I added the first network (VLAN001) to
 dummy_0 with no problem (I was able to save the configuration).  The
 system created the dummy_0.1.
 
 When I tried to add another network (VLAN002) to dummy_0 and save
 it, I got the following error:
 
 --- START ---
 Error while executing action:
 
 local_host:
 
 Cannot setup Networks. The following Network Interfaces were
 specified more than once: dummy_0.1.
 --- END ---
 
 I found that in the Setup Host Network if I dragged VLAN001 off
 dummy_0 to the right under Required, and left VLAN002 on dummy_0,
 I could save it, and VLAN001 appeared to remain active on the host. 
 Indeed, when I went back into Setup Host Network, both VLAN001 and
 VLAN002 were showing up on dummy_0.
 
 However, if I highlight VLAN002 under Networks and click on the
 Hosts tab, the Network Interface is showing a red down arrow,
 indicating it is not up.  The Network Interface for VLAN001 is a
 green up arrow.  I have refreshed several times and at this point in
 time, have left it for almost thirty minutes, but it remains as a red
 down arrow.  The same is true for the other two networks I tried adding:
 I was able to add them as above, but they are also showing a red down
 arrow.
 
 My questions:
 
   1.) Is doing what I had to do to add additional networks to an
 interface normal, or is it a quirk of using a dummy interface?
   2.) Any ideas why my additional networks are showing a red down
 arrow?  Again, maybe a quirk of using a dummy interface?

It's a quirk of oVirt at the moment :) The interface state as shown in
the GUI currently has very little to do with whether there's
connectivity on a network. There should be an open RFE for connectivity
indicators, though I can't seem to find it at the moment.

What it SHOULD tell you is whether a link is up or down on the host
(administrative state), but according to this bug it doesn't even do
that (will be fixed soon):
https://bugzilla.redhat.com/show_bug.cgi?id=1038741

 
 I am going to leave it overnight and see what happens (maybe some
 background process needs to run??), but if that does not work, I am sure
 I can scrounge up another NIC in the next couple of days and install it
 into my test server to see if it does the same thing.
 
 Thanks, in advance.
 
 -Alan
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Ovirt 3.4 - Fail to set permissions to VM

2014-01-29 Thread Jonas Israelsson


On 29/01/14 07:29, Oved Ourfalli wrote:

Hi Jonas

Apparently there is a quite new bug open about this issue 
(https://bugzilla.redhat.com/1057147).
CC-ing Juan and Yair - perhaps the'll know what's the source of the issue, as I 
think they were the last ones to make changes in it.

Jupp, got it.

Sorry for not checking there first..


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Volume Group does not exist. Blame device-mapper ?

2014-01-29 Thread Nicolas Ecarnot

Le 29/01/2014 13:36, Itamar Heim a écrit :

On 01/29/2014 02:35 PM, Nicolas Ecarnot wrote:

Le 29/01/2014 13:29, Maor Lipchuk a écrit :

Hi Nicolas,

Can u please attach the VDSM logs of the problematic nodes and valid
nodes, the engine log and also the sanlock log.

You wrote that many nodes suddenly began to become
unresponsive,
Do you mean that the hosts switched to non-responsive status in the
engine?
I'm asking that because non-responsive status indicate that the engine
could not communicate with the hosts, it could be related to sanlock
since if the host encountered a problem to write to the master domain it
causes sanlock to restart VDSM and make the hosts non responsive.


non-resposneive for engine is if vdsm is up/responsive.
run locally;
# vdsClient -s 0 getVdsCaps

to check vdsm is ok


When I find the time for it, I'll reproduce the crash and run this 
command and let you know.

I must admit this was scary.

--
Nicolas Ecarnot
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] [vdsm] ovirtmgmt vanishes after reboot

2014-01-29 Thread Francisco Pérez
OK. thanks all of you for your help. I'll wait until the new release.



On Wed, Jan 29, 2014 at 7:07 AM, Fabian Deutsch fabi...@redhat.com wrote:

 Am Mittwoch, den 29.01.2014, 05:02 -0500 schrieb Antoni Segura Puimedon:
  - Original Message -
   From: Fabian Deutsch fabi...@redhat.com
   To: Antoni Segura Puimedon asegu...@redhat.com
   Cc: Dan Kenigsberg dan...@redhat.com, Francisco Pérez
  fpere...@gmail.com, amul...@redhat.com, VDSM Project
   Development vdsm-de...@lists.fedorahosted.org, users@ovirt.org
   Sent: Wednesday, January 29, 2014 10:59:43 AM
   Subject: Re: [vdsm] [Users] ovirtmgmt vanishes after reboot
  
   Am Dienstag, den 28.01.2014, 17:42 -0500 schrieb Antoni Segura
  Puimedon:
   
We made some more tests with Francisco on #ovirt today and we saw
  that
for some unknown reason, when rebooting the machine the
ifcfg-YOURNET
files disappear while the rule-YOURNET and route-YOURNET stay.
That
happens even when the both ifcfg, rule and route have the correct
entry
in /config/files and are bound to /config/etc/sysconfig/
   
After reboot it returns to eth0 having the connectivity as it was
defined
with the TUI.
   
For libvirt networks a bit of the same as for ifcfg files. Before
reboot
/etc/libvirt/qemu/networks is present in /config/files and findmnt
reports
that /etc/libvirt/qemu/networks is a bound mount
of /config/etc/libvirt/qemu/networks.
After reboot it is not the case anymore.
   
@Fabian: Do you think there is something on the reboot that
  restores
the
network conf to the TUI settings?
  
   Hey Antoni,
  
   I vaguely remember that we had a problem with Node removing some
  ifcfg-*
   files. But this should have been solved in the TestDay iso.
  
   What ISO did you use to reproduce this problem?
 
  We reproduced on Francisco's setup, I think it was oVirt node 3.0.1

 Right.
 I believe the bug you are seeing is:
 http://gerrit.ovirt.org/#/c/20068/

 This has been merged into out stable branch, but the release of a new
 oVirt Node ISO for 3.3 is pending because of the vdsm-python-cpopen
 dependency problem.

 We plan to do a build early next week.

 - fabian

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Volume Group does not exist. Blame device-mapper ?

2014-01-29 Thread Maor Lipchuk
Hi,

Please see inline response.

Regards,
Maor

On 01/29/2014 02:35 PM, Nicolas Ecarnot wrote:
 Le 29/01/2014 13:29, Maor Lipchuk a écrit :
 Hi Nicolas,

 Can u please attach the VDSM logs of the problematic nodes and valid
 nodes, the engine log and also the sanlock log.

 You wrote that many nodes suddenly began to become
 unresponsive,
 Do you mean that the hosts switched to non-responsive status in the
 engine?
 I'm asking that because non-responsive status indicate that the engine
 could not communicate with the hosts, it could be related to sanlock
 since if the host encountered a problem to write to the master domain it
 causes sanlock to restart VDSM and make the hosts non responsive.

 regards,
 Maor
 
 It will be hard work to provide these logs but I will try asap.
 But to answer your question : the engine saw the failing nodes as
 unresponsive, but I was always fully able to ping them and ssh-log on them.
Sorry, I was not that clear, as Itamar wrote before,
vdsClient -s 0 getVdsCaps should indicate if the Host is non responsive
or not.
If the VDSM service is down or the host could not be reached then the
host will be non responsive as well.
 
 Is there some place I could read further doc about sanlock?
you can check man sanlock or https://fedorahosted.org/sanlock/
and http://www.ovirt.org/SANLock
 
 Nicolas Ecarnot
 

 On 01/27/2014 09:26 AM, Nicolas Ecarnot wrote:
 Le 26/01/2014 23:23, Itamar Heim a écrit :
 On 01/20/2014 12:06 PM, Nicolas Ecarnot wrote:
 Hi,

 oVirt 3.3, no big issue since the recent snapshot joke, but all in all
 running fine.

 All my VM are stored in a iSCSI SAN. The VM usually are using only one
 or two disks (1: system, 2: data) and it is OK.

 Friday, I created a new LUN. Inside a VM, I linked to it via iscsiadm
 and successfully login to the Lun (session, automatic attach on boot,
 read, write) : nice.

 Then after detaching it and shuting down the MV, and for the first
 time,
 I tried to make use of the feature direct attach to attach the disk
 directly from oVirt, login the session via oVirt.
 I connected nice and I saw the disk appear in my VM as /dev/sda or
 whatever. I was able to mount it, read and write.

 Then disaster stoke all this : many nodes suddenly began to become
 unresponsive, quickly migrating their VM to the remaining nodes.
 Hopefully, the migrations ran fine and I lost no VM nor downtime,
 but I
 had to reboot every concerned node (other actions failed).

 In the failing nodes, /var/log/messages showed the log you can read in
 the end of this message.
 I first get device-mapper warnings, then the host unable to
 collaborate
 with the logical volumes.

 The 3 volumes are the three main storage domains, perfectly up and
 running where I store my oVirt VMs.

 My reflexions :
 - I'm not sure device-mapper is to blame. I frequently see device
 mapper
 complaining and nothing is getting worse (not oVirt specifically)
 - I have not change my network settings for months (bonding,
 linking...)
 The only new factor is the usage of direct attach LUN.
 - This morning I was able to reproduce the bug, just by trying again
 this attachement, and booting the VM. No mounting of the LUN, just VM
 booting, waiting, and this is enough to crash oVirt.
 - when the disaster happens, usually, amongst the nodes, only three
 nodes gets stroke, the only one that run VMs. Obviously, after
 migration, different nodes are hosting the VMs, and those new nodes
 are
 the one that then get stroke.

 This is quite reproductible.

 And frightening.


 The log :

 Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: table: 253:36:
 multipath: error getting device
 Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: ioctl: error
 adding
 target to table
 Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: table: 253:36:
 multipath: error getting device
 Jan 20 10:20:45 serv-vm-adm11 kernel: device-mapper: ioctl: error
 adding
 target to table
 Jan 20 10:20:47 serv-vm-adm11 vdsm TaskManager.Task ERROR
 Task=`847653e6-8b23-4429-ab25-257538b35293`::Unexpected
 error#012Traceback (most recent call last):#012  File
 /usr/share/vdsm/storage/task.py, line 857, in _run#012return
 fn(*args, **kargs)#012  File /usr/share/vdsm/logUtils.py, line
 45, in
 wrapper#012res = f(*args, **kwargs)#012  File
 /usr/share/vdsm/storage/hsm.py, line 3053, in getVolumeSize#012
 volUUID, bs=1))#012  File /usr/share/vdsm/storage/volume.py, line
 333,
 in getVSize#012mysd = sdCache.produce(sdUUID=sdUUID)#012  File
 /usr/share/vdsm/storage/sdc.py, line 98, in produce#012
 domain.getRealDomain()#012  File /usr/share/vdsm/storage/sdc.py,
 line
 52, in getRealDomain#012return
 self._cache._realProduce(self._sdUUID)#012  File
 /usr/share/vdsm/storage/sdc.py, line 122, in _realProduce#012
 domain =
 self._findDomain(sdUUID)#012  File /usr/share/vdsm/storage/sdc.py,
 line 141, in _findDomain#012dom = findMethod(sdUUID)#012  File
 /usr/share/vdsm/storage/blockSD.py, line 1288, in findDomain#012
 

Re: [Users] replace engine hostname /pki

2014-01-29 Thread Yedidyah Bar David
(Following a discussion with Alon)

- Original Message -
 From: Sven Kieske s.kie...@mittwald.de
 To: Yedidyah Bar David d...@redhat.com
 Cc: Users@ovirt.org List Users@ovirt.org
 Sent: Wednesday, January 29, 2014 1:24:40 PM
 Subject: Re: [Users] replace engine hostname /pki
 
 Additional question regarding the certificates/pki:
 
 the wikipage states:
 
 The bigger concern is with the engine's certificate. Currently, to the
 best of our knowledge, there is no component that actually checks this
 trust.

Well, this is not accurate. The trust path _is_ checked, but against the
saved ca cert. On host deploy the host saves the ca cert and so can verify
the trust path even if the ca's hostname does not exist any more and can't
be connected to to get /ca.crt .

The point was that if there is something (e.g. spice client, web browser)
that checks the trust path, this will fail, if this client did not have the
ca cert, or tries to download it again after the rename.

 (All three certificates (CA, httpd, engine) are for the Common Name (CN)
 whose value is the hostname entered during engine-setup, which is
 supposed to be the hostname of the engine's machine, exist in the dns
 (forward and reverse records), and point to an IP address of the
 engine's machine. )
 
 Is there a list of values that get checked? e.g. the validity dates
 before and after?

Yes, these are checked.

 
 users might run into trouble in 10 years if this gets checked, because
 that is the current expiration date.

Indeed. If ovirt systems will live 10 years, 1. We'll be very happy :-),
2. all certificates will need to be reissued. You can verify this today
by moving the clock.

 
 if _nothing_ gets checked I wonder why the PKI is used at all ;)
 
 (I assume at least the keys get checked)

Yes.

Alon also added: Revocations are not checked. This means that if someone
breaks into your engine, there is no simple way to tell the hosts to not
trust the old engine key anymore.
-- 
Didi
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] replace engine hostname /pki

2014-01-29 Thread Alon Bar-Lev


- Original Message -
 From: Yedidyah Bar David d...@redhat.com
 To: Sven Kieske s.kie...@mittwald.de
 Cc: Users@ovirt.org List Users@ovirt.org, Alon Bar-Lev 
 alo...@redhat.com
 Sent: Wednesday, January 29, 2014 3:12:21 PM
 Subject: Re: [Users] replace engine hostname /pki
 
 (Following a discussion with Alon)

Hi,

I hope you find this[1] helpful, if not we should work to make it better.

Thanks,

[1] http://www.ovirt.org/Features/PKI

 
 - Original Message -
  From: Sven Kieske s.kie...@mittwald.de
  To: Yedidyah Bar David d...@redhat.com
  Cc: Users@ovirt.org List Users@ovirt.org
  Sent: Wednesday, January 29, 2014 1:24:40 PM
  Subject: Re: [Users] replace engine hostname /pki
  
  Additional question regarding the certificates/pki:
  
  the wikipage states:
  
  The bigger concern is with the engine's certificate. Currently, to the
  best of our knowledge, there is no component that actually checks this
  trust.
 
 Well, this is not accurate. The trust path _is_ checked, but against the
 saved ca cert. On host deploy the host saves the ca cert and so can verify
 the trust path even if the ca's hostname does not exist any more and can't
 be connected to to get /ca.crt .
 
 The point was that if there is something (e.g. spice client, web browser)
 that checks the trust path, this will fail, if this client did not have the
 ca cert, or tries to download it again after the rename.
 
  (All three certificates (CA, httpd, engine) are for the Common Name (CN)
  whose value is the hostname entered during engine-setup, which is
  supposed to be the hostname of the engine's machine, exist in the dns
  (forward and reverse records), and point to an IP address of the
  engine's machine. )
  
  Is there a list of values that get checked? e.g. the validity dates
  before and after?
 
 Yes, these are checked.
 
  
  users might run into trouble in 10 years if this gets checked, because
  that is the current expiration date.
 
 Indeed. If ovirt systems will live 10 years, 1. We'll be very happy :-),
 2. all certificates will need to be reissued. You can verify this today
 by moving the clock.
 
  
  if _nothing_ gets checked I wonder why the PKI is used at all ;)
  
  (I assume at least the keys get checked)
 
 Yes.
 
 Alon also added: Revocations are not checked. This means that if someone
 breaks into your engine, there is no simple way to tell the hosts to not
 trust the old engine key anymore.
 --
 Didi
 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] replace engine hostname /pki

2014-01-29 Thread Yedidyah Bar David
- Original Message -
 From: Alon Bar-Lev alo...@redhat.com
 To: Yedidyah Bar David d...@redhat.com
 Cc: Sven Kieske s.kie...@mittwald.de, Users@ovirt.org List 
 Users@ovirt.org
 Sent: Wednesday, January 29, 2014 3:23:10 PM
 Subject: Re: [Users] replace engine hostname /pki
 
 
 
 - Original Message -
  From: Yedidyah Bar David d...@redhat.com
  To: Sven Kieske s.kie...@mittwald.de
  Cc: Users@ovirt.org List Users@ovirt.org, Alon Bar-Lev
  alo...@redhat.com
  Sent: Wednesday, January 29, 2014 3:12:21 PM
  Subject: Re: [Users] replace engine hostname /pki
  
  (Following a discussion with Alon)
 
 Hi,
 
 I hope you find this[1] helpful, if not we should work to make it better.
 
 Thanks,
 
 [1] http://www.ovirt.org/Features/PKI

Thanks, I didn't know about that page. Added a link to it from the rename page.
-- 
Didi
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Notes on setting up spice-proxy console option

2014-01-29 Thread Marian Krcmarik


- Original Message -
 From: David Li david...@sbcglobal.net
 To: Marian Krcmarik mkrcm...@redhat.com
 Cc: users@ovirt.org
 Sent: Tuesday, January 28, 2014 10:37:18 PM
 Subject: Re: [Users] Notes on setting up spice-proxy console option
 
 Hi Marian,
 
 Thanks a lot for the clarification!
 
 Another question: In my spice-xpi console window, I can't get out of it by
 SHIFT+F12.  Should I use something else?

It should work, You mentioned you used RHEL/CentOS 6.5 as the client (what you 
call Browser) machine, In that case Can you edit /etc/spice/logger.ini file and 
change in the line log4j.rootCategory=INFO, R INFO to DEBUG? Restart Firefox, 
make connection to the VM and check ~/.spicec/spice-xpi.log file for line 
DEBUG nsPluginInstance::SetHotKeys: 
release-cursor=shift+f12,toggle-fullscreen=shift+f11 (looking for value of 
release-cursor). If the value is not shift+f12, can you check value of 
SpiceReleaseCursorKeys on the engine with using ovirt-config tool? If the 
value is shift-f12, can you run on the VM, xev tool and check what keys It can 
see pressed when pressing shift-f12 combo?

 
 David
 
 
 
 - Original Message -
  From: Marian Krcmarik mkrcm...@redhat.com
  To: David Li david...@sbcglobal.net
  Cc: users@ovirt.org
  Sent: Tuesday, January 28, 2014 11:14 AM
  Subject: Re: [Users] Notes on setting spice-proxy console option
  
  
  
  - Original Message -
   From: David Li david...@sbcglobal.net
   To: users@ovirt.org
   Cc: david li david...@sbcglobal.net
   Sent: Tuesday, January 28, 2014 7:41:26 PM
   Subject: [Users] Notes on setting spice-proxy console option
  
   Hi,
  
   I have struggled quite a bit to get it up and running. Over the time, I
  have
   accumulated some notes on various things I did so to share with everyone
  who
   is interested in doing this. This complements the online doc in a way
   that
   might give me a complete picture in one place. However I need some
   clarifications as I might have forgotten to document certain steps or
   certain steps I did turn out to be not necessary in the end. It will be
   great if experts here can help me get the things straight.
  
  
   My setup is like:
  
   Browser (firefox 24.2 on RHEL6)  ovirt-engine (3.3.2)
    ovirt-node (3.0.3)
  
   No direct network connectivity from the browser machine to the node
  machine.
  
   These are the major things I installed for spice-proxy to work:
  
   * On ovirt-engine:
          yum install spice-gtk, virt-viewer, spice-xpi
  These components are client components (what you call Browser machine).
          yum-install squid
         /etc/squid/squid.conf updates:
   acl localhost src browser IP addr
           #http_access deny CONNECT !SSL_ports
  I would rather allow CONNECT to specific Spice ports only 5634-6166:
  acl Spice_ports port 5634-6166
  http_access denny CONNECT !Spice_ports
           http_access deny !Safe_ports
           http_port 3128
       
          service squid restart
          make sure iptables allow 3128
  
          engine-config -s SpiceProxyDefault=
  http://ovirt-engine-IP:3128
          service ovirt-engine restart
      
      *  On browser machine running firefox 24.2.0 on RHEL6 for running
  browser
      console plugin client
         yum install spice-xpi.
  spice-xpi should bring its dependencies virt-viewer - spice-gtk - etc.
  but If you do not wish to use the plugin launch type, you may install only
  virt-viewer (without spice-xpi) and use what I guess is called Native
  client launch type.
         make sure VM's console option is set to SPICE
         
   Are the above steps reasonable? any missing or redundant?
  Seems fine, just no need the client packages on the engine.
  
   Additional questions:
  
   1. Will spice-proxy work with the Spice HTML5 client in the browser?
  Probably, but you would need to set the websocket proxy which is part of
  installation steps for engine as well (I believe).
   2. Is the spice-proxy architecture diagram like:  browser - squid
   proxy - spice-proxy -- VM
  Browser plugin spice-xpi invokes start of Spice client (virt-viewer) which
  makes
  CONNECT to Host machine (where the VM is hosted) through the HTTP proxy (in
  your
  case squid).
  Client machine --- Squid --- Host (where the VM is hosted).
   3. I didn't explicitly install any certs for the squid proxy. Is it
   automatically taken care of?
  No, no authentication to Squid is supported with Spice now. So If It is
  publicly
  visible proxy It's important to set careful proxy rules.
  
  
   References:
  
   http://www.ovirt.org/Console_Client_Resources
  
   http://www.ovirt.org/Features/Spice_Proxy
  
  
  https://access.redhat.com/site/documentation/en-US/Red_Hat_Enterprise_Virtualization/3.3/pdf/Installation_Guide/Red_Hat_Enterprise_Virtualization-3.3-Installation_Guide-en-US.pdf
  
  
   Thanks.
  
   David
  
   

Re: [Users] Notes on setting spice-proxy console option

2014-01-29 Thread Michal Skrivanek

On Jan 28, 2014, at 20:14 , Marian Krcmarik mkrcm...@redhat.com wrote:

 
 
 - Original Message -
 From: David Li david...@sbcglobal.net
 To: users@ovirt.org
 Cc: david li david...@sbcglobal.net
 Sent: Tuesday, January 28, 2014 7:41:26 PM
 Subject: [Users] Notes on setting spice-proxy console option
 
 Hi,
 
 I have struggled quite a bit to get it up and running. Over the time, I have
 accumulated some notes on various things I did so to share with everyone who
 is interested in doing this. This complements the online doc in a way that
 might give me a complete picture in one place. However I need some
 clarifications as I might have forgotten to document certain steps or
 certain steps I did turn out to be not necessary in the end. It will be
 great if experts here can help me get the things straight.
 
 
 My setup is like:
 
 Browser (firefox 24.2 on RHEL6)  ovirt-engine (3.3.2)
  ovirt-node (3.0.3)
 
 No direct network connectivity from the browser machine to the node machine.
 
 These are the major things I installed for spice-proxy to work:
 
 * On ovirt-engine:
yum install spice-gtk, virt-viewer, spice-xpi
 These components are client components (what you call Browser machine).
yum-install squid
   /etc/squid/squid.conf updates:
 acl localhost src browser IP addr
 #http_access deny CONNECT !SSL_ports
 I would rather allow CONNECT to specific Spice ports only 5634-6166:
 acl Spice_ports port 5634-6166
 http_access denny CONNECT !Spice_ports
 http_access deny !Safe_ports
 http_port 3128
 
service squid restart
make sure iptables allow 3128
 
engine-config -s SpiceProxyDefault= http://ovirt-engine-IP:3128
service ovirt-engine restart

*  On browser machine running firefox 24.2.0 on RHEL6 for running browser
console plugin client
   yum install spice-xpi.
 spice-xpi should bring its dependencies virt-viewer - spice-gtk - etc. but 
 If you do not wish to use the plugin launch type, you may install only 
 virt-viewer (without spice-xpi) and use what I guess is called Native 
 client launch type.
   make sure VM's console option is set to SPICE
   
 Are the above steps reasonable? any missing or redundant?
 Seems fine, just no need the client packages on the engine.
 
 Additional questions:
 
 1. Will spice-proxy work with the Spice HTML5 client in the browser?
 Probably, but you would need to set the websocket proxy which is part of 
 installation steps for engine as well (I believe).

since it's a proxy as well I'd recommend to use it instead of squid, not in 
addition. By default it's installed on the engine host but it can be deployed 
anywhere else.

 2. Is the spice-proxy architecture diagram like:  browser - squid
 proxy - spice-proxy -- VM
 Browser plugin spice-xpi invokes start of Spice client (virt-viewer) which 
 makes CONNECT to Host machine (where the VM is hosted) through the HTTP proxy 
 (in your case squid).
 Client machine --- Squid --- Host (where the VM is hosted).
 3. I didn't explicitly install any certs for the squid proxy. Is it
 automatically taken care of?
 No, no authentication to Squid is supported with Spice now. So If It is 
 publicly visible proxy It's important to set careful proxy rules.
 
 
 References:
 
 http://www.ovirt.org/Console_Client_Resources
 
 http://www.ovirt.org/Features/Spice_Proxy
 
 https://access.redhat.com/site/documentation/en-US/Red_Hat_Enterprise_Virtualization/3.3/pdf/Installation_Guide/Red_Hat_Enterprise_Virtualization-3.3-Installation_Guide-en-US.pdf
 
 
 Thanks.
 
 David
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] replace engine hostname /pki

2014-01-29 Thread Sven Kieske
Thanks for the link,

I will work through the page and see if any questions pop up.
also thanks to yedidyah for the clarification!

Am 29.01.2014 14:23, schrieb Alon Bar-Lev:
 
 
 - Original Message -
 From: Yedidyah Bar David d...@redhat.com
 To: Sven Kieske s.kie...@mittwald.de
 Cc: Users@ovirt.org List Users@ovirt.org, Alon Bar-Lev 
 alo...@redhat.com
 Sent: Wednesday, January 29, 2014 3:12:21 PM
 Subject: Re: [Users] replace engine hostname /pki

 (Following a discussion with Alon)
 
 Hi,
 
 I hope you find this[1] helpful, if not we should work to make it better.
 
 Thanks,
 
 [1] http://www.ovirt.org/Features/PKI
 

 - Original Message -
 From: Sven Kieske s.kie...@mittwald.de
 To: Yedidyah Bar David d...@redhat.com
 Cc: Users@ovirt.org List Users@ovirt.org
 Sent: Wednesday, January 29, 2014 1:24:40 PM
 Subject: Re: [Users] replace engine hostname /pki

 Additional question regarding the certificates/pki:

 the wikipage states:

 The bigger concern is with the engine's certificate. Currently, to the
 best of our knowledge, there is no component that actually checks this
 trust.

 Well, this is not accurate. The trust path _is_ checked, but against the
 saved ca cert. On host deploy the host saves the ca cert and so can verify
 the trust path even if the ca's hostname does not exist any more and can't
 be connected to to get /ca.crt .

 The point was that if there is something (e.g. spice client, web browser)
 that checks the trust path, this will fail, if this client did not have the
 ca cert, or tries to download it again after the rename.

 (All three certificates (CA, httpd, engine) are for the Common Name (CN)
 whose value is the hostname entered during engine-setup, which is
 supposed to be the hostname of the engine's machine, exist in the dns
 (forward and reverse records), and point to an IP address of the
 engine's machine. )

 Is there a list of values that get checked? e.g. the validity dates
 before and after?

 Yes, these are checked.


 users might run into trouble in 10 years if this gets checked, because
 that is the current expiration date.

 Indeed. If ovirt systems will live 10 years, 1. We'll be very happy :-),
 2. all certificates will need to be reissued. You can verify this today
 by moving the clock.


 if _nothing_ gets checked I wonder why the PKI is used at all ;)

 (I assume at least the keys get checked)

 Yes.

 Alon also added: Revocations are not checked. This means that if someone
 breaks into your engine, there is no simple way to tell the hosts to not
 trust the old engine key anymore.
 --
 Didi

 
 
 

-- 
Mit freundlichen Grüßen / Regards

Sven Kieske

Systemadministrator
Mittwald CM Service GmbH  Co. KG
Königsberger Straße 6
32339 Espelkamp
T: +49-5772-293-100
F: +49-5772-293-333
https://www.mittwald.de
Geschäftsführer: Robert Meyer
St.Nr.: 331/5721/1033, USt-IdNr.: DE814773217, HRA 6640, AG Bad Oeynhausen
Komplementärin: Robert Meyer Verwaltungs GmbH, HRB 13260, AG Bad Oeynhausen
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Vm's being paused

2014-01-29 Thread Neil
Sorry for the re-post, I was suddenly unsubscribed from the oVirt
users list for the 3rd time this month.

Regards.

Neil Wilson.


On Wed, Jan 29, 2014 at 4:16 PM, Neil nwilson...@gmail.com wrote:
 Hi Dafna,


 On Wed, Jan 29, 2014 at 1:14 PM, Dafna Ron d...@redhat.com wrote:
 The reason I asked about the size if because this was the original issue no?
 vm's pausing on lack of space?

 Apologies, I just wanted to make sure it was still about this pausing
 and not the original migration issue that I think you were also
 helping me with a few weeks back.

 You're having a problem with your data domains.
 Can you check the rout from the hosts to the storage? I think that you have
 some disconnection to the storage from the hosts
 since it's random and not from all the vm's I would suggest that its a
 routing problem?
 Thanks,
 Dafna

 The connections to the main data domain is 8Gb Fibre Channel directly
 from each of the hosts to the FC SAN, so if it is a connection issue
 then I can't understand how anything would be working. Or am I barking
 up the wrong tree completely? There were some ethernet network
 bridging changes on each of the hosts in early January, but this would
 only affect the NFS mounted ISO domain, or could this be the cause of
 the problems?

 Is this disconnection causing the huge log files that I sent previously?

 Thank you.

 Regards.

 Neil Wilson.



 On 01/29/2014 08:00 AM, Neil wrote:

 Sorry, more on this issue, I see my logs are rapidly filling up my
 disk space on node02 with this error in /var/log/messages...

 Jan 29 09:56:53 node02 vdsm vm.Vm ERROR
 vmId=`dfa2cf7c-3f0e-42e3-b495-10ccb3e0c71b`::Stats function failed:
 AdvancedStatsFunction _highWrite at 0x1c2fb90#012Traceback (most
 recent call last):#012  File /usr/share/vdsm/sampling.py, line 351,
 in collect#012statsFunction()#012  File
 /usr/share/vdsm/sampling.py, line 226, in __call__#012retValue =
 self._function(*args, **kwargs)#012  File /usr/share/vdsm/vm.py,
 line 513, in _highWrite#012self._vm._dom.blockInfo(vmDrive.path,
 0)#012  File /usr/share/vdsm/vm.py, line 835, in f#012ret =
 attr(*args, **kwargs)#012  File
 /usr/lib64/python2.6/site-packages/vdsm/libvirtconnection.py, line
 76, in wrapper#012ret = f(*args, **kwargs)#012  File
 /usr/lib64/python2.6/site-packages/libvirt.py, line 1814, in
 blockInfo#012if ret is None: raise libvirtError
 ('virDomainGetBlockInfo() failed', dom=self)#012libvirtError: invalid
 argument: invalid path

 /rhev/data-center/mnt/blockSD/0e6991ae-6238-4c61-96d2-ca8fed35161e/images/fac8a3bb-e414-43c0-affc-6e2628757a28/6c3e5ae8-23fc-4196-ba42-778bdc0fbad8
 not assigned to domain
 Jan 29 09:56:53 node02 vdsm vm.Vm ERROR
 vmId=`ac2a3f99-a6db-4cae-955d-efdfb901abb7`::Stats function failed:
 AdvancedStatsFunction _highWrite at 0x1c2fb90#012Traceback (most
 recent call last):#012  File /usr/share/vdsm/sampling.py, line 351,
 in collect#012statsFunction()#012  File
 /usr/share/vdsm/sampling.py, line 226, in __call__#012retValue =
 self._function(*args, **kwargs)#012  File /usr/share/vdsm/vm.py,
 line 509, in _highWrite#012if not vmDrive.blockDev or
 vmDrive.format != 'cow':#012AttributeError: 'Drive' object has no
 attribute 'format'

 Not sure if this is related at all though?

 Thanks.

 Regards.

 Neil Wilson.

 On Wed, Jan 29, 2014 at 9:02 AM, Neil nwilson...@gmail.com wrote:

 Hi Dafna,

 Thanks for clarifying that, I found the migration issue and this was
 resolved once I sorted out the ISO domain problem.

 I'm sorry I don't understand your last question?
  after the engine restart, do you still see a problem with the size
 or did the report of size changed?

 The migration issue was resolved, it's now just trying to track down
 why the two VM's paused on their own, one on the 8th of Jan(I think)
 and one on the 19th of Jan.

 Thank you.


 Regards.

 Neil Wilson.


 On Tue, Jan 28, 2014 at 8:18 PM, Dafna Ron d...@redhat.com wrote:

 yes - engine lost communication with vdsm and it has no way of knowing
 if
 the host is down or if there was a network issue so a network issue
 would
 cause the same errors that I see in the logs.

 The error you put on the iso is the reason the vm's have failed
 migration -
 if a vm is run with a cd and the cd is gone than the vm will not be able
 to
 be migrated.

 after the engine restart, do you still see a problem with the size or
 did
 the report of size changed?

 Dafna


 On 01/28/2014 01:02 PM, Neil wrote:

 Hi Dafna,

 Thanks for coming back to me. I'll try answer your queries one by one.

 On Tue, Jan 28, 2014 at 1:38 PM, Dafna Ron d...@redhat.com wrote:

 you had a problem with your storage on the 14th of Jan and one of the
 hosts
 rebooted (if you have the vdsm log from that day than I can see what
 happened on vdsm side)
 in engine, I could see a problem with the export domain and this
 should
 not
 have cause a reboot.

 1.) I don't unfortunately have logs going back that far. Looking at
 all 3 hosts 

Re: [Users] Error: The following Network Interfaces were specified more than once:

2014-01-29 Thread Alan Murrell

Quoting Assaf Muller amul...@redhat.com:


Please run:
vdsClient -s 0 getVdsCaps


Here you go:

--- START ---
HBAInventory = {'FC': [], 'iSCSI': [{'InitiatorName':  
'iqn.1994-05.com.redhat:63b5586465eb'}]}

ISCSIInitiatorName = 'iqn.1994-05.com.redhat:63b5586465eb'
bondings = {'bond0': {'addr': '',
  'cfg': {},
  'hwaddr': '00:00:00:00:00:00',
  'ipv6addrs': [],
  'mtu': '1500',
  'netmask': '',
  'slaves': []},
'bond1': {'addr': '',
  'cfg': {},
  'hwaddr': '00:00:00:00:00:00',
  'ipv6addrs': [],
  'mtu': '1500',
  'netmask': '',
  'slaves': []},
'bond2': {'addr': '',
  'cfg': {},
  'hwaddr': '00:00:00:00:00:00',
  'ipv6addrs': [],
  'mtu': '1500',
  'netmask': '',
  'slaves': []},
'bond3': {'addr': '',
  'cfg': {},
  'hwaddr': '00:00:00:00:00:00',
  'ipv6addrs': [],
  'mtu': '1500',
  'netmask': '',
  'slaves': []},
'bond4': {'addr': '',
  'cfg': {},
  'hwaddr': '00:00:00:00:00:00',
  'ipv6addrs': [],
  'mtu': '1500',
  'netmask': '',
  'slaves': []}}
bridges = {'VLAN001': {'addr': '',
  'cfg': {'DEFROUTE': 'no',
  'DELAY': '0',
  'DEVICE': 'VLAN001',
  'NM_CONTROLLED': 'no',
  'ONBOOT': 'yes',
  'STP': 'no',
  'TYPE': 'Bridge'},
  'gateway': '0.0.0.0',
  'ipv6addrs': ['fe80::2ccd:eff:fee5:6a65/64'],
  'ipv6gateway': '::',
  'mtu': '1500',
  'netmask': '',
  'ports': ['dummy_0.1'],
  'stp': 'off'},
  'VLAN002': {'addr': '',
  'cfg': {'DEFROUTE': 'no',
  'DELAY': '0',
  'DEVICE': 'VLAN002',
  'NM_CONTROLLED': 'no',
  'ONBOOT': 'yes',
  'STP': 'no',
  'TYPE': 'Bridge'},
  'gateway': '0.0.0.0',
  'ipv6addrs': ['fe80::2ccd:eff:fee5:6a65/64'],
  'ipv6gateway': '::',
  'mtu': '1500',
  'netmask': '',
  'ports': ['dummy_0.2'],
  'stp': 'off'},
  'VLAN009': {'addr': '',
  'cfg': {'DEFROUTE': 'no',
  'DELAY': '0',
  'DEVICE': 'VLAN009',
  'NM_CONTROLLED': 'no',
  'ONBOOT': 'yes',
  'STP': 'no',
  'TYPE': 'Bridge'},
  'gateway': '0.0.0.0',
  'ipv6addrs': ['fe80::2ccd:eff:fee5:6a65/64'],
  'ipv6gateway': '::',
  'mtu': '1500',
  'netmask': '',
  'ports': ['dummy_0.9'],
  'stp': 'off'},
  'VLAN010': {'addr': '',
  'cfg': {'DEFROUTE': 'no',
  'DELAY': '0',
  'DEVICE': 'VLAN010',
  'NM_CONTROLLED': 'no',
  'ONBOOT': 'yes',
  'STP': 'no',
  'TYPE': 'Bridge'},
  'gateway': '0.0.0.0',
  'ipv6addrs': ['fe80::2ccd:eff:fee5:6a65/64'],
  'ipv6gateway': '::',
  'mtu': '1500',
  'netmask': '',
  'ports': ['dummy_0.10'],
  'stp': 'off'},
  'ovirtmgmt': {'addr': '10.20.37.104',
'cfg': {'BOOTPROTO': 'dhcp',
'DEFROUTE': 'yes',
'DELAY': '0',
'DEVICE': 

Re: [Users] Error: The following Network Interfaces were specified more than once:

2014-01-29 Thread Alan Murrell

Quoting Lior Vernia lver...@redhat.com:


It's a quirk of oVirt at the moment :) The interface state as shown in
the GUI currently has very little to do with whether there's
connectivity on a network. There should be an open RFE for connectivity


OK, so at this point it is a cosmetic issue as opposed to a functional one?

-Alan

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[Users] oVirt vs RHEV docs

2014-01-29 Thread David Li
Hi,

Maybe I have missed this discussion before.

It seems RHEV has a much richer collection of docs than oVIrt. I understand 
RHEV might be a little behind. As a guideline I wonder if we should always use 
RHEV docs if it's missing from oVirt. Are they always compatible? 
Also in RHEV docs, how should we map the commands back to the oVirt world if 
they are different?

Thanks.

David

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Error: The following Network Interfaces were specified more than once:

2014-01-29 Thread Assaf Muller
Thanks Alan, that confirms that it was the issue I had in mind.

This bug is fixed in oVirt 3.4, which has a first beta release out.


Assaf Muller, Cloud Networking Engineer 
Red Hat 

- Original Message -
From: Alan Murrell li...@murrell.ca
To: users@ovirt.org
Sent: Wednesday, January 29, 2014 4:58:59 PM
Subject: Re: [Users] Error: The following Network Interfaces were  
specified   more than   once:

Quoting Assaf Muller amul...@redhat.com:

 Please run:
 vdsClient -s 0 getVdsCaps

Here you go:

--- START ---
 HBAInventory = {'FC': [], 'iSCSI': [{'InitiatorName':  
'iqn.1994-05.com.redhat:63b5586465eb'}]}
 ISCSIInitiatorName = 'iqn.1994-05.com.redhat:63b5586465eb'
 bondings = {'bond0': {'addr': '',
   'cfg': {},
   'hwaddr': '00:00:00:00:00:00',
   'ipv6addrs': [],
   'mtu': '1500',
   'netmask': '',
   'slaves': []},
 'bond1': {'addr': '',
   'cfg': {},
   'hwaddr': '00:00:00:00:00:00',
   'ipv6addrs': [],
   'mtu': '1500',
   'netmask': '',
   'slaves': []},
 'bond2': {'addr': '',
   'cfg': {},
   'hwaddr': '00:00:00:00:00:00',
   'ipv6addrs': [],
   'mtu': '1500',
   'netmask': '',
   'slaves': []},
 'bond3': {'addr': '',
   'cfg': {},
   'hwaddr': '00:00:00:00:00:00',
   'ipv6addrs': [],
   'mtu': '1500',
   'netmask': '',
   'slaves': []},
 'bond4': {'addr': '',
   'cfg': {},
   'hwaddr': '00:00:00:00:00:00',
   'ipv6addrs': [],
   'mtu': '1500',
   'netmask': '',
   'slaves': []}}
 bridges = {'VLAN001': {'addr': '',
   'cfg': {'DEFROUTE': 'no',
   'DELAY': '0',
   'DEVICE': 'VLAN001',
   'NM_CONTROLLED': 'no',
   'ONBOOT': 'yes',
   'STP': 'no',
   'TYPE': 'Bridge'},
   'gateway': '0.0.0.0',
   'ipv6addrs': ['fe80::2ccd:eff:fee5:6a65/64'],
   'ipv6gateway': '::',
   'mtu': '1500',
   'netmask': '',
   'ports': ['dummy_0.1'],
   'stp': 'off'},
   'VLAN002': {'addr': '',
   'cfg': {'DEFROUTE': 'no',
   'DELAY': '0',
   'DEVICE': 'VLAN002',
   'NM_CONTROLLED': 'no',
   'ONBOOT': 'yes',
   'STP': 'no',
   'TYPE': 'Bridge'},
   'gateway': '0.0.0.0',
   'ipv6addrs': ['fe80::2ccd:eff:fee5:6a65/64'],
   'ipv6gateway': '::',
   'mtu': '1500',
   'netmask': '',
   'ports': ['dummy_0.2'],
   'stp': 'off'},
   'VLAN009': {'addr': '',
   'cfg': {'DEFROUTE': 'no',
   'DELAY': '0',
   'DEVICE': 'VLAN009',
   'NM_CONTROLLED': 'no',
   'ONBOOT': 'yes',
   'STP': 'no',
   'TYPE': 'Bridge'},
   'gateway': '0.0.0.0',
   'ipv6addrs': ['fe80::2ccd:eff:fee5:6a65/64'],
   'ipv6gateway': '::',
   'mtu': '1500',
   'netmask': '',
   'ports': ['dummy_0.9'],
   'stp': 'off'},
   'VLAN010': {'addr': '',
   'cfg': {'DEFROUTE': 'no',
   'DELAY': '0',
   'DEVICE': 'VLAN010',
   'NM_CONTROLLED': 'no',
   'ONBOOT': 'yes',
   'STP': 'no',
   'TYPE': 'Bridge'},
   'gateway': '0.0.0.0',
   'ipv6addrs': 

[Users] Failed to execute stage 'Misc configuration': Command '/bin/systemctl' failed to execute

2014-01-29 Thread Ertan ERBEK

Hi All,


I want use oVirt instead of Vmware and Virtualbox but I couldn't 
complate setup exacly any time actually I try with four diffrent 
computer and everytime get same roblem and can't resolve this problem, 
could you assit me about this issue ?


[ INFO  ] Stage: Transaction setup
[ INFO  ] Stopping engine service
[ INFO  ] Stopping websocket-proxy service
[ INFO  ] Stage: Misc configuration
[ INFO  ] Stage: Package installation
[ INFO  ] Stage: Misc configuration
[ INFO  ] Creating PostgreSQL database
[ ERROR ] Failed to execute stage 'Misc configuration': Command 
'/bin/systemctl' failed to execute

[ INFO  ] Yum Performing yum transaction rollback
[ INFO  ] Stage: Clean up
  Log file is located at 
/var/log/ovirt-engine/setup/ovirt-engine-setup-20140128142352.log

[ INFO  ] Stage: Pre-termination
[ INFO  ] Stage: Termination
[ ERROR ] Execution of setup failed

OvirtLog
http://paste.fedoraproject.org/72322/90912140/
RPM Packet log
http://paste.fedoraproject.org/72327/12656139/
http://paste.fedoraproject.org/72328/39091266/
hardwareInformation
http://paste.fedoraproject.org/72332/39091279
http://paste.fedoraproject.org/72333/0912839
http://paste.fedoraproject.org/72334/39091284

Domain

[root@ovirt zone]# cat /etc/hosts
127.0.0.1   localhost localhost.localdomain localhost4 
localhost4.localdomain4
::1 localhost localhost.localdomain localhost6 
localhost6.localdomain6

192.168.0.207ovirt.ovirt.localovirt
[root@ovirt zone]# cat /etc/hostname
ovirt
[root@ovirt zone]# nslookup ovirt.ovirt.local
Server:127.0.0.1
Address:127.0.0.1#53

Name:ovirt.ovirt.local
Address: 192.168.0.207

[root@ovirt zone]# nslookup
 set type=ptr
 192.168.0.207
Server:127.0.0.1
Address:127.0.0.1#53

207.0.168.192.in-addr.arpaname = ovirt.ovirt.local.
207.0.168.192.in-addr.arpaname = ns1.ovirt.local.


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] two node ovirt cluster with HA

2014-01-29 Thread Tareq Alayan

Adding Eli.


On 01/27/2014 02:50 PM, Andrew Lau wrote:

Hi,

I think he was asking what if the power management device reported 
that the host was powered off. Then VMs should be brought back up as 
being off would essentially be the same as running a power cycle/reboot?


Another example I'm seeing is what happens if the whole host loses 
power and it's power management device then becomes unavailable (ie. 
not reachable) then you're stuck in the case where it requires manual 
intervention.


I would be interested to potentially see something like a timeout on 
those problematic VMs (eg. if nothing was read or write after x amount 
of time) then you could consider the host as offline? I guess then 
that adds a lot of risk..



On Mon, Jan 27, 2014 at 11:43 PM, Tareq Alayan tala...@redhat.com 
mailto:tala...@redhat.com wrote:


Hi,

Power management makes use of special *dedicated* hardware in
order to restart hosts independently of host OS. The engine
connects to a power management devices using a *dedicated* network
IP address.
The engine is capable of rebooting hosts that have entered a
non-operational or non-responsive state,
The abilities provided by all power management devices are: check
status, start, stop and recycle (restart)...

In the case of non-responsive host: all of the VMs that are
currently running on that host can also become non-responsive.
However, the non-responsive host keeps locking the VM hard disk
for all VMs it is running. Attempting to start a VM on a different
host and assign the second host write privileges for the virtual
machine hard disk image can cause data corruption.
Rebooting allows the engine to assume that the lock on a VM hard
disk image has been released.
The engine can know for sure that the problematic host has been
rebooted via the power management device and then it can start a
VM from the problematic host on another host without risking data
corruption.
Important note: A virtual machine that has been marked
highly-available can not be safely started on a different host
without the certainty that doing so will not cause data corruption.

N-joy,

--Tareq




On 01/27/2014 02:05 PM, Dafna Ron wrote:

I am adding Tareq for the Power Management implementation.

Dafna


On 01/27/2014 11:48 AM, Karli Sjöberg wrote:

On Mon, 2014-01-27 at 11:11 +, Dafna Ron wrote:

Powering off the host will never trigger vm migration.
As far as engine is concerned it just lost connection
to the host, but
has no way of telling if the host is down or if a
router is down.

Can´t it at least check with power management if the Host
status is down
first?

I mean, if the network is down there will be no response
from either PM
or Host. But if PM is up and can tell you that the Host is
down, sounds
rather clear cut to me...

Seems to me the VM's would be restarted sooner if the flow
was altered
to first check with PM if it´s a network or Host issue,
and if Host
issue, immediately restart VM's on another Host, instead
of waiting for
a potentially problematic Host to boot up eventually.

/K

since vm's can continue running on the host even if
engine has no access
to it, starting the vm's on the second host can cause
split brain and
data corruption.

The way that the engine knows what's going on is by
sending heath check
queries to the vdsm.
Power management will try to reboot a host when the
health checks to
vdsm will not be answered.
So... if engine gets no reply and has no way of
rebooting the host, the
host status will be changed to Non-Responsive and the
vm's will be
unknown because engine has no way of knowing what's
happening with the
vm's.
Since reboot of the host will kill the vm's running on
it - this will
never cause any vm migration but... along with the
High-Availability vm
feature, you will be able to have some of the vm's
re-started on the
second host after the host reboot (and that is only if
Power Management
was confirmed as successful).

VM migration is only triggered when:
1. Cluster configuration states that the vm should be
migrated in case
of failure
2. 

Re: [Users] Vm's being paused

2014-01-29 Thread Neil
Hi Dafna,


On Wed, Jan 29, 2014 at 1:14 PM, Dafna Ron d...@redhat.com wrote:
 The reason I asked about the size if because this was the original issue no?
 vm's pausing on lack of space?

Apologies, I just wanted to make sure it was still about this pausing
and not the original migration issue that I think you were also
helping me with a few weeks back.

 You're having a problem with your data domains.
 Can you check the rout from the hosts to the storage? I think that you have
 some disconnection to the storage from the hosts
 since it's random and not from all the vm's I would suggest that its a
 routing problem?
 Thanks,
 Dafna

The connections to the main data domain is 8Gb Fibre Channel directly
from each of the hosts to the FC SAN, so if it is a connection issue
then I can't understand how anything would be working. Or am I barking
up the wrong tree completely? There were some ethernet network
bridging changes on each of the hosts in early January, but this would
only affect the NFS mounted ISO domain, or could this be the cause of
the problems?

Is this disconnection causing the huge log files that I sent previously?

Thank you.

Regards.

Neil Wilson.



 On 01/29/2014 08:00 AM, Neil wrote:

 Sorry, more on this issue, I see my logs are rapidly filling up my
 disk space on node02 with this error in /var/log/messages...

 Jan 29 09:56:53 node02 vdsm vm.Vm ERROR
 vmId=`dfa2cf7c-3f0e-42e3-b495-10ccb3e0c71b`::Stats function failed:
 AdvancedStatsFunction _highWrite at 0x1c2fb90#012Traceback (most
 recent call last):#012  File /usr/share/vdsm/sampling.py, line 351,
 in collect#012statsFunction()#012  File
 /usr/share/vdsm/sampling.py, line 226, in __call__#012retValue =
 self._function(*args, **kwargs)#012  File /usr/share/vdsm/vm.py,
 line 513, in _highWrite#012self._vm._dom.blockInfo(vmDrive.path,
 0)#012  File /usr/share/vdsm/vm.py, line 835, in f#012ret =
 attr(*args, **kwargs)#012  File
 /usr/lib64/python2.6/site-packages/vdsm/libvirtconnection.py, line
 76, in wrapper#012ret = f(*args, **kwargs)#012  File
 /usr/lib64/python2.6/site-packages/libvirt.py, line 1814, in
 blockInfo#012if ret is None: raise libvirtError
 ('virDomainGetBlockInfo() failed', dom=self)#012libvirtError: invalid
 argument: invalid path

 /rhev/data-center/mnt/blockSD/0e6991ae-6238-4c61-96d2-ca8fed35161e/images/fac8a3bb-e414-43c0-affc-6e2628757a28/6c3e5ae8-23fc-4196-ba42-778bdc0fbad8
 not assigned to domain
 Jan 29 09:56:53 node02 vdsm vm.Vm ERROR
 vmId=`ac2a3f99-a6db-4cae-955d-efdfb901abb7`::Stats function failed:
 AdvancedStatsFunction _highWrite at 0x1c2fb90#012Traceback (most
 recent call last):#012  File /usr/share/vdsm/sampling.py, line 351,
 in collect#012statsFunction()#012  File
 /usr/share/vdsm/sampling.py, line 226, in __call__#012retValue =
 self._function(*args, **kwargs)#012  File /usr/share/vdsm/vm.py,
 line 509, in _highWrite#012if not vmDrive.blockDev or
 vmDrive.format != 'cow':#012AttributeError: 'Drive' object has no
 attribute 'format'

 Not sure if this is related at all though?

 Thanks.

 Regards.

 Neil Wilson.

 On Wed, Jan 29, 2014 at 9:02 AM, Neil nwilson...@gmail.com wrote:

 Hi Dafna,

 Thanks for clarifying that, I found the migration issue and this was
 resolved once I sorted out the ISO domain problem.

 I'm sorry I don't understand your last question?
  after the engine restart, do you still see a problem with the size
 or did the report of size changed?

 The migration issue was resolved, it's now just trying to track down
 why the two VM's paused on their own, one on the 8th of Jan(I think)
 and one on the 19th of Jan.

 Thank you.


 Regards.

 Neil Wilson.


 On Tue, Jan 28, 2014 at 8:18 PM, Dafna Ron d...@redhat.com wrote:

 yes - engine lost communication with vdsm and it has no way of knowing
 if
 the host is down or if there was a network issue so a network issue
 would
 cause the same errors that I see in the logs.

 The error you put on the iso is the reason the vm's have failed
 migration -
 if a vm is run with a cd and the cd is gone than the vm will not be able
 to
 be migrated.

 after the engine restart, do you still see a problem with the size or
 did
 the report of size changed?

 Dafna


 On 01/28/2014 01:02 PM, Neil wrote:

 Hi Dafna,

 Thanks for coming back to me. I'll try answer your queries one by one.

 On Tue, Jan 28, 2014 at 1:38 PM, Dafna Ron d...@redhat.com wrote:

 you had a problem with your storage on the 14th of Jan and one of the
 hosts
 rebooted (if you have the vdsm log from that day than I can see what
 happened on vdsm side)
 in engine, I could see a problem with the export domain and this
 should
 not
 have cause a reboot.

 1.) I don't unfortunately have logs going back that far. Looking at
 all 3 hosts uptime, the one with the least uptime is 21 days, the
 others are all over 40 days, so there definitely wasn't a host that
 rebooted on the 14th of Jan, would a network issue or Firewall issue
 also cause the error 

Re: [Users] two node ovirt cluster with HA

2014-01-29 Thread Tareq Alayan

Hi,

Power management makes use of special *dedicated* hardware in order to 
restart hosts independently of host OS. The engine connects to a power 
management devices using a *dedicated* network IP address.
The engine is capable of rebooting hosts that have entered a 
non-operational or non-responsive state,
The abilities provided by all power management devices are: check 
status, start, stop and recycle (restart)...


In the case of non-responsive host: all of the VMs that are currently 
running on that host can also become non-responsive. However, the 
non-responsive host keeps locking the VM hard disk for all VMs it is 
running. Attempting to start a VM on a different host and assign the 
second host write privileges for the virtual machine hard disk image can 
cause data corruption.
Rebooting allows the engine to assume that the lock on a VM hard disk 
image has been released.
The engine can know for sure that the problematic host has been rebooted 
via the power management device and then it can start a VM from the 
problematic host on another host without risking data corruption.
Important note: A virtual machine that has been marked highly-available 
can not be safely started on a different host without the certainty that 
doing so will not cause data corruption.


N-joy,

--Tareq



On 01/27/2014 02:05 PM, Dafna Ron wrote:

I am adding Tareq for the Power Management implementation.

Dafna


On 01/27/2014 11:48 AM, Karli Sjöberg wrote:

On Mon, 2014-01-27 at 11:11 +, Dafna Ron wrote:

Powering off the host will never trigger vm migration.
As far as engine is concerned it just lost connection to the host, but
has no way of telling if the host is down or if a router is down.

Can´t it at least check with power management if the Host status is down
first?

I mean, if the network is down there will be no response from either PM
or Host. But if PM is up and can tell you that the Host is down, sounds
rather clear cut to me...

Seems to me the VM's would be restarted sooner if the flow was altered
to first check with PM if it´s a network or Host issue, and if Host
issue, immediately restart VM's on another Host, instead of waiting for
a potentially problematic Host to boot up eventually.

/K

since vm's can continue running on the host even if engine has no 
access

to it, starting the vm's on the second host can cause split brain and
data corruption.

The way that the engine knows what's going on is by sending heath check
queries to the vdsm.
Power management will try to reboot a host when the health checks to
vdsm will not be answered.
So... if engine gets no reply and has no way of rebooting the host, the
host status will be changed to Non-Responsive and the vm's will be
unknown because engine has no way of knowing what's happening with the
vm's.
Since reboot of the host will kill the vm's running on it - this will
never cause any vm migration but... along with the High-Availability vm
feature, you will be able to have some of the vm's re-started on the
second host after the host reboot (and that is only if Power Management
was confirmed as successful).

VM migration is only triggered when:
1. Cluster configuration states that the vm should be migrated in case
of failure
2. Engine has access to the host - so the failure is on the storage 
side

and not the host side.
3. the vms are not actively writing (although there might be a new RFE
for it).

hope this clears things up

Dafna



On 01/27/2014 10:11 AM, Andrew Lau wrote:

Hi,

Have you got power management enabled?

That's the fencing feature required for the engine to ensure that the
host is actually offline. It won't resume any other VMs to prevent
potential VM corruption (eg. VM running on multiple hosts).

Andrew.

On Jan 27, 2014 5:12 PM, Jaison peter urotr...@gmail.com
mailto:urotr...@gmail.com wrote:

 Hi all ,

 I was setting a two node ovirt cluster with ovirt engine on
 seperate node . I completed the configuration and tested VM  live
 migrations with out any issues . Then for checking cluster HA I
 powered down one host and expected vms running on that host to be
 migrated to the other one . But nothing happened , Engine 
detected
 host as un-rechable and marked it as non-operational and vm 
ran on

 that host went to 'unknown state' . Is that not possible to setup
 a fully HA ovirt cluster with two nodes ? or else is that my
 configuration problem ? please advice .

 Thanks  Regards

 Alex

 ___
 Users mailing list
 Users@ovirt.org mailto:Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users



___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


--
Dafna Ron
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users









Re: [Users] Notes on setting spice-proxy console option

2014-01-29 Thread David Li




- Original Message -
 From: Michal Skrivanek michal.skriva...@redhat.com
 To: Marian Krcmarik mkrcm...@redhat.com; David Li david...@sbcglobal.net
 Cc: users@ovirt.org
 Sent: Wednesday, January 29, 2014 6:00 AM
 Subject: Re: [Users] Notes on setting spice-proxy console option
 
 

 
  1. Will spice-proxy work with the Spice HTML5 client in the browser?
  Probably, but you would need to set the websocket proxy which is part of 
 installation steps for engine as well (I believe).
 
 since it's a proxy as well I'd recommend to use it instead of squid, not 
 in addition. By default it's installed on the engine host but it can be 
 deployed anywhere else.

I am interested in trying this. I tried before but didn't succeed. Is it enough 
to configure websocket proxy on the engine like this?

WebSocketProxy: engine-IP-addr:6100 version: general
WebSocketProxyTicketValiditySeconds: 120 version: general

Any pointer to a good setup guide?

David
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Vm's being paused

2014-01-29 Thread Dafna Ron
mmm... I think that there is a bug with the iso domain and I am not 
sure if it was already opened.


can you help me to debug this and see if its related? :)

I think that you have some intermittent network issues to the iso domain 
and every time it happens, the vms that have booted with a cd (even if 
you detached it) would pause.


I have a second suspicion... is it possible that the vms that pause had 
a cd and you ejected it at some point? perhaps after or during the 
network issues you had on the 14th?
can you run dumpxml from libvirt? let me know if you need help with this 
command.


Thanks,

Dafna

On 01/29/2014 02:16 PM, Neil wrote:

Hi Dafna,


On Wed, Jan 29, 2014 at 1:14 PM, Dafna Ron d...@redhat.com wrote:

The reason I asked about the size if because this was the original issue no?
vm's pausing on lack of space?

Apologies, I just wanted to make sure it was still about this pausing
and not the original migration issue that I think you were also
helping me with a few weeks back.


You're having a problem with your data domains.
Can you check the rout from the hosts to the storage? I think that you have
some disconnection to the storage from the hosts
since it's random and not from all the vm's I would suggest that its a
routing problem?
Thanks,
Dafna

The connections to the main data domain is 8Gb Fibre Channel directly
from each of the hosts to the FC SAN, so if it is a connection issue
then I can't understand how anything would be working. Or am I barking
up the wrong tree completely? There were some ethernet network
bridging changes on each of the hosts in early January, but this would
only affect the NFS mounted ISO domain, or could this be the cause of
the problems?

Is this disconnection causing the huge log files that I sent previously?

Thank you.

Regards.

Neil Wilson.



On 01/29/2014 08:00 AM, Neil wrote:

Sorry, more on this issue, I see my logs are rapidly filling up my
disk space on node02 with this error in /var/log/messages...

Jan 29 09:56:53 node02 vdsm vm.Vm ERROR
vmId=`dfa2cf7c-3f0e-42e3-b495-10ccb3e0c71b`::Stats function failed:
AdvancedStatsFunction _highWrite at 0x1c2fb90#012Traceback (most
recent call last):#012  File /usr/share/vdsm/sampling.py, line 351,
in collect#012statsFunction()#012  File
/usr/share/vdsm/sampling.py, line 226, in __call__#012retValue =
self._function(*args, **kwargs)#012  File /usr/share/vdsm/vm.py,
line 513, in _highWrite#012self._vm._dom.blockInfo(vmDrive.path,
0)#012  File /usr/share/vdsm/vm.py, line 835, in f#012ret =
attr(*args, **kwargs)#012  File
/usr/lib64/python2.6/site-packages/vdsm/libvirtconnection.py, line
76, in wrapper#012ret = f(*args, **kwargs)#012  File
/usr/lib64/python2.6/site-packages/libvirt.py, line 1814, in
blockInfo#012if ret is None: raise libvirtError
('virDomainGetBlockInfo() failed', dom=self)#012libvirtError: invalid
argument: invalid path

/rhev/data-center/mnt/blockSD/0e6991ae-6238-4c61-96d2-ca8fed35161e/images/fac8a3bb-e414-43c0-affc-6e2628757a28/6c3e5ae8-23fc-4196-ba42-778bdc0fbad8
not assigned to domain
Jan 29 09:56:53 node02 vdsm vm.Vm ERROR
vmId=`ac2a3f99-a6db-4cae-955d-efdfb901abb7`::Stats function failed:
AdvancedStatsFunction _highWrite at 0x1c2fb90#012Traceback (most
recent call last):#012  File /usr/share/vdsm/sampling.py, line 351,
in collect#012statsFunction()#012  File
/usr/share/vdsm/sampling.py, line 226, in __call__#012retValue =
self._function(*args, **kwargs)#012  File /usr/share/vdsm/vm.py,
line 509, in _highWrite#012if not vmDrive.blockDev or
vmDrive.format != 'cow':#012AttributeError: 'Drive' object has no
attribute 'format'

Not sure if this is related at all though?

Thanks.

Regards.

Neil Wilson.

On Wed, Jan 29, 2014 at 9:02 AM, Neil nwilson...@gmail.com wrote:

Hi Dafna,

Thanks for clarifying that, I found the migration issue and this was
resolved once I sorted out the ISO domain problem.

I'm sorry I don't understand your last question?
 after the engine restart, do you still see a problem with the size
or did the report of size changed?

The migration issue was resolved, it's now just trying to track down
why the two VM's paused on their own, one on the 8th of Jan(I think)
and one on the 19th of Jan.

Thank you.


Regards.

Neil Wilson.


On Tue, Jan 28, 2014 at 8:18 PM, Dafna Ron d...@redhat.com wrote:

yes - engine lost communication with vdsm and it has no way of knowing
if
the host is down or if there was a network issue so a network issue
would
cause the same errors that I see in the logs.

The error you put on the iso is the reason the vm's have failed
migration -
if a vm is run with a cd and the cd is gone than the vm will not be able
to
be migrated.

after the engine restart, do you still see a problem with the size or
did
the report of size changed?

Dafna


On 01/28/2014 01:02 PM, Neil wrote:

Hi Dafna,

Thanks for coming back to me. I'll try answer your queries one by one.

On Tue, Jan 28, 2014 at 1:38 PM, Dafna Ron d...@redhat.com 

Re: [Users] Failed to execute stage 'Misc configuration': Command '/bin/systemctl' failed to execute

2014-01-29 Thread Einav Cohen
maybe related to [1]?
try running engine-setup as recommended in comment #11 in [1].

[1] Bug 1039616 - Setting shmmax on F19 is not enough for starting postgres
[https://bugzilla.redhat.com/show_bug.cgi?id=1039616]

- Original Message -
 From: Ertan ERBEK ertaner...@gmail.com
 To: users@ovirt.org
 Sent: Tuesday, January 28, 2014 7:42:59 AM
 Subject: [Users] Failed to execute stage 'Misc configuration': Command 
 '/bin/systemctl' failed to execute
 
 Hi All,
 
 
 I want use oVirt instead of Vmware and Virtualbox but I couldn't
 complate setup exacly any time actually I try with four diffrent
 computer and everytime get same roblem and can't resolve this problem,
 could you assit me about this issue ?
 
 [ INFO  ] Stage: Transaction setup
 [ INFO  ] Stopping engine service
 [ INFO  ] Stopping websocket-proxy service
 [ INFO  ] Stage: Misc configuration
 [ INFO  ] Stage: Package installation
 [ INFO  ] Stage: Misc configuration
 [ INFO  ] Creating PostgreSQL database
 [ ERROR ] Failed to execute stage 'Misc configuration': Command
 '/bin/systemctl' failed to execute
 [ INFO  ] Yum Performing yum transaction rollback
 [ INFO  ] Stage: Clean up
Log file is located at
 /var/log/ovirt-engine/setup/ovirt-engine-setup-20140128142352.log
 [ INFO  ] Stage: Pre-termination
 [ INFO  ] Stage: Termination
 [ ERROR ] Execution of setup failed
 
 OvirtLog
 http://paste.fedoraproject.org/72322/90912140/
 RPM Packet log
 http://paste.fedoraproject.org/72327/12656139/
 http://paste.fedoraproject.org/72328/39091266/
 hardwareInformation
 http://paste.fedoraproject.org/72332/39091279
 http://paste.fedoraproject.org/72333/0912839
 http://paste.fedoraproject.org/72334/39091284
 
 Domain
 
 [root@ovirt zone]# cat /etc/hosts
 127.0.0.1   localhost localhost.localdomain localhost4
 localhost4.localdomain4
 ::1 localhost localhost.localdomain localhost6
 localhost6.localdomain6
 192.168.0.207ovirt.ovirt.localovirt
 [root@ovirt zone]# cat /etc/hostname
 ovirt
 [root@ovirt zone]# nslookup ovirt.ovirt.local
 Server:127.0.0.1
 Address:127.0.0.1#53
 
 Name:ovirt.ovirt.local
 Address: 192.168.0.207
 
 [root@ovirt zone]# nslookup
   set type=ptr
   192.168.0.207
 Server:127.0.0.1
 Address:127.0.0.1#53
 
 207.0.168.192.in-addr.arpaname = ovirt.ovirt.local.
 207.0.168.192.in-addr.arpaname = ns1.ovirt.local.
 
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Storage unresponsive after sanlock

2014-01-29 Thread Trey Dockendorf
On Wed, Jan 29, 2014 at 4:33 AM, Maor Lipchuk mlipc...@redhat.com wrote:
 The VDSM log seems to be from the 26th and from the engine logs it seems
 that the incident occurred at the 24th, so I can't really see the what
 happened in VDSM that time.

 From the engine logs it seems that at around 2014-01-24 16:59 the master
 storage domain was in maintenance and then there was an attempt to
 activate it, but VDSM threw an exception that it cannot find master
 domain with the arguments of
 spUUID=5849b030-626e-47cb-ad90-3ce782d831b3,
 msdUUID=7c49750d-7eae-4cd2-9b63-1dc71f357b88'

 This could be happen from various reasons, for example a failure in
 connecting the storage (for example see https://bugzilla.redhat.com/782864)


Some errors on my part that occurred before the sanlock issue were
having all the NFS exports with same fsid, as well as initial
failures to correctly pass custom NFS options to VDSM.  The sanlock
issue was not present as late as 18:00 on 2014-01-24 as I was still
working in the web interface at that time and saw no issues.

 Since you mentioned that once you have added a second node and it
 worked, it seems like to origin of the problem is in the Host it self.

 what are the differences between the two hosts (VDSM version, OS version)

There should be no differences.  They are identical hardware and
provisioned and configured using Puppet.

* vdsm-4.13.3-2.el6.x86_64
* OS is CentOS 6.5 - 2.6.32-431.3.1.el6.x86_64

 Does the first host succeeded to work on other DC?

I only have the default DC defined.  Would it be worth setting up
another DC for the sake of troubleshooting?

 Have you tried to reinstall it?

Not yet.  The install processes is automated as well as the
configuration, so whatever issues I'm running into SHOULD be present
upon re-install.  If there is a possibility a fresh install could
somehow fix this, I can re-provision.

I just noticed the 2nd host (vm02) added to the default cluster has
become Non Operational and the VM on that host failed to migrate to
the 1st host (vm01) which became SPM and is marked as Up.  The logs
on vm02 are full of sanlock messages.  What concerns me is the VM I
have running for testing is non responsive and vm01 shows messages
such as Time out during operation: cannot acquire state change lock.

I can't yet pinpoint when the failure occurred and to avoid sending 3
days worth of logs from 3 hosts I'll reset everything and try to
reproduce this with some monitoring to get a timestamp for approximate
time of failure.

Thanks
- Trey


 Regards,
 Maor




 On 01/29/2014 02:50 AM, Trey Dockendorf wrote:
 See attached.  The event seems to have begun around 06:00:00 on
 2014-01-26.  I was unable to get the single node cluster back online
 so I provisioned another node to add to the cluster, which became the
 SPM.  Adding the second node worked and I had to power cycle the node
 that hung as sanlock was in a zombie state.  This is my first attempt
 at production use of NFS over RDMA and I'd like to rule out that being
 the cause.  Since the issue I've changed the 'nfs_mount_options' in
 /etc/vdsm/vdsm.conf to 'soft,nosharecache,rdma,port=20049'.  The
 options during the crash were only 'rdma,port=20049'.  I am also
 forcing NFSv3 by setting 'Nfsvers=3' in /etc/nfsmount.conf, which is
 still in place and was in place during the crash.

 Thanks
 - Trey

 On Tue, Jan 28, 2014 at 2:45 AM, Maor Lipchuk mlipc...@redhat.com wrote:
 Hi Trey,

 Can you please also attach the engine/vdsm logs.

 Thanks,
 Maor

 On 01/27/2014 06:12 PM, Trey Dockendorf wrote:
 I setup my first oVirt instance since 3.0 a few days ago and it went
 very well, and I left the single host cluster running with 1 VM over
 the weekend.  Today I come back and the primary data storage is marked
 as unresponsive.  The logs are full of entries [1] that look very
 similar to a knowledge base article on RHEL's website [2].

 This setup is using NFS over RDMA and so far the ib interfaces report
 no errors (via `ibcheckerrs -v LID 1`).  Based on a doc on ovirt
 site [3] it seems this could be due to response problems.  The storage
 system is a new purchase and not yet in production so if there's any
 advice on how to track down the cause that would be very helpful.
 Please let me know what additional information would be helpful as
 it's been about a year since I've been active in the oVirt community.

 Thanks
 - Trey

 [1]: http://pastebin.com/yRpSLKxJ

 [2]: https://access.redhat.com/site/solutions/400463

 [3]: http://www.ovirt.org/SANLock
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users




___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[Users] oVirt Weekly Meeting Minutes -- 2014-01-29

2014-01-29 Thread Doron Fediuck
Hi All,
Due to an issue with the ovirt bot, we'll have a digest only this week.
Also, please note that next week's meeting is cancelled. See you in Feb 12.

#startmeeting oVirt Weekly Sync
  #topic Agenda and roll Call
#info infra update
#info 3.3 update releases
#info 3.4 progress
#info conferences and workshops
#info other topics
  #topic infra update
#info some per-patch jobs disabled to decrease the load. currently Jenkins 
stabilized.
  #topic 3.3 releases update
#action bkp to add a warning about python-cpopen, so users will not fail 
installing 3.3.2.
#action sbonazzo to push forward on stable 3.3.3 with python-cpopen fix.
  #topic 3.4 progress
#info 3.4 tracker https://bugzilla.redhat.com/show_bug.cgi?id=1024889'
#info blocker updates: 2 integration BZs in POST being tracked. 2 network 
BZs in NEW being tracked.
#info 3.4  beta refresh on 6th and second test day on 11th
#info gluster 3,4 updates: one issue found, patch exists and should be 
handled in time.
#info infra 3.4 updates: 3 features still in progress.
#info integration 3.4 updates: report setup and hosted engine being 
handled. no news for virt appliance.
#info network 3.4 updates: Issue found with libvirt implementation 
regarding host network QoS. currently being tracked.
#info ppc 3.4 updates: a few patches require reviews, as well as libguestfs 
currently being checked for Fedore 19/20.
#info SLA 3.4 updates: hosted engine fix still needs vdsm attention. 
Currently in progress. No other updates.
#info storage 3.4 updates: 2 features pushed forward to 3.5  (get rid of 
pool md, ovf on any domain). No other updates.
#info ux 3.4 updates: none. No known blockers.
#info virt 3.4 updates: mising REST API of tempalte versions. No known 
blockers.
  #topic conferences and workshops
#info London Cloud Summit currently running. FOSDEM, cfgmgmtcmp and 
infra.next prepared.
#info expecting an oVirt event in Korea soon.
  #topic other topics
#info due to various conferences and events next week's meeting is 
canceled. Next meeting will be on Feb 12.
#endmeeting
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] How to upload images to a local storage ISO domain

2014-01-29 Thread Dresel Christopher
Thanks to all replies.

From: users-boun...@ovirt.org [mailto:users-boun...@ovirt.org] On Behalf Of 
Dresel Christopher
Sent: Mittwoch, 29. Jänner 2014 12:33
To: users@ovirt.org
Subject: [Users] How to upload images to a local storage ISO domain

Hi,

when using the engine-iso-uploader (the only documented tool I found) for a 
local storage ISO domain, I get the following error:

ovirt-iso-uploader upload --iso-domain=ISOs file.iso –v

ERROR: mount.nfs: Failed to resolve server None: Name or service not known

It looks like engine-iso-uploader tries to connect as if it is an NFS storage?

What is the procedure for uploading isos to local storage domains? Are there 
other options than using the engine-iso-uploader?

Best Regards,
Christopher Dresel




Fachhochschule Wiener Neustadt für Wirtschaft und Technik Ges.m.b.H.
University of Applied Sciences Wiener Neustadt for Business and Engineering 
Ltd., Austria
Johannes Gutenberg-Straße 3
2700 Wiener Neustadt
Austria, Europe
ATU: 37772406
Firmenbuchnummer: 77005v
Firmenbuchgericht: Landesgericht Wiener Neustadt
DVR: 0769665
Der Inhalt dieses E-Mails ist ausschliesslich fuer den bezeichneten Adressaten 
bestimmt. Jede Form der Kenntnisnahme, Veroeffentlichung, Vervielfaeltigung 
oder Weitergabe des Inhalts dieses E-Mails durch unberechtigte Dritte ist 
unzulaessig. Wir bitten Sie, sich mit dem Absender des E-Mails in Verbindung zu 
setzen, falls Sie nicht der Adressat dieses E-Mails sind und das Material von 
Ihrem Computer zu loeschen.

This e-mail and any attachments are confidential and intended solely for the 
addressee. The perusal, publication, copying or dissemination of the contents 
of this e-mail by unauthorised third parties is prohibited. If you are not the 
intended recipient of this e-mail, please delete it and immediately notify the 
sender.



Fachhochschule Wiener Neustadt für Wirtschaft und Technik Ges.m.b.H.
University of Applied Sciences Wiener Neustadt for Business and Engineering 
Ltd., Austria
Johannes Gutenberg-Straße 3
2700 Wiener Neustadt
Austria, Europe
ATU: 37772406
Firmenbuchnummer: 77005v
Firmenbuchgericht: Landesgericht Wiener Neustadt
DVR: 0769665
Der Inhalt dieses E-Mails ist ausschliesslich fuer den bezeichneten Adressaten 
bestimmt. Jede Form der Kenntnisnahme, Veroeffentlichung, Vervielfaeltigung 
oder Weitergabe des Inhalts dieses E-Mails durch unberechtigte Dritte ist 
unzulaessig. Wir bitten Sie, sich mit dem Absender des E-Mails in Verbindung zu 
setzen, falls Sie nicht der Adressat dieses E-Mails sind und das Material von 
Ihrem Computer zu loeschen.

This e-mail and any attachments are confidential and intended solely for the 
addressee. The perusal, publication, copying or dissemination of the contents 
of this e-mail by unauthorised third parties is prohibited. If you are not the 
intended recipient of this e-mail, please delete it and immediately notify the 
sender.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[Users] How to setup FreeBSD 8.3 Guest on oVirt 3.3

2014-01-29 Thread Dresel Christopher
Hi,

I’m using oVirt 3.3 and trying to setup a FreeBSD 8.3 Guest machine. The 
machine hangs after the booting screen -  I already googled about this, but 
didn’t find any helpful articles.

Cluster uses “Intel Penryn Family“, the VM is configured with 1 virtual CPU as 
suggested in one of the threads I found on the internet.

Still no luck :/



Fachhochschule Wiener Neustadt für Wirtschaft und Technik Ges.m.b.H.
University of Applied Sciences Wiener Neustadt for Business and Engineering 
Ltd., Austria
Johannes Gutenberg-Straße 3
2700 Wiener Neustadt
Austria, Europe
ATU: 37772406
Firmenbuchnummer: 77005v
Firmenbuchgericht: Landesgericht Wiener Neustadt
DVR: 0769665
Der Inhalt dieses E-Mails ist ausschliesslich fuer den bezeichneten Adressaten 
bestimmt. Jede Form der Kenntnisnahme, Veroeffentlichung, Vervielfaeltigung 
oder Weitergabe des Inhalts dieses E-Mails durch unberechtigte Dritte ist 
unzulaessig. Wir bitten Sie, sich mit dem Absender des E-Mails in Verbindung zu 
setzen, falls Sie nicht der Adressat dieses E-Mails sind und das Material von 
Ihrem Computer zu loeschen.

This e-mail and any attachments are confidential and intended solely for the 
addressee. The perusal, publication, copying or dissemination of the contents 
of this e-mail by unauthorised third parties is prohibited. If you are not the 
intended recipient of this e-mail, please delete it and immediately notify the 
sender.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] [vdsm] ovirtmgmt vanishes after reboot

2014-01-29 Thread Dan Kenigsberg
On Wed, Jan 29, 2014 at 11:07:08AM +0100, Fabian Deutsch wrote:
 Am Mittwoch, den 29.01.2014, 05:02 -0500 schrieb Antoni Segura Puimedon:
 Right.
 I believe the bug you are seeing is:
 http://gerrit.ovirt.org/#/c/20068/
 
 This has been merged into out stable branch, but the release of a new
 oVirt Node ISO for 3.3 is pending because of the vdsm-python-cpopen
 dependency problem.

Would you agree to patch `yum` on your build machine? I do not fully
understand the issue, but with the following patch, the dependency
problem is resolved for me:

diff --git a/yum/depsolve.py b/yum/depsolve.py
index 95c21bc..57cf379 100644
--- a/yum/depsolve.py
+++ b/yum/depsolve.py
@@ -720,7 +720,7 @@ class Depsolve(object):
 else:
 self.verbose_logger.debug(_('TSINFO: Marking %s as install for 
%s'), best,
 requiringPo)
-reqtuple = misc.string_to_prco_tuple(needname + str(needflags) + 
needversion)
+reqtuple = misc.string_to_prco_tuple(requirement)
 txmbrs = self.install(best, provides_for=reqtuple)
 for txmbr in txmbrs:
 txmbr.setAsDep(po=requiringPo)

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Extremely poor disk access speeds in Windows guest

2014-01-29 Thread Steve Dainard
On Wed, Jan 29, 2014 at 5:11 AM, Vadim Rozenfeld vroze...@redhat.comwrote:

 On Wed, 2014-01-29 at 11:30 +0200, Ronen Hod wrote:
  Adding the virtio-scsi developers.
  Anyhow, virtio-scsi is newer and less established than viostor (the
  block device), so you might want to try it out.

 [VR]
 Was it SCSI Controller or SCSI pass-through controller?
 If it's SCSI Controller then it will be viostor (virtio-blk) device
 driver.


SCSI Controller is listed in device manager.

Hardware ID's:
PCI\VEN_1AF4DEV_1004SUBSYS_00081AF4REV_00
PCI\VEN_1AF4DEV_1004SUBSYS_00081AF4
PCI\VEN_1AF4DEV_1004CC_01
PCI\VEN_1AF4DEV_1004CC_0100




  A disclaimer: There are time and patches gaps between RHEL and other
  versions.
 
  Ronen.
 
  On 01/28/2014 10:39 PM, Steve Dainard wrote:
 
   I've had a bit of luck here.
  
  
   Overall IO performance is very poor during Windows updates, but a
   contributing factor seems to be the SCSI Controller device in the
   guest. This last install I didn't install a driver for that device,

 [VR]
 Does it mean that your system disk is IDE and the data disk (virtio-blk)
 is not accessible?


In Ovirt 3.3.2-1.el6 I do not have an option to add a virtio-blk device:
Screenshot here:
https://dl.dropboxusercontent.com/u/21916057/Screenshot%20from%202014-01-29%2010%3A04%3A57.png

VM disk drive is Red Hat VirtIO SCSI Disk Device, storage controller is
listed as Red Hat VirtIO SCSI Controller as shown in device manager.
Screenshot here:
https://dl.dropboxusercontent.com/u/21916057/Screenshot%20from%202014-01-29%2009%3A57%3A24.png

In Ovirt manager the disk interface is listed as VirtIO.
Screenshot here:
https://dl.dropboxusercontent.com/u/21916057/Screenshot%20from%202014-01-29%2009%3A58%3A35.png



and my performance is much better. Updates still chug along quite
   slowly, but I seem to have more than the  100KB/s write speeds I
   was seeing previously.
  
  
   Does anyone know what this device is for? I have the Red Hat VirtIO
   SCSI Controller listed under storage controllers.

 [VR]
 It's a virtio-blk device. OS cannot see this volume unless you have
 viostor.sys driver installed on it.


Interesting that my VM's can see the controller, but I can't add a disk for
that controller in Ovirt. Is there a package I have missed on install?

rpm -qa | grep ovirt
ovirt-host-deploy-java-1.1.3-1.el6.noarch
ovirt-engine-backend-3.3.2-1.el6.noarch
ovirt-engine-lib-3.3.2-1.el6.noarch
ovirt-engine-restapi-3.3.2-1.el6.noarch
ovirt-engine-sdk-python-3.3.0.8-1.el6.noarch
ovirt-log-collector-3.3.2-2.el6.noarch
ovirt-engine-dbscripts-3.3.2-1.el6.noarch
ovirt-engine-webadmin-portal-3.3.2-1.el6.noarch
ovirt-host-deploy-1.1.3-1.el6.noarch
ovirt-image-uploader-3.3.2-1.el6.noarch
ovirt-engine-websocket-proxy-3.3.2-1.el6.noarch
ovirt-engine-userportal-3.3.2-1.el6.noarch
ovirt-engine-setup-3.3.2-1.el6.noarch
ovirt-iso-uploader-3.3.2-1.el6.noarch
ovirt-engine-cli-3.3.0.6-1.el6.noarch
ovirt-engine-3.3.2-1.el6.noarch
ovirt-engine-tools-3.3.2-1.el6.noarch


  
   I've setup a NFS storage domain on my desktops SSD.
   I've re-installed
   win 2008 r2 and initially it was running smoother.
  
   Disk performance peaks at 100MB/s.
  
   If I copy a 250MB file from a share into the Windows
   VM, it writes out
 [VR]
 Do you copy it with Explorer or any other copy program?


Windows Explorer only.


 Do you have HPET enabled?


I can't find it in the guest 'system devices'. On the hosts the current
clock source is 'tsc', although 'hpet' is an available option.


 How does it work with if you copy from/to local (non-NFS) storage?


Not sure, this is a royal pain to setup. Can I use my ISO domain in two
different data centers at the same time? I don't have an option to create
an ISO / NFS domain in the local storage DC.

When I use the import option with the default DC's ISO domain, I get an
error There is no storage domain under the specified path. Check event log
for more details. VDMS logs show Resource namespace
0e90e574-b003-4a62-867d-cf274b17e6b1_imageNS already registered so I'm
guessing the answer is no.

I tried to deploy with WDS, but the 64bit drivers apparently aren't signed,
and on x86 I get an error about the NIC not being supported even with the
drivers added to WDS.



 What is your virtio-win drivers package origin and version?


virtio-win-0.1-74.iso -
http://alt.fedoraproject.org/pub/alt/virtio-win/latest/images/



 Thanks,
 Vadim.



Appreciate it,
Steve
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Notes on setting spice-proxy console option

2014-01-29 Thread Michal Skrivanek


 On 29 Jan 2014, at 16:59, David Li david...@sbcglobal.net wrote:
 
 
 
 
 
 - Original Message -
 From: Michal Skrivanek michal.skriva...@redhat.com
 To: Marian Krcmarik mkrcm...@redhat.com; David Li david...@sbcglobal.net
 Cc: users@ovirt.org
 Sent: Wednesday, January 29, 2014 6:00 AM
 Subject: Re: [Users] Notes on setting spice-proxy console option
 
 
 
 
 1. Will spice-proxy work with the Spice HTML5 client in the browser?
 Probably, but you would need to set the websocket proxy which is part of 
 installation steps for engine as well (I believe).
 
 since it's a proxy as well I'd recommend to use it instead of squid, not 
 in addition. By default it's installed on the engine host but it can be 
 deployed anywhere else.
 
 I am interested in trying this. I tried before but didn't succeed. Is it 
 enough to configure websocket proxy on the engine like this?

Well, this is the default deployment on engine. The engine host is the proxy. 
If that's what you need there is no additional configuration needed. Just say 
yes to websocket proxy during installation and import ca.crt to your browser

Only if you want to run the proxy elsewhere you need to update the below 
parameters accordingly

Thanks,
michal
 
 WebSocketProxy: engine-IP-addr:6100 version: general
 WebSocketProxyTicketValiditySeconds: 120 version: general
 
 Any pointer to a good setup guide?
 
 David
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] How to setup FreeBSD 8.3 Guest on oVirt 3.3

2014-01-29 Thread Joop
Dresel Christopher dre...@fhwn.ac.at wrote:
Hi,

I’m using oVirt 3.3 and trying to setup a FreeBSD 8.3 Guest machine.
The machine hangs after the booting screen -  I already googled about
this, but didn’t find any helpful articles.

Cluster uses “Intel Penryn Family“, the VM is configured with 1 virtual
CPU as suggested in one of the threads I found on the internet.
You're probably looking at x2pic cpu flag problems. A vdsm hook might help. 
Have posted to proxmox forum about this. Also, solaris seems affected, has been 
discussed on this list.

Joop

-- 
Sent from my Android tablet with K-9 Mail. Please excuse my brevity.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] What settings are needed to run windows 98 on a VM?

2014-01-29 Thread Eliezer Croitoru

Thanks,

I have used kvm with command line in the past but I an using now 
virt-manager.
In virt-manager I am almost sure that I will need to enter it somewhere 
but I am not sure where it would be.
I have changed the CPU type but I needed to change the domain type from 
kvm to qemu.

Using this line
domain type='qemu'
instead of
domain type='kvm'

Which I was unable to understand how to change using the virt-manager menus.

Thanks,
Eliezer

On 21/01/14 16:52, Sander Grendelman wrote:

It looks like this (kind of) works on ovirt already.
I was able to install win98 from cdrom on a VM with IDE disk.

There are some issues:
- The VM tends to hang during boot, choosing step by step confirmation
(my first suspect was javasup.vxd) seems to fix this, this doesn't
happen if I use the -no-kvm option with a local kvm install.
- No network (no e1000 for win98 it seems) this can probably be fixed
with a hook to add a ne2k_pci nic
- Slow graphics, this can probably be fixed with a hook to use the
cirrus adapter in stead of qlx or vga

Working cli kvm start:
qemu-kvm -m 512 -hda /var/tmp/win98.qcow2 -no-kvm -cdrom
/var/tmp/win98se.iso -vga cirrus

Some more info here: http://ubuntuforums.org/showthread.php?t=774745

On Tue, Jan 21, 2014 at 7:27 AM, Eliezer Croitoru elie...@ngtech.co.il wrote:

I wanted to make sure I understand.
This guy comes to me with windows98 and says he must run this machine.
I don't really care about this machine but this is what he wants..
Is there any options else then XEN that will run windows98 on a VM?

Thanks,
Eliezer
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Extremely poor disk access speeds in Windows guest

2014-01-29 Thread Itamar Heim

On 01/29/2014 07:35 PM, Steve Dainard wrote:
...

In Ovirt 3.3.2-1.el6 I do not have an option to add a virtio-blk device:
Screenshot here:
https://dl.dropboxusercontent.com/u/21916057/Screenshot%20from%202014-01-29%2010%3A04%3A57.png


virtio is virtio-blk (in the beginning, there was only one virtio, 
virtio-blk)



...
Do you have HPET enabled?


I can't find it in the guest 'system devices'. On the hosts the current
clock source is 'tsc', although 'hpet' is an available option.

How does it work with if you copy from/to local (non-NFS) storage?


Not sure, this is a royal pain to setup. Can I use my ISO domain in two
different data centers at the same time? I don't have an option to
create an ISO / NFS domain in the local storage DC.


an iso domain can be associated to two data centers (or more, or 
different engines, etc.)




When I use the import option with the default DC's ISO domain, I get an
error There is no storage domain under the specified path. Check event
log for more details. VDMS logs show Resource namespace
0e90e574-b003-4a62-867d-cf274b17e6b1_imageNS already registered so I'm
guessing the answer is no.


the answer is yes, please open a separate thread on this issue to make 
it easier to troubleshoot it.


thanks



I tried to deploy with WDS, but the 64bit drivers apparently aren't
signed, and on x86 I get an error about the NIC not being supported even
with the drivers added to WDS.

What is your virtio-win drivers package origin and version?


virtio-win-0.1-74.iso -
http://alt.fedoraproject.org/pub/alt/virtio-win/latest/images/


Thanks,
Vadim.



Appreciate it,
Steve


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Extremely poor disk access speeds in Windows guest

2014-01-29 Thread Alan Murrell
I notced updates on a Win7 VM I ceated previously were *really* slow,  
but when I logged in to it remotely for daily use, it seemed pretty  
snappy.  I did not do any significant data transfers, however.  I had  
the same latest virtio-win drivers installed, and in oVirt, the disk  
was of type VIRTIO (and not VIRTIO-SCSI).


For other reasons, I have rebuilt my test host, and am going to be  
installing a new Windows 7 VM.  Is there anything I can do in this  
process to provide more data and help with this troubleshooting?


-Alan

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] How to setup FreeBSD 8.3 Guest on oVirt 3.3

2014-01-29 Thread Dresel Christopher
Thank you, I already found your repo and I'm testing.

The custom property is selectable via the GUI, the hook gets called, but I 
don't know why it don’t work - it seems like the custom property does not get 
passed to the hook :/

-Original Message-
From: René Koch [mailto:rk...@linuxland.at]
Sent: Mittwoch, 29. Jänner 2014 20:51
To: Joop; Dresel Christopher
Cc: users@ovirt.org
Subject: Re: [Users] How to setup FreeBSD 8.3 Guest on oVirt 3.3

On Wed, 2014-01-29 at 19:27 +0100, Joop wrote:
 Dresel Christopher dre...@fhwn.ac.at wrote:
 Hi,
 
 I’m using oVirt 3.3 and trying to setup a FreeBSD 8.3 Guest machine.
 The machine hangs after the booting screen -  I already googled about
 this, but didn’t find any helpful articles.
 
 Cluster uses “Intel Penryn Family“, the VM is configured with 1
 virtual CPU as suggested in one of the threads I found on the internet.
 You're probably looking at x2pic cpu flag problems. A vdsm hook might help. 
 Have posted to proxmox forum about this. Also, solaris seems affected, has 
 been discussed on this list.


For Solaris I wrote a hook which can be found here (works fine for Solaris 
10/11 on Intel CPUs):
https://github.com/ovido/ovirt-stuff/tree/master/hooks

Don't know if this is needed for (older versions of) FreeBSD, too.


Regards,
René


 Joop






Fachhochschule Wiener Neustadt für Wirtschaft und Technik Ges.m.b.H.
University of Applied Sciences Wiener Neustadt for Business and Engineering 
Ltd., Austria
Johannes Gutenberg-Straße 3
2700 Wiener Neustadt
Austria, Europe
ATU: 37772406
Firmenbuchnummer: 77005v
Firmenbuchgericht: Landesgericht Wiener Neustadt
DVR: 0769665
Der Inhalt dieses E-Mails ist ausschliesslich fuer den bezeichneten Adressaten 
bestimmt. Jede Form der Kenntnisnahme, Veroeffentlichung, Vervielfaeltigung 
oder Weitergabe des Inhalts dieses E-Mails durch unberechtigte Dritte ist 
unzulaessig. Wir bitten Sie, sich mit dem Absender des E-Mails in Verbindung zu 
setzen, falls Sie nicht der Adressat dieses E-Mails sind und das Material von 
Ihrem Computer zu loeschen.

This e-mail and any attachments are confidential and intended solely for the 
addressee. The perusal, publication, copying or dissemination of the contents 
of this e-mail by unauthorised third parties is prohibited. If you are not the 
intended recipient of this e-mail, please delete it and immediately notify the 
sender.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Storage unresponsive after sanlock

2014-01-29 Thread Trey Dockendorf
On Wed, Jan 29, 2014 at 4:33 AM, Maor Lipchuk mlipc...@redhat.com wrote:
 The VDSM log seems to be from the 26th and from the engine logs it seems
 that the incident occurred at the 24th, so I can't really see the what
 happened in VDSM that time.

 From the engine logs it seems that at around 2014-01-24 16:59 the master
 storage domain was in maintenance and then there was an attempt to
 activate it, but VDSM threw an exception that it cannot find master
 domain with the arguments of
 spUUID=5849b030-626e-47cb-ad90-3ce782d831b3,
 msdUUID=7c49750d-7eae-4cd2-9b63-1dc71f357b88'


The actual error was higher in the logs after I tried activating this
host. Puppet had removed the unmanaged /etc/sudoers.d/50_vdsm file and
that was preventing vdsm from being able to execute any mount
commands.  The issues with vm02 are likely all due to that mistake on
my part.  My apologies.

- Trey

 This could be happen from various reasons, for example a failure in
 connecting the storage (for example see https://bugzilla.redhat.com/782864)

 Since you mentioned that once you have added a second node and it
 worked, it seems like to origin of the problem is in the Host it self.

 what are the differences between the two hosts (VDSM version, OS version)
 Does the first host succeeded to work on other DC?
 Have you tried to reinstall it?

 Regards,
 Maor




 On 01/29/2014 02:50 AM, Trey Dockendorf wrote:
 See attached.  The event seems to have begun around 06:00:00 on
 2014-01-26.  I was unable to get the single node cluster back online
 so I provisioned another node to add to the cluster, which became the
 SPM.  Adding the second node worked and I had to power cycle the node
 that hung as sanlock was in a zombie state.  This is my first attempt
 at production use of NFS over RDMA and I'd like to rule out that being
 the cause.  Since the issue I've changed the 'nfs_mount_options' in
 /etc/vdsm/vdsm.conf to 'soft,nosharecache,rdma,port=20049'.  The
 options during the crash were only 'rdma,port=20049'.  I am also
 forcing NFSv3 by setting 'Nfsvers=3' in /etc/nfsmount.conf, which is
 still in place and was in place during the crash.

 Thanks
 - Trey

 On Tue, Jan 28, 2014 at 2:45 AM, Maor Lipchuk mlipc...@redhat.com wrote:
 Hi Trey,

 Can you please also attach the engine/vdsm logs.

 Thanks,
 Maor

 On 01/27/2014 06:12 PM, Trey Dockendorf wrote:
 I setup my first oVirt instance since 3.0 a few days ago and it went
 very well, and I left the single host cluster running with 1 VM over
 the weekend.  Today I come back and the primary data storage is marked
 as unresponsive.  The logs are full of entries [1] that look very
 similar to a knowledge base article on RHEL's website [2].

 This setup is using NFS over RDMA and so far the ib interfaces report
 no errors (via `ibcheckerrs -v LID 1`).  Based on a doc on ovirt
 site [3] it seems this could be due to response problems.  The storage
 system is a new purchase and not yet in production so if there's any
 advice on how to track down the cause that would be very helpful.
 Please let me know what additional information would be helpful as
 it's been about a year since I've been active in the oVirt community.

 Thanks
 - Trey

 [1]: http://pastebin.com/yRpSLKxJ

 [2]: https://access.redhat.com/site/solutions/400463

 [3]: http://www.ovirt.org/SANLock
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users




___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] engine reports and dwh setup in 3.3.2

2014-01-29 Thread Gianluca Cecchi
On Thu, Jan 16, 2014 at 4:57 PM, Yaniv Dary  wrote:
 The oVirt 3.3.3 release should be working for all install and upgrade flows.
 Please use that on your setup once it's released.



 Yaniv

 - Original Message -
 From: Yaniv Dary yd...@redhat.com
 To: Gianluca Cecchi gianluca.cec...@gmail.com
 Cc: users users@ovirt.org
 Sent: Sunday, January 5, 2014 3:54:40 PM
 Subject: Re: [Users] engine reports and dwh setup in 3.3.2



 - Original Message -
  From: Gianluca Cecchi gianluca.cec...@gmail.com
  To: Yaniv Dary yd...@redhat.com
  Cc: users users@ovirt.org
  Sent: Saturday, January 4, 2014 12:59:46 AM
  Subject: Re: [Users] engine reports and dwh setup in 3.3.2
 
  On Wed, Dec 25, 2013 at 1:00 AM, Yaniv Dary wrote:
 
  
   Hi,
   We have found a few blockers on the setup of dwh and reports.
   We hope to resolve the issues in the next few days. If you restore your
   environment using the backups, you will be able to upgrade.
   I've put a note to let you know when new packages are available.
 
  Ok, I'll wait good news about these items
  Just a question: when you write about blockers, are you referring to
  updates only or also to new setups directly made in 3.3.2
  environments?

 Local fresh installs should work and upgrades\remote fresh install will not
 probably.


 
  thanks in advance,
  Gianluca
 

Just to note that on my oVirt 3.3.2-1 allinone  on fedora 19, after
enabling updates-testing I get the same failure using 3.3.3rc

[root@tekkaman ~]# engine-setup
[ INFO  ] Stage: Initializing
[ INFO  ] Stage: Environment setup
  Configuration files:
['/etc/ovirt-engine-setup.conf.d/10-packaging-aio.conf',
'/etc/ovirt-engine-setup.conf.d/10-packaging.conf',
'/etc/ovirt-engine-setup.conf.d/20-setup-aio.conf',
'/etc/ovirt-engine-setup.conf.d/20-setup-ovirt-post.conf']
  Log file:
/var/log/ovirt-engine/setup/ovirt-engine-setup-20140129214203.log
  Version: otopi-1.1.2 (otopi-1.1.2-1.fc19)
[ INFO  ] Stage: Environment packages setup
[ INFO  ] Stage: Programs detection
[ INFO  ] Stage: Environment setup
[ INFO  ] Stage: Environment customization

  --== PACKAGES ==--

[ INFO  ] Checking for product updates...
  Setup has found updates for some packages, do you wish to
update them now? (Yes, No) [Yes]:
[ INFO  ] Checking for an update for Setup...

  --== NETWORK CONFIGURATION ==--

[WARNING] Failed to resolve tekkaman.localdomain.local using DNS, it
can be resolved only locally
  Setup can automatically configure the firewall on this system.
  Note: automatic configuration of the firewall may overwrite
current settings.
  Do you want Setup to configure the firewall? (Yes, No) [Yes]:
[ INFO  ] iptables will be configured as firewall manager.

  --== DATABASE CONFIGURATION ==--


  --== OVIRT ENGINE CONFIGURATION ==--

  Skipping storing options as database already prepared

  --== PKI CONFIGURATION ==--

  PKI is already configured

  --== APACHE CONFIGURATION ==--


  --== SYSTEM CONFIGURATION ==--


  --== END OF CONFIGURATION ==--

[ INFO  ] Stage: Setup validation
  During execution engine service will be stopped (OK, Cancel) [OK]:
[ INFO  ] Cleaning stale zombie tasks

  --== CONFIGURATION PREVIEW ==--

  Database name  : engine
  Database secured connection: False
  Database host  : localhost
  Database user name : engine
  Database host name validation  : False
  Datbase port   : 5432
  NFS setup  : True
  Firewall manager   : iptables
  Update Firewall: True
  Configure WebSocket Proxy  : True
  Host FQDN  : tekkaman.localdomain.local
  NFS mount point: /ISO
  Set application as default page: True
  Configure Apache SSL   : False
  Upgrade packages   : True

  Please confirm installation settings (OK, Cancel) [OK]:
[ INFO  ] Cleaning async tasks and compensations
[ INFO  ] Checking the DB consistency
[ INFO  ] Stage: Transaction setup
[ INFO  ] Stopping engine service
[ INFO  ] Stopping websocket-proxy service
[ INFO  ] Stage: Misc configuration
[ INFO  ] Fixing DB inconsistencies
[ INFO  ] Stage: Package installation
[ INFO  ] Yum Status: Downloading Packages
[ INFO  ] Yum Download/Verify: ovirt-engine-3.3.3-1.fc19.noarch
[ INFO  ] Yum Downloading: (2/8):
ovirt-engine-backend-3.3.3-1.fc19.noarch.rpm 1.6 M(44%)
[ INFO  ] Yum Downloading: (2/8):
ovirt-engine-backend-3.3.3-1.fc19.noarch.rpm 2.8 M(76%)
[ INFO  ] Yum Download/Verify: ovirt-engine-backend-3.3.3-1.fc19.noarch
[ INFO  ] Yum Download/Verify: ovirt-engine-dbscripts-3.3.3-1.fc19.noarch
[ INFO  ] Yum Downloading: (4/8):

Re: [Users] engine reports and dwh setup in 3.3.2

2014-01-29 Thread Gianluca Cecchi
On Wed, Jan 29, 2014 at 10:49 PM, Gianluca Cecchi  wrote:

 [ INFO  ] Backing up database to
 '/var/lib/ovirt-engine/backups/engine-20140129223909.I3iTp5.sql'.
 [ INFO  ] Updating database schema
 [ ERROR ] Failed to execute stage 'Misc configuration': [Errno 2] No
 such file or directory:
 '/var/lib/ovirt-engine/deployments/ovirt-engine-reports.war'
 [ INFO  ] Yum Performing yum transaction rollback

 Is the patch to be put yet for final 3.3.3?
 Or will there be any particular note in release (for example to
 deinstall reports..)?

 Gianluca

Note that my current situation is:

[root@tekkaman ~]# ll /var/lib/ovirt-engine/deployments
total 8
lrwxrwxrwx. 1 root  root  34 Jan  7  2013 engine.ear -
/usr/share/ovirt-engine/engine.ear
-rw-r--r--. 1 ovirt ovirt 10 Dec 21 12:28 engine.ear.deployed
lrwxrwxrwx. 1 root  root  48 Feb 19  2013 ovirt-engine-reports.war -
/usr/share/ovirt-engine/ovirt-engine-reports.war
-rw-r--r--. 1 ovirt ovirt 24 Dec 21 15:43 ovirt-engine-reports.war.dodeploy

with  /usr/share/ovirt-engine/ovirt-engine-reports.war being a broken link

[root@tekkaman ~]# ll /usr/share/ovirt-engine
total 52
drwxr-xr-x. 2 root  root  4096 Jan 29 22:38 bin
drwxr-xr-x. 3 root  root  4096 Jan 21 11:10 branding
drwxr-xr-x. 2 root  root  4096 Jan 21 11:10 conf
drwxr-xr-x. 3 root  root  4096 Jan 29 22:39 dbscripts
drwxr-xr-x. 8 ovirt ovirt 4096 Jan 29 22:38 engine.ear
drwxr-xr-x. 2 root  root  4096 Jan 29 22:38 files
drwxr-xr-x. 4 root  root  4096 Jan 21 11:10 firewalld
drwxr-xr-x. 2 root  root  4096 Jan 29 22:38 manual
drwxr-xr-x. 4 root  root  4096 Jan 21 11:10 modules
drwxr-xr-x. 3 root  root  4096 Nov 24 20:51 scripts
drwxr-xr-x. 5 root  root  4096 Jan 21 11:10 services
drwxr-xr-x. 6 root  root  4096 Jan 21 11:10 setup
drwxr-xr-x. 2 root  root  4096 Jan 21 11:10 ui-plugins

Gianluca
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] engine reports and dwh setup in 3.3.2

2014-01-29 Thread Alon Bar-Lev


- Original Message -
 From: Gianluca Cecchi gianluca.cec...@gmail.com
 To: Yaniv Dary yd...@redhat.com
 Cc: users users@ovirt.org
 Sent: Wednesday, January 29, 2014 11:49:04 PM
 Subject: Re: [Users] engine reports and dwh setup in 3.3.2
 
 Just to note that on my oVirt 3.3.2-1 allinone  on fedora 19, after
 enabling updates-testing I get the same failure using 3.3.3rc

Can it be that the following is dead symlink? if so please remove.

/var/lib/ovirt-engine/deployments/ovirt-engine-reports.war
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] engine reports and dwh setup in 3.3.2

2014-01-29 Thread Gianluca Cecchi
On Wed, Jan 29, 2014 at 10:55 PM, Alon Bar-Lev  wrote:


 - Original Message -
 From: Gianluca Cecchi
 To: Yaniv Dary
 Cc: users users@ovirt.org
 Sent: Wednesday, January 29, 2014 11:49:04 PM
 Subject: Re: [Users] engine reports and dwh setup in 3.3.2

 Just to note that on my oVirt 3.3.2-1 allinone  on fedora 19, after
 enabling updates-testing I get the same failure using 3.3.3rc

 Can it be that the following is dead symlink? if so please remove.

 /var/lib/ovirt-engine/deployments/ovirt-engine-reports.war

I confirm that
- rollback (made automatically during the failed engine-setup first run)
- rm /var/lib/ovirt-engine/deployments/ovirt-engine-reports.war
- engine-setup again
now it completes correctly

Probably it is worth to put a note in release notes.
But I don't know where:
http://wiki.ovirt.org/OVirt_3.3.3_release_notes
or
http://wiki.ovirt.org/OVirt_3.2_to_3.3_upgrade
?
Because I previously used (in December I think) the latest link to
migrate from f18+3.2 to f19+3.3.2

Gianluca
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] engine reports and dwh setup in 3.3.2

2014-01-29 Thread Alon Bar-Lev


- Original Message -
 From: Gianluca Cecchi gianluca.cec...@gmail.com
 To: Alon Bar-Lev alo...@redhat.com
 Cc: Yaniv Dary yd...@redhat.com, users users@ovirt.org
 Sent: Thursday, January 30, 2014 12:20:46 AM
 Subject: Re: [Users] engine reports and dwh setup in 3.3.2
 
 On Wed, Jan 29, 2014 at 10:55 PM, Alon Bar-Lev  wrote:
 
 
  - Original Message -
  From: Gianluca Cecchi
  To: Yaniv Dary
  Cc: users users@ovirt.org
  Sent: Wednesday, January 29, 2014 11:49:04 PM
  Subject: Re: [Users] engine reports and dwh setup in 3.3.2
 
  Just to note that on my oVirt 3.3.2-1 allinone  on fedora 19, after
  enabling updates-testing I get the same failure using 3.3.3rc
 
  Can it be that the following is dead symlink? if so please remove.
 
  /var/lib/ovirt-engine/deployments/ovirt-engine-reports.war
 
 I confirm that
 - rollback (made automatically during the failed engine-setup first run)
 - rm /var/lib/ovirt-engine/deployments/ovirt-engine-reports.war
 - engine-setup again
 now it completes correctly
 
 Probably it is worth to put a note in release notes.
 But I don't know where:
 http://wiki.ovirt.org/OVirt_3.3.3_release_notes
 or
 http://wiki.ovirt.org/OVirt_3.2_to_3.3_upgrade
 ?
 Because I previously used (in December I think) the latest link to
 migrate from f18+3.2 to f19+3.3.2

I guess both... as people can upgrade directly to 3.3.3
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[Users] engine-backup restore how to

2014-01-29 Thread Steve Dainard
There doesn't seem to be any solid documentation on how to use the
engine-backup restore function, and I'm not able to restore a backup.

The best I've come up with is:
1. Install engine on new host
2. Stop engine
3. run engine-backup --mode=restore --file=filename --log=logfile

Fail.

Log shows:
psql: FATAL:  password authentication failed for user engine
2014-01-29 18:20:30 10285: FATAL: Can't connect to the database

4. engine-backup --mode=restore --file=engine.bak --log=logfile
--change-db-credentials --db-host=localhost --db-user=engine
--db-name=engine --db-password='newpassword'

Fails with same error.

5. change user to postgres, drop the old db, create a new db named engine,
set password for engine user same as 'newpassword'

6. engine-backup --mode=restore --file=engine.bak --log=logfile
--change-db-credentials --db-host=localhost --db-user=engine
--db-name=engine --db-password='newpassword'

Restoring...
Rewriting /etc/ovirt-engine/engine.conf.d/10-setup-database.conf
Note: you might need to manually fix:
- iptables/firewalld configuration
- autostart of ovirt-engine service
You can now start the engine service and then restart httpd
Done.

7. start ovirt-engine, restart httpd, browse to web ui

Blank page, no content.

8. stop firewall, browse to web ui

Blank page, no content

9. Engine log contains:

2014-01-29 18:35:56,973 INFO  [org.ovirt.engine.core.utils.LocalConfig]
(MSC service
thread 1-40) Value of property SENSITIVE_KEYS is
,ENGINE_DB_PASSWORD,ENGINE_PKI_TR
UST_STORE_PASSWORD,ENGINE_PKI_ENGINE_STORE_PASSWORD.
2014-01-29 18:35:57,330 ERROR [org.ovirt.engine.core.bll.Backend] (MSC
service thread
 1-25) Error in getting DB connection. The database is inaccessible.
Original exception is: BadSqlGrammarException: CallableStatementCallback;
bad SQL grammar [{call checkdbconnection()}]; nested exception is
org.postgresql.util.PSQLException: ERROR: function checkdbconnection() does
not exist
  Hint: No function matches the given name and argument types. You might
need to add explicit type casts.
  Position: 15
2014-01-29 18:35:58,336 ERROR [org.ovirt.engine.core.bll.Backend] (MSC
service thread 1-25) Error in getting DB connection. The database is
inaccessible. Original exception is: UncategorizedSQLException:
CallableStatementCallback; uncategorized SQLException for SQL [{call
checkdbconnection()}]; SQL state [25P02]; error code [0]; ERROR: current
transaction is aborted, commands ignored until end of transaction block;
nested exception is org.postgresql.util.PSQLException: ERROR: current
transaction is aborted, commands ignored until end of transaction block


*Steve Dainard *
IT Infrastructure Manager
Miovision http://miovision.com/ | *Rethink Traffic*
519-513-2407 ex.250
877-646-8476 (toll-free)

*Blog http://miovision.com/blog  |  **LinkedIn
https://www.linkedin.com/company/miovision-technologies  |  Twitter
https://twitter.com/miovision  |  Facebook
https://www.facebook.com/miovision*
--
 Miovision Technologies Inc. | 148 Manitou Drive, Suite 101, Kitchener, ON,
Canada | N2C 1L3
This e-mail may contain information that is privileged or confidential. If
you are not the intended recipient, please delete the e-mail and any
attachments and notify us immediately.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] engine-backup restore how to

2014-01-29 Thread Alon Bar-Lev


- Original Message -
 From: Steve Dainard sdain...@miovision.com
 To: users users@ovirt.org
 Sent: Thursday, January 30, 2014 1:59:08 AM
 Subject: [Users] engine-backup restore how to
 
 There doesn't seem to be any solid documentation on how to use the
 engine-backup restore function, and I'm not able to restore a backup.
 
 The best I've come up with is:
 1. Install engine on new host
 2. Stop engine
 3. run engine-backup --mode=restore --file=filename --log=logfile
 
 Fail.
 
 Log shows:
 psql: FATAL: password authentication failed for user engine
 2014-01-29 18:20:30 10285: FATAL: Can't connect to the database
 
 4. engine-backup --mode=restore --file=engine.bak --log=logfile
 --change-db-credentials --db-host=localhost --db-user=engine
 --db-name=engine --db-password='newpassword'
 
 Fails with same error.

the --db-password must match the user's actual password within database, 
--change-db-credentials does not change the password in database but the 
host/port/user/password that are used by enigne.

 
 5. change user to postgres, drop the old db, create a new db named engine,
 set password for engine user same as 'newpassword'
 
 6. engine-backup --mode=restore --file=engine.bak --log=logfile
 --change-db-credentials --db-host=localhost --db-user=engine
 --db-name=engine --db-password='newpassword'

Ok, this is correct now.

 Restoring...
 Rewriting /etc/ovirt-engine/engine.conf.d/10-setup-database.conf
 Note: you might need to manually fix:
 - iptables/firewalld configuration
 - autostart of ovirt-engine service
 You can now start the engine service and then restart httpd
 Done.
 
 7. start ovirt-engine, restart httpd, browse to web ui
 
 Blank page, no content.
 
 8. stop firewall, browse to web ui
 
 Blank page, no content
 
 9. Engine log contains:
 
 2014-01-29 18:35:56,973 INFO [org.ovirt.engine.core.utils.LocalConfig] (MSC
 service
 thread 1-40) Value of property SENSITIVE_KEYS is
 ,ENGINE_DB_PASSWORD,ENGINE_PKI_TR
 UST_STORE_PASSWORD,ENGINE_PKI_ENGINE_STORE_PASSWORD.
 2014-01-29 18:35:57,330 ERROR [org.ovirt.engine.core.bll.Backend] (MSC
 service thread
 1-25) Error in getting DB connection. The database is inaccessible. Original
 exception is: BadSqlGrammarException: CallableStatementCallback; bad SQL
 grammar [{call checkdbconnection()}]; nested exception is
 org.postgresql.util.PSQLException: ERROR: function checkdbconnection() does
 not exist
 Hint: No function matches the given name and argument types. You might need
 to add explicit type casts.
 Position: 15
 2014-01-29 18:35:58,336 ERROR [org.ovirt.engine.core.bll.Backend] (MSC
 service thread 1-25) Error in getting DB connection. The database is
 inaccessible. Original exception is: UncategorizedSQLException:
 CallableStatementCallback; uncategorized SQLException for SQL [{call
 checkdbconnection()}]; SQL state [25P02]; error code [0]; ERROR: current
 transaction is aborted, commands ignored until end of transaction block;
 nested exception is org.postgresql.util.PSQLException: ERROR: current
 transaction is aborted, commands ignored until end of transaction block
 

It looks like the database is empty... are you sure the engine.bak is intact? 
does you see CheckDBConnection there?

Thanks,
Alon
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Ovirt 3.4 - Fail to set permissions to VM

2014-01-29 Thread Yair Zaslavsky
Yes,
A fix was already submitted for review.


- Original Message -
 From: Jonas Israelsson jo...@israelsson.com
 To: Oved Ourfalli ov...@redhat.com
 Cc: users@ovirt.org, Juan Hernandez jhern...@redhat.com, Yair Zaslavsky 
 yzasl...@redhat.com
 Sent: Wednesday, January 29, 2014 2:44:46 PM
 Subject: Re: [Users] Ovirt 3.4 - Fail to set permissions to VM
 
 
 On 29/01/14 07:29, Oved Ourfalli wrote:
  Hi Jonas
 
  Apparently there is a quite new bug open about this issue
  (https://bugzilla.redhat.com/1057147).
  CC-ing Juan and Yair - perhaps the'll know what's the source of the issue,
  as I think they were the last ones to make changes in it.
 Jupp, got it.
 
 Sorry for not checking there first..
 
 
 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] engine-backup restore how to

2014-01-29 Thread Steve Dainard
I also see this error in engine.log which repeats every second if I am
trying to access the web ui.

2014-01-29 18:59:47,531 ERROR [org.ovirt.engine.core.bll.Backend]
(ajp--127.0.0.1-8702-4) Error in getting DB connection. The database is
inaccessible. Original exception is: UncategorizedSQLException:
CallableStatementCallback; uncategorized SQLException for SQL [{call
checkdbconnection()}]; SQL state [25P02]; error code [0]; ERROR: current
transaction is aborted, commands ignored until end of transaction block;
nested exception is org.postgresql.util.PSQLException: ERROR: current
transaction is aborted, commands ignored until end of transaction block

It looks like the db inserted correctly, I took a quick look through some
tables and can see the valid admin user, and snapshots. But I can't say for
certain.

The IP address of the new server does not match the IP of the old (backup
file) server, would this have any impact? I would think not as its a local
db.

When I changed the password for the psql engine user, is there any config
file this is referenced in that may not have been updated?

Thanks,

*Steve Dainard *
IT Infrastructure Manager
Miovision http://miovision.com/ | *Rethink Traffic*
519-513-2407 ex.250
877-646-8476 (toll-free)

*Blog http://miovision.com/blog  |  **LinkedIn
https://www.linkedin.com/company/miovision-technologies  |  Twitter
https://twitter.com/miovision  |  Facebook
https://www.facebook.com/miovision*
--
 Miovision Technologies Inc. | 148 Manitou Drive, Suite 101, Kitchener, ON,
Canada | N2C 1L3
This e-mail may contain information that is privileged or confidential. If
you are not the intended recipient, please delete the e-mail and any
attachments and notify us immediately.


On Wed, Jan 29, 2014 at 7:06 PM, Alon Bar-Lev alo...@redhat.com wrote:



 - Original Message -
  From: Steve Dainard sdain...@miovision.com
  To: users users@ovirt.org
  Sent: Thursday, January 30, 2014 1:59:08 AM
  Subject: [Users] engine-backup restore how to
 
  There doesn't seem to be any solid documentation on how to use the
  engine-backup restore function, and I'm not able to restore a backup.
 
  The best I've come up with is:
  1. Install engine on new host
  2. Stop engine
  3. run engine-backup --mode=restore --file=filename --log=logfile
 
  Fail.
 
  Log shows:
  psql: FATAL: password authentication failed for user engine
  2014-01-29 18:20:30 10285: FATAL: Can't connect to the database
 
  4. engine-backup --mode=restore --file=engine.bak --log=logfile
  --change-db-credentials --db-host=localhost --db-user=engine
  --db-name=engine --db-password='newpassword'
 
  Fails with same error.

 the --db-password must match the user's actual password within database,
 --change-db-credentials does not change the password in database but the
 host/port/user/password that are used by enigne.

 
  5. change user to postgres, drop the old db, create a new db named
 engine,
  set password for engine user same as 'newpassword'
 
  6. engine-backup --mode=restore --file=engine.bak --log=logfile
  --change-db-credentials --db-host=localhost --db-user=engine
  --db-name=engine --db-password='newpassword'

 Ok, this is correct now.

  Restoring...
  Rewriting /etc/ovirt-engine/engine.conf.d/10-setup-database.conf
  Note: you might need to manually fix:
  - iptables/firewalld configuration
  - autostart of ovirt-engine service
  You can now start the engine service and then restart httpd
  Done.
 
  7. start ovirt-engine, restart httpd, browse to web ui
 
  Blank page, no content.
 
  8. stop firewall, browse to web ui
 
  Blank page, no content
 
  9. Engine log contains:
 
  2014-01-29 18:35:56,973 INFO [org.ovirt.engine.core.utils.LocalConfig]
 (MSC
  service
  thread 1-40) Value of property SENSITIVE_KEYS is
  ,ENGINE_DB_PASSWORD,ENGINE_PKI_TR
  UST_STORE_PASSWORD,ENGINE_PKI_ENGINE_STORE_PASSWORD.
  2014-01-29 18:35:57,330 ERROR [org.ovirt.engine.core.bll.Backend] (MSC
  service thread
  1-25) Error in getting DB connection. The database is inaccessible.
 Original
  exception is: BadSqlGrammarException: CallableStatementCallback; bad SQL
  grammar [{call checkdbconnection()}]; nested exception is
  org.postgresql.util.PSQLException: ERROR: function checkdbconnection()
 does
  not exist
  Hint: No function matches the given name and argument types. You might
 need
  to add explicit type casts.
  Position: 15
  2014-01-29 18:35:58,336 ERROR [org.ovirt.engine.core.bll.Backend] (MSC
  service thread 1-25) Error in getting DB connection. The database is
  inaccessible. Original exception is: UncategorizedSQLException:
  CallableStatementCallback; uncategorized SQLException for SQL [{call
  checkdbconnection()}]; SQL state [25P02]; error code [0]; ERROR: current
  transaction is aborted, commands ignored until end of transaction block;
  nested exception is org.postgresql.util.PSQLException: ERROR: current
  transaction is aborted, commands ignored until end of transaction 

Re: [Users] engine-backup restore how to

2014-01-29 Thread Yedidyah Bar David
 From: Steve Dainard sdain...@miovision.com
 To: Alon Bar-Lev alo...@redhat.com
 Cc: users users@ovirt.org, Yedidyah Bar David d...@redhat.com, Eli
 Mesika emes...@redhat.com
 Sent: Thursday, January 30, 2014 7:44:01 AM
 Subject: Re: [Users] engine-backup restore how to

 I also see this error in engine.log which repeats every second if I am trying
 to access the web ui.

 2014-01-29 18:59:47,531 ERROR [org.ovirt.engine.core.bll.Backend]
 (ajp--127.0.0.1-8702-4) Error in getting DB connection. The database is
 inaccessible. Original exception is: UncategorizedSQLException:
 CallableStatementCallback; uncategorized SQLException for SQL [{call
 checkdbconnection()}]; SQL state [25P02]; error code [0]; ERROR: current
 transaction is aborted, commands ignored until end of transaction block;
 nested exception is org.postgresql.util.PSQLException: ERROR: current
 transaction is aborted, commands ignored until end of transaction block

 It looks like the db inserted correctly, I took a quick look through some
 tables and can see the valid admin user, and snapshots. But I can't say for
 certain.

 The IP address of the new server does not match the IP of the old (backup
 file) server, would this have any impact? I would think not as its a local
 db.

 When I changed the password for the psql engine user, is there any config
 file this is referenced in that may not have been updated?

In principle, the only needed file is 
/etc/ovirt-engine/engine.conf.d/10-setup-database.conf 
which was updated by restore. Can you please verify that you can connect to the 
database 
using the credentials in this file? What are its permissions/owner? 

Thanks, 
-- 
Didi 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users