Re: [Users] Use Host

2014-02-21 Thread Darrell Budic
The host you choose is used to make the initial mount, test the volume, and 
create some initial directory structure and files for the backend. Doesn’t much 
matter in my experience, but can be useful if you want to avoid doing the work 
on a busy host.

Once it’s been added to the cluster, it gets mounted on all your host nodes and 
doesn’t matter anymore.

  -Darrell

On Feb 21, 2014, at 10:16 AM, Maurice James midnightst...@msn.com wrote:

 Can someone explain what is the purpose of Use Host is when creating a new 
 storage domain?
 
 If the NFS host is remote and not part of the cluster, why do I have to 
 choose a single cluster host to attach it to.
 Does this mean that if that particular host gos offline (turned off) that 
 other cluster member will no longer be able to access that nfs storage?
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] oVirt 3.5 planning

2014-02-25 Thread Darrell Budic

On Feb 25, 2014, at 1:38 PM, Itamar Heim ih...@redhat.com wrote:

 On 02/25/2014 09:20 PM, Markus Stockhausen wrote:
 3) Shared console access for two or more administrators
 
 https://bugzilla.redhat.com/show_bug.cgi?id=1060854
 
 michal - is this just a config in the .vv file? I'd expect qemu needs to 
 support this as well? especially since the ticketing mechanism is designed to 
 disconnect previous user?

+1. Would like to be able to make it available to power users as well.


And another minor cleanup on the admin gui, it’d be nice if the %memory graphic 
had a mouse over for “% of what” as well, maybe similar for %CPU  %network. 
Thinking mouseovers so as not to take more real estate or clutter the columns 
any more.

  -Darrell
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] [ANN] oVirt 3.4.0 Release Candidate is now available

2014-02-28 Thread Darrell Budic
Started testing this on two self-hosted clusters, with mixed results. There 
were updates from 3.4.0 beta 3.

On both, got informed the system was going to reboot in 2 minutes while it was 
still installing yum updates.

On the faster system, the whole update process finished before the 2 minutes 
were up, the VM restarted, and all appears normal.

On the other, slower cluster, the 2 minutes hit while the yum updates were 
still being installed, and the system rebooted. It continued rebooting every 3 
minutes or so, and the engine console web pages are not available because the 
engine doesn’t start. it did this at least 3 times before I went ahead and 
reran engine-setup, which completed successfully. The system stopped restarting 
and the web interface was available again. A quick perusal of system logs and 
engine-setup logs didn’t reveal what requested the reboot.

That was rather impolite of something to do that without warning :) At least it 
was recoverable. Seems like scheduling the reboot while the yum updates were 
still running seems like a poor idea as well.

  -Darrell

On Feb 28, 2014, at 10:11 AM, Sandro Bonazzola sbona...@redhat.com wrote:

 The oVirt team is pleased to announce that the 3.4.0 Release Candidate is now 
 available for testing.
 
 Release notes and information on the changes for this update are still being 
 worked on and will be available soon on the wiki[1].
 Please ensure to follow install instruction from release notes if you're 
 going to test it.
 The existing repository ovirt-3.4.0-prerelease has been updated for 
 delivering this release candidate and future refreshes until final release.
 
 An oVirt Node iso is already available, unchanged from third beta.
 
 You're welcome to join us testing this release candidate in next week test 
 day [2] scheduled for 2014-03-06!
 
 
 [1] http://www.ovirt.org/OVirt_3.4.0_release_notes
 [2] http://www.ovirt.org/OVirt_3.4_Test_Day
 
 -- 
 Sandro Bonazzola
 Better technology. Faster innovation. Powered by community collaboration.
 See how it works at redhat.com
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] hosted-engine rebooting in the middle of setup (was: [vdsm] [ANN] oVirt 3.4.0 Release Candidate is now available)

2014-03-04 Thread Darrell Budic
Whups, yes, that was it:

MainThread::INFO::2014-02-28 
17:23:03,546::hosted_engine::1311::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_stop_engine_vm)
 Shutting down vm using `/usr/sbin/hosted-engine --vm-shutdown`
MainThread::INFO::2014-02-28 
17:23:04,500::hosted_engine::1315::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_stop_engine_vm)
 stdout: Machine shut down

which also explains why I didn’t see anything in the engine logs, it was the 
self-hosted HA triggering the reboot when the engine shut down for the 
upgrade.. And I do remember the note about putting it into global maintenance 
before upgrading. Now ;)

Don’t know if the engine is aware it’s on a HA setup, if it it, might be a good 
thing to check for and maybe enable itself during the upgrade?

Are there any other special procedures to be aware of in a self-hosted setup? I 
haven’t tried updating the VDSM hosts for these yet, for instance. Seems like I 
shouldn’t enable global maintenance there, so the engine switches hosts 
properly?

Thanks.

  -Darrell

On Mar 2, 2014, at 2:35 PM, Liviu Elama liviu.el...@gmail.com wrote:

 Sounds like your hosts were not in maintenance mode while you were upgrading 
 the engine which explains the 2 min reboot.
 
 This should be revealed by logs 
 
 Regards 
 Liviu
 
 
 On Sun, Mar 2, 2014 at 10:32 PM, Yedidyah Bar David d...@redhat.com wrote:
 - Original Message -
  From: Darrell Budic darrell.bu...@zenfire.com
  To: Sandro Bonazzola sbona...@redhat.com
  Cc: annou...@ovirt.org, engine-devel engine-de...@ovirt.org, arch 
  a...@ovirt.org, Users@ovirt.org, VDSM
  Project Development vdsm-de...@lists.fedorahosted.org
  Sent: Saturday, March 1, 2014 1:56:23 AM
  Subject: Re: [vdsm] [Users] [ANN] oVirt 3.4.0 Release Candidate is now  
available
 
  Started testing this on two self-hosted clusters, with mixed results. There
  were updates from 3.4.0 beta 3.
 
  On both, got informed the system was going to reboot in 2 minutes while it
  was still installing yum updates.
 
  On the faster system, the whole update process finished before the 2 minutes
  were up, the VM restarted, and all appears normal.
 
  On the other, slower cluster, the 2 minutes hit while the yum updates were
  still being installed, and the system rebooted. It continued rebooting every
  3 minutes or so, and the engine console web pages are not available because
  the engine doesn’t start. it did this at least 3 times before I went ahead
  and reran engine-setup, which completed successfully. The system stopped
  restarting and the web interface was available again. A quick perusal of
  system logs and engine-setup logs didn’t reveal what requested the reboot.
 
  That was rather impolite of something to do that without warning :) At least
  it was recoverable. Seems like scheduling the reboot while the yum updates
  were still running seems like a poor idea as well.
 
 Can you please post relevant logs?
 hosts: /var/log/ovirt-hosted-engine-setup/*, 
 /var/log/ovirt-hosted-engine-ha/*,
 /var/log/vdsm/*
 engine: /var/log/ovirt-engine/setup/*, /var/log/ovirt-engine/*
 
 You can of course open a bug on bugzilla and attach there logs if you want.
 
 Thanks, and thanks for the report!
 --
 Didi
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] TSC clocksource gets lost after live migration

2014-04-04 Thread Darrell Budic
I see this on some guests as well, possibly relating to moving between hosts 
with the same family of CPU but different absolute CPU speeds?

  -Darrell

On Apr 4, 2014, at 8:33 AM, Michal Skrivanek michal.skriva...@redhat.com 
wrote:

 Hi,
 this is more for the KVM folks I suppose…can you get the qemu process cmdline 
 please?
 
 Thanks,
 michal
 
 On 3 Apr 2014, at 12:13, Markus Stockhausen wrote:
 
 Hello,
 
 we have an up to date ovirt 3.4 installation. Inside we are running SLES11 
 SP3
 VMs (Kernel 3.0.76-0.11). After live migration of these VMs they all of a 
 sudden
 do not react any longer and CPU usage of the VM goes to 100%.
 
 We identified kvm-clock source to be the culprit and therefore switched to 
 another
 clocksource. We ended with hpet but are not happy with that as our inital 
 goal
 was to use the more simple designed TSC clocksoure. 
 
 The reason behind that is the question I have for you experts.
 
 Our hosts all have the constant_tsc CPU flag available. Just to mention these
 are not identical hosts. We have a mix of Xeon 5500 and 5600 machines. E.G.
 [root@colovn01 ~]# cat /proc/cpuinfo | grep constant_tsc | wc -l
 8
 
 When we start the VM the client sees TSC as available clocksource:
 
 colvm53:~ # cat 
 /sys/devices/system/clocksource/clocksource0/available_clocksource
 kvm-clock tsc hpet acpi_pm
 
 After the first live migration to another host that also has constant_tsc 
 (see above)
 that flag is lost inside the VM.
 
 colvm53:~ # cat 
 /sys/devices/system/clocksource/clocksource0/available_clocksource
 kvm-clock hpet acpi_pm
 
 Any ideas?
 
 Markus
 
 
 InterScan_Disclaimer.txt___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] TSC clocksource gets lost after live migration

2014-04-05 Thread Darrell Budic
Whups, should note that I don’t see the CPU utilization issues, just the clock 
source errors. This is on a 3.3.4 install with centos 6.4 guests.

On Apr 4, 2014, at 6:19 PM, Darrell Budic darrell.bu...@zenfire.com wrote:

 I see this on some guests as well, possibly relating to moving between hosts 
 with the same family of CPU but different absolute CPU speeds?
 
   -Darrell
 
 On Apr 4, 2014, at 8:33 AM, Michal Skrivanek michal.skriva...@redhat.com 
 wrote:
 
 Hi,
 this is more for the KVM folks I suppose…can you get the qemu process 
 cmdline please?
 
 Thanks,
 michal
 
 On 3 Apr 2014, at 12:13, Markus Stockhausen wrote:
 
 Hello,
 
 we have an up to date ovirt 3.4 installation. Inside we are running SLES11 
 SP3
 VMs (Kernel 3.0.76-0.11). After live migration of these VMs they all of a 
 sudden
 do not react any longer and CPU usage of the VM goes to 100%.
 
 We identified kvm-clock source to be the culprit and therefore switched to 
 another
 clocksource. We ended with hpet but are not happy with that as our inital 
 goal
 was to use the more simple designed TSC clocksoure. 
 
 The reason behind that is the question I have for you experts.
 
 Our hosts all have the constant_tsc CPU flag available. Just to mention 
 these
 are not identical hosts. We have a mix of Xeon 5500 and 5600 machines. E.G.
 [root@colovn01 ~]# cat /proc/cpuinfo | grep constant_tsc | wc -l
 8
 
 When we start the VM the client sees TSC as available clocksource:
 
 colvm53:~ # cat 
 /sys/devices/system/clocksource/clocksource0/available_clocksource
 kvm-clock tsc hpet acpi_pm
 
 After the first live migration to another host that also has constant_tsc 
 (see above)
 that flag is lost inside the VM.
 
 colvm53:~ # cat 
 /sys/devices/system/clocksource/clocksource0/available_clocksource
 kvm-clock hpet acpi_pm
 
 Any ideas?
 
 Markus
 
 
 InterScan_Disclaimer.txt___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Network Security / Seperation

2014-04-25 Thread Darrell Budic
Check out the VDSM hooks, the isolatedprivatevlan hook will probably accomplish 
what you want.

  -Darrell

On Apr 24, 2014, at 2:08 AM, squadra squa...@gmail.com wrote:

 Hi Folks,
 
 i am currently looking for a way to isolate each vms network traffic
 so none can sniff others network traffic. currently i am playing
 around with the neutron integration, which gives me more question
 marks than answers for now (even documentation seems to be incomplete
 / outdated).
 
 Is there any other solution, which does not require to create a new
 vlan for each vm, to make sure that noone can sniff others traffic?
 
 Cheers,
 
 Juergen
 
 -- 
 Sent from the Delta quadrant using Borg technology!
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] help with sriov vdsm hook?

2014-05-08 Thread Darrell Budic
Anyone gotten the SRIOV hook to work with Solar Flare 10g cards? Any advice?

I've gotten the hooks installed on my host nodes, and added a string to my 
engine. The card is up in SRIOV mode on the host and virtual instances are 
available. I can see the hooks running in the vdsm logs, but can't tell what 
they are doing from there. They are not attaching the sriov card instance to my 
VM though, so I'm at a loss.

As an aside, it'd be really nice to have hooks register with the engine and add 
proper custom strings to the setup. Failing that, some explicit documentation 
on EXACTLY what string should be used for the hook would be nice. My first pass 
at it resulted in an improper string that caused my host to crash/hang and 
reboot when I attempted to start a VM with it the first time.

Speaking of which, I added my custom string to UserDefinedVMProperties, not 
CustomDeviceProperties. Was that correct?

Thanks,

  -Darrell
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Ovirt-3.4.1: How to: Upgrading Hosted Engine Cluster

2014-05-12 Thread Darrell Budic

On May 10, 2014, at 1:47 AM, Doron Fediuck dfedi...@redhat.com wrote:

 
 
 - Original Message -
 From: Daniel Helgenberger daniel.helgenber...@m-box.de
 To: users@ovirt.org
 Sent: Friday, May 9, 2014 6:45:36 PM
 Subject: [ovirt-users] Ovirt-3.4.1: How to: Upgrading Hosted Engine Cluster
 
 1. Set cluster to global maintance
 2. Login to ovit engine and to the upgrade according to the release
nodes.
 3. After the upgrade is finished and the engine running, set the
first Node in local maintenance.
 4. Login the first node and yum update (with the removal of
ovirt-release as mentioned in release notes).* I rebooted the
node because of the kernel update.
 5. Return to oVirt and reinstall the Node from GUI, it will be set
to operational automatically**
 6. Repeat steps 3-6 for the rest of the Nodes.
 7. Remove global maintenance.
 8. Update the last Node.***
 
 
 Hi Martin,
 Thanks for sharing!
 
 A few notes on your nice 8 steps ;)
 
 - There are 2 maintenance modes to cover host maintenance (local) and
  VM maintenance (global). Global maintenance disarms all HA hosts in
  the cluster, so use it with caution as there's no fail over in this
  mode.
 
 - Initially these were available only as a command line[1]. Since 3.4.0,
  this was integrated into the UI, so all you need to do is move a host
  to maintenance in order to achieve local maintenance, and activate it
  to remove the maintenance mode. For global maintenance, right click
  the engine VM, and you will see enable/disable ha-maintenance for
  the global mode.
 
 - No need to re-install nodes. All you need to do is activate it.
 
 - Basically a standard procedure should include:
  * Move host to maintenance, login and update the host, activate the host.
  * Follow the above for all other HA hosts
  * Set the engine VM to ha maintenance (global), login to the vm and
upgrade it, unset vm's ha-maintenance.
 
 Appreciate your feedback.
 Also, were you aware of [1] or did you look for info elsewhere? I'd like
 to know what we can do to improve documentation.

One thing that’s still unclear to me in this this setup is what’s needed to 
ensure all the nodes or get updates? Just do yum updates? I’ve have mixed 
success with the “reinstall or update” link in the GUI, seemed like it worked 
properly when I updated to 3.4.0, but trying to do the 3.4.1 update it failed 
for no apparent reason and I wound up doing yum updates on the host nodes…

  -Darrell

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] help with sriov vdsm hook?

2014-05-12 Thread Darrell Budic

On May 12, 2014, at 7:32 AM, Itamar Heim ih...@redhat.com wrote:

 On 05/12/2014 07:57 AM, Dan Kenigsberg wrote:
 On Sun, May 11, 2014 at 05:44:35PM -0400, Itamar Heim wrote:
 On 05/08/2014 02:54 PM, Darrell Budic wrote:
 Anyone gotten the SRIOV hook to work with Solar Flare 10g cards? Any 
 advice?
 
 I've gotten the hooks installed on my host nodes, and added a string to my 
 engine. The card is up in SRIOV mode on the host and virtual instances 
 are available. I can see the hooks running in the vdsm logs, but can't 
 tell what they are doing from there. They are not attaching the sriov card 
 instance to my VM though, so I'm at a loss.
 
 you can compare the generated livirt xml send to qemu by vdsm and
 compare to what you expected to happen by the hook.
 
 You may also share the bit of vdsm.log from vmCreate until failure.

I haven’t had time to get back to this yet, but I’ll share logs the next time I 
get to try it. May not be for a couple of weeks, about to take a vacation. But 
from memory, it’s not adding anything at all to the xml, so I’m not sure if my 
custom property string is wrong, or if it’s just not working for some other 
reason.


 As an aside, it'd be really nice to have hooks register with the engine and 
 add proper custom strings to the setup. Failing that, some explicit 
 documentation on EXACTLY what string should be used for the hook would be 
 nice. My first pass at it resulted in an improper string that caused my host 
 to crash/hang and reboot when I attempted to start a VM with it the first 
 time.
 
 
 each hook has a README - maybe send a patch to elaborate more on the
 one not clear enough to you?
 
 I think that Darrel means something like
 
 Bug 1086329 - [RFE] semi-automatic hook registration
 
 (or maybe a bit less fancy). It would be cool to add some introspection
 to the hook framework, so that Engine learns about available hooks and
 their custom properties from the nodes.
 
 
 engine learns about deployed hooks (and shows them in the gui) based on vdsm 
 reporting them.
 vdsm could report also the custom properties regex string, but would probably 
 need to version it, since engine would need to pick the 'one' regex from 
 multiple hosts reporting different things for same hook.

Exactly this. I didn’t find any documentation for what the strings should be, 
so I’m guessing a bit based on reading the code. My first try forgot to include 
the “sriov=“ bit, and if you just pass “eth0” to that hook, you’ll crash the 
whole node. Or at least I did once (if there were other vms running on that 
node), and just dropped networking on another occasion before I realized I’d 
passed an incomplete string. Seems like something that would be really nice to 
have auto set so you can’t screw it up, especially if it could crash more than 
just the VM.

I think it’d be fine to not have a properly be available if all the host nodes 
in a cluster didn’t agree on what the string/version number should be. Display 
an alert for the node if it’s out of sync or something so it’s easy to track 
down.


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[Users] disk image grew after moving to new storage domain?

2012-08-15 Thread Darrell Budic
I had a Thin Provision disk that was about 6Gb prior to moving it to a new 
storage domain. Now it's 40GB (the full size of the volume) even though it 
still says it's a Thin Provision allocation. Is this expected or is there any 
way to avoid it? This was an oVirt 3.0 setup.

The move was to a new storage system with a 10g connection. I don't suppose 
there's any way to change the address of an existing storage domain to reflect 
new connectivity, is there?

Thanks!


Darrell___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[Users] SPM problems after upgrade to 3.1

2012-08-20 Thread Darrell Budic
.ch1, 
spmStatus Unknown_Pool, storage pool Default
2012-08-20 10:06:22,170 INFO  
[org.ovirt.engine.core.vdsbroker.vdsbroker.ConnectStoragePoolVDSCommand] 
(QuartzScheduler_Worker-100) [f98d712] START, 
ConnectStoragePoolVDSCommand(vdsId = d38856e2-c2f1-11e1-9952-00188bf945ff, 
storagePoolId = de911214-832b-11e1-ab21-00188bf945ff, vds_spm_id = 3, 
masterDomainId = e6ba97ae-7ccc-42ed-8739-f05b7a90d82c, masterVersion = 178), 
log id: 51638c25
2012-08-20 10:06:22,737 INFO  
[org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] 
(QuartzScheduler_Worker-100) [f98d712] Command 
org.ovirt.engine.core.vdsbroker.vdsbroker.ConnectStoragePoolVDSCommand return 
value 
 Class Name: org.ovirt.engine.core.vdsbroker.vdsbroker.StatusOnlyReturnForXmlRpc
mStatus   Class Name: 
org.ovirt.engine.core.vdsbroker.vdsbroker.StatusForXmlRpc
mCode 324
mMessage  Wrong Master domain or its version: 
'SD=e6ba97ae-7ccc-42ed-8739-f05b7a90d82c, 
pool=de911214-832b-11e1-ab21-00188bf945ff'


2012-08-20 10:06:22,739 INFO  
[org.ovirt.engine.core.vdsbroker.vdsbroker.BrokerCommandBase] 
(QuartzScheduler_Worker-100) [f98d712] Vds: virt2.ch1
2012-08-20 10:06:22,740 ERROR [org.ovirt.engine.core.vdsbroker.VDSCommandBase] 
(QuartzScheduler_Worker-100) [f98d712] Command ConnectStoragePoolVDS execution 
failed. Exception: IRSNoMasterDomainException: IRSGenericException: 
IRSErrorException: IRSNoMasterDomainException: Wrong Master domain or its 
version: 'SD=e6ba97ae-7ccc-42ed-8739-f05b7a90d82c, 
pool=de911214-832b-11e1-ab21-00188bf945ff'
2012-08-20 10:06:22,741 INFO  
[org.ovirt.engine.core.vdsbroker.vdsbroker.ConnectStoragePoolVDSCommand] 
(QuartzScheduler_Worker-100) [f98d712] FINISH, ConnectStoragePoolVDSCommand, 
log id: 51638c25




Darrell Budic
Bigwells Technology LLC
office: 312.529.7816
cell: 608.239.4628



___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] HA: Re: HA: Re: HP Integrated Lights Out 3

2012-09-13 Thread Darrell Budic
That fix worked for me (ipmilan wise, anyway. Still no go on ilo, but we knew 
that, right?). Thanks Itamar!

Dmitriy, make sure you do this to all your host nodes, it may run the test from 
any of them. You'll also want to be sure you delete 
/usr/share/vdsm/BindingXMLRPC.pyc and .pyo, otherwise the compiled python is 
likely to still get used. Finally, I did need to restart vdsmd on all my nodes, 
service vdsmd restart on my Centos 6.3 system. Glad to know you can do that 
without causing problems for running vms.

I did notice that the ovirt management GUI still shows 3 Alerts in the alert 
area, and they are all Power Management test failed errors dated from the 
first time their particular node was added to the cluster. This is even after 
restarting a vdsmd again and seeing Host xxx power management was verified 
successfully. in the event log.

  -Darrell

On Sep 13, 2012, at 5:45 PM, Itamar Heim wrote:

 On 09/13/2012 08:42 AM, Dmitriy A Pyryakov wrote:
 Itamar Heim ih...@redhat.com написано 13.09.2012 11:09:24:
 
  От: Itamar Heim ih...@redhat.com
  Кому: Dmitriy A Pyryakov dpyrya...@ekb.beeline.ru
  Копия: Darrell Budic darrell.bu...@bigwells.net, users@ovirt.org
  Дата: 13.09.2012 11:09
  Тема: Re: [Users] HA: Re: HP Integrated Lights Out 3
 
  On 09/13/2012 06:00 AM, Dmitriy A Pyryakov wrote:
   Darrell Budic darrell.bu...@bigwells.net написано 13.09.2012
 07:43:44:
  
От: Darrell Budic darrell.bu...@bigwells.net
Кому: Dmitriy A Pyryakov dpyrya...@ekb.beeline.ru
Копия: Eli Mesika emes...@redhat.com, users@ovirt.org
Дата: 13.09.2012 07:43
Тема: Re: [Users] HP Integrated Lights Out 3
   
I have this problem too. I actually tracked it down to the engine
not passing the arguments to the fence scripts but then got
distracted and never followed up with a report. In my case, the data
base was correct, and if I ran the fence script by hand, it would
work, but the vdsm wasn't running it with the options or even all
the arguments (username/pw, etc). I've tried it with ilo3 and
ipmilan both, same issue.
   
If you'd like, I can recreate some of my debugging, I'd gotten so
far as to to hack some print statements into the fence scripts to
demonstrate what was happening. Lost that with some rebuilds, but
easy enough to recreate...
   
-Darrell
  
   Hello, Darrell.
  
   It would be great!
 
  may i suggest you first try to apply this patch to vdsm (just edit the
  relevant line in vdsm)
 
  commit 59934118e3a30c57539d2b71016532bdd9c4ab17
  Author: Roy Golan rgo...@redhat.com
  Date: Thu Aug 9 16:34:10 2012 +0300
 
  fenceNode API is missing the options argument
 
  Change-Id: Ib2ce9b0f71040f9198413fa06c5d8768994842ec
  Signed-off-by: Roy Golan rgo...@redhat.com
  Reviewed-on: http://gerrit.ovirt.org/7058
  Reviewed-by: Dan Kenigsberg dan...@redhat.com
  Reviewed-by: Omer Frenkel ofren...@redhat.com
  Tested-by: Omer Frenkel ofren...@redhat.com
 
  diff --git a/vdsm/BindingXMLRPC.py b/vdsm/BindingXMLRPC.py
  index cc5300f..8b548e4 100644
  --- a/vdsm/BindingXMLRPC.py
  +++ b/vdsm/BindingXMLRPC.py
  @@ -357,7 +357,7 @@ class BindingXMLRPC(object):
  secure=False, options=''):
  api = API.Global()
  return api.fenceNode(addr, port, agent, username, password,
  - action, secure)
  + action, secure, options)
 
  def setLogLevel(self, level):
  api = API.Global()
 
 
 There is my part of old /usr/share/vdsm/BindingXMLRPC.py file from proxy
 host:
 
 def fenceNode(self, addr, port, agent, username, password, action,
 secure=False, options=''):
 api = API.Global(self.cif)
 return api.fenceNode(addr, port, agent, username, password,
 action, secure)
 
 there is replased:
 
 def fenceNode(self, addr, port, agent, username, password, action,
 secure=False, options=''):
 api = API.Global(self.cif)
 return api.fenceNode(addr, port, agent, username, password,
 action, secure, options)
 
 I restart ovirt-engine and still see no option presented in vdsm.log.
 Test still failed.
 
 this is a vdsm change, not an ovirt-engine (restart vdsm?)
 is this ovirt node or plain fedora/el6?
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

Darrell Budic
Bigwells Technology LLC
office: 312.529.7816
cell: 608.239.4628



___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] HA: Re: HP Integrated Lights Out 3

2012-09-14 Thread Darrell Budic
I had tried that and it hadn't changed, but looking at it this morning, it 
cleared those. Maybe it only checked on login? Anyway, now it's showing me a 
really old alert from a month or two ago about No other host in cluster to 
test power management from (or whatever the real verbiage is for that). Kind 
of strange since they've all got power management now :)

  -Darrell

On Sep 14, 2012, at 2:46 AM, Itamar Heim wrote:

 On 09/14/2012 02:32 AM, Darrell Budic wrote:
 That fix worked for me (ipmilan wise, anyway. Still no go on ilo, but we
 knew that, right?). Thanks Itamar!
 
 Dmitriy, make sure you do this to all your host nodes, it may run the
 test from any of them. You'll also want to be sure you delete
 /usr/share/vdsm/BindingXMLRPC.pyc and .pyo, otherwise the compiled
 python is likely to still get used. Finally, I did need to restart vdsmd
 on all my nodes, service vdsmd restart on my Centos 6.3 system. Glad
 to know you can do that without causing problems for running vms.
 
 I did notice that the ovirt management GUI still shows 3 Alerts in the
 alert area, and they are all Power Management test failed errors dated
 from the first time their particular node was added to the cluster. This
 is even after restarting a vdsmd again and seeing Host xxx power
 management was verified successfully. in the event log.
 
 because the engine doesn't go and run 'test power management' all the time...
 click edit host, power management tab, click 'test'.
 

Darrell Budic
Bigwells Technology LLC

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Glusterfs HA with Ovirt

2014-07-03 Thread Darrell Budic
You need to setup a virtual IP to use as the mount point, most people use 
keepalived to provide a virtual ip via vrrp for this. Setup something like 
10.10.10.10 and use that for your mounts.

Right now, if 10.10.10.2 goes away, all your gluster mounts go away and your 
VMs get paused because the hypervisors can’t access the storage. Your gluster 
storage is still fine, but ovirt can’t talk to it because 10.10.10.2 isn’t 
there.

If the SPM goes down, it the other hypervisor hosts will elect a new one (under 
control of the ovirt engine).

Same scenarios if storage  compute are on the same server, you still need a 
vip address for the storage portion to serve as the mount point so it’s not 
dependent on any one server.

-Darrell

On Jul 3, 2014, at 1:14 AM, Punit Dambiwal hypu...@gmail.com wrote:

 Hi,
 
 I have some HA related concern about glusterfs with Ovirt...let say i have 4 
 storage node with gluster bricks as below :- 
 
 1. 10.10.10.1 to 10.10.10.4 with 2 bricks each and i have distributed 
 replicated architecture...
 2. Now attached this gluster storge to ovrit-engine with the following mount 
 point 10.10.10.2/vol1
 3. In my cluster i have 3 hypervisior hosts (10.10.10.5 to 10.10.10.7) SPM is 
 on 10.10.10.5...
 4. What happen if 10.10.10.2 will goes down.can hypervisior host can 
 still access the storage ??
 5. What happen if SPM goes down ???
 
 Note :- What happen for point 4 5 ,If storage and Compute both working on 
 the same server.
 
 Thanks,
 Punit 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] video driver selection for VNC consoles

2014-07-08 Thread Darrell Budic
I’d like to use “-vga vga” for my VNC console VMs instead of “-vga cirrus”, any 
easy way to do this?

  -Darrell

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] video driver selection for VNC consoles

2014-07-09 Thread Darrell Budic
Nice, I missed this somewhere along the line.

Unfortunately, it’s not working for me. I created a 10-osinfo.properties in 
/usr/share/ovirt-engine/conf and symlinked it into 
/etc/ovirt-engine/osinfo.conf.d for good measure, restarted jboss-as and 
ovirt-engine, and started up a VM but it still got a -vga cirrus device.

Contents of my 10-osinfo.properties:
os.other.devices.display.protocols.value = vnc/vga,qxl/qxl

I notice I didn’t have an os.other.devices.display.protocols in my existing 
00-defaults.properties. I’m running 3.3.4 on this cluster, is this fully 
supported in that version?


On Jul 9, 2014, at 4:42 AM, Omer Frenkel ofren...@redhat.com wrote:

 
 
 - Original Message -
 From: Darrell Budic darrell.bu...@zenfire.com
 To: users users@ovirt.org
 Sent: Wednesday, July 9, 2014 12:30:23 AM
 Subject: [ovirt-users] video driver selection for VNC consoles
 
 I’d like to use “-vga vga” for my VNC console VMs instead of “-vga cirrus”,
 any easy way to do this?
 
  -Darrell
 
 
 very easly, use os-info to change vnc/cirrus to vnc/vga for the OS you like
 you need to create 10-osinfo.properties under share/ovirt-engine/conf/ to 
 override os configurations
 
 (look at osinfo-defaults.properties and in http://www.ovirt.org/OS_info for 
 more info )
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] video driver selection for VNC consoles

2014-07-14 Thread Darrell Budic

On Jul 14, 2014, at 6:30 AM, Omer Frenkel ofren...@redhat.com wrote:

 
 
 - Original Message -
 From: Darrell Budic darrell.bu...@zenfire.com
 To: Omer Frenkel ofren...@redhat.com
 Cc: users users@ovirt.org, Roy Golan rgo...@redhat.com
 Sent: Wednesday, July 9, 2014 6:20:16 PM
 Subject: Re: [ovirt-users] video driver selection for VNC consoles
 
 Nice, I missed this somewhere along the line.
 
 Unfortunately, it’s not working for me. I created a 10-osinfo.properties in
 /usr/share/ovirt-engine/conf and symlinked it into
 /etc/ovirt-engine/osinfo.conf.d for good measure, restarted jboss-as and
 ovirt-engine, and started up a VM but it still got a -vga cirrus device.
 
 Contents of my 10-osinfo.properties:
 os.other.devices.display.protocols.value = vnc/vga,qxl/qxl
 
 I notice I didn’t have an os.other.devices.display.protocols in my existing
 00-defaults.properties. I’m running 3.3.4 on this cluster, is this fully
 supported in that version?
 
 
 well i forgot to mention that this will only work for new vms, or if you edit 
 the vm and switch the display type (to spice and after that again to vnc)
 because this value is saved in the db the moment the display device is 
 created (add vm or update of display type)
 
 please pay attention that win8,win8x64,win2012x64 override this value also

Yeap, discovered that this weekend. Also updated to 3.4.3 and now have a 
default entry for this in the 00-defaults.properties. Now working as expected, 
thanks!

I see there are some checks for these values at VM start, my one win8x64 vm 
with a SPICE console wouldn’t start because of the new default entry. Seems odd 
that it only partially applies at VM start, I’d expect it to be all or nothing 
at that state (Ie, if it only changes at device creation, why does it check at 
other times as well?). Not a huge deal though.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] 3.4.3 Network problem

2014-07-26 Thread Darrell Budic
 On 21-7-2014 19:34, Dan Kenigsberg wrote:
  On Mon, Jul 21, 2014 at 06:05:45PM +0100, Dan Kenigsberg wrote:
  On Mon, Jul 21, 2014 at 09:03:58AM -0400, Maurice James wrote:
  I submitted a bug report 
  https://bugzilla.redhat.com/show_bug.cgi?id=1121643

  - Original Message -
  From: Joop jvdwege at xs4all.nl
  To: users at ovirt.org
  Sent: Monday, July 21, 2014 8:46:18 AM 
  Subject: Re: [ovirt-users] 3.4.3 Network problem 
 
  On 21-7-2014 14:38, Maurice James wrote: 
 
  I just upgraded to 3.4.3, not its complaining that em1 and em2 are down. 
  They are not down not sure why it thinks the interfaces are down. Its 
  doing this for all 4 of my hosts 
  It is a horrible bug, due to my http://gerrit.ovirt.org/29689, I'll try
  to send a quick fix asap.

  Please help me verify that a removal of two lines
  http://gerrit.ovirt.org/#/c/30547/ fixes the issue.
 
 I commented out the indicated 2 lines and could activate my host and 
 itstayed activated (1h) while before this patch it would turn unresponsive
 quite quickly (minutes).Joop
 

I hit this too, it was complaining about both base interfaces in my bonded 
interface being down. Commenting out the two lines resolved the problem for me 
as well.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] ovirt-ha notifications flood of ovirt-hosted-engine state transition GlobalMaintenance-GlobalMaintenance messages?

2014-08-07 Thread Darrell Budic
Why do the ha brokers send this message every 15 seconds? It isn’t really a 
state transition, and it’s a little excessive for a reminder that it’s in 
Global Maintenance. This is with centos 6.5 hosts.
Any thing I can do on my side to get it to send just one for the initial 
transition into maintenance mode?

  -Darrell
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] ovirt-ha notifications flood of ovirt-hosted-engine state transition GlobalMaintenance-GlobalMaintenance messages?

2014-08-08 Thread Darrell Budic

On Aug 8, 2014, at 1:22 AM, Jiri Moskovcak jmosk...@redhat.com wrote:

 On 08/07/2014 07:08 PM, Darrell Budic wrote:
 Why do the ha brokers send this message every 15 seconds? It isn’t really a 
 state transition, and it’s a little excessive for a reminder that it’s in 
 Global Maintenance. This is with centos 6.5 hosts.
 Any thing I can do on my side to get it to send just one for the initial 
 transition into maintenance mode?
 
 - unfortunately with the current code it's either a message every 15secs or 
 never
 - if you want to silence it, you can edit 
 /etc/ovirt-hosted-engine-ha/agent-log.conf and change
 
 [logger_root]
 level=INFO
 
 to
 
 [logger_root]
 level=ERROR
 
 --Jirka

Thanks Jirka, I may silence it :)

Every so often, one of my groups decides to send me a bunch of erroneous 
messages. Just figured out that if I started the ovirt-engine VM by hand, the 
ha-agent doesn’t seem to pickup that it’s running, but sends me 
“EngineDown-EngineStart”  “EngineStart-EngineUp” messages every 10 mins or so. 
Doesn’t affect the running engine, but more spam :) When I shut the running 
engine down and let the agent start it up, it shuts up about it, so something 
that happens during automatic launch that isn’t happening if it’s launched with 
—vm-start maybe?

I’ve had it send me similar messages seemingly randomly as well, no apparent 
cause and the engine vm shows no interruptions in uptime. Then it gets quiet 
again an hour or 3 later. No idea what caused it, just thought I’d mention it 
in context.

 -Darrell
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] can you re-create self hosted storage pool?

2014-08-19 Thread Darrell Budic
Through a series of creative experiments, I've managed to corrupt the lock 
files for my self-hosted engine's storage pool. Is there any way to destroy and 
recreate the pool without reinstalling the self-hosted engine on all systems 
(after destroying it entirely)?

  -Darrell

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] can you re-create self hosted storage pool?

2014-08-22 Thread Darrell Budic
I had managed to splitbrain my lock files, and then screwed them up trying to 
heal them manually.

I wound up backing up my hosted engine disk, destroying the hosted engine 
storage volume, recreating the gluster volume and re-running setup to solve the 
lock issue, and copying over my hosted engine backup instead of reinstalling it 
all. Re-added the host nodes, and off it went. :)

Probably don't recommend that as a good thing, but it was my dev/testing 
cluster, so it can be done without reinstalling from scratch.

  -Darrell

On Aug 20, 2014, at 6:41 AM, Joop jvdw...@xs4all.nl wrote:

 On 19-8-2014 17:08, Darrell Budic wrote:
 Through a series of creative experiments, I've managed to corrupt the lock 
 files for my self-hosted engine's storage pool. Is there any way to destroy 
 and recreate the pool without reinstalling the self-hosted engine on all 
 systems (after destroying it entirely)?
 
 
 Don't know if my situation reflects yours but I had a problem with
 hosted-engine too. I had a way newer version of kvm/libvirt on host01
 and decided to yum remove it. Sofar no problem but it also removed
 hosted-engine and then i had a problem. In the end I solved it by just
 reinstalling hosted-engine and running engine-deploy on host01 and
 answering the question if this was a reinstall and pointing to the
 anwser file on host02. Once it was done all came up nicely.
 So it seems possible to maybe do this in a roundrobin way and
 update/reinstall your hosts.
 
 Joop
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] CentOS 7 + oVirt 3.5 + OpenVPN

2014-10-18 Thread Darrell Budic
Looks like an openvpn config issue and not a ovirt issue from this. 
192.168.124.1 is not in the same network as 192.168.124.200/25 mostly, try 
192.168.124.129.

 On Oct 18, 2014, at 7:45 AM, Phil Daws ux...@splatnix.net wrote:
 
 Hello:
 
 have installed oVirt 3.5 VDSM on a CentOS 7 node and then OpenVPN.  The 
 problem I have is that when I start OpenVPN I receive the message:
 
 Oct 18 13:29:50 kvm01 openvpn[4159]: /usr/sbin/ip link set dev tun0 up mtu 
 1500
 Oct 18 13:29:50 kvm01 openvpn[4159]: /usr/sbin/ip addr add dev tun0 
 192.168.124.200/25 broadcast 192.168.124.255
 Oct 18 13:29:50 kvm01 openvpn[4159]: /usr/sbin/ip route add 192.168.0.0/16 
 via 192.168.124.1
 Oct 18 13:29:50 kvm01 openvpn[4159]: ERROR: Linux route add command failed: 
 external program exited with error status: 2
 
 and if I run the route command manually:
 
 [root@kvm01 sysconfig]# /usr/sbin/ip route add 192.168.0.0/16 via 
 192.168.124.1
 RTNETLINK answers: No such process
 
 It would appear the tunnel is up:
 
 [root@kvm01 sysconfig]# ip add ls
 1: lo: LOOPBACK,UP,LOWER_UP mtu 65536 qdisc noqueue state UNKNOWN 
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
inet 127.0.0.1/8 scope host lo
   valid_lft forever preferred_lft forever
inet6 ::1/128 scope host 
   valid_lft forever preferred_lft forever
 2: bond0: BROADCAST,MULTICAST,MASTER mtu 1500 qdisc noop state DOWN 
link/ether f2:c9:ce:e5:ac:32 brd ff:ff:ff:ff:ff:ff
 3: em1: BROADCAST,MULTICAST,UP,LOWER_UP mtu 1500 qdisc mq master ovirtmgmt 
 state UP qlen 1000
link/ether c8:1f:66:c4:2c:76 brd ff:ff:ff:ff:ff:ff
inet6 fe80::ca1f:66ff:fec4:2c76/64 scope link 
   valid_lft forever preferred_lft forever
 4: em2: BROADCAST,MULTICAST mtu 1500 qdisc noop state DOWN qlen 1000
link/ether c8:1f:66:c4:2c:77 brd ff:ff:ff:ff:ff:ff
 6: ;vdsmdummy;: BROADCAST,MULTICAST mtu 1500 qdisc noop state DOWN 
link/ether 46:af:6e:9a:1e:4b brd ff:ff:ff:ff:ff:ff
 8: ovirtmgmt: BROADCAST,MULTICAST,UP,LOWER_UP mtu 1500 qdisc noqueue state 
 UP 
link/ether c8:1f:66:c4:2c:76 brd ff:ff:ff:ff:ff:ff
inet XXX.XXX.XXX.XXX/23 brd 88.150.253.255 scope global ovirtmgmt
   valid_lft forever preferred_lft forever
inet6 fe80::ca1f:66ff:fec4:2c76/64 scope link 
   valid_lft forever preferred_lft forever
 10: tun0: POINTOPOINT,MULTICAST,NOARP,UP,LOWER_UP mtu 1500 qdisc pfifo_fast 
 state UNKNOWN qlen 100
link/none 
inet 192.168.124.200/25 brd 192.168.124.255 scope global tun0
   valid_lft forever preferred_lft forever
 
 Any thoughts as to why the route will not work ? Rationale for this approach 
 is its a cloud server and wish to use a private network to reach the install 
 VMs on that node.
 
 Thanks, Phil
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Guest VM Running 160 RCU Processes

2014-10-21 Thread Darrell Budic
Was poking at this a little to see if there was any tuning that could affect it 
and spotted some oddness with the processor counts on my VMs under Ovirt 3.4. 
They seem to think they only have the proper number I set in ovirt (as shown in 
/proc/cpu), but if I look at dmidecode, there’s 159 bogus processors listed. 
I’d expect maybe 16 from the -smp 1,maxcpus=16,sockets=16,cores=1,threads=1 
argument to qemu-kvm, but there are 0xa0 of them. Maybe this is a seabios or 
qemu-kvm issue causing all those extras? Anyway, the # of rcu* processes 
matches pretty well, so that’s likely where it’s coming from.

At least they shouldn’t be causing a performance issue, given their purpose as 
non-blocking work threads, but it is odd to see.

Punit, did you open a BZ I can add these details to?

BTW, this appears to be corrected with some component of Ovirt 3.5 (probably 
qemu-rhev?). On VMs started after I upgraded my Engine (even on 3.4 vdsmds), 
I’m only seeing 16 “processors” in the bios, and thus only 16 of the various 
rcu* processes. Could have been a general Centos 6.5 update too, since I did 
those as well, so I can’t get any finer resolution on that issue (both engine 
and host nodes, lots of Centos 7 VMs).

If there is a RFE for this, perhaps a configurable max # of CPU sockets for hot 
add could be added, or it could be limited to the max physical cpu count of the 
biggest host in the cluster?

  -Darrell

 On Oct 20, 2014, at 7:34 AM, Doron Fediuck dfedi...@redhat.com wrote:
 
 
 
 - Original Message -
 From: Punit Dambiwal hypu...@gmail.com
 To: users@ovirt.org, Dan Kenigsberg dan...@redhat.com, Itamar Heim 
 ih...@redhat.com, aha...@redhat.com
 Sent: Monday, October 20, 2014 5:58:20 AM
 Subject: Re: [ovirt-users] Guest VM Running 160 RCU Processes
 
 Hi,
 
 Is there any body suggest me good way to handle it??
 
 On Fri, Oct 17, 2014 at 3:15 PM, Punit Dambiwal  hypu...@gmail.com  wrote:
 
 
 
 Hi,
 
 I have one Ovirt cluster and under this cluster all the guest machines (such
 as centos,ubuntu,debian etc) almost 160 RCU processes running
 
 i searched on google about the RCU (It's kernel process read-copy-upate)
 
 http://lwn.net/Articles/518953/
 
 I want to know how i can modify this 160 processes to 10-20 or how i can to
 disable itis there any bed impact if i disable it
 
 Thanks,
 Punit
 
 
 
 Hi Punit,
 we need to do a bit of a research on this one.
 In order to make sure we keep track of it, do you mind opening an RFE (BZ)
 with all the relevant details, including the hardware that you're using,
 guests config, how busy the guests/host are, Which hypervisor is used and
 the versions.
 
 Thanks,
 Doron
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Guest VM Running 160 RCU Processes

2014-10-22 Thread Darrell Budic

 On Oct 22, 2014, at 6:43 AM, Michal Skrivanek michal.skriva...@redhat.com 
 wrote:
 
 On Oct 21, 2014, at 18:13 , Darrell Budic bu...@onholyground.com wrote:
 
 Was poking at this a little to see if there was any tuning that could affect 
 it and spotted some oddness with the processor counts on my VMs under Ovirt 
 3.4. They seem to think they only have the proper number I set in ovirt (as 
 shown in /proc/cpu), but if I look at dmidecode, there’s 159 bogus 
 processors listed. I’d expect maybe 16 from the -smp 
 1,maxcpus=16,sockets=16,cores=1,threads=1 argument to qemu-kvm, but there 
 are 0xa0 of them. Maybe this is a seabios or qemu-kvm issue causing all 
 those extras? Anyway, the # of rcu* processes matches pretty well, so that’s 
 likely where it’s coming from.
 
 At least they shouldn’t be causing a performance issue, given their purpose 
 as non-blocking work threads, but it is odd to see.
 
 Punit, did you open a BZ I can add these details to?
 
 BTW, this appears to be corrected with some component of Ovirt 3.5 (probably 
 qemu-rhev?). On VMs started after I upgraded my Engine (even on 3.4 vdsmds), 
 I’m only seeing 16 “processors” in the bios, and thus only 16 of the various 
 rcu* processes. Could have been a general Centos 6.5 update too, since I did 
 those as well, so I can’t get any finer resolution on that issue (both 
 engine and host nodes, lots of Centos 7 VMs).
 
 If there is a RFE for this, perhaps a configurable max # of CPU sockets for 
 hot add could be added, or it could be limited to the max physical cpu count 
 of the biggest host in the cluster?
 
 The max is 16. Roy, what can we change/not change after your latest changes?
 I suppose it's related to teh maximum values we send because of hotplug 
 support
 
 Thanks,
 michal

Yes, definitely looks like a side effect of hot plug. The 160 rcu issues look 
like a bug somewhere else, 16 is more normal for the default. I had missed the 
engine setting for MaxNumOfVmCpus, I’ll probably set that for my smaller hosts 
just to keep it reasonable for the hardware it’s on.

 From: Punit Dambiwal hypu...@gmail.com
 To: Doron Fediuck dfedi...@redhat.com
 Sent: Wednesday, October 22, 2014 4:19:56 AM
 Subject: Re: [ovirt-users] Guest VM Running 160 RCU Processes
 
 Hi Darrell,
 
 I haven't open a BZ for the same...please let me know how i can open BZ ??

Punit, make sure your replies go to the list, this didn’t get to me. But you 
want to go to http://www.ovirt.org/Community and look at the “Find and File 
Bugs” section to raise a bug or file an RFE.

  -Darrell
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] noVNC problems after upgrading to 3.5.0

2014-11-13 Thread Darrell Budic
I had noVNC working under 3.4, but can’t seem to get it back up after updating 
to 3.5.0. VNC is working if I make direct connections, but it looks like the 
web socket proxy never tries to connect to the host server. noVNC is just 
reporting then generic 1006 error. Firefox reports it already has the right 
ca.crt installed, so it’s not that. From watching the network, it looks like it 
never gets authenticated properly to the web proxy, and never tries to connect 
on from there.

Any way to get some debugging info for the web socket proxy? Not locating any 
in the usual log files when I try this…

Anyone else seeing a similar problem?

  -Darrell
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] noVNC problems after upgrading to 3.5.0

2014-11-14 Thread Darrell Budic
That was it, thanks!

  -Darrell

 On Nov 14, 2014, at 7:09 AM, Finstrle, Ludek lfinst...@netsuite.com wrote:
 
 
 Hi Darrel,
 
 have you accepted the certificate for the web socket proxy?
 ( usually https://your https://your engine:6100/ )
 
 Cheers,
 
 Luf
 
 Darrell Budic píše v Čt 13. 11. 2014 v 14:25 -0600:
  I had noVNC working under 3.4, but can’t seem to get it back up after 
  updating to 3.5.0. VNC is working if I make direct connections, but it 
  looks like the web socket proxy never tries to connect to the host server. 
  noVNC is just reporting then generic 1006 error. Firefox reports it already 
  has the right ca.crt installed, so it’s not that. From watching the 
  network, it looks like it never gets authenticated properly to the web 
  proxy, and never tries to connect on from there.
  
  Any way to get some debugging info for the web socket proxy? Not locating 
  any in the usual log files when I try this…
  
  Anyone else seeing a similar problem?
  
  -Darrell
  ___
  Users mailing list
  Users@ovirt.org mailto:Users@ovirt.org
  http://lists.ovirt.org/mailman/listinfo/users 
  http://lists.ovirt.org/mailman/listinfo/users
 
 
 NOTICE: This email and any attachments may contain confidential and 
 proprietary information of NetSuite Inc. and is for the sole use of the 
 intended recipient for the stated purpose. Any improper use or distribution 
 is prohibited. If you are not the intended recipient, please notify the 
 sender; do not review, copy or distribute; and promptly delete or destroy all 
 transmitted information. Please note that all communications and information 
 transmitted through this email system may be monitored and retained by 
 NetSuite or its agents and that all incoming email is automatically scanned 
 by a third party spam and filtering service which may result in deletion of a 
 legitimate e-mail before it is read by the intended recipient.

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Multiple NICs on hosted engine?

2014-11-14 Thread Darrell Budic
It’s probably unsupported, but I have found you can edit the 
/etc/ovirt-hosted-engine/vm.conf file to add a nic.

I duplicated the existing devices={nicModel:pv,macAddr” line and modified the 
macAddr and deviceId entries.

Be sure you modify this file on ALL of your ha engine hosts. Then shutdown your 
running engine, let ha-agent restart it (hosted-engine —vm-start works fine 
too) and you’ve got two nics.

 -Darrell

 On Nov 14, 2014, at 9:11 AM, Chris Adams c...@cmadams.net wrote:
 
 I have installed the first node of a new oVirt 3.5 setup with a hosted
 engine VM.  I have multiple networks: one public-accessible and one
 private (with storage, iDRAC/IPMI, etc.).  I set the engine VM up on the
 public LAN, but now realize that it can't access the power control.  I
 tried to add a second NIC to the engine VM through the web interface,
 but of course that doesn't work (because it isn't really managed there).
 
 How can I add a second NIC to the hosted engine VM?
 
 -- 
 Chris Adams c...@cmadams.net
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Multiple NICs on hosted engine?

2014-11-14 Thread Darrell Budic
Ah, I did this back under 3.2 or something. Glad to hear it’s supported now! I 
suggest ignoring my hack.

 On Nov 14, 2014, at 11:17 AM, Darrell Budic bu...@onholyground.com wrote:
 
 It’s probably unsupported, but I have found you can edit the 
 /etc/ovirt-hosted-engine/vm.conf file to add a nic.
 
 I duplicated the existing devices={nicModel:pv,macAddr” line and modified 
 the macAddr and deviceId entries.
 
 Be sure you modify this file on ALL of your ha engine hosts. Then shutdown 
 your running engine, let ha-agent restart it (hosted-engine —vm-start works 
 fine too) and you’ve got two nics.
 
 -Darrell
 
 On Nov 14, 2014, at 9:11 AM, Chris Adams c...@cmadams.net wrote:
 
 I have installed the first node of a new oVirt 3.5 setup with a hosted
 engine VM.  I have multiple networks: one public-accessible and one
 private (with storage, iDRAC/IPMI, etc.).  I set the engine VM up on the
 public LAN, but now realize that it can't access the power control.  I
 tried to add a second NIC to the engine VM through the web interface,
 but of course that doesn't work (because it isn't really managed there).
 
 How can I add a second NIC to the hosted engine VM?
 
 -- 
 Chris Adams c...@cmadams.net
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] noVNC problems after upgrading to 3.5.0

2014-11-14 Thread Darrell Budic
That’s what I thought too, but... Firefox tells me “This certificate is already 
installed as a certificate authority.” though. It looks like the cert for :6100 
is the proper one, too. Maybe a firefox change with their new signing rules 
(i’ve disabled security.use_mozillapkix_verification due to problems with other 
internal certs)? I tried removing all the old web site certs and still had to 
add the https://engine:6100 cert to get the noVNC console to connect.


 On Nov 14, 2014, at 11:25 AM, Simone Tiraboschi stira...@redhat.com wrote:
 
 
 
 
 
 - Original Message -
 From: Darrell Budic bu...@onholyground.com
 To: Ludek Finstrle lfinst...@netsuite.com
 Cc: users users@ovirt.org
 Sent: Friday, November 14, 2014 5:31:57 PM
 Subject: Re: [ovirt-users] noVNC problems after upgrading to 3.5.0
 
 That was it, thanks!
 
 If you are correctly trusting your oVirt internal CA you don't need to 
 explicitly trust every single certificate signed by that CA. Please double 
 check it.
 
 -Darrell
 
 
 
 
 On Nov 14, 2014, at 7:09 AM, Finstrle, Ludek  lfinst...@netsuite.com 
 wrote:
 
 
 Hi Darrel,
 
 have you accepted the certificate for the web socket proxy?
 ( usually https://your engine:6100/ )
 
 Cheers,
 
 Luf
 
 Darrell Budic píše v Čt 13. 11. 2014 v 14:25 -0600:
 I had noVNC working under 3.4, but can’t seem to get it back up after
 updating to 3.5.0. VNC is working if I make direct connections, but it
 looks like the web socket proxy never tries to connect to the host server.
 noVNC is just reporting then generic 1006 error. Firefox reports it
 already has the right ca.crt installed, so it’s not that. From watching
 the network, it looks like it never gets authenticated properly to the web
 proxy, and never tries to connect on from there.
 
 Any way to get some debugging info for the web socket proxy? Not locating
 any in the usual log files when I try this…
 
 Anyone else seeing a similar problem?
 
 -Darrell
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 
 NOTICE: This email and any attachments may contain confidential and
 proprietary information of NetSuite Inc. and is for the sole use of the
 intended recipient for the stated purpose. Any improper use or distribution
 is prohibited. If you are not the intended recipient, please notify the
 sender; do not review, copy or distribute; and promptly delete or destroy
 all transmitted information. Please note that all communications and
 information transmitted through this email system may be monitored and
 retained by NetSuite or its agents and that all incoming email is
 automatically scanned by a third party spam and filtering service which may
 result in deletion of a legitimate e-mail before it is read by the intended
 recipient.
 
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Multiple NICs on hosted engine?

2014-11-14 Thread Darrell Budic
Glad it worked. Make sure you add it to the vm.conf file on all your ha hosts, 
otherwise you’ll drop it if ha-agent restarts it as opposed to a migration. 
Wasn’t clear if you’d done that or not.

 On Nov 14, 2014, at 1:17 PM, Chris Adams c...@cmadams.net wrote:
 
 Once upon a time, Simone Tiraboschi stira...@redhat.com said:
 Sorry, I forgot you cannot add that nic on the engine VM from the engine UI.
 Please try what I explained plus Darrel's trick.
 
 It worked.  I added the network in the UI, added it to the host (so it
 got the bridge set up on that interface) in the UI, and then edited the
 vm.conf file on the host.  Migrated back and forth and all appears well.
 
 Thanks.
 -- 
 Chris Adams c...@cmadams.net
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Status libgfapi support in oVirt

2014-11-19 Thread Darrell Budic
Is there an el7 build of this available too?


 On Nov 19, 2014, at 9:50 AM, Federico Simoncelli fsimo...@redhat.com wrote:
 
 - Original Message -
 From: noc n...@nieuwland.nl
 To: users@ovirt.org
 Sent: Wednesday, November 19, 2014 9:36:28 AM
 Subject: Re: [ovirt-users] Status libgfapi support in oVirt
 
 On 18-11-2014 20:57, Christopher Young wrote:
 
 I'm replying to 'up' this as well as I'm most interested in this. I actually
 thought this was implemented and working too.
 
 On Mon, Nov 17, 2014 at 10:01 AM, Daniel Helgenberger 
 daniel.helgenber...@m-box.de  wrote:
 
 
 Hello Joop,
 
 thanks for raising the issue as it is one of the things I assumed are
 already implemented and working.
 
 Sadly I cannot provide any answer ...
 
 On 17.11.2014 09:39, Joop wrote:
 I have been trying to use libgfapi glusterfs support in oVirt but can't
 get it to work. After talks on IRC it seems I should apply a patch
 ( http://gerrit.ovirt.org/33768 ) to enable libgf BUT I can't get it to
 work. Systems used:
 - hosts Centos7 or Fedora20 (so upto date qemu/libvirt/oVirt(3.5))
 - glusterfs-3.6.1
 - vdsm-4.16.0-524.gitbc618a4.el7.x86_64 (snapshot master 14-nov)
 - vdsm-4.16.7-1.gitdb83943.el7.x86_64 (official ovirt-3.5 vdsm, seems
 newer than master snapshot?? )
 
 Just adding the patch to vdsm-4.16.7-1.gitdb83943.el7.x86_64 doesn't
 work, vdsm doesn't start anymore due to an error in virt/vm.py.
 
 Q1: what is de exact status of libgf and oVirt.
 Q2: how do I test that patch?
 
 I experimented a little more and found that if I create a VM in oVirt on a
 glusterfs storage domain and start it, it won't use libgfapi, BUT if I use
 virsh on the host where the VM runs and then add a disk the libgfapi way the
 VM will see the disk and can use it. So the underlying infra is capable of
 using libgf but oVirt isn't using it. Thats where the patch comes in I think
 but I can't get it to work.
 
 Correct. oVirt up until now didn't use libgfapi because of missing features
 (e.g. live snapshot). It seems that now all those gaps have been fixed and
 we're trying to re-enable libgfapi.
 
 I just mentioned that I uploaded an el6 build here:
 
 https://fsimonce.fedorapeople.org/vdsm-libgfapi/
 
 and sources here (to rebuild on fedora):
 
 https://fsimonce.fedorapeople.org/vdsm-libgfapi/source/
 
 Let me know if the most of you are using fedora and I'll make a build on
 fedora as well.
 
 Please let me know how it goes. Thanks,
 -- 
 Federico
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] What do you want to see in oVirt next?

2013-01-03 Thread Darrell Budic

On Jan 3, 2013, at 10:25 AM, Patrick Hurrelmann wrote:

 On 03.01.2013 17:08, Itamar Heim wrote:
 Hi Everyone,
 
 as we wrap oVirt 3.2, I wanted to check with oVirt users on what they 
 find good/useful in oVirt, and what they would like to see 
 improved/added in coming versions?
 
 Thanks,
Itamar
 
 For me, I'd like to see official rpms for RHEL6/CentOS6. According to
 the traffic on this list quite a lot are using Dreyou's packages.

I'm going to second this strongly! Official support would be very much 
appreciated. Bonus points for supporting a migration from the dreyou packages. 
No offense to dreyou, of course, just rather be better supported by the 
official line on Centos 6.x.


Better support/integration of windows based SPICE clients would also be much 
appreciated, I have many end users on Windows, and it's been a chore to keep it 
working so far. This includes the client drivers for windows VMs to support the 
SPICE display for multiple displays. More of a client side thing, I know, but a 
desired feature in my environment.

Thanks for the continued progress and support as well!

-
Darrell Budic
Zenfire





___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] What do you want to see in oVirt next?

2013-01-03 Thread Darrell Budic

On Jan 3, 2013, at 12:06 PM, Itamar Heim wrote:

 On 01/03/2013 07:42 PM, Darrell Budic wrote:
 
 On Jan 3, 2013, at 10:25 AM, Patrick Hurrelmann wrote:
 
 On 03.01.2013 17:08, Itamar Heim wrote:
 Hi Everyone,
 
 as we wrap oVirt 3.2, I wanted to check with oVirt users on what they
 find good/useful in oVirt, and what they would like to see
 improved/added in coming versions?
 
 
 Better support/integration of windows based SPICE clients would also be
 much appreciated, I have many end users on Windows, and it's been a
 chore to keep it working so far. This includes the client drivers for
 windows VMs to support the SPICE display for multiple displays. More of
 a client side thing, I know, but a desired feature in my environment.
 
 these would be guest drivers for spice, rather than client ones, right?

Yes, I should have said guest drivers for spice, mixing my throughs on spice 
clients with the guest VMs themselves. Best way I can think of to describe it 
would be the ovirt equivalent of rhev-guest-tools-iso-3.0-37.noarch.rpm or 
whatever version they are on now.

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] update SpiceX.cab for win client?

2013-06-11 Thread Darrell Budic
Don't have the nightlys running anywhere at the moment, but I'll see about 
upgrading my test cluster and give it a try.

My primary cluster is still at 3.1, any recommendations for that?


On Jun 11, 2013, at 2:57 AM, Itamar Heim ih...@redhat.com wrote:

 On 06/10/2013 09:54 PM, Darrell Budic wrote:
 I must have missed the info on the mime based approach, can you point me at 
 it and I'll give it a try?
 
 I tried just 0.5.6 with the old instructions, no go there, not that I really 
 expected that work.
 
 you'll need an engine later than 3.2.2 (i.e. nightly)
 
 http://www.ovirt.org/Features/Non_plugin_console_invocation
 http://www.ovirt.org/Supporting_Chrome
 
 
 
   -Darrell
 
 On Jun 10, 2013, at 1:03 PM, Itamar Heim ih...@redhat.com wrote:
 
 On 06/07/2013 09:22 PM, Darrell Budic wrote:
 The spice.cab from elmarco that is referred to on 
 http://wiki.ovirt.org/How_to_Connect_to_SPICE_Console_With_Portal doesn't 
 exist any more, and I see that spice-space.org has a virt-viewer 0.5.6 for 
 windows out. Any chance of someone packaging it up in a compatible .cab? 
 Or if there's a resource on how to do it, a pointer would be appreciated 
 and I can tackle it.
 
 Thanks!
 
 
   -Darrell
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 
 have you tried using it without the .cab with the new mime based launch 
 approach?
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[Users] Multiple Engines, one web Interface?

2013-12-17 Thread Darrell Budic
Now that we’ve got a self hosted engine, has any one given any thought to 
allowing one engine to remotely control another?

The scenario I’m imagining has one cluster at a DC with 4 nodes, and another 
cluster at a different DC with 3 nodes. Connectivity is normally pretty good, 
but it’s been known to drop for a bit every year. And I wouldn’t want to have 
the engine polling across it all the time as it’s a lower bandwidth link (100M 
vs 10G between nodes, so a bit). But what I would like is to have the one I’m 
sitting next to be able to control the remote one, and monitor it’s status, 
with each having it’s own self hosted engine so they are independent and 
migrations, etc, will continue to happen even if the two clusters get 
disconnected. This keeps all the polling local and fast, and avoids engine or 
cluster freak outs if/when the link is down between the two physically diverse 
clusters. 

Cool bonus feature would be to enable a direct transfer between the two 
clusters storage facilities. Maybe with a “transport VM” button that suspends 
it to disk and moves it before restarting it (assuming you built your networks 
right, of course, complex but presumably possible), but I’d with a simple way 
to copy a shutdown VM over without going through an export/import process.

I could see this working with a separate manager engine, or just as an add in 
to the standard engine. Maybe a “cluster w/ local engine” type thing showing in 
the GUI.

Obviously two web interfaces on two separate engines aren’t a terribly hard 
thing to manage, but I just imagined the simple way and figured I’d throw it 
out there and see how hard or easy people thought it might be.

  -Darrell


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] web documentation suggestion

2013-12-17 Thread Darrell Budic
Should read “I was still running 3.2.3” there…

On Dec 17, 2013, at 10:21 PM, Darrell Budic darrell.bu...@zenfire.com wrote:

 Would it be possible to start adding the Ovirt version a feature became 
 available in to the various Features pages?
 
 This one as an example: 
 http://www.ovirt.org/Features/Non_plugin_console_invocation I was going to 
 test this and it took me a few moments to remember it was added in 3.3 and I 
 was still running 3.3 when the listed commands returned errors.

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Creation of preallocated disk with Gluster replication

2014-01-08 Thread Darrell Budic
Grégoire-

I think this is expected behavior. Well, at least the high glusterfsd CPU use 
during disk creation, anyway. I tried creating a 10 G disk on my test 
environment and observed similar high CPU usage by glusterfsd. Did the creation 
on the i5 system, it showed 95%-105% cpu for glusterfsd during creation, with 
the core2 system running ~35-65% glusterfsd utilization during the creation. 
Minor disk wait was observed on both systems,  10% peak and generally  5%. I 
imagine my ZFS cached backends helped a lot here. Took about 3 minutes, roughly 
what I’d expect for the i5’s disk system. Network usage was about 45% of the 1G 
link. No errors or messages logged to /var/log/messages.

Depending on what your test setup looks like, I’d check my network for packet 
loss or errors first. Then look at my storage setup and test pure throughput on 
the disks to see what you’ve got, maybe see what else is running. Did you use 
an NFS cluster or a PosixFS cluster for this?

My test setup, running a version of the nightly self-hosted setup w/ gluster 
distributed/replicated disks as shared storage, in a NFS cluster:

Core i5 3570K @ 3.4Ghz, 16G Ram
Boot disks: 2x 32G SATA SSDs in raid-1
Storage system: 4x500G Seagate RE3s in a ZFS raid-10 w/ 1GB ZIL  ~22G L2ARC 
caching from boot drives
1 1G ethernet
2 VMs running

Core2 Duo E8500 @ 3.16GHz, 8G Ram
Boot disks: 2x 32G SATA SSDS in raid-1
Storage system: 2x1500G WD Green drives in a  ZFS Raid w/ 1GB ZIL  ~22G L2ARC 
cache from boot drives
1 1G ethernet

They are connected through a Netgear Prosafe+ workgroup style switch, not much 
going on between them.

  -Darrell

On Jan 8, 2014, at 7:49 AM, gregoire.le...@retenodus.net wrote:

 Hello,
 
 Do you need more informations about this issue ? Do you think this problem is 
 likely to show up in other cases ? I mean, is that an expected behaviour with 
 my environment, or is it unexpected ?
 
 Is there a way to limit the bandwidth usage for creation of pre-allocated 
 disk so that it doesn't impact production ?
 
 Thank you,
 Regards,
 Grégoire
 
 Le 2014-01-02 17:42, Vijay Bellur a écrit :
 Adding gluster-users.
 On 01/02/2014 08:50 PM, gregoire.le...@retenodus.net wrote:
 Hello,
 I have a Gluster volume in distributed/replicated mode. I have 2 hosts.
 When I try to create a VM with a preallocated disk, it uses 100% of the
 available CPU and bandwidth (I have 1 Gigabit network card).
 The result is I can't even create a preallocated disk because the engine
 detects a network failure.
 I get that kind of messages in /var/log/messages :
 
 Jan  2 14:13:54 localhost sanlock[3811]: 2014-01-02 14:13:54+0100 167737
 [3811]: s4 kill 21114 sig 15 count 1
 Jan  2 14:13:54 localhost wdmd[3800]: test failed rem 51 now 167737 ping
 167718 close 167728 renewal 167657 expire 167737 client 3811
 sanlock_ef4978d6-5711-4e01-a0ec-7ffbd9 cdbe5d:1
 
 And that in the Ovirt Gui :
 
 2014-janv.-02, 15:35 Operation Add-Disk failed to complete.
 2014-janv.-02, 15:35 Storage Pool Manager runs on Host HOST2 (Address:
 X.X.X.X).
 2014-janv.-02, 15:35 Invalid status on Data Center GlusterSewan. Setting
 Data Center status to Non Responsive (On host HOST2, Error: done).
 2014-janv.-02, 15:35 State was set to Up for host HOST2.
 2014-janv.-02, 15:33 Used Network resources of host HOST2 [98%] exceeded
 defined threshold [95%].
 2014-janv.-02, 15:33 Add-Disk operation of test_Disk1 was initiated on
 VM test by admin@internal.
 I understand that the creation of a 10 Go disk image generates a lot of
 traffic, but is there a way to limit it so that it doesn't have an
 impact on the production ? Furthermore, Why does it use so much CPU
 ressources ? I can see on my monitoring graph a big peak of CPU usage
 when I launched the operation (probably until 100%).
 Do you happen to notice what is consuming CPU? Since the same cluster
 does both virtualization and storage, a GigE network might get
 saturated very quickly. Is it possible to separate out the management
 and data/gluster traffic in this setup?
 Regards,
 Vijay
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] Creation of preallocated disk with Gluster replication

2014-01-08 Thread Darrell Budic

On Jan 8, 2014, at 11:55 AM, Karli Sjöberg karli.sjob...@slu.se wrote:

 
 
 Skickat från min iPhone
 
 8 jan 2014 kl. 18:47 skrev Darrell Budic darrell.bu...@zenfire.com:
 
 Grégoire-
 
 My test setup, running a version of the nightly self-hosted setup w/ gluster 
 distributed/replicated disks as shared storage, in a NFS cluster:
 
 Core i5 3570K @ 3.4Ghz, 16G Ram
 Boot disks: 2x 32G SATA SSDs in raid-1
 Storage system: 4x500G Seagate RE3s in a ZFS raid-10 w/ 1GB ZIL  ~22G L2ARC 
 caching from boot drives
 1 1G ethernet
 2 VMs running
 
 Core2 Duo E8500 @ 3.16GHz, 8G Ram
 Boot disks: 2x 32G SATA SSDS in raid-1
 Storage system: 2x1500G WD Green drives in a  ZFS Raid w/ 1GB ZIL  ~22G 
 L2ARC cache from boot drives
 1 1G ethernet
 
 Just curious, are you doing ZFS in Linux?
 
 /K

Yes, forgot to mention those are freshly built Centos 6.5 systems with zfs 
0.6.2, and glusterfs-3.4.1-3.el6.x86_64, vdsm-gluster-4.13.2-1.el6.noarch for 
testing/experimenting. Bought some cheap SSDs and just grabbed systems and 
platters I had around for it. Testbedding and getting some experience with the 
self hosted engine, since I’d like to move to it once it’s released. Also 
looking forward to testing native gluster on this setup.

I have a production ovirt cluster with a linux zfs based NFS storage server, 
the backend has been very stable since I got rid of Nextenta and went to linux. 
Sounds odd, I know, but couldn’t get good support for a community nextenta 
server I inherited. I was having driver level box lockup issues with 
openSolaris that I couldn’t resolve. So I rebuilt it with linux, imported the 
pool, and haven’t looked back or had a storage failure since.

  -Darrell

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] SSD Caching

2014-01-08 Thread Darrell Budic
Stick your bricks on ZFS and let it do it for you. Works well, although I 
haven’t done much benchmarking of it. My test setup is described in the thread 
under [Users] Creation of preallocated disk with Gluster replication. I’ve seen 
some blog posts here and there about gluster on ZFS for this reason too.

 -Darrell

On Jan 7, 2014, at 9:56 PM, Russell Purinton r...@sonicbx.com wrote:

 [20:42] sonicrose is anybody out there using a good RAM+SSD caching system 
 ahead of gluster storage?
 [20:42] sonicrose sorry if that came through twice
 [20:44] sonicrose im thinking about making the SSD one giant swap file then 
 creating a very large ramdisk in virtual memory and using that as a block 
 level cache for parts and pieces of virtual machine disk images
 [20:44] sonicrose then i think the memory managers would inherently play 
 the role of storage tiering ie: keeping the hottest data in memory and the 
 coldest data on swap
 [20:45] sonicrose everything i have seen today has been setup as   
 consumer  ===  network  SSD cache  real disks
 [20:45] sonicrose but i'd like to actually do consumer === RAM+SSD cache 
 ===  network === real disks
 [20:46] sonicrose i realize doing a virtual memory disk means the cache 
 will be cleared on every reboot, and I'm ok with that
 [20:47] sonicrose i know this can be done with NFS and 
 cachefilesd(fscache), but how could something be integrated into the native 
 gluster clients?
 [20:47] sonicrose i'd prefer not to have to access gluster via NFS
 [20:49] sonicrose any feedback from this room is greatly appreciated, 
 getting someone started to build managed HA cloud hosting
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] VSDM´s logrotate makes Hosts fill up var eventually

2014-01-09 Thread Darrell Budic

On Jan 9, 2014, at 10:33 AM, Dan Kenigsberg dan...@redhat.com wrote:

 The question of how much logging we should keep is a tough one. I, as a
 developer, would like to have as much as possible. For long-running busy
 systems, it has happened to me that the core bug was spotted in
 vdsm.log.67 or so.
 
 However, I understand that verbosity has its price. To understand
 whether we are stable enough to change the defaults, I need volunteers:
 people who are willing to change their log level to INFO or WARNING, and
 see if they miss useful information from their logs.
 
 When you make you log level higher, you can lower the number of kept
 log files, as they would not be filled as quick.
 
 Would you, users@, help me with hard data?

I played with turning the verbosity down and keeping fewer logs, but in the end 
I wound up adjusting the drive layout on my nodes (running centos with vdsdm 
added, not “pure” ovirt nodes). If a VM crashes and leaves cores, I can run 
into problem, but monitoring alerts me and I can take action.

It does seem a bit much though, maybe an easy switch in the GUI to allow 
setting it to DEBUG when needed, and start off at WARNING for my production 
nodes? I think this feeling is in part because it takes a bit of work me, an 
experienced power admin but not developer, to dig stuff out of the logs myself, 
and most of it is not useful to me. So the volume is a bit much. Of course, 
like all logs, they’re invaluable sometimes, and you never know when you’ll 
need them, so…

  -Darrell
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Using 10gb vNIC/vbridge into VM is possible?

2014-12-19 Thread Darrell Budic
I tried a quick iperf test a while back and got 3-4Gb/sec between a pair of VMs 
on separate hosts with a 10G infrastructure, no real tuning and no SRIOV. That 
met my needs so I didn’t try anything further. If you were aiming for 10G for 
all, you’d want to work on SRIOV I imagine, but they get pretty good 
performance even without it.


 On Dec 19, 2014, at 4:49 AM, Kalil de A. Carvalho kali...@gmail.com wrote:
 
 Dear Amador.
 
 No, unfortunately not.
 
 This was just a friend question, because he need VM with 10GB NIC's.
 
 Today he is using Xenserver, this solution does not attend his expedition and 
 he is research another solution.
 
 I told him about oVirt/KVM but this is a prerequisite to use.
 
 I will plan with him to try make a project to test a take the resolts.
 
 Best regards. 
 
 On Fri, Dec 19, 2014 at 1:18 AM, Amador Segundo asegu...@redhat.com 
 mailto:asegu...@redhat.com wrote:
 Virtio devices does not support speed, so we fake their speeds showing 
 1000mbps in Admin. Portal. If your boxes have 10gbps devices then your vms 
 are already taking advantage of that. Did you test it? Could you share some 
 results? 
 
 
 -Original Message- 
 From: Kalil de A. Carvalho [kali...@gmail.com mailto:kali...@gmail.com] 
 Received: Thursday, 18 Dec 2014, 22:59 
 To: users@ovirt.org mailto:users@ovirt.org 
 Subject: [ovirt-users] Using 10gb vNIC/vbridge into VM is possible? 
 
 
 Hello all.
 
 Today a follow work ask me if is possible to use vNIC or vbridge in a VM 
 managed by ovirt.
 
 What he wants is to have a virtual 10gb network to some machines.
 
 All hosts NIC's are 10gb.
 
 Is this possible?
 
 If yes how can I do it?
 
 Best regards.
 
 
 
 
 -- 
 Atenciosamente,
 Kalil de A. Carvalho
 
 ___
 Users mailing list
 Users@ovirt.org mailto:Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users 
 http://lists.ovirt.org/mailman/listinfo/users
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] change network MTU settings without taking all the VMs down?

2015-02-05 Thread Darrell Budic
I wound up manually changing the network configs on my hosts during other 
updates, one at a time. Then I migrated off the lower MTU host and repeated. 
Seemed to be minor extra bit of pause after migration completed, then things 
were fine (and I manually updated MTU settings on the VMs at that time, without 
rebooting/shutdowns). I did have to reboot one VM that went to 100% cpu and 
didn’t recover, but it has done that before and I think it’s a different 
problem with that one in particular, not the process. Then I repeated on the 
other host. Both though the networks in question were not synchronized, but 
came up fine away. Once that was all done, I updated the MTU values via the 
database, and now ovirt thinks everything is happy.

So the manaul process Loir describes for 3.4 works, and works for 3.5 if you 
manage it properly. I’m still set with my vdmds forced to ifcfg persistence due 
to troubles with bonded interfaces in 3.5, so this was easy for me.

  -Darrell

 On Jan 28, 2015, at 11:37 AM, Donny Davis do...@cloudspin.me wrote:
 
 So adding a new network was the way you went??On Jan 28, 2015 9:22 AM, 
 Darrell Budic bu...@onholyground.com wrote:
 
 3.5.1. It’s really a “clean and tidy” thing now that I’ve got the network 
 cleaned up, so not urgent but nice to have. Thanks for the info!
 
 On Jan 28, 2015, at 4:05 AM, Dan Kenigsberg dan...@redhat.com wrote:
 
 On Wed, Jan 28, 2015 at 11:03:14AM +0200, Lior Vernia wrote:
 Hi Darrell!
 
 There's currently no clean way to do this - we'll be looking to fix this
 in 3.6 (https://bugzilla.redhat.com/show_bug.cgi?id=1055454).
 
 You haven't mentioned which version of oVirt you're running - if it's
 3.4 or lower, I think it would suffice to change ifcfg files on your
 hypervisors and restart the network service. The network will then
 appear as out-of-sync in the GUI, but should be fully functional with
 MTU 1500. You'd also want the network configuration to be saved in case
 of future rollbacks - Dan, how would that be done?
 
 up to 3.4, nothing else should be done (assuming the network config was
 already declared safe and survived reboot). ifcfg is the only persistent 
 copy.
 
 
 If you're running 3.5, I think you need to run some vdsm shell commands
 on the hypervisor as we've added an abstraction layer for configuration
 persistence above ifcfg files - again I'll ask Dan to chime in.
 
 In 3.5 we've added a Vdsm-side persistent copy of the network,
 which sits under /var/lib/vdsm/netconf/nets. You should edit the json
 definitions therein in order to ensure proper network startup after
 boot.
 
 
 As for getting the engine network configuration to MTU 1500 (for future
 hypervisor configuration and for networks to not appear as out-of-sync
 on existing ones) without taking down all the VMs (or hot-unplugging
 NICs) at one point - I don't think there's currently a way other than
 hacking the DB... Just leaving the network out-of-sync on the hosts
 could result in inconveniences later on when configuring host networking.
 
 Is it worth the trouble of getting MTU 1500 instead of 1448? I presume
 the difference in performance would be negligible. Or are you
 experiencing incoming frames being dropped due to having 1500 bytes
 instead of 1448?... Either way, as you mentioned taking down the VMs can
 be a last resort, or wait for 3.6 where it should be simpler :)
 
 Yours, Lior.
 
 On 27/01/15 19:43, Darrell Budic wrote:
 I finally got a couple of networks our from behind a wan based layer 2 
 bridge that required me to run at MTU 1448, and would like to get back up 
 to MTU 1500. I see the GUI won’t let me do that while the network is in 
 use. Any way around this, clean or otherwise? Restarting VMs to update 
 them is ok, just trying to avoid having to take everything down at the 
 same time.
 
 -Darrell
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Unable to run noVNC console un recent browsers

2015-02-19 Thread Darrell Budic
Try reimporting the ca.cert for noVNC by connecting directly to the webproxy 
address at port 6100. Do this by trying to connect to a console and then, once 
the 1006 error shows up, just strip off everything after :6100/ . I've found 
that somewhere in or after 3.5, restarting the webproxy causes it to generate 
its own new ca.cert even through it shouldn't.

  -Darrell

 On Feb 19, 2015, at 4:09 PM, Stefano Danzi s.da...@hawai.it wrote:
 
 Hello,
 
 I can't make work noVNC console on recent browsers (Chrome 40, Firefox 35 and 
 IE 11).
 
 The error that I have is already explained here:
 
 https://forge.univention.org/bugzilla/show_bug.cgi?id=33587
 
 I tried to change websocket like suggested 
 (http://errata.univention.de/ucs/3.2/31.html) but this not helped.
 
 Someone know a workaround?
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] change network MTU settings without taking all the VMs down?

2015-01-27 Thread Darrell Budic
Except you can’t change the network MTU setting in the first place on the 
network in the GUI. I’ve thought about doing it in the database, with a 
migration as you mention. Just checking first for better options :)


 On Jan 27, 2015, at 12:08 PM, Donny Davis do...@cloudspin.me wrote:
 
 Migrate the vms between hosts, and when a host gets to no running VMS you can 
 sync that network. 
 
 Donny D
 cloudspin.meOn Jan 27, 2015 10:43 AM, Darrell Budic bu...@onholyground.com 
 wrote:
 
 I finally got a couple of networks our from behind a wan based layer 2 
 bridge that required me to run at MTU 1448, and would like to get back up to 
 MTU 1500. I see the GUI won’t let me do that while the network is in use. 
 Any way around this, clean or otherwise? Restarting VMs to update them is 
 ok, just trying to avoid having to take everything down at the same time.
 
 -Darrell
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] change network MTU settings without taking all the VMs down?

2015-01-27 Thread Darrell Budic
Try changing that custom MTU and hitting OK. I get:


on 3.5 and 3.5.1. What version are you running?

 On Jan 27, 2015, at 2:04 PM, Donny Davis do...@cloudspin.me wrote:
 
 Maybe I missed your question, but I can change the MTU from the gui without 
 any problems. As long as you make sure there are not any vm's on the host you 
 are trying to sync, I have experienced no issues. 
 
 Donny
 
 -Original Message-
 From: Darrell Budic [mailto:bu...@onholyground.com] 
 Sent: Tuesday, January 27, 2015 12:30 PM
 To: Donny Davis
 Cc: users@ovirt.org
 Subject: Re: [ovirt-users] change network MTU settings without taking all the 
 VMs down?
 
 Except you can’t change the network MTU setting in the first place on the 
 network in the GUI. I’ve thought about doing it in the database, with a 
 migration as you mention. Just checking first for better options :)
 
 
 On Jan 27, 2015, at 12:08 PM, Donny Davis do...@cloudspin.me wrote:
 
 Migrate the vms between hosts, and when a host gets to no running VMS you 
 can sync that network. 
 
 Donny D
 cloudspin.meOn Jan 27, 2015 10:43 AM, Darrell Budic bu...@onholyground.com 
 wrote:
 
 I finally got a couple of networks our from behind a wan based layer 2 
 bridge that required me to run at MTU 1448, and would like to get back up 
 to MTU 1500. I see the GUI won’t let me do that while the network is in 
 use. Any way around this, clean or otherwise? Restarting VMs to update them 
 is ok, just trying to avoid having to take everything down at the same time.
 
 -Darrell
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 Capture.PNG

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] change network MTU settings without taking all the VMs down?

2015-01-27 Thread Darrell Budic
Ah, good idea. That will be cleaner than the hack I was considering, I’ll see 
if I can work that somehow. Thanks!

 On Jan 27, 2015, at 2:59 PM, Donny Davis do...@cloudspin.me wrote:
 
 I'm on the same.. And I see the issue. Why don't you create a new network 
 with the correct parameters and then move the network the VM is attached to 
 
 On Jan 27, 2015 1:39 PM, Darrell Budic bu...@onholyground.com wrote:
 
 Try changing that custom MTU and hitting OK. I get:
 
 on 3.5 and 3.5.1. What version are you running?
 
 On Jan 27, 2015, at 2:04 PM, Donny Davis do...@cloudspin.me wrote:
 
 Maybe I missed your question, but I can change the MTU from the gui without 
 any problems. As long as you make sure there are not any vm's on the host 
 you are trying to sync, I have experienced no issues. 
 
 Donny
 
 -Original Message-
 From: Darrell Budic [mailto:bu...@onholyground.com] 
 Sent: Tuesday, January 27, 2015 12:30 PM
 To: Donny Davis
 Cc: users@ovirt.org
 Subject: Re: [ovirt-users] change network MTU settings without taking all 
 the VMs down?
 
 Except you can’t change the network MTU setting in the first place on the 
 network in the GUI. I’ve thought about doing it in the database, with a 
 migration as you mention. Just checking first for better options :)
 
 
 On Jan 27, 2015, at 12:08 PM, Donny Davis do...@cloudspin.me wrote:
 
 Migrate the vms between hosts, and when a host gets to no running VMS you 
 can sync that network. 
 
 Donny D
 cloudspin.meOn Jan 27, 2015 10:43 AM, Darrell Budic 
 bu...@onholyground.com wrote:
 
 
 I finally got a couple of networks our from behind a wan based layer 2 
 bridge that required me to run at MTU 1448, and would like to get back up 
 to MTU 1500. I see the GUI won’t let me do that while the network is in 
 use. Any way around this, clean or otherwise? Restarting VMs to update 
 them is ok, just trying to avoid having to take everything down at the 
 same time.
 
 -Darrell
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 
 Capture.PNG
 
 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] change network MTU settings without taking all the VMs down?

2015-01-27 Thread Darrell Budic
I finally got a couple of networks our from behind a wan based layer 2 bridge 
that required me to run at MTU 1448, and would like to get back up to MTU 1500. 
I see the GUI won’t let me do that while the network is in use. Any way around 
this, clean or otherwise? Restarting VMs to update them is ok, just trying to 
avoid having to take everything down at the same time.

 -Darrell
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] change network MTU settings without taking all the VMs down?

2015-01-28 Thread Darrell Budic
3.5.1. It’s really a “clean and tidy” thing now that I’ve got the network 
cleaned up, so not urgent but nice to have. Thanks for the info!

 On Jan 28, 2015, at 4:05 AM, Dan Kenigsberg dan...@redhat.com wrote:
 
 On Wed, Jan 28, 2015 at 11:03:14AM +0200, Lior Vernia wrote:
 Hi Darrell!
 
 There's currently no clean way to do this - we'll be looking to fix this
 in 3.6 (https://bugzilla.redhat.com/show_bug.cgi?id=1055454).
 
 You haven't mentioned which version of oVirt you're running - if it's
 3.4 or lower, I think it would suffice to change ifcfg files on your
 hypervisors and restart the network service. The network will then
 appear as out-of-sync in the GUI, but should be fully functional with
 MTU 1500. You'd also want the network configuration to be saved in case
 of future rollbacks - Dan, how would that be done?
 
 up to 3.4, nothing else should be done (assuming the network config was
 already declared safe and survived reboot). ifcfg is the only persistent 
 copy.
 
 
 If you're running 3.5, I think you need to run some vdsm shell commands
 on the hypervisor as we've added an abstraction layer for configuration
 persistence above ifcfg files - again I'll ask Dan to chime in.
 
 In 3.5 we've added a Vdsm-side persistent copy of the network,
 which sits under /var/lib/vdsm/netconf/nets. You should edit the json
 definitions therein in order to ensure proper network startup after
 boot.
 
 
 As for getting the engine network configuration to MTU 1500 (for future
 hypervisor configuration and for networks to not appear as out-of-sync
 on existing ones) without taking down all the VMs (or hot-unplugging
 NICs) at one point - I don't think there's currently a way other than
 hacking the DB... Just leaving the network out-of-sync on the hosts
 could result in inconveniences later on when configuring host networking.
 
 Is it worth the trouble of getting MTU 1500 instead of 1448? I presume
 the difference in performance would be negligible. Or are you
 experiencing incoming frames being dropped due to having 1500 bytes
 instead of 1448?... Either way, as you mentioned taking down the VMs can
 be a last resort, or wait for 3.6 where it should be simpler :)
 
 Yours, Lior.
 
 On 27/01/15 19:43, Darrell Budic wrote:
 I finally got a couple of networks our from behind a wan based layer 2 
 bridge that required me to run at MTU 1448, and would like to get back up 
 to MTU 1500. I see the GUI won’t let me do that while the network is in 
 use. Any way around this, clean or otherwise? Restarting VMs to update them 
 is ok, just trying to avoid having to take everything down at the same time.
 
 -Darrell
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Status libgfapi support in oVirt

2015-01-09 Thread Darrell Budic
Any chance of an update for the 4.16.10 build from ybrohoni yesterday?

Or has this made it into the nightlies yet?


 On Nov 21, 2014, at 4:25 PM, Federico Simoncelli fsimo...@redhat.com wrote:
 
 I updated the packages (rebasing on a newer master) and I provided an
 el7 build as well:
 
 https://fsimonce.fedorapeople.org/vdsm-libgfapi/
 
 These rpms are less tested than the previous ones but the rebase was
 straight forward.
 
 -- 
 Federico
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] How long do your migrations last?

2015-02-13 Thread Darrell Budic
I’m under the impression it depends more on the hosts memory assignment than 
disk size. libvirt has to synchronize that over your networking setup. Your 
times sound like mine over 1G ethernet with a 9000 MTU, most of my machines are 
1-4GB ram. I’ve another setup with a 10G backend that can migrate larger 
machines much faster. Things that do a lot of memory access (databases, say) or 
use more of their allocated memory, tend to take longer to migrate as it’s more 
work for libvirt to get it synchronized.

A 10G+ backend is the best way to speed this up, and there are libvirt 
variables you can tweak to allocate more bandwidth to a migration (and the # of 
simultaneous migrations you allow). I think the defaults are 3 at max of 30% of 
your available bandwidth. I don’t think this takes bonds into account, so if 
you have bonded connections, you may be able to allocate more % or allow more 
simultaneous migrations. Keep in mind that if you’re sharing bandwidth/media 
with iSCSI, that some bandwidth will be needed there as well, how much depends 
on your storage load. A dedicated NIC could definitely help, especially if 
you’re trying to tune libvirt for this.

  -Darrell

 On Feb 13, 2015, at 8:53 AM, Nicolas Ecarnot nico...@ecarnot.net wrote:
 
 Hello list,
 
 Our storage domains are iSCSI on dedicated network, and when migrating VMs, 
 the duration varies according to the size of the vDisks.
 
 The smallest VMs are migrated in about 20 seconds, while the biggest one may 
 take more than 5 or 10 minutes.
 The average duration is 90 seconds.
 
 Questions :
 
 1- Though I may have understood that the task of migration was made by the 
 SPM, I don't know what it actually does? (which bytes goes where)
 
 2- Do our times sound OK, or does it look like improvable?
 
 3- What bottleneck should I investigate? I'm thinking about the dedicated 
 hardware NICs setup of the hosts, the SAN, the MTU has already been setup at 
 9000...
 
 Any ideas welcomed.
 
 -- 
 Nicolas Ecarnot
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Ovirt Manager Problem

2015-02-12 Thread Darrell Budic
Also check your auth backend, especially if you’re using the old AAA. I’ve 
found that with freeipa, for example, if one of my ipa hosts is down 
(specifically the “first” one for the ovirt engine machine), the engine GUI 
lags badly waiting for auth before failing over to the second ipa host. It does 
this for pretty much everything, unfortunately.

  -Darrell

 On Feb 12, 2015, at 9:31 AM, Martin Pavlík mpav...@redhat.com wrote:
 
 Hello Massimo,
 
 just a wild guess. Is it possible that engine machine has its RAM full and is 
 forced to use SWAP? Or some other resources on it are exhausted?
 
 HTH
 
 M.
 
 
 On 11 Feb 2015, at 17:30, Dan Yasny dya...@gmail.com wrote:
 
 What are you using to access the manager UI on the client (OS and browser)?
 
 On Wed, Feb 11, 2015 at 11:25 AM, Massimo Mad mad196...@gmail.com wrote:
 Hi, i have a strange problem,my manager is very slow and with the gui I can 
 not do anything on the vm and the host, but the vm are ok.
 I tried to look at the log, but I have not seen anything that would explain 
 the problem.
 Regards
 Massimo
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Self hosted engine iusses

2015-02-05 Thread Darrell Budic
You can also add “find_multipaths 1” to /etc/multipath.conf, this keeps 
multipathd from finding non-multipath devices as multi path devices and avoids 
the error message and keeps mutlipathd from binding your normal devices. I find 
it simpler than blacklisting and it should work if you also have real multi 
path devices.

defaults {
find_multipaths yes
polling_interval5
…


 On Feb 5, 2015, at 1:04 PM, George Skorup geo...@mwcomm.com wrote:
 
 I ran into this same problem after setting up my cluster on EL7. As has been 
 pointed out, the hosted-engine installer modifies /etc/multipath.conf.
 
 I appended:
 
 blacklist {
devnode *
 }
 
 to the end of the modified multipath.conf, which is what was there before the 
 engine installer, and the errors stopped.
 
 I think I was getting 253:3 trying to map which don't exist on my systems. I 
 have a similar setup, md raid1 and LVM+XFS for gluster.
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Unable to run noVNC console un recent browsers

2015-02-20 Thread Darrell Budic
Yeah, any x.509 cert not ultimately signed by a major trust anchor was 
problematic. I noticed it on some internal sites signed by my freeipa ca myself.

 On Feb 20, 2015, at 11:50 AM, Simone Tiraboschi stira...@redhat.com wrote:
 
 
 
 - Original Message -
 From: Darrell Budic bu...@onholyground.com
 To: Simone Tiraboschi stira...@redhat.com
 Cc: users users@ovirt.org
 Sent: Friday, February 20, 2015 5:57:10 PM
 Subject: Re: [ovirt-users] Unable to run noVNC console un recent browsers
 
 I had some trouble with self signed certs in firefox when they switch to the
 new pkix stuff recently, have you tried setting
 security.use_mozillapkix_verification to false?
 
 The websocket proxy cert is not self-signed: it's normally signed by the 
 internal oVirt CA. 
 
 On Feb 20, 2015, at 8:56 AM, Simone Tiraboschi stira...@redhat.com wrote:
 
 
 
 - Original Message -
 From: Donny Davis do...@cloudspin.me
 To: Simone Tiraboschi stira...@redhat.com
 Cc: users@ovirt.org
 Sent: Friday, February 20, 2015 3:53:04 PM
 Subject: RE: [ovirt-users] Unable to run noVNC console un recent browsers
 
 No, I made my life easy and used nginx to proxy for the websocket. I was
 then
 able to use my commercial ssl cert to avoid all of these issues. Using a
 proxy for a proxy has been working out quite well for cloudspin, because I
 don't have to mess with anything internal to the engine and noVNC works
 without issue.
 
 Yes, using the oVirt internal CA is just the low-profile out of the box
 solution.
 
 DonnyD
 
 -Original Message-
 From: Simone Tiraboschi [mailto:stira...@redhat.com]
 Sent: Friday, February 20, 2015 7:03 AM
 To: Donny Davis
 Subject: Re: [ovirt-users] Unable to run noVNC console un recent browsers
 
 
 
 - Original Message -
 From: Donny Davis do...@cloudspin.me
 To: Simone Tiraboschi stira...@redhat.com
 Sent: Friday, February 20, 2015 2:23:56 PM
 Subject: RE: [ovirt-users] Unable to run noVNC console un recent
 browsers
 
 Is your websocket proxy on the same machine as your engine. I also get
 the CA error when the time it off. The proxy throws the error to
 /var/log/messages
 
 Hi Donny,
 I'm using the proxy on the same machine where the engine runs.
 No error till now no my side.
 
 I also trusted oVirt internal CA to sign other certs in my browser. Did
 you?
 You can find it at https://{engine}/ca.crt
 
 You should download it and add to the list of trusted certification
 authorities in your browser.
 
 -Original Message-
 From: users-boun...@ovirt.org [mailto:users-boun...@ovirt.org] On
 Behalf Of Simone Tiraboschi
 Sent: Friday, February 20, 2015 5:57 AM
 To: Stefano Danzi
 Cc: users@ovirt.org
 Subject: Re: [ovirt-users] Unable to run noVNC console un recent
 browsers
 
 
 
 - Original Message -
 From: Stefano Danzi s.da...@hawai.it
 To: Darrell Budic bu...@onholyground.com
 Cc: users@ovirt.org
 Sent: Friday, February 20, 2015 9:07:51 AM
 Subject: Re: [ovirt-users] Unable to run noVNC console un recent
 browsers
 
 Hello!
 Already done but this didn't help.
 
 I downloaded a portable version of Firefox 17 and noVNC work as
 expected.
 
 Il 20/02/2015 5.18, Darrell Budic ha scritto:
 
 
 
 Try reimporting the ca.cert for noVNC by connecting directly to the
 webproxy address at port 6100. Do this by trying to connect to a
 console and then, once the 1006 error shows up, just strip off
 everything after :6100/ . I've found that somewhere in or after 3.5,
 restarting the webproxy causes it to generate its own new ca.cert
 even
 through it shouldn't.
 
 -Darrell
 
 
 
 On Feb 19, 2015, at 4:09 PM, Stefano Danzi s.da...@hawai.it wrote:
 
 Hello,
 
 I can't make work noVNC console on recent browsers (Chrome 40,
 Firefox
 35 and IE 11).
 
 The error that I have is already explained here:
 https://forge.univention.org/bugzilla/show_bug.cgi?id=33587 I tried
 to change websocket like suggested (
 http://errata.univention.de/ucs/3.2/31.html ) but this not helped.
 
 noVNC 0.5.1 should be soon released in EPEL6/EPEL7 as for [1].
 noVNC 0.5.1 should also improve compatibility with recent browsers.
 
 [1] https://bugzilla.redhat.com/show_bug.cgi?id=1193454#c3
 
 
 Someone know a workaround?
 ___
 Users mailing list Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 
 
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] 3.5.1 net config persistence

2015-03-20 Thread Darrell Budic
I’ve encounter these issues on systems new and upgraded with bonding 
connections. The new system seems especially bad with bonds, and I’ve taken to 
immediately switching my hosts to the ifcfg persistence methods. Centos 6 and 7 
hosts.

If it matters, I’m good with setting up my own network config, and sometimes I 
REALLY DO NOT WANT ovirt to change them, especially with vlans and gluster 
co-existance. I can see the goal, but it seems pretty far from it right now, so 
I’m very happy that there’s a way to switch back to “system” control of those 
things.


 On Mar 20, 2015, at 10:41 AM, Nicolas Ecarnot nico...@ecarnot.net wrote:
 
 Le 20/03/2015 14:40, Dan Kenigsberg a écrit :
 On Fri, Mar 20, 2015 at 10:14:54AM +0100, Nicolas Ecarnot wrote:
 Hello,
 
 There are a number of bugs [1] reported these days about the issue aourd
 network config of the hosts, when dealing with interfaces manually
 configured, with bonding and VLANs.
 These /etc/sysconfig/network-scripts/ifcfg.* files are wiped by vdsm after
 rebooting.
 
 I see that there are people at Redhat working on these, and some cases were
 reproduced in lab conditions - and some were not.
 
 I upgraded 3 DC from 3.4.? to 3.5.1, and faced this issue (lost of every
 network files) in an non-consistent manner.
 
 I finally thought I coped with this problem by adding
 net_persistence = ifcfg
 to /etc/vdsm/vdsm.conf
 and indeed, when restarting vdsmd and the network, files were conserved.
 
 It was before I observed that some action [2] lead to /etc/vdsm/vdsm.conf
 being renamed into /etc/vdsm/vdsm.conf.some_timestamp and the original one
 replaced by a very short file with no netcfg persistence at all.
 
 I didn't identified [2]. That could be :
 - some actions made by me through the Web UI ?
 - service vdsmd restart ?
 - reboots ?
 
 I'm sure that some Redhat people know what could be responsible for renaming
 /etc/vdsm/vdsm.conf into /etc/vdsm/vdsm.conf.some_timestamp, and I wish they
 are working closely with Dan Kenigsberg and Michael Burman who helped a lot
 on these issues (or maybe, THEY are the coders responsible for this ?)
 
 [2] :
 - https://bugzilla.redhat.com/show_bug.cgi?id=1154399
 - https://bugzilla.redhat.com/show_bug.cgi?id=1188251
 - and more or less related :
 https://bugzilla.redhat.com/show_bug.cgi?id=1134346
 
 Thanks for reporting this issue. We are well aware of it, and working
 hard to fix it. Unfortunately, there were several bugs on the process of
 upgrading ifcfg-based network configuration to vdsm's own unified
 persistence that sits under /var/lib/vdsm/persistence/netconf.
 
 Would you share which platform are you using? el6? el7? ovirt-node, or
 plain install?
 
 We are using centos 6.6 on all our hosts, minimal install.
 Idem on the manager, bare metal stand alone, not hosted.
 
 There is a recent report that ovirt-node may be restarting networking
 while vdsm starts up, which may well explain the problem and its
 inpredictability. Is this the case with you?
 
 We are not using ovirt-nodes since 3 years, for some reasons.
 
 Regarding /etc/vdsm/vdsm.conf: vdsm never rename it. Could it be rpm's
 new behavior (replacing vdsm.conf.rpmsave) ? Or could it be the node,
 Fabian?
 
 Let us stay prudent : I indeed did some yum upgrade, BUT :
 - I made every step in a very modular way : first upgrade the manager
 - then put one host in maintenance
 - add the 3.5.1 repo on the host
 - then web-gui-reinstall it (upgrading the useful packages)
 - then put it up, migrate some VM on it, well, test it.
 - then put it back into maintenance
 - then yum upgrade it
 - then reboot it
 - then blah blah blah
 
 well you see, I won't explain every step, but I did that in a very cautious 
 way, taking time for each of them, and repeating this whole process more than 
 20 times.
 
 I don't get why it is working like a charm on most of them, and facing the 
 issues mentioned above on a portion of them.
 
 
 To answer to the renaming comment : yes Dan, some package upgrade renamed 
 vdsm.conf into rpmsave, BUT I was explicitly talking about an additional 
 renaming into vdsm.conf.201503191220 something, and I never saw a package 
 upgrade do that.
 
 
 Just a final word : though I sound grumpy and find this issue a real pain, I 
 am actually absolutely amazed by all the work done by all the oVirt community 
 and the Redhat people :)
 
 -- 
 Nicolas Ecarnot
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] VDSM memory consumption

2015-03-09 Thread Darrell Budic
 On Mar 9, 2015, at 4:51 AM, Dan Kenigsberg dan...@redhat.com wrote:
 
 On Fri, Mar 06, 2015 at 10:58:53AM -0600, Darrell Budic wrote:
 I believe the supervdsm leak was fixed, but 3.5.1 versions of vdsmd still 
 leaks slowly, ~300k/hr, yes.
 
 https://bugzilla.redhat.com/show_bug.cgi?id=1158108
 
 
 On Mar 6, 2015, at 10:23 AM, Chris Adams c...@cmadams.net wrote:
 
 Once upon a time, Federico Alberto Sayd fs...@uncu.edu.ar said:
 I am experiencing troubles with VDSM memory consuption.
 
 I am running
 
 Engine: ovirt 3.5.1
 
 Nodes:
 
 Centos 6.6
 VDSM 4.16.10-8
 Libvirt: libvirt-0.10.2-46
 Kernel: 2.6.32
 
 When the host boots, memory consuption is normal, but after 2 or 3
 days running, VDSM memory consuption grows and it consumes more
 memory that all vm's running in the host. If I restart the vdsm
 service, memory consuption normalizes, but then it start growing
 again.
 
 I have seen some BZ about vdsm and supervdsm about memory leaks, but
 I don't know if VDSM 4.6.10.8 is still affected by a related bug.
 
 Can't help, but I see the same thing with CentOS 7 nodes and the same
 version of vdsm.
 -- 
 Chris Adams c...@cmadams.net
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 I'm afraid that we are yet to find a solution for this issue, which is
 completly different from the horrible leak of supervdsm  4.16.7.
 
 Could you corroborate the claim of
Bug 1147148 - M2Crypto usage in vdsm leaks memory
 ? Does the leak disappear once you start using plaintext transport?
 
 Regards,
 Dan.

I don’t think this is crypto related, but I could try that if you still need 
some confirmation (and point me at a quick doc on switching to plaintext?).

This is from #ovirt around November 18th I think, Saggi thought he’d found 
something related:

9:58:43 AM saggi: YamakasY: Found the leak
9:58:48 AM saggi: YamakasY: Or at least the flow
9:58:57 AM saggi: YamakasY: The good news is that I can reproduce
9:59:20 AM YamakasY: saggi: that's kewl!
9:59:25 AM YamakasY: saggi: what happens ?
9:59:41 AM YamakasY: I know from Telsin (ping ping!) that he sees it going 
faster on gluster usage
tdosek left the room (quit: Ping timeout: 480 seconds). (10:00:02 AM)
djasa left the room (quit: Quit: Leaving). (10:00:24 AM)
mlipchuk left the room (quit: Quit: Leaving.). (10:00:29 AM)
laravot left the room (quit: Quit: Leaving.). (10:01:19 AM)
10:01:54 AM saggi: YamakasY: it's in getCapabilities(). Here is the RSS graph. 
The flatlines are when I stopped calling it and called other verbs. 
http://i.imgur.com/CLm0Q75.png
movciari left the room (quit: Ping timeout: 480 seconds). (10:02:34 AM)
10:02:46 AM saggi: YamakasY: horizontal is time since epoch and vertical is RSS 
in bytes
bobdrad left the room (quit: Quit: Leaving.). (10:03:25 AM)
10:03:52 AM YamakasY: saggi: I have seen that line s much!
10:04:11 AM YamakasY: I think I even made a mailing about it
10:04:18 AM YamakasY: at least asked here
10:04:32 AM YamakasY: no-one knew, but those lines are almost blowing you away
10:04:35 AM YamakasY: can we patch it ?
10:04:59 AM YamakasY: wow, nice one to catch
10:05:28 AM saggi: YamakasY: I now have a smaller part of the code to scan 
through and a way to reproduce so hopefully I'll have a patch soon

was that ever followed up on?


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] VDSM memory consumption

2015-03-06 Thread Darrell Budic
I believe the supervdsm leak was fixed, but 3.5.1 versions of vdsmd still leaks 
slowly, ~300k/hr, yes.

https://bugzilla.redhat.com/show_bug.cgi?id=1158108


 On Mar 6, 2015, at 10:23 AM, Chris Adams c...@cmadams.net wrote:
 
 Once upon a time, Federico Alberto Sayd fs...@uncu.edu.ar said:
 I am experiencing troubles with VDSM memory consuption.
 
 I am running
 
 Engine: ovirt 3.5.1
 
 Nodes:
 
 Centos 6.6
 VDSM 4.16.10-8
 Libvirt: libvirt-0.10.2-46
 Kernel: 2.6.32
 
 When the host boots, memory consuption is normal, but after 2 or 3
 days running, VDSM memory consuption grows and it consumes more
 memory that all vm's running in the host. If I restart the vdsm
 service, memory consuption normalizes, but then it start growing
 again.
 
 I have seen some BZ about vdsm and supervdsm about memory leaks, but
 I don't know if VDSM 4.6.10.8 is still affected by a related bug.
 
 Can't help, but I see the same thing with CentOS 7 nodes and the same
 version of vdsm.
 -- 
 Chris Adams c...@cmadams.net
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] 3.5.1 net config persistence

2015-03-24 Thread Darrell Budic

 On Mar 23, 2015, at 12:35 PM, Dan Kenigsberg dan...@redhat.com wrote:
 
 On Fri, Mar 20, 2015 at 02:01:25PM -0500, Darrell Budic wrote:
 I’ve encounter these issues on systems new and upgraded with bonding
 connections. The new system seems especially bad with bonds, and I’ve
 taken to immediately switching my hosts to the ifcfg persistence
 methods. Centos 6 and 7 hosts.
 
 There have been multiple issue regarding net config upgrade. We might
 have nailed an important one regarding ovirt-node.
 
 However, I'd like to learn more about your report regarding new systems.
 Your report sounds similar to
 
Bug 1203422 - vdsm should restore networks much earlier, to let
net-dependent services start

Caveat: I don’t have systems available to recreate at this time, so this is 
from memory of what I go through on a new host setup.

I havn’t filed bugs because I’ve seen several that look like mine, and until 
recently, I couldn’t be sure my problems weren’t being caused by upgrades from 
older systems. Whenever I experience issues, it’s related to installing onto a 
new host system, creating the bonds either in or outside of ovirt, and the next 
time I reboot that host, the bonds do not get created so none of the networks 
come up and I need to get on a console to fix things.

 If it matters, I’m good with setting up my own network config, and
 sometimes I REALLY DO NOT WANT ovirt to change them, especially with
 vlans and gluster co-existance. I can see the goal, but it seems
 pretty far from it right now, so I’m very happy that there’s a way to
 switch back to “system” control of those things.
 
 Besides Vdsm slowliness to start the network, what are the reasons for
 your not wanting ovirt to touch your ifcfg? BTW, even today ovirt
 overwrites ifcfg files, but only on network def time, to on every boot.

I don’t actually notice the slowness, but my mgmt, access, and gluster storage 
networks depend on the bonded network config to function. I’d like to have them 
up at boot and not wait for vdsmd to bring them up. Similar to Bug 1203422, but 
my problem is that the bonds don’t get created at boot, so no other networks 
that depend on them can come up.

Also, I have setup my gluster backend to use specific interfaces and ip 
addresses, and I’d like it if Ovirt didn’t mess with them. 

These are all things I can work around with ifcfg files, so I prefer them. I’ve 
taken to saving my ifcfg-* files so I can easily replace them if ovirt does 
things to them I don’t like (like setting ONBOOT=no). I did catch that it only 
alters them when defining a network, it does mean I can easily adjust things as 
needed.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] 3.5.1 net config persistence

2015-03-24 Thread Darrell Budic
I should add that I do one thing that may be considered unusual. I have a bunch 
of systems with 2 1Gb links on them, and I’m building them on one link, then 
manually converting them to bonded links before configuring them as ovirt host 
nodes. Since I have no other dedicated interfaces, all of my networking depends 
on the bonded interface for connectivity.


 On Mar 24, 2015, at 11:40 AM, Darrell Budic bu...@onholyground.com wrote:
 
 
 On Mar 23, 2015, at 12:35 PM, Dan Kenigsberg dan...@redhat.com wrote:
 
 On Fri, Mar 20, 2015 at 02:01:25PM -0500, Darrell Budic wrote:
 I’ve encounter these issues on systems new and upgraded with bonding
 connections. The new system seems especially bad with bonds, and I’ve
 taken to immediately switching my hosts to the ifcfg persistence
 methods. Centos 6 and 7 hosts.
 
 There have been multiple issue regarding net config upgrade. We might
 have nailed an important one regarding ovirt-node.
 
 However, I'd like to learn more about your report regarding new systems.
 Your report sounds similar to
 
   Bug 1203422 - vdsm should restore networks much earlier, to let
   net-dependent services start
 
 Caveat: I don’t have systems available to recreate at this time, so this is 
 from memory of what I go through on a new host setup.
 
 I havn’t filed bugs because I’ve seen several that look like mine, and until 
 recently, I couldn’t be sure my problems weren’t being caused by upgrades 
 from older systems. Whenever I experience issues, it’s related to installing 
 onto a new host system, creating the bonds either in or outside of ovirt, and 
 the next time I reboot that host, the bonds do not get created so none of the 
 networks come up and I need to get on a console to fix things.
 
 If it matters, I’m good with setting up my own network config, and
 sometimes I REALLY DO NOT WANT ovirt to change them, especially with
 vlans and gluster co-existance. I can see the goal, but it seems
 pretty far from it right now, so I’m very happy that there’s a way to
 switch back to “system” control of those things.
 
 Besides Vdsm slowliness to start the network, what are the reasons for
 your not wanting ovirt to touch your ifcfg? BTW, even today ovirt
 overwrites ifcfg files, but only on network def time, to on every boot.
 
 I don’t actually notice the slowness, but my mgmt, access, and gluster 
 storage networks depend on the bonded network config to function. I’d like to 
 have them up at boot and not wait for vdsmd to bring them up. Similar to Bug 
 1203422, but my problem is that the bonds don’t get created at boot, so no 
 other networks that depend on them can come up.
 
 Also, I have setup my gluster backend to use specific interfaces and ip 
 addresses, and I’d like it if Ovirt didn’t mess with them. 
 
 These are all things I can work around with ifcfg files, so I prefer them. 
 I’ve taken to saving my ifcfg-* files so I can easily replace them if ovirt 
 does things to them I don’t like (like setting ONBOOT=no). I did catch that 
 it only alters them when defining a network, it does mean I can easily adjust 
 things as needed.

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] VDSM memory consumption

2015-03-25 Thread Darrell Budic

 On Mar 25, 2015, at 5:34 AM, Dan Kenigsberg dan...@redhat.com wrote:
 
 On Tue, Mar 24, 2015 at 02:01:40PM -0500, Darrell Budic wrote:
 
 On Mar 24, 2015, at 4:33 AM, Dan Kenigsberg dan...@redhat.com wrote:
 
 On Mon, Mar 23, 2015 at 04:00:14PM -0400, John Taylor wrote:
 Chris Adams c...@cmadams.net writes:
 
 Once upon a time, Sven Kieske s.kie...@mittwald.de said:
 On 13/03/15 12:29, Kapetanakis Giannis wrote:
 We also face this problem since 3.5 in two different installations...
 Hope it's fixed soon
 
 Nothing will get fixed if no one bothers to
 open BZs and send relevants log files to help
 track down the problems.
 
 There's already an open BZ:
 
 https://bugzilla.redhat.com/show_bug.cgi?id=1158108
 
 I'm not sure if that is exactly the same problem I'm seeing or not; my
 vdsm process seems to be growing faster (RSS grew 952K in a 5 minute
 period just now; VSZ didn't change).
 
 For those following this I've added a comment on the bz [1], although in
 my case the memory leak is, like Chris Adams, a lot more than the 300KiB/h
 in the original bug report by Daniel Helgenberger .
 
 [1] https://bugzilla.redhat.com/show_bug.cgi?id=1158108
 
 That's interesting (and worrying).
 Could you check your suggestion by editing sampling.py so that
 _get_interfaces_and_samples() returns the empty dict immediately?
 Would this make the leak disappear?
 
 Looks like you’ve got something there. Just a quick test for now, watching 
 RSS in top. I’ll let it go this way for a while and see what it looks in a 
 few hours.
 
 System 1: 13 VMs w/ 24 interfaces between them
 
 11:47 killed a vdsm @ 9.116G RSS (after maybe a week and a half running)
 
 11:47: 97xxx
 11:57 135544 and climbing
 12:00 136400
 
 restarted with sampling.py modified to just return empty set:
 
 def _get_interfaces_and_samples():
links_and_samples = {}
return links_and_samples
 
 Thanks for the input. Just to be a little more certain that the culprit
 is _get_interfaces_and_samples() per se, would you please decorate it
 with memoized, and add a log line in the end
 
 @utils.memoized   # add this line
 def _get_interfaces_and_samples():
...
logging.debug('LINKS %s', links_and_samples)  ## and this line
return links_and_samples
 
 I'd like to see what happens when the function is run only once, and
 returns a non-empty reasonable dictionary of links and samples.

Looks similar, I modified my second server for this test:

12:25, still growing from yesterday: 544512

restarted with mods for logging and memoize:
stabilized @ 12:32: 114284
1:23: 115300

Thread-12::DEBUG::2015-03-25 
12:28:08,080::sampling::243::root::(_get_interfaces_and_samples) LINKS 
{'vnet18': virt.sampling.InterfaceSample instance at 0x7f38c03e85f0, 
'vnet19': virt.sampling.InterfaceSample instance at 0x7f38b42cbcf8, 'bond0': 
virt.sampling.InterfaceSample instance at 0x7f38b429afc8, 'vnet13': 
virt.sampling.InterfaceSample instance at 0x7f38b42c8680, 'vnet16': 
virt.sampling.InterfaceSample instance at 0x7f38b42cb368, 'private': 
virt.sampling.InterfaceSample instance at 0x7f38b42b8bd8, 'bond0.100': 
virt.sampling.InterfaceSample instance at 0x7f38b42bdd88, 'vnet0': 
virt.sampling.InterfaceSample instance at 0x7f38b42c1f80, 'enp3s0': 
virt.sampling.InterfaceSample instance at 0x7f38b429cef0, 'vnet2': 
virt.sampling.InterfaceSample instance at 0x7f38b42bbbd8, 'vnet3': 
virt.sampling.InterfaceSample instance at 0x7f38b42c37e8, 'vnet4': 
virt.sampling.InterfaceSample instance at 0x7f38b42c5518, 'vnet5': 
virt.sampling.InterfaceSample instance at 0x7f38b42c6ab8, 'vnet6': 
virt.sampling.InterfaceSample instance at 0x7f38b42c7248, 'vnet7': 
virt.sampling.InterfaceSample instance at 0x7f38c03e7a28, 'vnet8': 
virt.sampling.InterfaceSample instance at 0x7f38b42c7c20, 'bond0.1100': 
virt.sampling.InterfaceSample instance at 0x7f38b42be710, 'bond0.1103': 
virt.sampling.InterfaceSample instance at 0x7f38b429dc68, 'ovirtmgmt': 
virt.sampling.InterfaceSample instance at 0x7f38b42b16c8, 'lo': 
virt.sampling.InterfaceSample instance at 0x7f38b429a8c0, 'vnet22': 
virt.sampling.InterfaceSample instance at 0x7f38c03e7128, 'vnet21': 
virt.sampling.InterfaceSample instance at 0x7f38b42cd368, 'vnet20': 
virt.sampling.InterfaceSample instance at 0x7f38b42cc7a0, 'internet': 
virt.sampling.InterfaceSample instance at 0x7f38b42aa098, 'bond0.1203': 
virt.sampling.InterfaceSample instance at 0x7f38b42aa8c0, 'bond0.1223': 
virt.sampling.InterfaceSample instance at 0x7f38b42bb128, ‘XXX': 
virt.sampling.InterfaceSample instance at 0x7f38b42bee60, ‘XXX': 
virt.sampling.InterfaceSample instance at 0x7f38b42beef0, ';vdsmdummy;': 
virt.sampling.InterfaceSample instance at 0x7f38b42bdc20, 'vnet14': 
virt.sampling.InterfaceSample instance at 0x7f38b42ca050, 'mgmt': 
virt.sampling.InterfaceSample instance at 0x7f38b42be248, 'vnet15': 
virt.sampling.InterfaceSample instance at 0x7f38b42cab00, 'enp2s0': 
virt.sampling.InterfaceSample instance at 0x7f38b429c200, 'bond0.1110

Re: [ovirt-users] VDSM memory consumption

2015-03-31 Thread Darrell Budic
Finally got a chance to implement this, so testing this on my centos7 hosts, 
and it looks good. I’ll keep eye on it for a couple days, but after a couple of 
hours, there’s no evidence of any leakage.


 On Mar 30, 2015, at 4:14 PM, John Taylor jtt77...@yahoo.com wrote:
 
 Dan Kenigsberg dan...@redhat.com writes:
 
 On Sat, Mar 28, 2015 at 10:20:25AM -0400, John Taylor wrote:
 Daniel Helgenberger daniel.helgenber...@m-box.de writes:
 
 Hello Everyone,
 
 I did create the original BZ on this. In the mean time, lab system I
 used is dismantled and the production system is yet to deploy.
 
 As I wrote in BZ1147148 [1], I experienced two different issues. One,
 one big mem leak of about 15MiB/h and a smaller one, ~300KiB. These seem
 unrelated.
 
 The larger leak was indeed related to SSL in some way; not necessarily
 M2Crypto. However, after disabling SSL this was gone leaving the smaller
 leak.
 
 [1] https://bugzilla.redhat.com/show_bug.cgi?id=1147148
 
 
 I think there are, at least for the purpose of this discussion, 3 leaks:
 1. the M2Crypto leak
 2. a slower leak 
 3. a large leak that's not M2Crypto related that's part of sampling
 
 My efforts have been around finding the source of my larger leak, which
 I think is #3.  I had disabled ssl so I knew that M2Crypto
 isn't/shouldn't be the problem as in bz1147148, and ssl is beside the
 point as it happens with a deactived host. It's part of sampling which
 always runs.
 
 What I've found is, after trying to get the smallest reproducer, that
 it's not the netlink.iter_links that I commented on in [1] that is the
 problem. But in the _get_intefaces_and_samples loop is the call to
 create an InterfaceSample and that has getLinkSpeed() which, for vlans,
 ends up calling ipwrapper.getLink, and that to
 netlink.get_link(name)
 
 netlink.get_link(name) *is* the source of my big leak. This is vdsm
 4.16.10, so it is [2] and it's been changed in master for the removal of
 support for libnl v1 so it might not be a problem anymore. 
 
 def get_link(name):
Returns the information dictionary of the name specified link.
with _pool.socket() as sock:
with _nl_link_cache(sock) as cache:
link = _rtnl_link_get_by_name(cache, name)
if not link:
raise IOError(errno.ENODEV, '%s is not present in the 
 system' %
  name)
return _link_info(cache, link)
 
 
 The libnl documentation note at [3] says that for the rtnl_link_get_by_name 
 function 
 Attention
The reference counter of the returned link object will be incremented. 
 Use rtnl_link_put() to release the reference.
 
 So I took that hint, and made a change that does the rtnl_link_put() in
 get_link(name) and it looks like it works for me.
 
 diff oldnetlink.py netlink.py
 67d66
  return _link_info(cache, link)
 68a68,70
li = _link_info(cache, link)
_rtnl_link_put(link)
return li
 333a336,337
 
 _rtnl_link_put  = _none_proto(('rtnl_link_put', LIBNL_ROUTE))
 
 Hope that helps. And if someone else could confirm that would be great.
 
 [1] https://bugzilla.redhat.com/show_bug.cgi?id=1158108
 [2]
 https://gerrit.ovirt.org/gitweb?p=vdsm.git;a=blob;f=lib/vdsm/netlink.py;h=afae5cecb5ce701d00fb8f019ec92b3331a39036;hb=5608cfdf43db9186dabac4b2a779f9557e798968
 [3] 
 http://www.infradead.org/~tgr/libnl/doc/api/group__link.html#ga1d583e4f0b43c89d854e5e681a529fad
 
 Thanks, John, for a great detective work.
 
 I'm afraid that with even on the master branch we keep calling
 rtnl_link_get_link() and rtnl_link_get_by_name() without clearing the
 reference count, so a fix is due there, too.
 
 Would you consider posting a fully-fledged fix to gerrit? I still need
 to understand what is the use of that refcount, so that we do not
 release it too early.
 
 Regards,
 Dan.
 
 Dan,
 
 I'm happy to [1], although I've probably gotten something wrong with how
 it's supposed to be done :) It's for the version I'm using so it's for
 branch ovirt-3.5.
 
 [1] https://gerrit.ovirt.org/#/c/39372/
 
 Thanks,
 -John
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] VDSM memory consumption

2015-03-26 Thread Darrell Budic

 On Mar 26, 2015, at 6:42 AM, Dan Kenigsberg dan...@redhat.com wrote:
 
 On Wed, Mar 25, 2015 at 01:29:25PM -0500, Darrell Budic wrote:
 
 On Mar 25, 2015, at 5:34 AM, Dan Kenigsberg dan...@redhat.com wrote:
 
 On Tue, Mar 24, 2015 at 02:01:40PM -0500, Darrell Budic wrote:
 
 On Mar 24, 2015, at 4:33 AM, Dan Kenigsberg dan...@redhat.com wrote:
 
 On Mon, Mar 23, 2015 at 04:00:14PM -0400, John Taylor wrote:
 Chris Adams c...@cmadams.net writes:
 
 Once upon a time, Sven Kieske s.kie...@mittwald.de said:
 On 13/03/15 12:29, Kapetanakis Giannis wrote:
 We also face this problem since 3.5 in two different installations...
 Hope it's fixed soon
 
 Nothing will get fixed if no one bothers to
 open BZs and send relevants log files to help
 track down the problems.
 
 There's already an open BZ:
 
 https://bugzilla.redhat.com/show_bug.cgi?id=1158108
 
 I'm not sure if that is exactly the same problem I'm seeing or not; my
 vdsm process seems to be growing faster (RSS grew 952K in a 5 minute
 period just now; VSZ didn't change).
 
 For those following this I've added a comment on the bz [1], although in
 my case the memory leak is, like Chris Adams, a lot more than the 
 300KiB/h
 in the original bug report by Daniel Helgenberger .
 
 [1] https://bugzilla.redhat.com/show_bug.cgi?id=1158108
 
 That's interesting (and worrying).
 Could you check your suggestion by editing sampling.py so that
 _get_interfaces_and_samples() returns the empty dict immediately?
 Would this make the leak disappear?
 
 Looks like you’ve got something there. Just a quick test for now, watching 
 RSS in top. I’ll let it go this way for a while and see what it looks in a 
 few hours.
 
 System 1: 13 VMs w/ 24 interfaces between them
 
 11:47 killed a vdsm @ 9.116G RSS (after maybe a week and a half running)
 
 11:47: 97xxx
 11:57 135544 and climbing
 12:00 136400
 
 restarted with sampling.py modified to just return empty set:
 
 def _get_interfaces_and_samples():
   links_and_samples = {}
   return links_and_samples
 
 Thanks for the input. Just to be a little more certain that the culprit
 is _get_interfaces_and_samples() per se, would you please decorate it
 with memoized, and add a log line in the end
 
 @utils.memoized   # add this line
 def _get_interfaces_and_samples():
   ...
   logging.debug('LINKS %s', links_and_samples)  ## and this line
   return links_and_samples
 
 I'd like to see what happens when the function is run only once, and
 returns a non-empty reasonable dictionary of links and samples.
 
 Looks similar, I modified my second server for this test:
 
 Thanks again. Would you be kind to search further?
 Does the following script leak anything on your host, when placed in your
 /usr/share/vdsm:
 
#!/usr/bin/python
 
from time import sleep
from virt.sampling import _get_interfaces_and_samples
 
while True:
_get_interfaces_and_samples()
sleep(0.2)
 
 Something that can be a bit harder would be to:
 # service vdsmd stop
 # su - vdsm -s /bin/bash
 # cd /usr/share/vdsm
 # valgrind --leak-check=full --log-file=/tmp/your.log vdsm
 
 as suggested by Thomas on
 https://bugzilla.redhat.com/show_bug.cgi?id=1158108#c6

Yes, this script leaks quickly. Started out at a RSS of 21000ish, already at 
26744 a minute in, about 5 minutes later it’s at 39384 and climbing.

Been abusing a production server for those simple tests, but didn’t want to run 
valgrind against it right this minute. Did run it against the test.py script 
above though, got this (fpaste.org didn’t like, too long maybe?): 
http://tower.onholyground.com/valgrind-test.log

To comment on some other posts in this thread, I also see leaks on my test 
system which is running Centos 6.6, but it only has 3 VMs across 2 servers and 
3 configured networks and it leaks MUCH slower. I suspect people don’t notice 
this on test systems because they don’t have a lot of VMs/interfaces running, 
and don’t leave them up for weeks at a time. That’s why I was running these 
tests on my production box, to have more VMs up.




___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Host/storage OS upgrade: VM migrate?

2015-03-05 Thread Darrell Budic
In theory you can do this, but it takes a bit of work.

I migrated a cent 6 system to a cent 7 and kept my gluster bricks intact by 
backing up and restoring /etc/gluster and /var/lib/gluster, as well as the 
bricks themselves. Your milage may vary. I was also working on a multi-server 
system and was in a position to rebuild the bricks from the running systems if 
needed. And I needed to for one set, so it wasn’t perfect. If you go this 
route, make sure you backup your engine and restore it as well, a simple copy 
of your storage domain is not enough to keep a VM in ovirt (although you can 
probably import the volume as an existing domain and get the disks back).

Might be safer to setup an export domain on an external drive and export your 
VMs to it, then you can import them to a clean new system after your upgrade. 
Way less to go wrong with this approach, so I’d probably recommend it.

  -Darrell


 On Mar 5, 2015, at 4:59 AM, Alan Murrell li...@murrell.ca wrote:
 
 Hello,
 
 I currently run oVirt 3.5 on CentOS6.  It is on a single host with a 
 self-hosted engine.  Not an officially supported setup, but it is just a home 
 lab.
 
 When the next release of oVirt comes out (3.6), I am thinking I may want to 
 upgrade to CentOS7 on both host and engine to take advantage of the newer 
 libraries and features of Centos7.
 
 On the host, my storage is GlusterFS and is on the same physical HDD as the 
 OS, but on a different LVM partition.
 
 If I were to do a fresh install on the OS partition, and run through the 
 initial steps to install oVirt and GlusterFS packages, when I get to 
 configuring GlusterFS, will it be able to pick up my existing bricks and thus 
 allow me to import that existing storage into oVirt?
 
 Alternatively, is there a way to export my VMs to an external HDD, do a 
 completely fresh install, then import them VMs back in?  I suspect I would 
 probably need to resort to a cloning tool like CloneZilla?
 
 Downtime of the VMs is not an issue, since this is just a lab and there is 
 nothing production-wise running on it.
 
 Thanks for your advise!
 
 Regards,
 
 Alan
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Unable to run noVNC console un recent browsers

2015-02-20 Thread Darrell Budic
I had some trouble with self signed certs in firefox when they switch to the 
new pkix stuff recently, have you tried setting 
security.use_mozillapkix_verification to false?


 On Feb 20, 2015, at 8:56 AM, Simone Tiraboschi stira...@redhat.com wrote:
 
 
 
 - Original Message -
 From: Donny Davis do...@cloudspin.me
 To: Simone Tiraboschi stira...@redhat.com
 Cc: users@ovirt.org
 Sent: Friday, February 20, 2015 3:53:04 PM
 Subject: RE: [ovirt-users] Unable to run noVNC console un recent browsers
 
 No, I made my life easy and used nginx to proxy for the websocket. I was then
 able to use my commercial ssl cert to avoid all of these issues. Using a
 proxy for a proxy has been working out quite well for cloudspin, because I
 don't have to mess with anything internal to the engine and noVNC works
 without issue.
 
 Yes, using the oVirt internal CA is just the low-profile out of the box 
 solution.
 
 DonnyD
 
 -Original Message-
 From: Simone Tiraboschi [mailto:stira...@redhat.com]
 Sent: Friday, February 20, 2015 7:03 AM
 To: Donny Davis
 Subject: Re: [ovirt-users] Unable to run noVNC console un recent browsers
 
 
 
 - Original Message -
 From: Donny Davis do...@cloudspin.me
 To: Simone Tiraboschi stira...@redhat.com
 Sent: Friday, February 20, 2015 2:23:56 PM
 Subject: RE: [ovirt-users] Unable to run noVNC console un recent
 browsers
 
 Is your websocket proxy on the same machine as your engine. I also get
 the CA error when the time it off. The proxy throws the error to
 /var/log/messages
 
 Hi Donny,
 I'm using the proxy on the same machine where the engine runs.
 No error till now no my side.
 
 I also trusted oVirt internal CA to sign other certs in my browser. Did you?
 You can find it at https://{engine}/ca.crt
 
 You should download it and add to the list of trusted certification
 authorities in your browser.
 
 -Original Message-
 From: users-boun...@ovirt.org [mailto:users-boun...@ovirt.org] On
 Behalf Of Simone Tiraboschi
 Sent: Friday, February 20, 2015 5:57 AM
 To: Stefano Danzi
 Cc: users@ovirt.org
 Subject: Re: [ovirt-users] Unable to run noVNC console un recent
 browsers
 
 
 
 - Original Message -
 From: Stefano Danzi s.da...@hawai.it
 To: Darrell Budic bu...@onholyground.com
 Cc: users@ovirt.org
 Sent: Friday, February 20, 2015 9:07:51 AM
 Subject: Re: [ovirt-users] Unable to run noVNC console un recent
 browsers
 
 Hello!
 Already done but this didn't help.
 
 I downloaded a portable version of Firefox 17 and noVNC work as expected.
 
 Il 20/02/2015 5.18, Darrell Budic ha scritto:
 
 
 
 Try reimporting the ca.cert for noVNC by connecting directly to the
 webproxy address at port 6100. Do this by trying to connect to a
 console and then, once the 1006 error shows up, just strip off
 everything after :6100/ . I've found that somewhere in or after 3.5,
 restarting the webproxy causes it to generate its own new ca.cert
 even
 through it shouldn't.
 
  -Darrell
 
 
 
 On Feb 19, 2015, at 4:09 PM, Stefano Danzi s.da...@hawai.it wrote:
 
 Hello,
 
 I can't make work noVNC console on recent browsers (Chrome 40,
 Firefox
 35 and IE 11).
 
 The error that I have is already explained here:
 https://forge.univention.org/bugzilla/show_bug.cgi?id=33587 I tried
 to change websocket like suggested (
 http://errata.univention.de/ucs/3.2/31.html ) but this not helped.
 
 noVNC 0.5.1 should be soon released in EPEL6/EPEL7 as for [1].
 noVNC 0.5.1 should also improve compatibility with recent browsers.
 
 [1] https://bugzilla.redhat.com/show_bug.cgi?id=1193454#c3
 
 
 Someone know a workaround?
 ___
 Users mailing list Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 
 
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] VDSM memory consumption

2015-03-24 Thread Darrell Budic

 On Mar 24, 2015, at 4:33 AM, Dan Kenigsberg dan...@redhat.com wrote:
 
 On Mon, Mar 23, 2015 at 04:00:14PM -0400, John Taylor wrote:
 Chris Adams c...@cmadams.net writes:
 
 Once upon a time, Sven Kieske s.kie...@mittwald.de said:
 On 13/03/15 12:29, Kapetanakis Giannis wrote:
 We also face this problem since 3.5 in two different installations...
 Hope it's fixed soon
 
 Nothing will get fixed if no one bothers to
 open BZs and send relevants log files to help
 track down the problems.
 
 There's already an open BZ:
 
 https://bugzilla.redhat.com/show_bug.cgi?id=1158108
 
 I'm not sure if that is exactly the same problem I'm seeing or not; my
 vdsm process seems to be growing faster (RSS grew 952K in a 5 minute
 period just now; VSZ didn't change).
 
 For those following this I've added a comment on the bz [1], although in
 my case the memory leak is, like Chris Adams, a lot more than the 300KiB/h
 in the original bug report by Daniel Helgenberger .
 
 [1] https://bugzilla.redhat.com/show_bug.cgi?id=1158108
 
 That's interesting (and worrying).
 Could you check your suggestion by editing sampling.py so that
 _get_interfaces_and_samples() returns the empty dict immediately?
 Would this make the leak disappear?

Looks like you’ve got something there. Just a quick test for now, watching RSS 
in top. I’ll let it go this way for a while and see what it looks in a few 
hours.

System 1: 13 VMs w/ 24 interfaces between them

11:47 killed a vdsm @ 9.116G RSS (after maybe a week and a half running)

11:47: 97xxx
11:57 135544 and climbing
12:00 136400

restarted with sampling.py modified to just return empty set:

def _get_interfaces_and_samples():
links_and_samples = {}
return links_and_samples

12:02 quickly grew to 127694
12:13: 133352
12:20: 132476
12:31: 132732
12:40: 132656
12:50: 132800
1:30: 133928
1:40: 133136
1:50: 133116
2:00: 133128

interestingly, it looks like overall system load dropped significantly (from 
~40-45% to 10% reported). mostly ksmd getting out of the way after freeing 9G, 
but feels like more than that. (this is a 6 core system, usually saw ksmd using 
~80% of a single cpu, roughly 15% of the total available)


Second system, 10 Vms w/ 17 interfaces

vdsmd @ 5.027G RSS (slightly less uptime that previous host) freeing this ram 
caused a ~16% utilization drop as ksmd stopped running as hard.

restarted at 12:10

12:10: 106224
12:20: 111220
12:31: 114616
12:40: 117500
12:50: 120504
1:30: 133040
1:40: 136140
1:50: 139032
2:00: 142292



___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] ReferenceError: WebUtil is not defined = novnc console broken after yum update (on centos 6.6?)

2015-05-18 Thread Darrell Budic
yum downgrade novnc

on your ovirt engine works to get around it. Doesn’t seem to have any 
dependency problems on my cent 6.6 host, just have to keep remembering to redo 
it after engine updates.

  -Darrell

 On May 18, 2015, at 7:07 AM, m...@ohnewald.net wrote:
 
 Hello,
 
 i think i ran into this bug:
 https://bugzilla.redhat.com/show_bug.cgi?format=multipleid=1202356
 
 I can not use my novnc console anymore.
 
 Does anyone have a fix for this (on centos 6.6?)
 
 Thanks,
 Mario
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] VMs freezing during heals

2015-04-06 Thread Darrell Budic
I hadn’t revisited it yet, but it is possible to use cgroups to limit 
glusterfs’s cpu usage, might help you out.

Andrew Wklau has a blog post about it: 
http://www.andrewklau.com/controlling-glusterfsd-cpu-outbreaks-with-cgroups/

Careful about how far you throttle it down, if it’s your VMs disk it’s 
rebuilding, you’ll pause it anyway I’d expect.

 On Apr 4, 2015, at 8:57 AM, Jorick Astrego j.astr...@netbulae.eu wrote:
 
 
 
 On 04/03/2015 10:04 PM, Alastair Neil wrote:
 Any follow up on this?
 
  Are there known issues using a replica 3 glsuter datastore with lvm thin 
 provisioned bricks?
 
 On 20 March 2015 at 15:22, Alastair Neil ajneil.t...@gmail.com 
 mailto:ajneil.t...@gmail.com wrote:
 CentOS 6.6
  
  vdsm-4.16.10-8.gitc937927.el6
 glusterfs-3.6.2-1.el6
 2.6.32 - 504.8.1.el6.x86_64
 
 moved to 3.6 specifically to get the snapshotting feature, hence my desire 
 to migrate to thinly provisioned lvm bricks.
 
 
 Well on the glusterfs mailinglist there have been discussions:
 
 
 3.6.2 is a major release and introduces some new features in cluster wide 
 concept. Additionally it is not stable yet.
 
 
 
 
 
 
 On 20 March 2015 at 14:57, Darrell Budic bu...@onholyground.com 
 mailto:bu...@onholyground.com wrote:
 What version of gluster are you running on these?
 
 I’ve seen high load during heals bounce my hosted engine around due to 
 overall system load, but never pause anything else. Cent 7 combo 
 storage/host systems, gluster 3.5.2.
 
 
 On Mar 20, 2015, at 9:57 AM, Alastair Neil ajneil.t...@gmail.com 
 mailto:ajneil.t...@gmail.com wrote:
 
 Pranith
 
 I have run a pretty straightforward test.  I created a two brick 50 G 
 replica volume with normal lvm bricks, and installed two servers, one 
 centos 6.6 and one centos 7.0.  I kicked off bonnie++ on both to generate 
 some file system activity and then made the volume replica 3.  I saw no 
 issues on the servers.   
 
 Not clear if this is a sufficiently rigorous test and the Volume I have had 
 issues on is a 3TB volume  with about 2TB used.
 
 -Alastair
 
 
 On 19 March 2015 at 12:30, Alastair Neil ajneil.t...@gmail.com 
 mailto:ajneil.t...@gmail.com wrote:
 I don't think I have the resources to test it meaningfully.  I have about 
 50 vms on my primary storage domain.  I might be able to set up a small 50 
 GB volume and provision 2 or 3 vms running test loads but I'm not sure it 
 would be comparable.  I'll give it a try and let you know if I see similar 
 behaviour.
 
 On 19 March 2015 at 11:34, Pranith Kumar Karampuri pkara...@redhat.com 
 mailto:pkara...@redhat.com wrote:
 Without thinly provisioned lvm.
 
 Pranith
 
 On 03/19/2015 08:01 PM, Alastair Neil wrote:
 do you mean raw partitions as bricks or simply with out thin provisioned 
 lvm?
 
 
 
 On 19 March 2015 at 00:32, Pranith Kumar Karampuri pkara...@redhat.com 
 mailto:pkara...@redhat.com wrote:
 Could you let me know if you see this problem without lvm as well?
 
 Pranith
 
 On 03/18/2015 08:25 PM, Alastair Neil wrote:
 I am in the process of replacing the bricks with thinly provisioned lvs 
 yes.
 
 
 
 On 18 March 2015 at 09:35, Pranith Kumar Karampuri pkara...@redhat.com 
 mailto:pkara...@redhat.com wrote:
 hi,
   Are you using thin-lvm based backend on which the bricks are 
 created?
 
 Pranith
 
 On 03/18/2015 02:05 AM, Alastair Neil wrote:
 I have a Ovirt cluster with 6 VM hosts and 4 gluster nodes. There are 
 two virtualisation clusters one with two nehelem nodes and one with  
 four  sandybridge nodes. My master storage domain is a GlusterFS backed 
 by a replica 3 gluster volume from 3 of the gluster nodes.  The engine 
 is a hosted engine 3.5.1 on 3 of the sandybridge nodes, with storage 
 broviede by nfs from a different gluster volume.  All the hosts are 
 CentOS 6.6.
 
  vdsm-4.16.10-8.gitc937927.el6
 glusterfs-3.6.2-1.el6
 2.6.32 - 504.8.1.el6.x86_64
 
 Problems happen when I try to add a new brick or replace a brick 
 eventually the self heal will kill the VMs. In the VM's logs I see 
 kernel hung task messages. 
 
 Mar 12 23:05:16 static1 kernel: INFO: task nginx:1736 blocked for more 
 than 120 seconds.
 Mar 12 23:05:16 static1 kernel:  Not tainted 
 2.6.32-504.3.3.el6.x86_64 #1
 Mar 12 23:05:16 static1 kernel: echo 0  
 /proc/sys/kernel/hung_task_timeout_secs disables this message.
 Mar 12 23:05:16 static1 kernel: nginx D 0001 0  
 1736   1735 0x0080
 Mar 12 23:05:16 static1 kernel: 8800778b17a8 0082 
  000126c0
 Mar 12 23:05:16 static1 kernel: 88007e5c6500 880037170080 
 0006ce5c85bd9185 88007e5c64d0
 Mar 12 23:05:16 static1 kernel: 88007a614ae0 0001722b64ba 
 88007a615098 8800778b1fd8
 Mar 12 23:05:16 static1 kernel: Call Trace:
 Mar 12 23:05:16 static1 kernel: [8152a885] 
 schedule_timeout+0x215/0x2e0
 Mar 12 23:05:16 static1 kernel: [8152a503] 
 wait_for_common+0x123/0x180
 Mar 12 23:05:16 static1 kernel: [81064b90

[ovirt-users] gentoo guest support?

2015-05-20 Thread Darrell Budic
Just checking to see if anyones gotten ovirt-guest-agent to work on a gentoo 
based guest?

  -Darrell

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Confused / Hosted-Engine won't start

2015-07-02 Thread Darrell Budic
Looks normal, hosted engine uses some extra files/leases to track some of it’s 
stuff.

Looks like you might have a gluster problem though, that IO error appears to be 
on your hosted engines disk image. Check for split brains and try and initiate 
a heal on the files, see what you get.

  -Darrell

 On Jul 2, 2015, at 11:33 AM, Christopher Young mexigaba...@gmail.com wrote:
 
 If anyone has an experience of the various IDs in hosted-engine.conf
 and vm.conf for the Hosted Engine, I believe I need to just verify
 everything.  I tried a couple of changes, but I feel like I'm just
 making this worse, so I've reverted them.
 
 One thing I do not understand well is how a gluster-based (NFS) storage
 domain for the hosted-engine has so many entries:
 
 -
 
 [root@orldc-dev-vnode02 ovirt-gluster-nfs:_engine]# find . -type f |
 xargs ls -lah
 ls: cannot access ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/5ead7b5d-50e8-4d6c-a0e5-bbe6d93dd836/eeb2d821-a432
 -4df6-8856-fdb14df260e3: Input/output error
 -rw-rw. 1 vdsm kvm  1.0M Jul  2 11:20 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/dom_md/ids
 -rw-rw. 1 vdsm kvm   16M Jul  1 19:54 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/dom_md/inbox
 -rw-rw. 1 vdsm kvm  2.0M Jul  1 19:50 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/dom_md/leases
 -rw-r--r--. 1 vdsm kvm   482 Jul  1 19:49 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/dom_md/metadata
 -rw-rw. 1 vdsm kvm   16M Jul  1 19:49 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/dom_md/outbox
 -rw-rw. 1 vdsm kvm  1.0M Jul  2 11:32 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/1d80a60c-8f26-4448-9460-2c7b00ff75bf/d9bacbf6-c2f4
 -4f74-a91f-3a3a52f255bf
 -rw-rw. 1 vdsm kvm  1.0M Dec 23  2014 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/1d80a60c-8f26-4448-9460-2c7b00ff75bf/d9bacbf6-c2f4
 -4f74-a91f-3a3a52f255bf.lease
 -rw-r--r--. 1 vdsm kvm   284 Dec 23  2014 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/1d80a60c-8f26-4448-9460-2c7b00ff75bf/d9bacbf6-c2f4
 -4f74-a91f-3a3a52f255bf.meta
 -rw-rw. 1 vdsm kvm  1.0M Dec 23  2014 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/23ac8897-b0c7-41d6-a7de-19f46ed78400/dd9f373c-d161
 -4fa0-aab1-3cb52305dba7.lease
 -rw-r--r--. 1 vdsm kvm   283 Dec 23  2014 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/23ac8897-b0c7-41d6-a7de-19f46ed78400/dd9f373c-d161
 -4fa0-aab1-3cb52305dba7.meta
 -rw-rw. 1 vdsm kvm   25G Dec 23  2014 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/3278c444-d92a-4cb9-87d6-9669c6e4993e/1a4b6a5d-9c8f
 -4d54-91a7-3dd87377c362
 -rw-rw. 1 vdsm kvm  1.0M Dec 23  2014 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/3278c444-d92a-4cb9-87d6-9669c6e4993e/1a4b6a5d-9c8f
 -4d54-91a7-3dd87377c362.lease
 -rw-r--r--. 1 vdsm kvm   278 Dec 23  2014 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/3278c444-d92a-4cb9-87d6-9669c6e4993e/1a4b6a5d-9c8f
 -4d54-91a7-3dd87377c362.meta
 -rw-rw. 1 vdsm kvm  1.0M Jul  1 17:50 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/5ead7b5d-50e8-4d6c-a0e5-bbe6d93dd836/eeb2d821-a432
 -4df6-8856-fdb14df260e3.lease
 -rw-r--r--. 1 vdsm kvm   278 Dec 23  2014 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/5ead7b5d-50e8-4d6c-a0e5-bbe6d93dd836/eeb2d821-a432
 -4df6-8856-fdb14df260e3.meta
 -rw-rw. 1 vdsm kvm  1.0M Dec 23  2014 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/6064179f-2720-4db9-a7c4-a97e044c2238/05afaa26-95af
 -4226-9a00-5383d8937a83
 -rw-rw. 1 vdsm kvm  1.0M Dec 23  2014 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/6064179f-2720-4db9-a7c4-a97e044c2238/05afaa26-95af
 -4226-9a00-5383d8937a83.lease
 -rw-r--r--. 1 vdsm kvm   284 Dec 23  2014 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/6064179f-2720-4db9-a7c4-a97e044c2238/05afaa26-95af
 -4226-9a00-5383d8937a83.meta
 -rw-rw. 1 vdsm kvm 1004K Dec 23  2014 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/bb9d9a37-4f91-4973-ba9e-72ee81aed0b6/5acb27b3-62c5
 -46ac-8978-576a8a4a0399
 -rw-rw. 1 vdsm kvm  1.0M Dec 23  2014 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/bb9d9a37-4f91-4973-ba9e-72ee81aed0b6/5acb27b3-62c5
 -46ac-8978-576a8a4a0399.lease
 -rw-r--r--. 1 vdsm kvm   283 Dec 23  2014 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/images/bb9d9a37-4f91-4973-ba9e-72ee81aed0b6/5acb27b3-62c5
 -46ac-8978-576a8a4a0399.meta
 -rw-r--r--. 1 vdsm kvm   384 Jul  1 19:49 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/master/tasks/fef13299-0e7f-4c7a-a399
 -092a1235faab/fef13299-0e7f-4c7a-a399-092a1235faab.job.0
 -rw-r--r--. 1 vdsm kvm   277 Jul  1 19:49 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/master/tasks/fef13299-0e7f-4c7a-a399
 -092a1235faab/fef13299-0e7f-4c7a-a399-092a1235faab.recover.0
 -rw-r--r--. 1 vdsm kvm   417 Jul  1 19:49 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/master/tasks/fef13299-0e7f-4c7a-a399
 -092a1235faab/fef13299-0e7f-4c7a-a399-092a1235faab.recover.1
 -rw-r--r--. 1 vdsm kvm   107 Jul  1 19:49 ./4e3017eb-d062-4ad1-9df8
 -7057fcee412c/master/tasks/fef13299-0e7f-4c7a-a399
 -092a1235faab/fef13299-0e7f-4c7a-a399-092a1235faab.result
 -rw-r--r--. 1 vdsm kvm   

Re: [ovirt-users] Any way to correlate a VM disk (e.g. /dev/vda) to the vdsm ovirt disk?

2015-10-07 Thread Darrell Budic
You can also find the disk under the “Disks” tab in the web gui, selecting it 
will yield the uuid of the virtual disk as ID in the disk description panel. 

> On Oct 7, 2015, at 12:43 AM, Raz Tamir  wrote:
> 
> Hi ccox,
> you can see the disk id mapping to device if you execute 'ls -l 
> /dev/disk/by-id/' .
> Second way, and easier, is to make sure you have guest-agent installed on 
> your guest virtual machine and using rest API you can run GET command:
> GET on .../api/vms/{vm_id}/disks
> 
> You will see an attribute called "" .
> I hope that helps
> 
> 
> 
> Thanks,
> Raz Tamir
> Red Hat Israel
> 
> On Tue, Oct 6, 2015 at 11:07 PM,  > wrote:
> I want to correlate virtual disks back to their originating storage under
> ovirt. Is there any way to do this?
> 
> e.g. (made up example)
> 
> /dev/vda
> 
> maps to ovirt disk
> 
> disk1_vm serial 978e00a3-b4c9-4962-bc4f-ffc9267acdd8
> 
> ___
> Users mailing list
> Users@ovirt.org 
> http://lists.ovirt.org/mailman/listinfo/users 
> 
> 
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Change IP Address of Ovirt Engine

2015-09-04 Thread Darrell Budic
I’m not the networking team, but I did change the IP of my engine and kept the 
same FQDN, and it seems to be fine :)


> On Sep 3, 2015, at 9:42 AM, Sandro Bonazzola  wrote:
> 
> 
> 
> On Thu, Aug 20, 2015 at 7:42 AM, Phil Gersekowski  > wrote:
> We have an operational ovirt cluster where all nodes on 1 IP Network, and the 
> oVirt Engine is on another IP Network and are wanting to change IP Address of 
> the host of the ovirt engine so that it is on the same network as the nodes 
> that are managed.
> 
> I have not been able to find a definative answer, but since I are NOT 
> changing the name of the ovirt engine host, from what I have read it seems 
> that all I will need to do is alter the IP Address in the DNS of the hostname 
> for the ovirt engine host (apart from plumbing and address on the new network 
> into the ovrit engine host of course).
> 
> Is this correct, or is there some configuration file on either the engine 
> host or the nodes that needs to be updated to reflect the new IP Address of 
> the engine host ?
> 
> 
> Hi, if you configured everything using FQDN only and you're changing IP 
> preserving FQDN everything should continue working.
> BTW, I suggest to wait for someone from network team to confirm
> 
>  
> 
> 
> 
> 
> -- 
> Regards,
> Phil Gersekowski
> IT Director
> http://www.aspedia.net 
> 
>  | ph...@aspedia.net 
>  
> 
> Phone: 1800 677 656
> Mobile: 0447 546 890
> Suite 1, 1 Clunies Ross Court, Eight Mile Plains QLD 4113 | Map 
> 
> This message and any files transmitted with it are confidential and should be 
> read only by those persons to whom it is addressed. It may contain sensitive 
> and private proprietary or legally privileged information. No confidentiality 
> or privilege is waived or lost by any mistransmission. If you are not the 
> intended recipient, please immediately delete this message and notify the 
> sender Aspedia Australia Pty Ltd. You must not, directly or indirectly, use, 
> disclose, distribute, print, or copy any part of this message if you are not 
> the intended recipient. Unless otherwise expressly stated by an authorised 
> representative of Aspedia Australia Pty Ltd, any views, opinions and other 
> information expressed in this message and any attachments are solely those of 
> the sender and do not constitute formal views or opinions of our company. 
> Please consider the environment before printing.
> 
> 
> ___
> Users mailing list
> Users@ovirt.org 
> http://lists.ovirt.org/mailman/listinfo/users 
> 
> 
> 
> 
> 
> -- 
> Sandro Bonazzola
> Better technology. Faster innovation. Powered by community collaboration.
> See how it works at redhat.com 
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] vdsm high mem usage

2015-09-10 Thread Darrell Budic
If you’re using nfs mounts (even if they are gluster based), it’s safe to 
restart vdsmd, you’ll see it change status in ovirt, but your VMs will continue 
running. If you’re mounting gluster based storage as glusterfs shares directly 
(not over nfs), there’s another issue that will cause all your VMs to pause and 
the only way to recover is to stop them and restart them, but that’s going to 
happen to them anyway when vdsmd runs out of ram and crashes… Best solution is 
to migrate them yourself in this case, then restart and migrate back. Or live 
migrate them to NFS mounted storage so when vdsm crashes they don’t lock up, 
and clean up after you’ve had an opportunity to upgrade or patch.

Upgrade to 3.5.3 or later at your earliest opportunity, the mem leak is 
resolved there. Sounds like you already found the patch you can apply if 
upgrading isn’t an option, but it will still require you to restart your vdsms.

  -Darrell

> On Sep 10, 2015, at 1:45 PM, Michael Kleinpaste 
>  wrote:
> 
> Hi everybody.
> 
> So I ran into that high mem usage thing. The problem I have with patching is 
> that this is a live system so I can't do it mid day.  Can anybody tell me if 
> it is possible to just restart the vdsm service or does the host have to be 
> in "maintenance mode" before restarting it?  It is using gluster storage, if 
> that makes a difference as well.
> 
> Thanks,
> 
> -- 
> Michael Kleinpaste
> Senior Systems Administrator
> SharperLending, LLC.
> www.SharperLending.com <>
> michael.kleinpa...@sharperlending.com
> (509) 324-1230   Fax: (509) 324-1234
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Not able to resume a VM which was paused because of gluster quorum issue

2015-09-23 Thread Darrell Budic
This is a known issue in overt 3.5.x and below. It’s been solved in the 
upcoming ovirt 3.6.

Related to https://bugzilla.redhat.com/show_bug.cgi?id=1172905, the fix 
involved setting up a special cgroup for the mount, but i can’t find the exact 
details atm.


> On Sep 23, 2015, at 7:38 AM, Ramesh Nachimuthu  wrote:
> 
> 
> 
> On 09/22/2015 05:57 PM, Alastair Neil wrote:
>> You need to set the gluster.server-quorum-ratio to 51%
>> 
> 
> I did that. But still I am facing the same issue. VM get paused when I do 
> some I/O using fio on some disks backed by gluster. I am not able to resume 
> the VM after this. Now only way is to bring down the VM and run again. It 
> runs successfully on the same host without any issue.
> 
> Regards,
> Ramesh
> 
>> On 22 September 2015 at 08:25, Ramesh Nachimuthu > > wrote:
>> 
>> 
>> On 09/22/2015 05:43 PM, Alastair Neil wrote:
>>> what are the gluster-quorum-type and gluster.server-quorum-ratio  settings 
>>> on the volume?
>>> 
>> 
>> cluster.server-quorum-type:server
>> cluster.quorum-type:auto
>> gluster.server-quorum-ratio is not set.
>> 
>> One brick process is purposefully killed  but remaining two bricks are up 
>> and running.
>> 
>> Regards,
>> Ramesh
>> 
>>> On 22 September 2015 at 06:24, Ramesh Nachimuthu < 
>>> rnach...@redhat.com 
>>> > wrote:
>>> Hi,
>>> 
>>>I am not able to resume a VM which was paused because of gluster client 
>>> quorum issue. Here is what happened in my setup. 
>>> 
>>> 1. Created a gluster storage domain which is backed by gluster volume with 
>>> replica 3. 
>>> 2. Killed one brick process. So only two bricks are running in replica 3 
>>> setup.
>>> 3. Created two VMs
>>> 4. Started some IO using fio on both of the VMs
>>> 5. After some time got the following error in gluster mount and VMs moved 
>>> to paused state.
>>>  " server 10.70.45.17:49217  has not 
>>> responded in the last 42 seconds, disconnecting."
>>>   "vmstore-replicate-0: e16d1e40-2b6e-4f19-977d-e099f465dfc6: Failing 
>>> WRITE as quorum is not met"
>>>   more gluster mount logs at  
>>> http://pastebin.com/UmiUQq0F 
>>> 
>>> 6. After some time gluster quorum is active and I am able to write the the 
>>> gluster file system.
>>> 7. When I try to resume the VM it doesn't work and I got following error in 
>>> vdsm log.
>>>   http://pastebin.com/aXiamY15 
>>> 
>>> 
>>> Regards,
>>> Ramesh
>>> 
>>> 
>>> ___
>>> Users mailing list
>>> Users@ovirt.org 
>>> http://lists.ovirt.org/mailman/listinfo/users 
>>> 
>>> 
>>> 
>> 
>> 
> 
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Managing two DCs with ovirt

2015-10-06 Thread Darrell Budic
I use method 1. One thing to consider is that the engine manages HA VMs, 
migrations, etc. It doesn’t need much bandwidth, but if it can’t talk to nodes, 
no migrations can happen, either for load balancing or in case of a node or 
storage failure.

If you had very solid networking, it’s probably fine, but I find it works 
better in my situation to run a self hosted engine for each cluster.

  -Darrell

> On Oct 5, 2015, at 2:17 PM, wodel youchi  wrote:
> 
> Hi,
> 
> I need some help to decide which is better / feasible with ovirt to manage 
> two or more distant DCs.
> 
> Let say that we have two distant DCs to virtualize with ovirt.
> 
> we have two options to manage them:
> 
> 1- install two engines, one on each DC, the good side is, if one DC is down, 
> we can still manage the other one. the down side we will have two consoles to 
> manage.
> 
> 2- install one engine to manage the two DCs, the good side is the use of one 
> console to rule them all :-) the down side is if the DC containing the engine 
> become down, there is way to manage the other one.
> 
> is there (will be there in the future) a way for example to create a slave 
> engine in the second DC which can takeover and let the admin to manage the 
> second DC?
> 
> thanks in advance
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Problem update ovirt 3.5.6.2-1.el7 to 6.2.6-1.el7

2016-02-04 Thread Darrell Budic
I suspect that’s my problem, my database isn’t owned by engine:

engine=# \l
  List of databases
   Name|  Owner   | Encoding |  Collation  |Ctype|   Access 
privileges
   
---+--+--+-+-+
---
 engine| postgres | UTF8 | en_US.UTF-8 | en_US.UTF-8 | 

This is almost certainly my fault though, one too many upgrades from old 
versions and/or a slightly botched restore process. Would I be better off 
setting “engine” as the owner of the engine database then (thinking forward 
here)?

  -Darrell


> On Feb 4, 2016, at 10:22 AM, Martin Perina <mper...@redhat.com> wrote:
> 
> 
> 
> - Original Message -
>> From: "Eli Mesika" <emes...@redhat.com>
>> To: "Martin Perina" <mper...@redhat.com>
>> Cc: "Marcelo Leandro" <marcelol...@gmail.com>, "Darrell Budic" 
>> <bu...@onholyground.com>, "users" <users@ovirt.org>
>> Sent: Thursday, February 4, 2016 5:17:24 PM
>> Subject: Re: [ovirt-users] Problem update ovirt 3.5.6.2-1.el7 to 6.2.6-1.el7
>> 
>> 
>> 
>> - Original Message -
>>> From: "Martin Perina" <mper...@redhat.com>
>>> To: "Marcelo Leandro" <marcelol...@gmail.com>
>>> Cc: "Darrell Budic" <bu...@onholyground.com>, "Eli Mesika"
>>> <emes...@redhat.com>, "users" <users@ovirt.org>
>>> Sent: Thursday, February 4, 2016 6:12:34 PM
>>> Subject: Re: [ovirt-users] Problem update ovirt 3.5.6.2-1.el7 to
>>> 6.2.6-1.el7
>>> 
>>> Hi,
>>> 
>>> so it seems, that for some strange reason, user 'engine' cannot create
>>> schema in 'engine' database although it should be an owner of this db.
>>> I double checked that on all our testing databases this works fine and
>>> also if you created engine db according to doc (either automatically
>>> by engine-setup or manually in case of remote db) this should work.
>>> 
>>> So you can try to execute following command which adds proper
>>> permissions to 'engine' db user (please adapt according to you setup):
>>> 
>>> 
>>> su - postgres -c "psql --command=\"GRANT ALL ON DATABASE
>>> @ENGINE_DB_DATABASE@ TO @ENGINE_DB_USER@;\""
>>> 
>>> The correct output should be:
>>> 
>>> GRANT
>>> 
>>> If it's successful, please execute engine-setup again to verify that
>>> above step fixed the issue.
>> 
>> Martin, should that be added to engine-setup ?
> 
> I don't this so, because for all other users I know of this works fine
> (if engine db is created according to doc, engine db user is an owner
> of db so he can create new schemas).
> 
> But let's wait for reply ...
> 
>> 
>>> 
>>> Thanks
>>> 
>>> Martin Perina
>>> 
>>> - Original Message -
>>>> From: "Marcelo Leandro" <marcelol...@gmail.com>
>>>> To: "Martin Perina" <mper...@redhat.com>
>>>> Sent: Thursday, February 4, 2016 1:40:36 PM
>>>> Subject: Re: [ovirt-users] Problem update ovirt 3.5.6.2-1.el7 to
>>>> 6.2.6-1.el7
>>>> 
>>>> Hello Martin,
>>>> 
>>>> Indeed there´s an issue here. Runnig your command, I got this error:
>>>> ERROR:  permission denied for database engine ,  I'm investigating
>>>> deeper
>>>> 
>>>> Thanks
>>>> 
>>>> 2016-02-04 6:18 GMT-03:00 Martin Perina <mper...@redhat.com>:
>>>>> Hi,
>>>>> 
>>>>> I just verified that upgrades on both Centos 6.7 and Centos 7.2 works
>>>>> fine,
>>>>> so there's something bad with psql on you machines :-(
>>>>> 
>>>>> Could you please execute following steps and send me result?
>>>>> 
>>>>> 1. Please take a look at your engine db configuration in
>>>>>   /etc/ovirt-engine/engine.conf.d/10-setup-database.conf
>>>>> 
>>>>> 2. Please replace variables (@VAR@) with real values according to your
>>>>>   db setup and execute following command on engine host (or VM in
>>>>>   case of hosted engine):
>>>>> 
>>>>>   LC_ALL="C" PGPASSWORD="@ENGINE_DB_PASSWORD@" psql -w
>>>>>   --pset=tuples_only=on --host="@ENGINE_DB_HOST@"
>>>>>   --po

Re: [ovirt-users] Problem update ovirt 3.5.6.2-1.el7 to 6.2.6-1.el7

2016-02-04 Thread Darrell Budic
After changing the owner of engine to “engine”, I was able to upgrade normally, 
so that looks like it was my problem. 

Thanks for the pointers!


> On Feb 4, 2016, at 3:40 PM, Darrell Budic <bu...@onholyground.com> wrote:
> 
> I suspect that’s my problem, my database isn’t owned by engine:
> 
> engine=# \l
>  List of databases
>   Name|  Owner   | Encoding |  Collation  |Ctype|   Access 
> privileges
> 
> ---+--+--+-+-+
> ---
> engine| postgres | UTF8 | en_US.UTF-8 | en_US.UTF-8 | 
> 
> This is almost certainly my fault though, one too many upgrades from old 
> versions and/or a slightly botched restore process. Would I be better off 
> setting “engine” as the owner of the engine database then (thinking forward 
> here)?
> 
>  -Darrell
> 
> 
>> On Feb 4, 2016, at 10:22 AM, Martin Perina <mper...@redhat.com> wrote:
>> 
>> 
>> 
>> - Original Message -
>>> From: "Eli Mesika" <emes...@redhat.com>
>>> To: "Martin Perina" <mper...@redhat.com>
>>> Cc: "Marcelo Leandro" <marcelol...@gmail.com>, "Darrell Budic" 
>>> <bu...@onholyground.com>, "users" <users@ovirt.org>
>>> Sent: Thursday, February 4, 2016 5:17:24 PM
>>> Subject: Re: [ovirt-users] Problem update ovirt 3.5.6.2-1.el7 to 6.2.6-1.el7
>>> 
>>> 
>>> 
>>> - Original Message -
>>>> From: "Martin Perina" <mper...@redhat.com>
>>>> To: "Marcelo Leandro" <marcelol...@gmail.com>
>>>> Cc: "Darrell Budic" <bu...@onholyground.com>, "Eli Mesika"
>>>> <emes...@redhat.com>, "users" <users@ovirt.org>
>>>> Sent: Thursday, February 4, 2016 6:12:34 PM
>>>> Subject: Re: [ovirt-users] Problem update ovirt 3.5.6.2-1.el7 to
>>>> 6.2.6-1.el7
>>>> 
>>>> Hi,
>>>> 
>>>> so it seems, that for some strange reason, user 'engine' cannot create
>>>> schema in 'engine' database although it should be an owner of this db.
>>>> I double checked that on all our testing databases this works fine and
>>>> also if you created engine db according to doc (either automatically
>>>> by engine-setup or manually in case of remote db) this should work.
>>>> 
>>>> So you can try to execute following command which adds proper
>>>> permissions to 'engine' db user (please adapt according to you setup):
>>>> 
>>>> 
>>>> su - postgres -c "psql --command=\"GRANT ALL ON DATABASE
>>>> @ENGINE_DB_DATABASE@ TO @ENGINE_DB_USER@;\""
>>>> 
>>>> The correct output should be:
>>>> 
>>>> GRANT
>>>> 
>>>> If it's successful, please execute engine-setup again to verify that
>>>> above step fixed the issue.
>>> 
>>> Martin, should that be added to engine-setup ?
>> 
>> I don't this so, because for all other users I know of this works fine
>> (if engine db is created according to doc, engine db user is an owner
>> of db so he can create new schemas).
>> 
>> But let's wait for reply ...
>> 
>>> 
>>>> 
>>>> Thanks
>>>> 
>>>> Martin Perina
>>>> 
>>>> - Original Message -
>>>>> From: "Marcelo Leandro" <marcelol...@gmail.com>
>>>>> To: "Martin Perina" <mper...@redhat.com>
>>>>> Sent: Thursday, February 4, 2016 1:40:36 PM
>>>>> Subject: Re: [ovirt-users] Problem update ovirt 3.5.6.2-1.el7 to
>>>>> 6.2.6-1.el7
>>>>> 
>>>>> Hello Martin,
>>>>> 
>>>>> Indeed there´s an issue here. Runnig your command, I got this error:
>>>>> ERROR:  permission denied for database engine ,  I'm investigating
>>>>> deeper
>>>>> 
>>>>> Thanks
>>>>> 
>>>>> 2016-02-04 6:18 GMT-03:00 Martin Perina <mper...@redhat.com>:
>>>>>> Hi,
>>>>>> 
>>>>>> I just verified that upgrades on both Centos 6.7 and Centos 7.2 works
>>>>>> fine,
>>>>>> so there's something bad with psql on you machines :-(
>>>>>> 
>>>>>> Could you please execute following steps and send me result?
>>>&g

Re: [ovirt-users] Dumb question: exclamation mark next to VM?

2016-02-05 Thread Darrell Budic
After upgrading to 3.6.2, I’ve got a couple that are doing this to (No actual 
tooltip for the exclamation point). One windows, two linux, funny thing is they 
are all down at the moment and still have this warning…

  -Darrell

> On Feb 5, 2016, at 4:58 PM, Chris Adams  wrote:
> 
> Once upon a time, Joe DiTommasso  said:
>> If you mouse over the exclamation mark, you should get a tooltip that tells
>> you what it's complaining about. I've got it on pretty much all my VMs,
>> it's an issue with the timezone for me.
> 
> I get nothing for the exclamation mark.  I go straight from the "Up" tip
> to the "Server" tip.  The ! is in the first column with the status icon
> (if you widen the columns it stay next to the up arror).
> 
> -- 
> Chris Adams 
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Problem update ovirt 3.5.6.2-1.el7 to 6.2.6-1.el7

2016-02-03 Thread Darrell Budic
I just hit a similar problem, but on Centos 6.7, postgres (PostgreSQL) 8.4.20. 
Looks like a permission error for me?

2016-02-03 22:52:42 DEBUG otopi.context context._executeMethod:142 Stage misc ME
THOD otopi.plugins.ovirt_engine_setup.ovirt_engine.config.aaajdbc.Plugin._misc
2016-02-03 22:52:42 INFO otopi.plugins.ovirt_engine_setup.ovirt_engine.config.aa
ajdbc aaajdbc._setupSchema:97 Creating/refreshing Engine 'internal' domain datab
ase schema
2016-02-03 22:52:42 DEBUG otopi.plugins.ovirt_engine_setup.ovirt_engine.config.a
aajdbc plugin.executeRaw:828 execute: ['/usr/share/ovirt-engine-extension-aaa-jd
bc/dbscripts/schema.sh', '-s', 'localhost', '-p', '5432', '-u', 'engine', '-d', 
'engine', '-e', 'aaa_jdbc', '-l', '/var/log/ovirt-engine/setup/ovirt-engine-setu
p-20160203224629-uy91s0.log', '-c', 'apply'], executable='None', cwd='None', env
={'LESSOPEN': '||/usr/bin/lesspipe.sh %s', 'LOGNAME': 'root', 'USER': 'root', 'P
ATH': '/sbin:/bin:/usr/sbin:/usr/bin', 'HOME': '/root', 'LANG': 'en_US.UTF-8', '
TERM': 'xterm-color', 'SHELL': '/bin/bash', 'SHLVL': '1', 'G_BROKEN_FILENAMES': 
'1', 'HISTSIZE': '1000', 'SUDO_USER': 'telsin', 'USERNAME': 'root', 'PYTHONPATH'
: '/usr/share/ovirt-engine/setup/bin/..::', 'SUDO_UID': '69961', 'KRB5CCNAME
': 'FILE:/tmp/krb5cc_69961_5oRWEw', 'MAIL': '/var/spool/mail/telsin', 'DBFUN
C_DB_PGPASSFILE': '/tmp/tmpqzrQIw', 'SUDO_COMMAND': '/bin/bash', 'SUDO_GID': '69
961', 'HOSTNAME': 'ovirt.int.ohgnetworks.com', 'PWD': '/home/telsin', 'OTOPI
_LOGFILE': '/var/log/ovirt-engine/setup/ovirt-engine-setup-20160203224629-uy91s0
.log', 'LS_COLORS': 'rs=0:di=01;34:ln=01;36:mh=00:pi=40;33:so=01;35:do=01;35:bd=
40;33;01:cd=40;33;01:or=40;31;01:mi=01;05;37;41:su=37;41:sg=30;43:ca=30;41:tw=30
;42:ow=34;42:st=37;44:ex=01;32:*.tar=01;31:*.tgz=01;31:*.arj=01;31:*.taz=01;31:*
.lzh=01;31:*.lzma=01;31:*.tlz=01;31:*.txz=01;31:*.zip=01;31:*.z=01;31:*.Z=01;31:
*.dz=01;31:*.gz=01;31:*.lz=01;31:*.xz=01;31:*.bz2=01;31:*.tbz=01;31:*.tbz2=01;31
:*.bz=01;31:*.tz=01;31:*.deb=01;31:*.rpm=01;31:*.jar=01;31:*.rar=01;31:*.ace=01;
31:*.zoo=01;31:*.cpio=01;31:*.7z=01;31:*.rz=01;31:*.jpg=01;35:*.jpeg=01;35:*.gif
=01;35:*.bmp=01;35:*.pbm=01;35:*.pgm=01;35:*.ppm=01;35:*.tga=01;35:*.xbm=01;35:*
.xpm=01;35:*.tif=01;35:*.tiff=01;35:*.png=01;35:*.svg=01;35:*.svgz=01;35:*.mng=0
1;35:*.pcx=01;35:*.mov=01;35:*.mpg=01;35:*.mpeg=01;35:*.m2v=01;35:*.mkv=01;35:*.
ogm=01;35:*.mp4=01;35:*.m4v=01;35:*.mp4v=01;35:*.vob=01;35:*.qt=01;35:*.nuv=01;3
5:*.wmv=01;35:*.asf=01;35:*.rm=01;35:*.rmvb=01;35:*.flc=01;35:*.avi=01;35:*.fli=
01;35:*.flv=01;35:*.gl=01;35:*.dl=01;35:*.xcf=01;35:*.xwd=01;35:*.yuv=01;35:*.cg
m=01;35:*.emf=01;35:*.axv=01;35:*.anx=01;35:*.ogv=01;35:*.ogx=01;35:*.aac=01;36:
*.au=01;36:*.flac=01;36:*.mid=01;36:*.midi=01;36:*.mka=01;36:*.mp3=01;36:*.mpc=0
1;36:*.ogg=01;36:*.ra=01;36:*.wav=01;36:*.axa=01;36:*.oga=01;36:*.spx=01;36:*.xs
pf=01;36:', 'OTOPI_EXECDIR': '/home/telsin'}
* QUERY **
select 1;
**

1

* QUERY **
copy (
SELECT COUNT(schema_name)
FROM information_schema.schemata
WHERE schema_name = 'aaa_jdbc'
) to stdout with delimiter as '|';
**

* QUERY **
CREATE SCHEMA aaa_jdbc AUTHORIZATION engine;
**

2016-02-03 22:52:43 DEBUG otopi.plugins.ovirt_engine_setup.ovirt_engine.config.a
aajdbc plugin.executeRaw:878 execute-result: ['/usr/share/ovirt-engine-extension
-aaa-jdbc/dbscripts/schema.sh', '-s', 'localhost', '-p', '5432', '-u', 'engine',
 '-d', 'engine', '-e', 'aaa_jdbc', '-l', '/var/log/ovirt-engine/setup/ovirt-engi
ne-setup-20160203224629-uy91s0.log', '-c', 'apply'], rc=1
2016-02-03 22:52:43 DEBUG otopi.plugins.ovirt_engine_setup.ovirt_engine.config.a
aajdbc plugin.execute:936 execute-output: ['/usr/share/ovirt-engine-extension-aa
a-jdbc/dbscripts/schema.sh', '-s', 'localhost', '-p', '5432', '-u', 'engine', '-
d', 'engine', '-e', 'aaa_jdbc', '-l', '/var/log/ovirt-engine/setup/ovirt-engine-
setup-20160203224629-uy91s0.log', '-c', 'apply'] stdout:
Creating custom schema aaa_jdbc ...

2016-02-03 22:52:43 DEBUG otopi.plugins.ovirt_engine_setup.ovirt_engine.config.a
aajdbc plugin.execute:941 execute-output: ['/usr/share/ovirt-engine-extension-aa
a-jdbc/dbscripts/schema.sh', '-s', 'localhost', '-p', '5432', '-u', 'engine', '-
d', 'engine', '-e', 'aaa_jdbc', '-l', '/var/log/ovirt-engine/setup/ovirt-engine-
setup-20160203224629-uy91s0.log', '-c', 'apply'] stderr:
ERROR:  permission denied for database engine
FATAL: Cannot execute sql command: --no-psqlrc --command=CREATE SCHEMA aaa_jdbc 
AUTHORIZATION engine;

2016-02-03 22:52:43 DEBUG otopi.context context._executeMethod:156 method except
ion
Traceback (most recent call last):
  File "/usr/lib/python2.6/site-packages/otopi/context.py", line 146, in _execut
eMethod
method['method']()
  File 

[ovirt-users] upgrading from 3.6 -> 4.1, vm restarts at 4.0 mandatory?

2017-02-24 Thread Darrell Budic
I’m upgrading my main cluster from 3.6 to 4.1, and I’m currently at 4.0. I’ve 
upgraded the cluster and datacenter compatibility versions to 4.0, and now all 
my VMs are pending restart to update their configs to 4.0.

My question is “Do I need to do this here, or can I go ahead and update the 
engine and host nodes to 4.1, update compatibility to 4.1, and then restart all 
the VMs to get them on 4.1”? Or is that unsafe, will I screw them up if I go to 
4.1 compatibility in this state?

Thanks,

  -Darrell

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] gpu passthrough

2017-02-21 Thread Darrell Budic
I think you need to click the “down arrow” under the top box, should move the 
GPU into the “Host Devices to be attached” box, then hit Ok to make the 
assignment.

> On Feb 21, 2017, at 3:26 AM, qinglong.d...@horebdata.cn wrote:
> 
> Hi, all:
> I want to assign the gpu card of one host to the vm which is running 
> in the host.
> 
> After I click "OK", I got nothing. Anyone can help? Thanks!
> ___
> Users mailing list
> Users@ovirt.org 
> http://lists.ovirt.org/mailman/listinfo/users 
> 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] upgrading from 3.6 -> 4.1, vm restarts at 4.0 mandatory?

2017-02-26 Thread Darrell Budic
Not really, not when upgrading to 4.0 at any rate. Take chapter 4 in that guide:

http://www.ovirt.org/documentation/upgrade-guide/chap-Post-Upgrade_Tasks/ 
<http://www.ovirt.org/documentation/upgrade-guide/chap-Post-Upgrade_Tasks/>

When you perform these steps, you are advised that you now need to restart the 
VMs to enable the new 4.0 compatible configurations.

As there is no 4.1 upgrade guide, I’m left to interpret Chapter 1 to imply that 
you need to be at 4.0 to go to 4.1. But does that mean just having the Cluster 
and DC levels at 4.0, or also having your VMs running at 4.0? This wasn’t 
applicable to 3.6, so there’s no parallel here. Basically, I’m concerned that a 
similar procedure for setting Cluster & DC compatibility to 4.1 will not 
properly handle VMs still running with 3.6 configs, and hoping someone has 
concrete knowledge of this step that can chime in.

FYI, the breadcrumbs on the web site don’t actually work, they give you a page 
with the right number of list items for what you’re reading, but no actual 
links to the chapters (the href is present, but no link text exists).

  -Darrell

> On Feb 26, 2017, at 1:10 AM, Fred Rolland <froll...@redhat.com> wrote:
> 
> Hi,
> 
> Restart the VMs is not part of the upgrade procedure.
> You can check the upgrade guide :
> http://www.ovirt.org/documentation/upgrade-guide/chap-Updating_the_oVirt_Environment/
>  
> <http://www.ovirt.org/documentation/upgrade-guide/chap-Updating_the_oVirt_Environment/>
> 
> Regards,
> Fred
> 
> On Fri, Feb 24, 2017 at 10:44 PM, Darrell Budic <bu...@onholyground.com 
> <mailto:bu...@onholyground.com>> wrote:
> I’m upgrading my main cluster from 3.6 to 4.1, and I’m currently at 4.0. I’ve 
> upgraded the cluster and datacenter compatibility versions to 4.0, and now 
> all my VMs are pending restart to update their configs to 4.0.
> 
> My question is “Do I need to do this here, or can I go ahead and update the 
> engine and host nodes to 4.1, update compatibility to 4.1, and then restart 
> all the VMs to get them on 4.1”? Or is that unsafe, will I screw them up if I 
> go to 4.1 compatibility in this state?
> 
> Thanks,
> 
>   -Darrell
> 
> ___
> Users mailing list
> Users@ovirt.org <mailto:Users@ovirt.org>
> http://lists.ovirt.org/mailman/listinfo/users 
> <http://lists.ovirt.org/mailman/listinfo/users>
> 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] ovirtmgmt manual bridge cannot be used in ovirt 4.0

2016-11-28 Thread Darrell Budic
I’m having trouble with that plan and I’m not even trying to put the ovirtmgmt 
bridge on an existing bond. I have a pre-existing gluster setup with gluster 
running on a bonded interface. The ovirtmgmt should go on a different 
interface, but even that fails on the bond with the “torn down manually” 
statement. Haven’t had much time to do more troubleshooting, but it’s rather 
annoying. 

I’d really prefer it if the installer would just deal with existing network 
setups, I (presumably) know what I want there, especially in the case of 
pre-existing gluster setups.

  -Darrell

> On Nov 28, 2016, at 9:11 AM, Charles Kozler  wrote:
> 
> What happens when you configure the bond and then build the bridge manually 
> over the bond? oVirt installer should skip over it and not do anything. Just 
> make sure you have DEFROUTE set or routes configuration file as you expect 
> (this is what used to screw me up)
> 
> On Mon, Nov 28, 2016 at 10:06 AM,  > wrote:
> Thanks for your responses but the ui is not an option for me as i am dealing 
> with loads of systems.
> in 3.5 ovirt used to just accept the bridge as it was and incorporate it, i 
> am just wondering if i am facing a bug or a feature at the moment.
> 
> 
> Charles Kozler schreef op 2016-11-28 15:48:
> Thats what I used to do as well then on oVirt 4 it started screwing
> with the the bond as well so I ended up just dumbing it down and
> figured using the UI after the fact would be OK. I cant remember
> exactly what would happen but it would be stupid little things like
> routing would break or something. 
> 
> On Mon, Nov 28, 2016 at 9:43 AM, Simone Tiraboschi
>  [8]> wrote:
> 
> On Mon, Nov 28, 2016 at 3:42 PM, Charles Kozler
>  [7]> wrote:
> 
> What Ive been doing since oVirt 4 is just configuring one NIC
> manually when I provision the server (eg: eth0, em1, etc) and then
> let oVirt do the bridge setup. Once the engine is up I login to
> the UI and I use it to bond the NICs in whatever fashion I need
> (LACP or active-backup). Any time I tried to configure ovirtmgmt
> manually it seemed to "annoy" the hosted-engine --deploy script
> 
> This is fine.
> Another thing you could do is manually creating the bond and then
> having hosted-engine-setup creating the management bridge over your
> bond.
> 
>  
> 
> On Mon, Nov 28, 2016 at 9:33 AM, Simone Tiraboschi
>  [6]> wrote:
> 
> On Mon, Nov 28, 2016 at 12:24 PM,   [3]>
> wrote:
> 
> Hi All,
> 
> In our ovirt 3.5 setup. i have always setup the ovirtmgmt
> bridge manually .
> The bridge consisted of 2 nics
> 
> Id have /etc/vdsm/vdsm.conf list net_persist = ifcfg
> 
> 
> When i then deployed the host from the ovirt ui or api it
> would install and would display the network setup correctly in
> the ui.
> 
> On ovirt 4. (vdsm-4.18.15.3-1.el7.centos.x86_64)
> I seem unable to follow the same approach.
> 
> In the engine logs i get among other things
> 
> If the interface ovirtmgmt is a bridge, it should be
> torn-down manually.
> 
> the interface is indeed a bridge with two nics which i would
> like to keep this way.
> 
> On the host vdsm.log i get limited info,
> 
> when start a python terminal to obtain netinfo i get this
> 
> from vdsm.tool import unified_persistence
> unified_persistence.netswitch.netinfo()
> Traceback (most recent call last):
>   File "", line 1, in 
>   File
> "/usr/lib/python2.7/site-packages/vdsm/network/netswitch.py",
> line 298, in netinfo
> _netinfo = netinfo_get(compatibility=compatibility)
>   File
> 
> 
> "/usr/lib/python2.7/site-packages/vdsm/network/netinfo/cache.py",
> line 109, in get
> return _get(vdsmnets)
>   File
> 
> 
> "/usr/lib/python2.7/site-packages/vdsm/network/netinfo/cache.py",
> line 101, in _get
> report_network_qos(networking)
>   File
> 
> "/usr/lib/python2.7/site-packages/vdsm/network/netinfo/qos.py",
> line 46, in report_network_qos
> iface, = host_ports
> ValueError: too many values to unpack
> 
> As it appears the line in question does not like to deal with
> a list of nics i think.
> but either way.
> 
> Is in ovirt 4 the ability to use the ovirtmgmt bridge with
> multiple nics removed?
> 
> But do you need a bridge or a bond?
>  
> 
> If so what can i do to stick to what we have done in the past.
> 
> Thanks.
> 
> ___
> Users mailing list
> Users@ovirt.org  [1]
> http://lists.ovirt.org/mailman/listinfo/users 
>  [2]
> 
> ___
> Users mailing list
> Users@ovirt.org  [4]
> http://lists.ovirt.org/mailman/listinfo/users 
>  [5]
> 
> 
> 
> 

Re: [ovirt-users] Request for feedback on your db vacuum status

2016-12-13 Thread Darrell Budic
Whups, bad reply to, no problem copying the list.

It's a self hosted system, currently with two clusters and 9 active hosts.74 
VMs, yes. It’s had a few more clusters over time, and a few more hosts, 
including some removed and re-added when upgrading from centos 6 to 7. But only 
9 at the moment. One cluster of 6 with most of the vms, using external Gluster 
servers not managed by ovirt, and one cluster of 3 which are also gluster 
servers as well as hypervisors. 7 vms currently on the smaller cluster.


> On Dec 13, 2016, at 3:36 AM, Roy Golan <rgo...@redhat.com> wrote:
> On 12 December 2016 at 20:31, Darrell Budic <bu...@onholyground.com 
> <mailto:bu...@onholyground.com>> wrote:
> Here’s mine: http://paste.fedoraproject.org/505443/14815674/ 
> <http://paste.fedoraproject.org/505443/14815674/>
> 
> This engine has been around since at 3.3, maybe 3.2, currently on 3.6 because 
> I haven’t had time to arrange the OS upgrade from centos 6 to 7 for the 
> engine host yet.
> 
> 
> Thank you very much Darrell! your vacuum seems boring (good!) and the db 
> seems healthy.  Can you reply to the list that you sent the feedback so 
> everyone will have a chance to look at it? also how big is your setup? 
> judging by the output is it 74 vms and 42 hosts?  
> 
>> On Dec 8, 2016, at 8:18 AM, Roy Golan <rgo...@redhat.com 
>> <mailto:rgo...@redhat.com>> wrote:
>> 
>> Hi all,
>> 
>> Following the thread about vacuum tool [1] I would like to gather some 
>> feedback about your deployment's db vacuum status The info is completely 
>> anonymous and function running it is a read only reporting one and should 
>> have little or no effect on the db.
>> 
>> The result can be pretty verbose  but again will not disclose sensitive 
>> info. Anyway review it before pasting it. It should look something like 
>> that(a snippet of one table):
>> 
>> INFO:  vacuuming "pg_catalog.pg_ts_template"
>> INFO:  index "pg_ts_template_tmplname_index" now contains 5 row versions in 
>> 2 pages
>> DETAIL:  0 index row versions were removed.
>> 0 index pages have been deleted, 0 are currently reusable.
>> CPU 0.00s/0.00u sec elapsed 0.00 sec.
>> 
>> 
>> 1. sudo su - postgres  -c "psql engine -c 'vacuum verbose'" &> 
>> /tmp/vacuum.log
>> 
>> 2. review the /tmp/vacuum.log
>> 
>> 3. paste it to http://paste.fedoraproject.org/ 
>> <http://paste.fedoraproject.org/> and reply with the link here
>> 
>> 
>> [1] http://lists.ovirt.org/pipermail/devel/2016-December/014484.html 
>> <http://lists.ovirt.org/pipermail/devel/2016-December/014484.html>
>> 
>> 
>> Thanks,
>> Roy
>> ___
>> Users mailing list
>> Users@ovirt.org <mailto:Users@ovirt.org>
>> http://lists.ovirt.org/mailman/listinfo/users 
>> <http://lists.ovirt.org/mailman/listinfo/users>
> 
> 

___
Users mailing list
Users@ovirt.org
http://lists.phx.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Lost our HostedEngineVM

2017-03-22 Thread Darrell Budic
From a hosted engine host shell, it’s:

hosted-engine —vm-start

hosted-engine —vm-status
 is also useful. If you restored your storage (that include the hosted engine 
storage) after rebooting that host, you should try restarting ovirt-ha-agent & 
ovirt-ha-broker, or just restart the machine and see if it mounts it properly.

 
> On Mar 21, 2017, at 4:58 PM, Matt Emma  wrote:
> 
> We’re in a bit of a panic mode, so excuse any shortness. 
>  
> We had a storage failure. We rebooted a VMHost that had the hostedengine VM - 
> The HostedENgine did not try to move to the other hosts. We’ve since restored 
> storage and we are able to successfully restart the paused VMs. We know the 
> HostedEngine’s VM ID is there a way we can force load it from the mounted 
> storage? 
>  
> -Matt 
> ___
> Users mailing list
> Users@ovirt.org 
> http://lists.ovirt.org/mailman/listinfo/users 
> 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Replicated Glusterfs on top of ZFS

2017-03-03 Thread Darrell Budic
Why are you using an arbitrator if all your HW configs are identical? I’d use a 
true replica 3 in this case.

Also in my experience with gluster and vm hosting, the ZIL/slog degrades write 
performance unless it’s a truly dedicated disk. But I have 8 spinners backing 
my ZFS volumes, so trying to share a sata disk wasn’t a good zil. If yours is 
dedicated SAS, keep it, if it’s SATA, try testing without it.

You don’t have compression enabled on your zfs volume, and I’d recommend 
enabling relatime on it. Depending on the amount of RAM in these boxes, you 
probably want to limit your zfs arc size to 8G or so (1/4 total ram or less). 
Gluster just works volumes hard during a rebuild, what’s the problem you’re 
seeing? If it’s affecting your VMs, using shading and tuning client & server 
threads can help avoid interruptions to your VMs while repairs are running. If 
you really need to limit it, you can use cgroups to keep it from hogging all 
the CPU, but it takes longer to heal, of course. There are a couple older posts 
and blogs about it, if you go back a while.


> On Mar 3, 2017, at 9:02 AM, Arman Khalatyan  wrote:
> 
> The problem itself is not the streaming data performance., and also dd zero 
> does not help much in the production zfs running with compression.
> the main problem comes when the gluster is starting to do something with 
> that, it is using xattrs, probably accessing extended attributes inside the 
> zfs is slower than XFS.
> Also primitive find file or ls -l in the (dot)gluster folders takes ages: 
> 
> now I can see that arbiter host has almost 100% cache miss during the 
> rebuild, which is actually natural while he is reading always the new 
> datasets:
> [root@clei26 ~]# arcstat.py 1
> time  read  miss  miss%  dmis  dm%  pmis  pm%  mmis  mm%  arcsz c  
> 15:57:31292910029  100 0029  100   685M   31G  
> 15:57:32   530   476 89   476   89 00   457   89   685M   31G  
> 15:57:33   480   467 97   467   97 00   463   97   685M   31G  
> 15:57:34   452   443 98   443   98 00   435   97   685M   31G  
> 15:57:35   582   547 93   547   93 00   536   94   685M   31G  
> 15:57:36   439   417 94   417   94 00   393   94   685M   31G  
> 15:57:38   435   392 90   392   90 00   374   89   685M   31G  
> 15:57:39   364   352 96   352   96 00   352   96   685M   31G  
> 15:57:40   408   375 91   375   91 00   360   91   685M   31G  
> 15:57:41   552   539 97   539   97 00   539   97   685M   31G  
> 
> It looks like we cannot have in the same system performance and reliability :(
> Simply final conclusion is with the single disk+ssd even zfs doesnot help to 
> speedup the glusterfs healing.
> I will stop here:)
> 
> 
> 
> 
> On Fri, Mar 3, 2017 at 3:35 PM, Juan Pablo  > wrote:
> cd to inside the pool path
> then dd if=/dev/zero of=test.tt  bs=1M 
> leave it runing 5/10 minutes.
> do ctrl+c paste result here.
> etc.
> 
> 2017-03-03 11:30 GMT-03:00 Arman Khalatyan  >:
> No, I have one pool made of the one disk and ssd as a cache and log device.
> I have 3 Glusterfs bricks- separate 3 hosts:Volume type Replicate (Arbiter)= 
> replica 2+1!
> That how much you can push into compute nodes(they have only 3 disk slots).
> 
> 
> On Fri, Mar 3, 2017 at 3:19 PM, Juan Pablo  > wrote:
> ok, you have 3 pools, zclei22, logs and cache, thats wrong. you should have 1 
> pool, with zlog+cache if you are looking for performance.
> also, dont mix drives. 
> whats the performance issue you are facing? 
> 
> 
> regards,
> 
> 2017-03-03 11:00 GMT-03:00 Arman Khalatyan  >:
> This is CentOS 7.3 ZoL version 0.6.5.9-1
> 
> [root@clei22 ~]# lsscsi
> 
> [2:0:0:0]diskATA  INTEL SSDSC2CW24 400i  /dev/sda
> 
> [3:0:0:0]diskATA  HGST HUS724040AL AA70  /dev/sdb
> 
> [4:0:0:0]diskATA  WDC WD2002FYPS-0 1G01  /dev/sdc
> 
> 
> 
> [root@clei22 ~]# pvs ;vgs;lvs
> 
>   PV VGFmt  Attr 
> PSize   PFree
> 
>   /dev/mapper/INTEL_SSDSC2CW240A3_CVCV306302RP240CGN vg_cache  lvm2 a--  
> 223.57g 0
> 
>   /dev/sdc2  centos_clei22 lvm2 a--   
>  1.82t 64.00m
> 
>   VG#PV #LV #SN Attr   VSize   VFree
> 
>   centos_clei22   1   3   0 wz--n-   1.82t 64.00m
> 
>   vg_cache1   2   0 wz--n- 223.57g 0
> 
>   LV   VGAttr   LSize   Pool Origin Data%  Meta%  Move 
> Log Cpy%Sync Convert
> 
>   home centos_clei22 -wi-ao   1.74t   
> 
> 
>   root centos_clei22 -wi-ao  50.00g   
>   

Re: [ovirt-users] oVirt 4 and 10GbE NICs

2017-03-14 Thread Darrell Budic

> On Mar 14, 2017, at 7:54 AM, FERNANDO FREDIANI  
> wrote:
> 
> Isn't the traffic shown on the dashboard based in 1Gbps always, even if the 
> hosts have 10Gb interfaces ?
> 

Yep, all dirt interfaces show as 1Gb.

> Is there anywhere in oVirt config files or Database that you can tell to the 
> dashboard to consider 10Gb instead of 1Gb for those cases ?
> 
> 

Not that I know of, but it doesn’t affect that available performance, it’s just 
visible.

I’ve gotten ~3.5Gbps out of iperf with no appreciable tuning to a VM, so it’s 
definitely possible to get more speed out of them.

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] ovirt-hosted-engine state transition messages

2017-07-18 Thread Darrell Budic
I had some of this going on recently under 4.1.2, started with one or two 
warning messages, then a flood of them. Did the upgrade to 4.1.3 and haven’t 
seen it yet, but it’s only been a few days so far. A java process was consuming 
much CPU, and the DataWarehouse appears to not be collecting data (evidenced by 
a blank dashboard). My DWH has since recovered as well.

I forgot to check, but suspect I was low/out of memory on my engine VM, it’s an 
old one with only 6G allocated currently. Watching for this to happen again, 
and will confirm RAM utilization and bump up appropriately if it looks like 
it’s starved for RAM.


> On Jul 18, 2017, at 5:45 AM, Christophe TREFOIS  
> wrote:
> 
> I have the same as you on 4.1.0
> 
> EngineBadHealth-EngineUp 1 minute later. Sometimes 20 times per day, mostly 
> on weekends.
> 
> Cheers,
> -- 
> 
> Dr Christophe Trefois, Dipl.-Ing.  
> Technical Specialist / Post-Doc
> 
> UNIVERSITÉ DU LUXEMBOURG
> 
> LUXEMBOURG CENTRE FOR SYSTEMS BIOMEDICINE
> Campus Belval | House of Biomedicine  
> 6, avenue du Swing 
> L-4367 Belvaux  
> T: +352 46 66 44 6124 
> F: +352 46 66 44 6949  
> http://www.uni.lu/lcsb 
>        
>    
>    
> 
> 
> This message is confidential and may contain privileged information. 
> It is intended for the named recipient only. 
> If you receive it in error please notify me and permanently delete the 
> original message and any copies. 
> 
>   
> 
>> On 17 Jul 2017, at 17:35, Jim Kusznir > > wrote:
>> 
>> Ok, I've been ignoring this for a long time as the logs were so verbose and 
>> didn't show anything I could identify as usable debug info.  Recently one of 
>> my ovirt hosts (currently NOT running the main engine, but a candidate) was 
>> cycling as much as 40 times a day between "EngineUpBadHealth and EngineUp".  
>> Here's the log snippit.  I included some time before and after if that's 
>> helpful.  In this case, I got an email about bad health at 8:15 and a 
>> restore (engine up) at 8:16.  I see where the messages are sent, but I don't 
>> see any explanation as to why / what the problem is.
>> 
>> BTW: 192.168.8.11 is this computer's physical IP; 192.168.8.12 is the 
>> computer currently running the engine.  Both are also hosting the gluster 
>> store (eg, I have 3 hosts, all are participating in the gluster replica 
>> 2+arbitrator).
>> 
>> I'd appreciate it if someone could shed some light on why this keeps 
>> happening!
>> 
>> --Jim
>> 
>> 
>> MainThread::INFO::2017-07-17 
>> 08:12:06,230::config::485::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine.config::(refresh_vm_conf)
>>  Reloading vm.conf from the shared storage domain
>> MainThread::INFO::2017-07-17 
>> 08:12:06,230::config::412::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine.config::(_get_vm_conf_content_from_ovf_store)
>>  Trying to get a fresher copy of vm configuration from the OVF_STORE
>> MainThread::INFO::2017-07-17 
>> 08:12:08,877::ovf_store::103::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(scan)
>>  Found OVF_STORE: imgUUID:e10c90a5-4d9c-4e18-b6f7-ae8f0cdf4f57, 
>> volUUID:a9754d40-eda1-44d7-ac92-76a228f9f1ac
>> MainThread::INFO::2017-07-17 
>> 08:12:09,432::ovf_store::103::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(scan)
>>  Found OVF_STORE: imgUUID:f22829ab-9fd5-415a-9a8f-809d3f7887d4, 
>> volUUID:9f4760ee-119c-412a-a1e8-49e73e6ba929
>> MainThread::INFO::2017-07-17 
>> 08:12:09,925::ovf_store::112::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
>>  Extracting Engine VM OVF from the OVF_STORE
>> MainThread::INFO::2017-07-17 
>> 08:12:10,324::ovf_store::119::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(getEngineVMOVF)
>>  OVF_STORE volume path: 
>> /rhev/data-center/mnt/glusterSD/192.168.8.11:_engine/c0acdefb-7d16-48ec-9d76-659b8fe33e2a/images/f22829ab-9fd5-415a-9a8f-809d3f7887d4/9f4760ee-119c-412a-a1e8-49e73e6ba929
>>  
>> MainThread::INFO::2017-07-17 
>> 08:12:10,696::config::431::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine.config::(_get_vm_conf_content_from_ovf_store)
>>  Found an OVF for HE VM, trying to convert
>> MainThread::INFO::2017-07-17 
>> 08:12:10,704::config::436::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine.config::(_get_vm_conf_content_from_ovf_store)
>>  Got vm.conf from OVF_STORE
>> MainThread::INFO::2017-07-17 
>> 08:12:10,705::states::426::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(consume)
>>  Engine vm running on localhost
>> MainThread::INFO::2017-07-17 
>> 08:12:10,714::hosted_engine::604::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine::(_initialize_vdsm)
>>  Initializing VDSM
>> MainThread::INFO::2017-07-17 
>> 

Re: [ovirt-users] ovirt-hosted-engine state transition messages

2017-07-23 Thread Darrell Budic
This happened to me again, started last night so it was almost a week from the 
last restart. System was not out of memory, a bit low, and it may have been 
churning buffers or java GC, I’m on vacation and didn’t dig into it very far. 
Restarted the engine and it’s happy. DWH was still working, but web interface 
was a bit slow before the restart. This was 4.1.3 now. Added some ram to the 
Hosted Engine, but looks like I need to restart it and will probably wait until 
I’m back for that.


> On Jul 18, 2017, at 9:22 AM, Darrell Budic <bu...@onholyground.com> wrote:
> 
> I had some of this going on recently under 4.1.2, started with one or two 
> warning messages, then a flood of them. Did the upgrade to 4.1.3 and haven’t 
> seen it yet, but it’s only been a few days so far. A java process was 
> consuming much CPU, and the DataWarehouse appears to not be collecting data 
> (evidenced by a blank dashboard). My DWH has since recovered as well.
> 
> I forgot to check, but suspect I was low/out of memory on my engine VM, it’s 
> an old one with only 6G allocated currently. Watching for this to happen 
> again, and will confirm RAM utilization and bump up appropriately if it looks 
> like it’s starved for RAM.
> 
> 
>> On Jul 18, 2017, at 5:45 AM, Christophe TREFOIS <christophe.tref...@uni.lu 
>> <mailto:christophe.tref...@uni.lu>> wrote:
>> 
>> I have the same as you on 4.1.0
>> 
>> EngineBadHealth-EngineUp 1 minute later. Sometimes 20 times per day, mostly 
>> on weekends.
>> 
>> Cheers,
>> -- 
>> 
>> Dr Christophe Trefois, Dipl.-Ing.  
>> Technical Specialist / Post-Doc
>> 
>> UNIVERSITÉ DU LUXEMBOURG
>> 
>> LUXEMBOURG CENTRE FOR SYSTEMS BIOMEDICINE
>> Campus Belval | House of Biomedicine  
>> 6, avenue du Swing 
>> L-4367 Belvaux  
>> T: +352 46 66 44 6124 
>> F: +352 46 66 44 6949  
>> http://www.uni.lu/lcsb <http://www.uni.lu/lcsb>
>>  <https://www.facebook.com/trefex>   <https://twitter.com/Trefex>   
>> <https://plus.google.com/+ChristopheTrefois/>   
>> <https://www.linkedin.com/in/trefoischristophe>   <http://skype:Trefex?call>
>> 
>> 
>> This message is confidential and may contain privileged information. 
>> It is intended for the named recipient only. 
>> If you receive it in error please notify me and permanently delete the 
>> original message and any copies. 
>> 
>>   
>> 
>>> On 17 Jul 2017, at 17:35, Jim Kusznir <j...@palousetech.com 
>>> <mailto:j...@palousetech.com>> wrote:
>>> 
>>> Ok, I've been ignoring this for a long time as the logs were so verbose and 
>>> didn't show anything I could identify as usable debug info.  Recently one 
>>> of my ovirt hosts (currently NOT running the main engine, but a candidate) 
>>> was cycling as much as 40 times a day between "EngineUpBadHealth and 
>>> EngineUp".  Here's the log snippit.  I included some time before and after 
>>> if that's helpful.  In this case, I got an email about bad health at 8:15 
>>> and a restore (engine up) at 8:16.  I see where the messages are sent, but 
>>> I don't see any explanation as to why / what the problem is.
>>> 
>>> BTW: 192.168.8.11 is this computer's physical IP; 192.168.8.12 is the 
>>> computer currently running the engine.  Both are also hosting the gluster 
>>> store (eg, I have 3 hosts, all are participating in the gluster replica 
>>> 2+arbitrator).
>>> 
>>> I'd appreciate it if someone could shed some light on why this keeps 
>>> happening!
>>> 
>>> --Jim
>>> 
>>> 
>>> MainThread::INFO::2017-07-17 
>>> 08:12:06,230::config::485::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine.config::(refresh_vm_conf)
>>>  Reloading vm.conf from the shared storage domain
>>> MainThread::INFO::2017-07-17 
>>> 08:12:06,230::config::412::ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine.config::(_get_vm_conf_content_from_ovf_store)
>>>  Trying to get a fresher copy of vm configuration from the OVF_STORE
>>> MainThread::INFO::2017-07-17 
>>> 08:12:08,877::ovf_store::103::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(scan)
>>>  Found OVF_STORE: imgUUID:e10c90a5-4d9c-4e18-b6f7-ae8f0cdf4f57, 
>>> volUUID:a9754d40-eda1-44d7-ac92-76a228f9f1ac
>>> MainThread::INFO::2017-07-17 
>>> 08:12:09,432::ovf_store::103::ovirt_hosted_engine_ha.lib.ovf.ovf_store.OVFStore::(scan)
>>>  Found OVF_STORE: imgUUID:f22829ab-9fd5-415a-9a8f-809d3f

[ovirt-users] vdsm changing disk scheduler when starting, configurable?

2017-07-02 Thread Darrell Budic
It seems vdsmd under 4.1.x (or something under it’s control) changes the disk 
schedulers when it starts or a host node is activated, and I’d like to avoid 
this. Is it preventable? Or configurable anywhere? This was probably happening 
under earlier version, but I just noticed it while upgrading some converged 
boxes today.

It likes to set deadline, which I understand is the RHEL default for centos 7 
on non SATA disks. But I’d rather have NOOP on my SSDs because SSDs, and NOOP 
on my SATA spinning platters because ZFS does it’s own scheduling, and running 
anything other than NOOP can cause increased CPU utilization for no gain. It’s 
also fighting ZFS, which tires to set NOOP on whole disks it controls, and my 
kernel command line setting.

Thanks,

  -Darrell
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] vdsm (4.1) restarts glusterd when activating a node, even if it's already running

2017-07-02 Thread Darrell Budic
Upgrading some nodes today, and noticed that vdsmd restarts glusterd on a node 
when it activates it. This is causing a short break in healing when the shd 
gets disconnected, forcing some extra healing when the healing process reports 
“Transport Endpoint Disconnected” (N/A in the ovirt gui).

This is on a converged cluster (3 nodes, gluster replica volume across all 3, 
ovirt-engine running elsewhere). Centos 7 install, just upgraded to Ovirt 
4.1.2, running cluster 3.10 from the Centos SIG.

The process I’m observing:

Place a node into maintenance via GUI
Update node from command line
Reboot node (kernel update)
Watch gluster heal itself after reboot
Activate node in GUI
gluster is completely stopped on this node
gluster is started on this node
healing begins again, but isn’t working
“gluster vol heal  info” reports this node’s information not available 
because “Transport endpoint not connected”.
This clears up in 5-10 minutes, then volume heals normally

Someone with a similar setup want to check this and see if it’s something 
specific to my nodes, or just a general problem with the way it’s restarting 
gluster? Looking for a little confirmation before I file a bug report on it.

Or a dev want to comment on why it stops and starts gluster, instead of a 
restart which would presumably leave the brick processes and shd running and 
not causing this effect?

Thanks,

  -Darrell
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Very poor GlusterFS performance

2017-06-19 Thread Darrell Budic
Chris-

You probably need to head over to gluster-us...@gluster.org 
 for help with performance issues.

That said, what kind of performance are you getting, via some form or testing 
like bonnie++ or even dd runs? Raw bricks vs gluster performance is useful to 
determine what kind of performance you’re actually getting.

Beyond that, I’d recommend dropping the arbiter bricks and re-adding them as 
full replicas, they can’t serve distributed data in this configuration and may 
be slowing things down on you. If you’ve got a storage network setup, make sure 
it’s using the largest MTU it can, and consider adding/testing these settings 
that I use on my main storage volume:

performance.io-thread-count: 32
client.event-threads: 8
server.event-threads: 3
performance.stat-prefetch: on

Good luck,

  -Darrell


> On Jun 19, 2017, at 9:46 AM, Chris Boot  wrote:
> 
> Hi folks,
> 
> I have 3x servers in a "hyper-converged" oVirt 4.1.2 + GlusterFS 3.10
> configuration. My VMs run off a replica 3 arbiter 1 volume comprised of
> 6 bricks, which themselves live on two SSDs in each of the servers (one
> brick per SSD). The bricks are XFS on LVM thin volumes straight onto the
> SSDs. Connectivity is 10G Ethernet.
> 
> Performance within the VMs is pretty terrible. I experience very low
> throughput and random IO is really bad: it feels like a latency issue.
> On my oVirt nodes the SSDs are not generally very busy. The 10G network
> seems to run without errors (iperf3 gives bandwidth measurements of >=
> 9.20 Gbits/sec between the three servers).
> 
> To put this into perspective: I was getting better behaviour from NFS4
> on a gigabit connection than I am with GlusterFS on 10G: that doesn't
> feel right at all.
> 
> My volume configuration looks like this:
> 
> Volume Name: vmssd
> Type: Distributed-Replicate
> Volume ID: d5a5ddd1-a140-4e0d-b514-701cfe464853
> Status: Started
> Snapshot Count: 0
> Number of Bricks: 2 x (2 + 1) = 6
> Transport-type: tcp
> Bricks:
> Brick1: ovirt3:/gluster/ssd0_vmssd/brick
> Brick2: ovirt1:/gluster/ssd0_vmssd/brick
> Brick3: ovirt2:/gluster/ssd0_vmssd/brick (arbiter)
> Brick4: ovirt3:/gluster/ssd1_vmssd/brick
> Brick5: ovirt1:/gluster/ssd1_vmssd/brick
> Brick6: ovirt2:/gluster/ssd1_vmssd/brick (arbiter)
> Options Reconfigured:
> nfs.disable: on
> transport.address-family: inet6
> performance.quick-read: off
> performance.read-ahead: off
> performance.io-cache: off
> performance.stat-prefetch: off
> performance.low-prio-threads: 32
> network.remote-dio: off
> cluster.eager-lock: enable
> cluster.quorum-type: auto
> cluster.server-quorum-type: server
> cluster.data-self-heal-algorithm: full
> cluster.locking-scheme: granular
> cluster.shd-max-threads: 8
> cluster.shd-wait-qlength: 1
> features.shard: on
> user.cifs: off
> storage.owner-uid: 36
> storage.owner-gid: 36
> features.shard-block-size: 128MB
> performance.strict-o-direct: on
> network.ping-timeout: 30
> cluster.granular-entry-heal: enable
> 
> I would really appreciate some guidance on this to try to improve things
> because at this rate I will need to reconsider using GlusterFS altogether.
> 
> Cheers,
> Chris
> 
> -- 
> Chris Boot
> bo...@bootc.net
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Debugging warning messages about bonding mode 4

2017-10-06 Thread Darrell Budic
That looks like the normal state for a LACP bond, but it does record some churn 
(bond renegotiations, I believe). So it probably bounced once or twice coming 
up. Maybe a slow switch, maybe a switch relying on dynamic bonding instead of 
static bonds, and taking longer to establish. 

For the ones with a down link, and this one too, you could ask the network guys 
if they statically configured the bond, or if they could, might make it quicker 
to bring it up.

I don’t think anything updates when the host is in maintenance, you could take 
it out and see what happens :) The bond is lower level though, should come up 
if it’s configured properly, and you should be able to see that on the host.

  -Darrell

a bond on one of mine:

# cat /proc/net/bonding/bond0 
Ethernet Channel Bonding Driver: v3.7.1 (April 27, 2011)

Bonding Mode: IEEE 802.3ad Dynamic link aggregation
Transmit Hash Policy: layer2 (0)
MII Status: up
MII Polling Interval (ms): 100
Up Delay (ms): 0
Down Delay (ms): 0

802.3ad info
LACP rate: slow
Min links: 0
Aggregator selection policy (ad_select): stable
System priority: 65535
System MAC address: 00:0f:53:08:4b:ac
Active Aggregator Info:
Aggregator ID: 1
Number of ports: 2
Actor Key: 13
Partner Key: 14
Partner Mac Address: 64:64:9b:5e:9b:00

Slave Interface: p1p1
MII Status: up
Speed: 1 Mbps
Duplex: full
Link Failure Count: 0
Permanent HW addr: 00:0f:53:08:4b:ac
Slave queue ID: 0
Aggregator ID: 1
Actor Churn State: none
Partner Churn State: none
Actor Churned Count: 0
Partner Churned Count: 0
details actor lacp pdu:
system priority: 65535
system mac address: 00:0f:53:08:4b:ac
port key: 13
port priority: 255
port number: 1
port state: 61
details partner lacp pdu:
system priority: 127
system mac address: 64:64:9b:5e:9b:00
oper key: 14
port priority: 127
port number: 8
port state: 63

Slave Interface: p1p2
MII Status: up
Speed: 1 Mbps
Duplex: full
Link Failure Count: 0
Permanent HW addr: 00:0f:53:08:4b:ad
Slave queue ID: 0
Aggregator ID: 1
Actor Churn State: none
Partner Churn State: none
Actor Churned Count: 0
Partner Churned Count: 0
details actor lacp pdu:
system priority: 65535
system mac address: 00:0f:53:08:4b:ac
port key: 13
port priority: 255
port number: 2
port state: 61
details partner lacp pdu:
system priority: 127
system mac address: 64:64:9b:5e:9b:00
oper key: 14
port priority: 127
port number: 7
port state: 63


> From: Gianluca Cecchi 
> Subject: [ovirt-users] Debugging warning messages about bonding mode 4
> Date: October 6, 2017 at 6:28:16 AM CDT
> To: users
> 
> Hello,
> on a 2 nodes cluster in 4.1.6 I have this situation.
> Every node has 3 bonds, each one composed by 2 network adapters and each one 
> of type mode=4
> (actually in setup networks I have configured custom and then the value: 
> "mode=4 miimon=100"
> )
> 
> At this moment only one of the servers has access to FC storage, while the 
> other is currently on maintenance.
> 
> On 2 of the 3 bonds of the active server I get an exclamation point in 
> "Network Interfaces" subtab with this mouseover popup
> 
> Bond is in link aggregation mode (mode 4), but no partner mac has been 
> reported for it
> 
> What is the exact meaning of this message? Do I have to care about (I think 
> so..)?
> What should I report to network guys?
> Eg, one of these two warning bonds status is:
> 
> # cat /proc/net/bonding/bond2
> Ethernet Channel Bonding Driver: v3.7.1 (April 27, 2011)
> 
> Bonding Mode: IEEE 802.3ad Dynamic link aggregation
> Transmit Hash Policy: layer2 (0)
> MII Status: up
> MII Polling Interval (ms): 100
> Up Delay (ms): 0
> Down Delay (ms): 0
> 
> 802.3ad info
> LACP rate: slow
> Min links: 0
> Aggregator selection policy (ad_select): stable
> System priority: 65535
> System MAC address: 48:df:37:0c:7f:5a
> Active Aggregator Info:
> Aggregator ID: 5
> Number of ports: 2
> Actor Key: 9
> Partner Key: 6
> Partner Mac Address: b8:38:61:9c:75:80
> 
> Slave Interface: ens2f2
> MII Status: up
> Speed: 1000 Mbps
> Duplex: full
> Link Failure Count: 2
> Permanent HW addr: 48:df:37:0c:7f:5a
> Slave queue ID: 0
> Aggregator ID: 5
> Actor Churn State: none
> Partner Churn State: none
> Actor Churned Count: 2
> Partner Churned Count: 3
> details actor lacp pdu:
> system priority: 65535
> system mac address: 48:df:37:0c:7f:5a
> port key: 9
> port priority: 255
> port number: 1
> port state: 61
> details partner lacp pdu:
> system priority: 32768
> system mac address: b8:38:61:9c:75:80
> oper key: 6
> port priority: 32768
> port number: 293
> port state: 61
> 
> Slave Interface: ens2f3
> MII Status: up
> Speed: 1000 Mbps
> Duplex: full
> Link Failure Count: 2
> Permanent HW addr: 48:df:37:0c:7f:5b
> Slave queue ID: 0
> Aggregator ID: 5
> Actor Churn State: none
> Partner 

Re: [ovirt-users] More than one mgmt network possible?

2017-09-11 Thread Darrell Budic
From personal experience, if you want it in the same Cluster as other servers, 
it needs to be on the same mgmt network. If you put it in it’s own cluster, it 
can have it’s own mgmt network. The engine needs IP connectivity, obviously.

I have a DC running with 3 clusters, 2 in the same interconnected vlan 100 on 
opposite sides of Chicago, and one with mgmt on vlan 40 in Amsterdam.

 -Darrell

> On Sep 11, 2017, at 4:47 AM, Gianluca Cecchi  
> wrote:
> 
> Hello,
> in site1 I have 2 oVirt hosts with ovirtmgmt configured on vlan167.
> Now I want to add a server that is in site2 where this vlan doesn't arrive.
> I have here a vlan 169 that does routing with the vlan 167 of site1.
> Can I add the host into the same cluster or the only way is to "transport" 
> vlan167 into site2 too?
> 
> Thanks,
> Gianluca
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Non-responsive host, VM's are still running - how to resolve?

2017-11-14 Thread Darrell Budic
Try restarting vdsmd from the shell, “systemctl restart vdsmd”.


> From: Artem Tambovskiy 
> Subject: [ovirt-users] Non-responsive host, VM's are still running - how to 
> resolve?
> Date: November 14, 2017 at 11:23:32 AM CST
> To: users
> 
> Apparently, i lost the host which was running hosted-engine and another 4 
> VM's exactly during migration of second host from bare-metal to second host 
> in the cluster. For some reason first host entered the "Non reponsive" state. 
> The interesting thing is that hosted-engine and all other VM's up and 
> running, so its like a communication problem between hosted-engine and host. 
> 
> The engine.log at hosted-engine is full of following messages:
> 
> 2017-11-14 17:06:43,158Z INFO  
> [org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp Reactor) [] 
> Connecting to ovirt2/80.239.162.106 
> 2017-11-14 17:06:43,159Z ERROR 
> [org.ovirt.engine.core.vdsbroker.vdsbroker.GetAllVmStatsVDSCommand] 
> (DefaultQuartzScheduler9) [50938c3] Command 'GetAllVmStatsVDSCommand(HostName 
> = ovirt2.telia.ru , 
> VdsIdVDSCommandParametersBase:{runAsync='true', 
> hostId='3970247c-69eb-4bd8-b263-9100703a8243'})' execution failed: 
> java.net.NoRouteToHostException: No route to host
> 2017-11-14 17:06:43,159Z INFO  
> [org.ovirt.engine.core.vdsbroker.monitoring.PollVmStatsRefresher] 
> (DefaultQuartzScheduler9) [50938c3] Failed to fetch vms info for host 
> 'ovirt2.telia.ru ' - skipping VMs monitoring.
> 2017-11-14 17:06:45,929Z INFO  
> [org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp Reactor) [] 
> Connecting to ovirt2/80.239.162.106 
> 2017-11-14 17:06:45,930Z ERROR 
> [org.ovirt.engine.core.vdsbroker.vdsbroker.GetCapabilitiesVDSCommand] 
> (DefaultQuartzScheduler2) [6080f1cc] Command 
> 'GetCapabilitiesVDSCommand(HostName = ovirt2.telia.ru 
> , 
> VdsIdAndVdsVDSCommandParametersBase:{runAsync='true', 
> hostId='3970247c-69eb-4bd8-b263-9100703a8243', vds='Host[ovirt2.telia.ru 
> ,3970247c-69eb-4bd8-b263-9100703a8243]'})' execution 
> failed: java.net.NoRouteToHostException: No route to host
> 2017-11-14 17:06:45,930Z ERROR 
> [org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring] 
> (DefaultQuartzScheduler2) [6080f1cc] Failure to refresh host 'ovirt2.telia.ru 
> ' runtime info: java.net.NoRouteToHostException: No 
> route to host
> 2017-11-14 17:06:48,933Z INFO  
> [org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp Reactor) [] 
> Connecting to ovirt2/80.239.162.106 
> 2017-11-14 17:06:48,934Z ERROR 
> [org.ovirt.engine.core.vdsbroker.vdsbroker.GetCapabilitiesVDSCommand] 
> (DefaultQuartzScheduler6) [1a64dfea] Command 
> 'GetCapabilitiesVDSCommand(HostName = ovirt2.telia.ru 
> , 
> VdsIdAndVdsVDSCommandParametersBase:{runAsync='true', 
> hostId='3970247c-69eb-4bd8-b263-9100703a8243', vds='Host[ovirt2.telia.ru 
> ,3970247c-69eb-4bd8-b263-9100703a8243]'})' execution 
> failed: java.net.NoRouteToHostException: No route to host
> 2017-11-14 17:06:48,934Z ERROR 
> [org.ovirt.engine.core.vdsbroker.monitoring.HostMonitoring] 
> (DefaultQuartzScheduler6) [1a64dfea] Failure to refresh host 'ovirt2.telia.ru 
> ' runtime info: java.net.NoRouteToHostException: No 
> route to host
> 2017-11-14 17:06:50,931Z INFO  
> [org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp Reactor) [] 
> Connecting to ovirt2/80.239.162.106 
> 2017-11-14 17:06:50,932Z ERROR 
> [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStatusVDSCommand] 
> (DefaultQuartzScheduler4) [6b19d168] Command 'SpmStatusVDSCommand(HostName = 
> ovirt2.telia.ru , 
> SpmStatusVDSCommandParameters:{runAsync='true', 
> hostId='3970247c-69eb-4bd8-b263-9100703a8243', 
> storagePoolId='5a044257-02ec-0382-0243-01f2'})' execution failed: 
> java.net.NoRouteToHostException: No route to host
> 2017-11-14 17:06:50,939Z INFO  
> [org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp Reactor) [] 
> Connecting to ovirt2/80.239.162.106 
> 2017-11-14 17:06:50,940Z ERROR 
> [org.ovirt.engine.core.vdsbroker.irsbroker.IrsBrokerCommand] 
> (DefaultQuartzScheduler4) [6b19d168] IrsBroker::Failed::GetStoragePoolInfoVDS
> 2017-11-14 17:06:50,940Z ERROR 
> [org.ovirt.engine.core.vdsbroker.irsbroker.GetStoragePoolInfoVDSCommand] 
> (DefaultQuartzScheduler4) [6b19d168] Command 'GetStoragePoolInfoVDSCommand( 
> GetStoragePoolInfoVDSCommandParameters:{runAsync='true', 
> storagePoolId='5a044257-02ec-0382-0243-01f2', 
> ignoreFailoverLimit='true'})' execution failed: IRSProtocolException: 
> 2017-11-14 17:06:51,937Z INFO  
> [org.ovirt.vdsm.jsonrpc.client.reactors.ReactorClient] (SSL Stomp Reactor) [] 
> Connecting 

Re: [ovirt-users] [ANN] oVirt 4.2.0 First Beta Release is now available for testing

2017-11-01 Thread Darrell Budic
From: Greg Sheremeta 
> Subject: Re: [ovirt-users] [ANN] oVirt 4.2.0 First Beta Release is now 
> available for testing
> Date: November 1, 2017 at 11:21:52 AM CDT
> To: Robert Story
> Cc: FERNANDO FREDIANI; users
> 
> 
> I'd argue
> that oVirt, particularly the admin portal, is for a much more
> technical audience. I think right-click should stay for admin portal.
> 
> What are people's opinions on an "actions" button on the far right of the 
> tables?
> See #6 here:
> http://www.patternfly.org/pattern-library/content-views/table-view/#/design 
> 
> 
> Would that be an adequate substitute for right-clicking?

Mostly, but I’d prefer the options to be available on the left by the name of 
the VM, as that’s what I’m likely sorting on and where I’m going to find it 
easier to be sure I’ve got the right one when I select an action on a row that 
might not be highlighted yet.

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] How to best view remote console via macosX

2017-11-03 Thread Darrell Budic
https://www.ovirt.org/develop/release-management/features/virt/novnc-console/

> From: Jayme Fall <ja...@silverorange.com>
> Subject: Re: [ovirt-users] How to best view remote console via macosX
> Date: November 3, 2017 at 5:24:24 PM CDT
> To: Darrell Budic
> 
> How is web based VNC supported?  Do you need to install any specific 
> components, when I launch console from ovirt admin it just prompts to 
> download VV file. 
> 
>> On Nov 3, 2017, at 5:54 PM, Darrell Budic <bu...@onholyground.com 
>> <mailto:bu...@onholyground.com>> wrote:
>> 
>> I find using the web based VNC is the simplest from my mac. You can extract 
>> data from a console.vv file and open it with any VNC software on a mac, even 
>> Screen Sharing, but you have to enter the IP & port manually.  I’m not aware 
>> of any spice solutions at this moment, but i haven’t looked for one in a 
>> while.
>> 
>>> From: Jayme Fall <ja...@silverorange.com <mailto:ja...@silverorange.com>>
>>> Subject: [ovirt-users] How to best view remote console via macosX
>>> Date: November 3, 2017 at 3:06:29 PM CDT
>>> To: users@ovirt.org <mailto:users@ovirt.org>
>>> 
>>> I’m wondering what the best method is to get overt console support working 
>>> from a MacOSx device.  I have tried opening console.vv files using vnc 
>>> client as well as remote viewer and have not had any luck thus far.  Overt 
>>> is version 4.1.4
>>> 
>>> Thanks!
>>> ___
>>> Users mailing list
>>> Users@ovirt.org <mailto:Users@ovirt.org>
>>> http://lists.ovirt.org/mailman/listinfo/users 
>>> <http://lists.ovirt.org/mailman/listinfo/users>
>> 
> 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Enabling libgfapi disk access with oVirt 4.2

2017-11-09 Thread Darrell Budic
You do need to stop the VMs and restart them, not just issue a reboot. I havn’t 
tried under 4.2 yet, but it works in 4.1.6 that way for me.

> From: Alessandro De Salvo 
> Subject: Re: [ovirt-users] Enabling libgfapi disk access with oVirt 4.2
> Date: November 9, 2017 at 2:35:01 AM CST
> To: users@ovirt.org
> 
> Hi again,
> 
> OK, tried to stop all the vms, except the engine, set engine-config -s 
> LibgfApiSupported=true (for 4.2 only) and restarted the engine.
> 
> When I tried restarting the VMs they are still not using gfapi, so it does 
> not seem to help.
> 
> Cheers,
> 
> 
> 
> Alessandro
> 
> 
> 
> 
> Il 09/11/17 09:12, Alessandro De Salvo ha scritto:
>> Hi,
>> where should I enable gfapi via the UI?
>> The only command I tried was engine-config -s LibgfApiSupported=true but the 
>> result is what is shown in my output below, so it’s set to true for v4.2. Is 
>> it enough?
>> I’ll try restarting the engine. Is it really needed to stop all the VMs and 
>> restart them all? Of course this is a test setup and I can do it, but for 
>> production clusters in the future it may be a problem.
>> Thanks,
>> 
>>Alessandro
>> 
>> Il giorno 09 nov 2017, alle ore 07:23, Kasturi Narra > > ha scritto:
>> 
>>> Hi ,
>>> 
>>> The procedure to enable gfapi is below.
>>> 
>>> 1) stop all the vms running
>>> 2) Enable gfapi via UI or using engine-config command
>>> 3) Restart ovirt-engine service
>>> 4) start the vms.
>>> 
>>> Hope you have not missed any !!
>>> 
>>> Thanks
>>> kasturi 
>>> 
>>> On Wed, Nov 8, 2017 at 11:58 PM, Alessandro De Salvo 
>>> >> > wrote:
>>> Hi,
>>> 
>>> I'm using the latest 4.2 beta release and want to try the gfapi access, but 
>>> I'm currently failing to use it.
>>> 
>>> My test setup has an external glusterfs cluster v3.12, not managed by oVirt.
>>> 
>>> The compatibility flag is correctly showing gfapi should be enabled with 
>>> 4.2:
>>> 
>>> # engine-config -g LibgfApiSupported
>>> LibgfApiSupported: false version: 3.6
>>> LibgfApiSupported: false version: 4.0
>>> LibgfApiSupported: false version: 4.1
>>> LibgfApiSupported: true version: 4.2
>>> 
>>> The data center and cluster have the 4.2 compatibility flags as well.
>>> 
>>> However, when starting a VM with a disk on gluster I can still see the disk 
>>> is mounted via fuse.
>>> 
>>> Any clue of what I'm still missing?
>>> 
>>> Thanks,
>>> 
>>> 
>>>Alessandro
>>> 
>>> ___
>>> Users mailing list
>>> Users@ovirt.org 
>>> http://lists.ovirt.org/mailman/listinfo/users 
>>> 
>>> 
>> 
>> 
>> ___
>> Users mailing list
>> Users@ovirt.org 
>> http://lists.ovirt.org/mailman/listinfo/users 
>> 
> 
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] How to best view remote console via macosX

2017-11-03 Thread Darrell Budic
I find using the web based VNC is the simplest from my mac. You can extract 
data from a console.vv file and open it with any VNC software on a mac, even 
Screen Sharing, but you have to enter the IP & port manually.  I’m not aware of 
any spice solutions at this moment, but i haven’t looked for one in a while.

> From: Jayme Fall 
> Subject: [ovirt-users] How to best view remote console via macosX
> Date: November 3, 2017 at 3:06:29 PM CDT
> To: users@ovirt.org
> 
> I’m wondering what the best method is to get overt console support working 
> from a MacOSx device.  I have tried opening console.vv files using vnc client 
> as well as remote viewer and have not had any luck thus far.  Overt is 
> version 4.1.4
> 
> Thanks!
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] VM resource allocation and IO Threads

2017-10-30 Thread Darrell Budic
Best explanation I’ve found is 
https://wiki.mikejung.biz/KVM_/_Xen#virtio-blk_iothreads_.28x-data-plane.29  If 
you google a bit, you’ll find some more under QEMU topics, I saw some 
discussion of threads and queues in virtio-scsi, but that seems to be a 
slightly different thing than this setting.

In short, having at least 1 offers advantages for all your VM’s disks, and if 
you want to be optimal (at the possible expense of extra CPU for IO), one per 
drive attached. There is (currently) no benefit to having more than 1 thread 
per drive. From what I can tell, if you have more drives than threads they 
share the threads evenly and are statically assigned to a thread. Seems to be 
effective at QEMU start, so you have to change it with the VM down or stop and 
start it again.

I currently enable it on all VMs and assign 1 thread per drive on my systems.
> From: Gianluca Cecchi 
> Subject: [ovirt-users] VM resource allocation and IO Threads
> Date: October 27, 2017 at 9:26:59 AM CDT
> To: users
> 
> Hello,
> can anyone give any pointer to deeper information about what in subject and 
> the value for "Num Of IO Threads" configuration, best practices and 
> to-be-expected improvements?
> 
> I read also here:
> https://access.redhat.com/documentation/en-us/red_hat_virtualization/4.1/html-single/virtual_machine_management_guide/#Editing_IO_Threads
>  
> 
> 
> but in some points it seems not so clear to me:
> 
> eg:
> 
> If a virtual machine has more than one disk, you can enable or change the 
> number of IO threads to improve performance.
> 
> but also
> 
> Red Hat recommends using the default number of IO threads, which is 1.
> 
> There is also a note about deactivation and activation of disks: does it mean 
> that even if I poweroff the VM and change its config I have to make this step 
> after?
> 
> Anyone has run benchmarks?
> Does it make sense if my VM has 3 disks to configure 6 IO threads for example?
> Do IO threads map to SCSI controllers inside the guest or what?
> 
> Thanks in advance,
> Gianluca
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


  1   2   >