Re: [ovirt-users] Network config

2015-01-27 Thread Ondřej Svoboda

Jorick,

just what comes to my mind: do you have NetworkManager running, by any 
chance? VDSM and NetworkManager don't mix well currently so please try


# systemctl stop NetworkManager
# systemctl disable NetworkManager

Regards,
Ondra Svoboda

On 27.1.2015 14:10, Jorick Astrego wrote:


On 01/27/2015 01:56 PM, Koen Vanoppen wrote:

Hello everybody,

Just wanted to know if anybody else has the problem, that when you 
restart your network configuration of a hypervisor, all the settings 
concerning the bonding are gone...

Do I file a bug report for this or is this already a known issue?

Kind regards,

Koen


Hmm,

I cannot restart the network service from the host for some reason:

Jan 27 14:00:27 test2 systemd: network.service: control process
exited, code=exited status=1
Jan 27 14:00:27 test2 systemd: Failed to start LSB: Bring up/down
networking.


systemctl status network.service
network.service - LSB: Bring up/down networking
   Loaded: loaded (/etc/rc.d/init.d/network)
   Active: failed (Result: exit-code) since Tue 2015-01-27
14:00:27 CET; 1min 56s ago
  Process: 4113 ExecStart=/etc/rc.d/init.d/network start
(code=exited, status=1/FAILURE)

Jan 27 14:00:27 test2.netbulae.test network[4113]: RTNETLINK
answers: File exists
Jan 27 14:00:27 test2.netbulae.test network[4113]: RTNETLINK
answers: File exists
Jan 27 14:00:27 test2.netbulae.test network[4113]: RTNETLINK
answers: File exists
Jan 27 14:00:27 test2.netbulae.test network[4113]: RTNETLINK
answers: File exists
Jan 27 14:00:27 test2.netbulae.test network[4113]: RTNETLINK
answers: File exists
Jan 27 14:00:27 test2.netbulae.test network[4113]: RTNETLINK
answers: File exists
Jan 27 14:00:27 test2.netbulae.test network[4113]: RTNETLINK
answers: File exists
Jan 27 14:00:27 test2.netbulae.test systemd[1]: network.service:
control process exited, code=exited status=1
Jan 27 14:00:27 test2.netbulae.test systemd[1]: Failed to start
LSB: Bring up/down networking.
Jan 27 14:00:27 test2.netbulae.test systemd[1]: Unit
network.service entered failed state.

-- Subject: Unit network.service has failed
-- Defined-By: systemd
-- Support:
http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- 
-- Unit network.service has failed.
-- 
-- The result is failed.

Jan 27 14:00:27 test2.netbulae.test systemd[1]: Unit
network.service entered failed state.


In supervdsm I find lots and lots of these:


sourceRoute::DEBUG::2015-01-27
14:02:40,802::utils::739::root::(execCmd) /sbin/ip -4 route add
0.0.0.0/0 via **.***.**.** dev bond0 table 174351414 (cwd None)
sourceRoute::DEBUG::2015-01-27
14:02:40,806::utils::759::root::(execCmd) FAILED: err =
'RTNETLINK answers: File exists\n'; rc = 2
sourceRoute::ERROR::2015-01-27
14:02:40,806::sourceroute::88::root::(configure) ip binary failed
during source route configuration: ['RTNETLINK answers: File exists']
sourceRoute::WARNING::2015-01-27
14:02:40,807::utils::129::root::(rmFile) File:
/var/run/vdsm/trackedInterfaces/bond0 already removed
sourceRoute::DEBUG::2015-01-27
14:07:09,047::sourceroutethread::39::root::(process_IN_CLOSE_WRITE_filePath)
Responding to DHCP response in /var/run/vdsm/sourceRoutes/1422364029
sourceRoute::INFO::2015-01-27
14:07:09,052::sourceroute::78::root::(configure) Configuring
gateway - ip: **.***.**.***, network: **.***.**.**0/24, subnet:
255.255.255.0, gateway: **.***.**.**, table: 174351414, device: bond0
sourceRoute::DEBUG::2015-01-27
14:07:09,052::utils::739::root::(execCmd) /sbin/ip -4 route add
0.0.0.0/0 via **.***.**.** dev bond0 table 174351414 (cwd None)
sourceRoute::DEBUG::2015-01-27
14:07:09,057::utils::759::root::(execCmd) FAILED: err =
'RTNETLINK answers: File exists\n'; rc = 2
sourceRoute::ERROR::2015-01-27
14:07:09,057::sourceroute::88::root::(configure) ip binary failed
during source route configuration: ['RTNETLINK answers: File exists']
sourceRoute::WARNING::2015-01-27
14:07:09,057::utils::129::root::(rmFile) File:
/var/run/vdsm/trackedInterfaces/bond0 already removed


The hosts are all Centos 7 freshly provisioned. With these vdsm packages:

vdsm-python-zombiereaper-4.16.10-8.gitc937927.el7.noarch
vdsm-python-4.16.10-8.gitc937927.el7.noarch
vdsm-yajsonrpc-4.16.10-8.gitc937927.el7.noarch
vdsm-4.16.10-8.gitc937927.el7.x86_64
vdsm-xmlrpc-4.16.10-8.gitc937927.el7.noarch
vdsm-jsonrpc-4.16.10-8.gitc937927.el7.noarch
vdsm-cli-4.16.10-8.gitc937927.el7.noarch





Met vriendelijke groet, With kind regards,

Jorick Astrego*

Netbulae Virtualization Experts *

Tel: 053 20 30 270  i...@netbulae.euStaalsteden 4-3AKvK 
08198180
Fax: 053 20 30 271  www.netbulae.eu 7547 TA 

Re: [ovirt-users] Network config

2015-01-27 Thread Sven Kieske


On 27/01/15 14:20, Ondřej Svoboda wrote:
 just what comes to my mind: do you have NetworkManager running, by any
 chance? VDSM and NetworkManager don't mix well currently so please try

Doesn't vdsm / ovirt take care of that during installation?
on EL7 NetworkManager is active by default!

-- 
Mit freundlichen Grüßen / Regards

Sven Kieske

Systemadministrator
Mittwald CM Service GmbH  Co. KG
Königsberger Straße 6
32339 Espelkamp
T: +49-5772-293-100
F: +49-5772-293-333
https://www.mittwald.de
Geschäftsführer: Robert Meyer
St.Nr.: 331/5721/1033, USt-IdNr.: DE814773217, HRA 6640, AG Bad Oeynhausen
Komplementärin: Robert Meyer Verwaltungs GmbH, HRB 13260, AG Bad Oeynhausen
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Network config

2015-01-27 Thread Jorick Astrego
Ah yes, I forgot to disable it as I switched to foreman provisioning.

Maybe it can be auto detected and disabled in the install host scripts
when ovirt installs vdsm and sets up the networking. For people who
forget to disable it.

But now I get this:

sourceRoute::DEBUG::2015-01-27
14:33:57,416::utils::739::root::(execCmd) /sbin/ip -4 route add
0.0.0.0/0 via ***.***.***.*** dev bond0 table 174351414 (cwd None)
sourceRoute::DEBUG::2015-01-27
14:33:57,420::utils::759::root::(execCmd) FAILED: err = 'RTNETLINK
answers: File exists\n'; rc = 2
sourceRoute::ERROR::2015-01-27
14:33:57,420::sourceroute::88::root::(configure) ip binary failed
during source route configuration: ['RTNETLINK answers: File exists']
sourceRoute::WARNING::2015-01-27
14:33:57,420::utils::129::root::(rmFile) File:
/var/run/vdsm/trackedInterfaces/bond0 already removed

Jan 27 14:38:19 test3.netbulae.test network[3042]: RTNETLINK
answers: File exists
Jan 27 14:38:19 test3.netbulae.test systemd[1]: network.service:
control process exited, code=exited status=1
Jan 27 14:38:19 test3.netbulae.test systemd[1]: Failed to start LSB:
Bring up/down networking.
-- Subject: Unit network.service has failed
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- 
-- Unit network.service has failed.
-- 
-- The result is failed.
Jan 27 14:38:19 test3.netbulae.test systemd[1]: Unit network.service
entered failed state.



Bond0 is still there though

sourceRoute::DEBUG::2015-01-27
14:33:57,410::sourceroutethread::39::root::(process_IN_CLOSE_WRITE_filePath)
Responding to DHCP response in /var/run/vdsm/sourceRoutes/1422365637
sourceRoute::INFO::2015-01-27
14:33:57,416::sourceroute::78::root::(configure) Configuring gateway
- ip: **.***.**.**, network: 10.100.100.0/24, subnet: 255.255.255.0,
gateway: **.***.**.**, table: 174351414, device: bond0
sourceRoute::DEBUG::2015-01-27
14:33:57,416::utils::739::root::(execCmd) /sbin/ip -4 route add
0.0.0.0/0 via **.***.**.** dev bond0 table 174351414 (cwd None)
sourceRoute::DEBUG::2015-01-27
14:33:57,420::utils::759::root::(execCmd) FAILED: err = 'RTNETLINK
answers: File exists\n'; rc = 2
sourceRoute::ERROR::2015-01-27
14:33:57,420::sourceroute::88::root::(configure) ip binary failed
during source route configuration: ['RTNETLINK answers: File exists']
sourceRoute::WARNING::2015-01-27
14:33:57,420::utils::129::root::(rmFile) File:
/var/run/vdsm/trackedInterfaces/bond0 already removed
sourceRoute::DEBUG::2015-01-27
14:37:52,085::sourceroutethread::39::root::(process_IN_CLOSE_WRITE_filePath)
Responding to DHCP response in /var/run/vdsm/sourceRoutes/1422365872
sourceRoute::INFO::2015-01-27
14:37:52,090::sourceroute::78::root::(configure) Configuring gateway
- ip: **.***.**.**, network: 10.100.100.0/24, subnet: 255.255.255.0,
gateway: **.***.**.**, table: 174351414, device: bond0
sourceRoute::DEBUG::2015-01-27
14:37:52,090::utils::739::root::(execCmd) /sbin/ip -4 route add
0.0.0.0/0 via **.***.**.** dev bond0 table 174351414 (cwd None)
sourceRoute::DEBUG::2015-01-27
14:37:52,094::utils::759::root::(execCmd) FAILED: err = 'RTNETLINK
answers: File exists\n'; rc = 2
sourceRoute::ERROR::2015-01-27
14:37:52,094::sourceroute::88::root::(configure) ip binary failed
during source route configuration: ['RTNETLINK answers: File exists']
sourceRoute::WARNING::2015-01-27
14:37:52,094::utils::129::root::(rmFile) File:
/var/run/vdsm/trackedInterfaces/bond0 already removed


If you need more info, let me know.


On 01/27/2015 02:20 PM, Ondřej Svoboda wrote:
 Jorick,

 just what comes to my mind: do you have NetworkManager running, by any
 chance? VDSM and NetworkManager don't mix well currently so please try

 # systemctl stop NetworkManager
 # systemctl disable NetworkManager

 Regards,
 Ondra Svoboda

 On 27.1.2015 14:10, Jorick Astrego wrote:

 On 01/27/2015 01:56 PM, Koen Vanoppen wrote:
 Hello everybody,

 Just wanted to know if anybody else has the problem, that when you
 restart your network configuration of a hypervisor, all the settings
 concerning the bonding are gone...
 Do I file a bug report for this or is this already a known issue?

 Kind regards,

 Koen

 Hmm,

 I cannot restart the network service from the host for some reason:

 Jan 27 14:00:27 test2 systemd: network.service: control process
 exited, code=exited status=1
 Jan 27 14:00:27 test2 systemd: Failed to start LSB: Bring up/down
 networking.


 systemctl status network.service
 network.service - LSB: Bring up/down networking
Loaded: loaded (/etc/rc.d/init.d/network)
Active: failed (Result: exit-code) since Tue 2015-01-27
 14:00:27 CET; 1min 56s ago
   Process: 4113 ExecStart=/etc/rc.d/init.d/network start
 

Re: [ovirt-users] change network MTU settings without taking all the VMs down?

2015-01-27 Thread Darrell Budic
Except you can’t change the network MTU setting in the first place on the 
network in the GUI. I’ve thought about doing it in the database, with a 
migration as you mention. Just checking first for better options :)


 On Jan 27, 2015, at 12:08 PM, Donny Davis do...@cloudspin.me wrote:
 
 Migrate the vms between hosts, and when a host gets to no running VMS you can 
 sync that network. 
 
 Donny D
 cloudspin.meOn Jan 27, 2015 10:43 AM, Darrell Budic bu...@onholyground.com 
 wrote:
 
 I finally got a couple of networks our from behind a wan based layer 2 
 bridge that required me to run at MTU 1448, and would like to get back up to 
 MTU 1500. I see the GUI won’t let me do that while the network is in use. 
 Any way around this, clean or otherwise? Restarting VMs to update them is 
 ok, just trying to avoid having to take everything down at the same time.
 
 -Darrell
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Host remains Non-Responsive after reboot

2015-01-27 Thread Piotr Kliczewski
Looking at the logs I can see that connection was lost at 2015-01-26
09:24:43,213
and I can see good number of reconnection attempts which end up with
timeout or 'no route to host'.
The connection was recovered at 2015-01-26 09:28:56,292.

Vdsm.log do not contain above connection loss (it starts at 2015-01-26
10:01:02,208).

It was lost again at 2015-01-26 11:54:58,741 and it was recovered at
2015-01-26 12:01:47,752.

I checked vdsm logs and I can see really weird lack of logs:

JsonRpc (StompReactor)::DEBUG::2015-01-26
11:52:35,893::stompReactor::98::Broker.StompAdapter::(handle_frame)
Handling message StompFMainThread::INFO::2015-01-26
12:01:45,183::vdsm::131::vds::(run) (PID: 7021) I am the actual vdsm
4.16.10-8.gitc937927.el6 love005.ovt.visionamics.com
(2.6.32-504.3.3.el6.x86_64)
MainThread::DEBUG::2015-01-26
12:01:45,184::resourceManager::421::Storage.ResourceManager::(registerNamespace)
Registering namespace 'Storage'

which covers having no connection from the engine perspective.

Usually when there are connectivity issues we see timeouts in the logs
but here there are 'no route to host' as well
which suggest networking issues.

@Dan - Do you know what caused lack of logs in vdsm?
@ILanit - What vdsm version do you use?

On Tue, Jan 27, 2015 at 4:57 PM, Piotr Kliczewski pklic...@redhat.com wrote:




 - Original Message -
 From: Eli Mesika emes...@redhat.com
 To: Piotr Kliczewski pklic...@redhat.com
 Cc: Artyom Lukianov aluki...@redhat.com, users@ovirt.org, 
 rabsh...@citytwist.net, ILanit Stein
 ist...@redhat.com
 Sent: Tuesday, January 27, 2015 4:39:26 PM
 Subject: Re: [ovirt-users] Host remains Non-Responsive after reboot



 - Original Message -
  From: ILanit Stein ist...@redhat.com
  To: Artyom Lukianov aluki...@redhat.com, Eli Mesika
  emes...@redhat.com
  Cc: users@ovirt.org, rabsh...@citytwist.net
  Sent: Tuesday, January 27, 2015 5:19:12 PM
  Subject: Fwd: [ovirt-users] Host remains Non-Responsive after reboot
 
 
  Hi Guys,
 
  Can you please look into this please?

 Hi
 From the logs I can see clearly that host is turned on in 2015-01-26
 11:56:51,191
 However, there is a stomp exception in 2015-01-26 11:56:53,544 and a
 connection timeout in 2015-01-26 11:56:53,553 that might be related

 Piotr, can you please have a look ?


 Sure. Can you please send me the logs?


 
  Thanks,
  Ilanit.
  - Forwarded Message -
  From: Rob Abshear rabsh...@citytwist.net
  To: ILanit Stein ist...@redhat.com
  Sent: Tuesday, January 27, 2015 3:05:56 PM
  Subject: Re: [ovirt-users] Host remains Non-Responsive after reboot
 
  Here are the logs. you requested.  The shutdown of the node was at 11:53
  and vdsmd was manually restarted at 12:01 to get the node back online.
 
  On Tue, Jan 27, 2015 at 2:05 AM, ILanit Stein ist...@redhat.com wrote:
 
   It might be a bug,
   Would you please attach the logs, I mentioned bellow,
   that can bring more details on the failure?
   Adding Eli, that may want to give some input on this issue.
  
   Thanks,
   Ilanit.
  
   - Original Message -
   From: Rob Abshear rabsh...@citytwist.net
   To: ILanit Stein ist...@redhat.com
   Cc: users@ovirt.org
   Sent: Monday, January 26, 2015 9:43:14 PM
   Subject: Re: [ovirt-users] Host remains Non-Responsive after reboot
  
   I have done a bit more investigating on this matter.  If I restart the
   node
   from within oVirt using the power management option restart, then the
   node restarts and vdsmd DOES NOT start.  If I go into the DRAC and issue
   the command to power cycle the machine, then the machine restarts and
   vdsmd
   DOES start.  I can run the following command from another node in the
   cluster:
   fence_drac5 -a 192.168.200.105 -l root -p password -x -o reboot
   and the node restarts and vdsmd DOES start.
  
   On Sun, Jan 25, 2015 at 1:56 AM, ILanit Stein ist...@redhat.com wrote:
  
Hi Rob,
   
Thanks for this report.
   
Would you please provide these logs, at the time frame, the host
failure
occur:
1. oVirt Engine: /var/log/ovirt-engine/engine.log
2. host: /var/log/vdsm/vdsm.log
   
If it is reproducible, please add this info as well.
   
You can also check vdsm service status, on host, while host reported as
Non responsive,
by running on host 'service vdsmd status'
There might some problem, that might have prevented from vdsm service
to
come up, on host.
   
Ilanit.
   
- Original Message -
From: Rob Abshear rabsh...@citytwist.net
To: users@ovirt.org
Sent: Friday, January 23, 2015 9:22:42 PM
Subject: [ovirt-users] Host remains Non-Responsive after reboot
   
   
I am running oVirt Engine Version 3.5.0.1-1.el6. I have 4 hosts in the
cluster. Each host has a drac5 and it is configured and working. I am
trying to simulate a node failure. I am running one HA VM on one of the
hosts for testing. I simulate the failure by powering off the host with
   the
VM running.
   
 

Re: [ovirt-users] change network MTU settings without taking all the VMs down?

2015-01-27 Thread Donny Davis
Maybe I missed your question, but I can change the MTU from the gui without any 
problems. As long as you make sure there are not any vm's on the host you are 
trying to sync, I have experienced no issues. 

Donny

-Original Message-
From: Darrell Budic [mailto:bu...@onholyground.com] 
Sent: Tuesday, January 27, 2015 12:30 PM
To: Donny Davis
Cc: users@ovirt.org
Subject: Re: [ovirt-users] change network MTU settings without taking all the 
VMs down?

Except you can’t change the network MTU setting in the first place on the 
network in the GUI. I’ve thought about doing it in the database, with a 
migration as you mention. Just checking first for better options :)


 On Jan 27, 2015, at 12:08 PM, Donny Davis do...@cloudspin.me wrote:
 
 Migrate the vms between hosts, and when a host gets to no running VMS you can 
 sync that network. 
 
 Donny D
 cloudspin.meOn Jan 27, 2015 10:43 AM, Darrell Budic bu...@onholyground.com 
 wrote:
 
 I finally got a couple of networks our from behind a wan based layer 2 
 bridge that required me to run at MTU 1448, and would like to get back up to 
 MTU 1500. I see the GUI won’t let me do that while the network is in use. 
 Any way around this, clean or otherwise? Restarting VMs to update them is 
 ok, just trying to avoid having to take everything down at the same time.
 
 -Darrell
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Host remains Non-Responsive after reboot

2015-01-27 Thread Rob Abshear
Yeah.  There would have been a lot of connection issues because I was doing
a lot of testing and reconfiguring.The only part that's really
applicable for this issue is the period you mentioned from 11:54 to 12:01.
I did use the service vdsm status command after the host came back up and
the service was not running.  I start the service manually and it comes up
without error and the node comes back online.  Is it normal operation for
the host to automatically recover if it can, including starting vdsmd?  One
of my colleagues thinks that, perhaps that we are experiencing normal
operation.  But I can't imagine that the host wouldn't come back completely
if it's able.

On Tue, Jan 27, 2015 at 3:05 PM, Piotr Kliczewski 
piotr.kliczew...@gmail.com wrote:

 Looking at the logs I can see that connection was lost at 2015-01-26
 09:24:43,213
 and I can see good number of reconnection attempts which end up with
 timeout or 'no route to host'.
 The connection was recovered at 2015-01-26 09:28:56,292.

 Vdsm.log do not contain above connection loss (it starts at 2015-01-26
 10:01:02,208).

 It was lost again at 2015-01-26 11:54:58,741 and it was recovered at
 2015-01-26 12:01:47,752.

 I checked vdsm logs and I can see really weird lack of logs:

 JsonRpc (StompReactor)::DEBUG::2015-01-26
 11:52:35,893::stompReactor::98::Broker.StompAdapter::(handle_frame)
 Handling message StompFMainThread::INFO::2015-01-26
 12:01:45,183::vdsm::131::vds::(run) (PID: 7021) I am the actual vdsm
 4.16.10-8.gitc937927.el6 love005.ovt.visionamics.com
 (2.6.32-504.3.3.el6.x86_64)
 MainThread::DEBUG::2015-01-26

 12:01:45,184::resourceManager::421::Storage.ResourceManager::(registerNamespace)
 Registering namespace 'Storage'

 which covers having no connection from the engine perspective.

 Usually when there are connectivity issues we see timeouts in the logs
 but here there are 'no route to host' as well
 which suggest networking issues.

 @Dan - Do you know what caused lack of logs in vdsm?
 @ILanit - What vdsm version do you use?

 On Tue, Jan 27, 2015 at 4:57 PM, Piotr Kliczewski pklic...@redhat.com
 wrote:
 
 
 
 
  - Original Message -
  From: Eli Mesika emes...@redhat.com
  To: Piotr Kliczewski pklic...@redhat.com
  Cc: Artyom Lukianov aluki...@redhat.com, users@ovirt.org,
 rabsh...@citytwist.net, ILanit Stein
  ist...@redhat.com
  Sent: Tuesday, January 27, 2015 4:39:26 PM
  Subject: Re: [ovirt-users] Host remains Non-Responsive after reboot
 
 
 
  - Original Message -
   From: ILanit Stein ist...@redhat.com
   To: Artyom Lukianov aluki...@redhat.com, Eli Mesika
   emes...@redhat.com
   Cc: users@ovirt.org, rabsh...@citytwist.net
   Sent: Tuesday, January 27, 2015 5:19:12 PM
   Subject: Fwd: [ovirt-users] Host remains Non-Responsive after reboot
  
  
   Hi Guys,
  
   Can you please look into this please?
 
  Hi
  From the logs I can see clearly that host is turned on in 2015-01-26
  11:56:51,191
  However, there is a stomp exception in 2015-01-26 11:56:53,544 and a
  connection timeout in 2015-01-26 11:56:53,553 that might be related
 
  Piotr, can you please have a look ?
 
 
  Sure. Can you please send me the logs?
 
 
  
   Thanks,
   Ilanit.
   - Forwarded Message -
   From: Rob Abshear rabsh...@citytwist.net
   To: ILanit Stein ist...@redhat.com
   Sent: Tuesday, January 27, 2015 3:05:56 PM
   Subject: Re: [ovirt-users] Host remains Non-Responsive after reboot
  
   Here are the logs. you requested.  The shutdown of the node was at
 11:53
   and vdsmd was manually restarted at 12:01 to get the node back online.
  
   On Tue, Jan 27, 2015 at 2:05 AM, ILanit Stein ist...@redhat.com
 wrote:
  
It might be a bug,
Would you please attach the logs, I mentioned bellow,
that can bring more details on the failure?
Adding Eli, that may want to give some input on this issue.
   
Thanks,
Ilanit.
   
- Original Message -
From: Rob Abshear rabsh...@citytwist.net
To: ILanit Stein ist...@redhat.com
Cc: users@ovirt.org
Sent: Monday, January 26, 2015 9:43:14 PM
Subject: Re: [ovirt-users] Host remains Non-Responsive after reboot
   
I have done a bit more investigating on this matter.  If I restart
 the
node
from within oVirt using the power management option restart, then
 the
node restarts and vdsmd DOES NOT start.  If I go into the DRAC and
 issue
the command to power cycle the machine, then the machine restarts
 and
vdsmd
DOES start.  I can run the following command from another node in
 the
cluster:
fence_drac5 -a 192.168.200.105 -l root -p password -x -o reboot
and the node restarts and vdsmd DOES start.
   
On Sun, Jan 25, 2015 at 1:56 AM, ILanit Stein ist...@redhat.com
 wrote:
   
 Hi Rob,

 Thanks for this report.

 Would you please provide these logs, at the time frame, the host
 failure
 occur:
 1. oVirt Engine: /var/log/ovirt-engine/engine.log
 2. host: 

Re: [ovirt-users] Network config

2015-01-27 Thread Ondřej Svoboda
From the first log (journal): VDSM managed to start, only it is being 
run too early (it should be waiting patiently for its supervdsm 
component to start responding).


Are there no RTNETLINK answers: File exists errors anymore?

Second log: This is normal operation.

So the host stopped responding? It should have come up, assuming that it 
obtained the same address from DHCP (which it should, but please check).


Could you try using static network configuration first? And when it 
succeeds you can try DHCP again.


===

About the GUI: Is your host's status now Not Operational? In that case 
oVirt cannot tell anything about the status of the host's interfaces, it 
may only reflect the status before communication was lost.


Please file a bug if this is really the case I described :-) It would 
then be a matter of discussion if oVirt should display a yellow 
unknown status of the interfaces, or keep displaying the last known 
one. (The first solution would make it obvious that we really lost the 
host and go looking for the host's status to discover it is Not 
Operational. The other is informative as well, just in a different way.)


On 27.1.2015 16:31, Jorick Astrego wrote:


Done, maybe this helps:

Jan 27 16:20:26 test2 vdsmd_init_common.sh: vdsm: Running
unified_network_persistence_upgrade
Jan 27 16:20:26 test2 vdsmd_init_common.sh: vdsm: Running restore_nets
Jan 27 16:21:47 test2 systemd-logind: New session 5 of user root.
Jan 27 16:21:47 test2 systemd: Starting Session 5 of user root.
Jan 27 16:21:47 test2 systemd: Started Session 5 of user root.
Jan 27 16:21:55 test2 systemd: vdsmd.service operation timed out.
Terminating.
Jan 27 16:21:55 test2 journal: End of file while reading data:
Input/output error
Jan 27 16:21:55 test2 systemd: Failed to start Virtual Desktop
Server Manager.
Jan 27 16:21:55 test2 systemd: Unit vdsmd.service entered failed
state.
Jan 27 16:21:55 test2 systemd: vdsmd.service holdoff time over,
scheduling restart.
Jan 27 16:21:55 test2 systemd: Stopping Virtual Desktop Server
Manager...
Jan 27 16:21:55 test2 systemd: Starting Virtual Desktop Server
Manager...
Jan 27 16:21:55 test2 vdsmd_init_common.sh: vdsm: Running mkdirs

Still:

MainThread::INFO::2015-01-27
16:21:56,964::netconfpersistence::158::root::(_clearDisk) Clearing
/var/run/vdsm/netconf/nets/ and /var/run/vdsm/netconf/bonds/
MainThread::DEBUG::2015-01-27
16:21:56,964::netconfpersistence::166::root::(_clearDisk) No
existent config to clear.
MainThread::INFO::2015-01-27
16:21:56,964::netconfpersistence::182::root::(save) Saved new
config RunningConfig({}, {}) to /var/run/vdsm/netconf/nets/ and
/var/run/vdsm/netconf/bonds/
MainThread::DEBUG::2015-01-27
16:21:56,965::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.

And when I want to reapply setup networks in ovirt I get:


Error while executing action Commit Network changes: Network error
during communication with the Host.

Every interface and bond is still green and visible in the webgui by 
the way. That is also strange, every interface of the host is bonded 
and up while in reality the host has lost it's network config and 
doesn't respond anymore.


Shouldn't they all be red on communication failure to the host?

Cheers,

Jorick

On 01/27/2015 04:01 PM, Ondřej Svoboda wrote:
This is weird, like there were multiple dhclients on the bond. Can 
you stop VDSM (systemctl stop vdsmd supervdsmd) and maybe `killall 
-SIGKILL dhclient` and restart VDSM afterwards?


On 27.1.2015 14:43, Jorick Astrego wrote:

Ah yes, I forgot to disable it as I switched to foreman provisioning.

Maybe it can be auto detected and disabled in the install host 
scripts when ovirt installs vdsm and sets up the networking. For 
people who forget to disable it.


But now I get this:

sourceRoute::DEBUG::2015-01-27
14:33:57,416::utils::739::root::(execCmd) /sbin/ip -4 route add
0.0.0.0/0 via ***.***.***.*** dev bond0 table 174351414 (cwd None)
sourceRoute::DEBUG::2015-01-27
14:33:57,420::utils::759::root::(execCmd) FAILED: err =
'RTNETLINK answers: File exists\n'; rc = 2
sourceRoute::ERROR::2015-01-27
14:33:57,420::sourceroute::88::root::(configure) ip binary
failed during source route configuration: ['RTNETLINK answers:
File exists']
sourceRoute::WARNING::2015-01-27
14:33:57,420::utils::129::root::(rmFile) File:
/var/run/vdsm/trackedInterfaces/bond0 already removed

Jan 27 14:38:19 test3.netbulae.test network[3042]: RTNETLINK
answers: File exists
Jan 27 14:38:19 test3.netbulae.test systemd[1]: network.service:
control process exited, code=exited status=1
Jan 27 14:38:19 test3.netbulae.test systemd[1]: Failed to start
LSB: Bring up/down networking.
-- Subject: Unit network.service has failed
-- Defined-By: systemd
-- Support:

Re: [ovirt-users] Network config

2015-01-27 Thread Ondřej Svoboda
I can see in the log that VDSM tried to obtain information about bond1 
and (bridge) Internet, both of which did not exist (yet?) in a 
procedure which reports the information back to oVirt. But this just 
means that no information about them was reported, it shows no real error.


There was no network configuration to be stored (persisted), this is why 
the directories /var/run/vdsm/netconf/nets and 
/var/run/vdsm/netconf/bonds/ are not there.


If you use a static address/gateway, does the creation of the bond and 
the network succeed in Setup host networks? If it does not I suggest 
that you attach the whole logs.


On 27.1.2015 15:59, Jorick Astrego wrote:


On 01/27/2015 01:56 PM, Koen Vanoppen wrote:

Hello everybody,

Just wanted to know if anybody else has the problem, that when you 
restart your network configuration of a hypervisor, all the settings 
concerning the bonding are gone...

Do I file a bug report for this or is this already a known issue?

Kind regards,

Koen



After a host restart the bonds are indead gone, except for bond and I 
get this output:



MainThread::DEBUG::2015-01-27
15:44:18,343::api::623::root::(setupNetworks) Validating configuration
MainThread::DEBUG::2015-01-27
15:44:18,355::api::635::setupNetworks::(setupNetworks) Applying...
MainThread::DEBUG::2015-01-27
15:44:18,355::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:44:18,356::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:44:18,356::utils::739::root::(execCmd) /sbin/ip route show to
0.0.0.0/0 table all (cwd None)
MainThread::DEBUG::2015-01-27
15:44:18,358::utils::759::root::(execCmd) SUCCESS: err = '';
rc = 0
MainThread::INFO::2015-01-27
15:44:18,366::netinfo::555::root::(_getNetInfo) Obtaining info for
net bond1.
Traceback (most recent call last):
  File /usr/lib/python2.7/site-packages/vdsm/netinfo.py, line
552, in _getNetInfo
'mtu': str(getMtu(iface))})
  File /usr/lib/python2.7/site-packages/vdsm/netinfo.py, line
182, in getMtu
mtu = open('/sys/class/net/%s/mtu' % iface).readline().rstrip()
IOError: [Errno 2] No such file or directory:
u'/sys/class/net/bond1/mtu'
MainThread::INFO::2015-01-27
15:44:18,366::netinfo::555::root::(_getNetInfo) Obtaining info for
net Internet.
Traceback (most recent call last):
  File /usr/lib/python2.7/site-packages/vdsm/netinfo.py, line
536, in _getNetInfo
data.update({'ports': ports(iface), 'stp':
bridge_stp_state(iface),
  File /usr/lib/python2.7/site-packages/vdsm/netinfo.py, line
178, in ports
return os.listdir('/sys/class/net/' + bridge + '/brif')
OSError: [Errno 2] No such file or directory:
'/sys/class/net/Internet/brif'
MainThread::DEBUG::2015-01-27
15:44:18,373::utils::739::root::(execCmd) /sbin/ip route show to
0.0.0.0/0 table all (cwd None)
MainThread::DEBUG::2015-01-27
15:44:18,375::utils::759::root::(execCmd) SUCCESS: err = '';
rc = 0
MainThread::INFO::2015-01-27
15:44:18,384::netconfpersistence::158::root::(_clearDisk) Clearing
/var/run/vdsm/netconf/nets/ and /var/run/vdsm/netconf/bonds/
MainThread::DEBUG::2015-01-27
15:44:18,384::netconfpersistence::166::root::(_clearDisk) No
existent config to clear.
MainThread::INFO::2015-01-27
15:44:18,384::netconfpersistence::182::root::(save) Saved new
config RunningConfig({}, {}) to /var/run/vdsm/netconf/nets/ and
/var/run/vdsm/netconf/bonds/
MainThread::DEBUG::2015-01-27
15:44:18,386::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:44:18,386::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:44:18,470::utils::739::root::(execCmd) /usr/sbin/ifdown
Internet (cwd None)
MainThread::DEBUG::2015-01-27
15:45:46,165::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:45:46,165::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.

This is what  I get  later after I try Setup host networks again:

MainThread::INFO::2015-01-27
15:53:16,739::netconfpersistence::182::root::(save) Saved new
config RunningConfig({}, {}) to /var/run/vdsm/netconf/nets/ and
/var/run/vdsm/netconf/bonds/
MainThread::DEBUG::2015-01-27
15:53:16,740::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:53:16,740::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:53:16,743::utils::739::root::(execCmd) /usr/sbin/ifdown
Internet (cwd None)
MainThread::DEBUG::2015-01-27

[ovirt-users] Changing the engine HA ping address?

2015-01-27 Thread Chris Adams
I have an up-to-date hosted-engine 3.5.1 setup (CentOS 7 for the nodes,
CentOS 6 for the engine), and the engine keeps jumping between the two
nodes running the hosted-engine HA (sometimes after just 10-20 minutes,
sometimes after a day or two).  I figured out that it is failing on
pinging the gateway sometimes.

The gateway IP is a layer-3 switch, and I think sometimes it just is not
responding to ICMP echo request in a timely fashion (traffic is routing
just fine though).  How is the HA ping implemented?  How many requests
does it send (and how many responses are required to be considered
good)?

If I can't tweak the sensitivity of the ping, I'd like to ping a
different IP (on a HA load balancer setup).  The oVirt HA config refers
to it as gateway though; is it really used as a gateway in any case,
or is that just the recommended IP?

Can I just edit /etc/ovirt-hosted-engine/hosted-engine.conf on the two
nodes and restart the ovirt-ha-broker service?

-- 
Chris Adams c...@cmadams.net
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Update to 3.5.1 scrambled multipath.conf?

2015-01-27 Thread Nir Soffer
- Original Message -
 From: Gianluca Cecchi gianluca.cec...@gmail.com
 To: Nir Soffer nsof...@redhat.com
 Cc: Dan Kenigsberg dan...@redhat.com, users users@ovirt.org, Yeela 
 Kaplan ykap...@redhat.com, Benjamin
 Marzinski bmarz...@redhat.com
 Sent: Monday, January 26, 2015 8:46:52 PM
 Subject: Re: [ovirt-users] Update to 3.5.1 scrambled multipath.conf?
 
 On Mon, Jan 26, 2015 at 1:37 PM, Nir Soffer nsof...@redhat.com wrote:
 
 
Any suggestion appreciated.
   
Current multipath.conf (where I also commented out the getuid_callout
  that
is not used anymore):
   
[root@tekkaman setup]# cat /etc/multipath.conf
# RHEV REVISION 1.1
   
blacklist {
devnode ^(sda|sdb)[0-9]*
}
 
 
  I think what happened is:
 
  1. 3.5.1 had new multipath version
 
 
 what do you mean with new multipath version?

I mean new multipath.conf version:

# RHEV REVISION 1.1

When vdsm finds that its current configuration version is different from current
multipath.conf version, it replaces the current multipath.conf with a new 
version.

 I currently have device-mapper-multipath-0.4.9-56.fc20.x86_64
 The system came from f18 to f19 and then to f20 via fedup in both cases
 In my yum.log files I see this about January 2013 when I was in Fedora 18
 Jan 07 00:11:44 Installed: device-mapper-multipath-0.4.9-36.fc18.x86_64
 
 I then upgraded to f19 on 24th November 2013 and device-mapper-multipath
 was the one pushed during the fedup update:
 device-mapper-multipath-libs-0.4.9-51.fc19.x86_64
 
 Then on 12th of November 2014 I passed form f19 to f20 and fedup pushed
 device-mapper-multipath-0.4.9-56.fc20.x86_64
 that is my current version
 At that time I also passed from oVirt 3.4.4 to 3.5.0.
 And I didn't register any problem with my internal disk
 
 It was sufficient to keep inside
 blacklist {
devnode sd[a-b]
 }
 
 At the head of the file I only had:
 # RHEV REVISION 1.0

This is why vdsm replaced the file.

 
 No reference about
 # RHEV PRIVATE
 
 But right now that I'm writing I notice that my rule for blacklist after
 migration to 3.5.1 was
 
 devnode ^(sda|sdb)[0-9]*
 
 probably blacklists only partitions and not the full disks ;-)
 So I'm going to check with the old blacklist option and/or the PRIVATE
 label as suggested by Dan...
 
 Probably passing from RHEV REVISION 1.0 to 1.1 the original blacklist part
 was thrown away...
 
 2. So vdsm upgraded the local file
  3. blacklist above was removed
 (it should exists in /etc/multipath.bak)
 
 
 It seems it didn't generate any multipath.conf.bak ...
 

In 3.5.1 it should rotate multipath.conf, saving multipath.conf.1 ...

If you don't find the backup file after multipath.conf was updated, this is a 
bug.
Can you open a a bug about it?

Nir
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] ovirt 3.5 self-hosted engine and ovirtmgmt not syncronised

2015-01-27 Thread Lior Vernia


On 23/01/15 19:41, Kostyrev Aleksandr wrote:
 Donny Davis писал 2015-01-23 18:07:
 You have to move the hosted engine to another host to sync the
 networks, they cannot be synced while the host is running vm's. Bring
 your second hosted engine machine online so the engine can be
 migrated, and the host in which you want to sync the network can be
 synced.

 I only know from experience, and one of the ovirt devs may have a
 better solution. This is what worked for me.

Indeed, there's a bug open about it and we intend to get that fixed for 3.6:
https://bugzilla.redhat.com/show_bug.cgi?id=1055454

 
 Hmmm...
 I was able to sync ovirtmgmt network on two hosts out of three but now I
 have a problem starting engine up on those two nodes
 
 Jan 23 20:23:23 ovirt-3 vdsm vm.Vm ERROR
 vmId=`e3c64c89-65e0-4936-83af-de201130e789`::The vm start process
 failed#012Traceback (most recent call last):#012  File
 /usr/share/vdsm/virt/vm.py, line 2264, in _startUnderlyingVm#012   
 self._run()#012  File /usr/share/vdsm/virt/vm.py, line 3328, in
 _run#012self._connection.createXML(domxml, flags),#012  File
 /usr/lib/python2.6/site-packages/vdsm/libvirtconnection.py, line 111,
 in wrapper#012ret = f(*args, **kwargs)#012  File
 /usr/lib64/python2.6/site-packages/libvirt.py, line 2709, in
 createXML#012if ret is None:raise libvirtError('virDomainCreateXML()
 failed', conn=self)#012libvirtError: Cannot get interface MTU on
 'ovirtmgmt': No such device
 
 as I mentioned earlier file
 /etc/sysconfig/network-scripts/ifcfg-ovirtmgmt gets removed from those
 two nodes after syncing.
 

Is ovirtmgmt configured as non-VM by any chance?

 How can I recreated that ifcfg-ovirtmgmt file?
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Power Management config on Ovirt

2015-01-27 Thread Piotr Kliczewski
Hi Mathew,

It seems like this is quite old version (September 2014). There were good 
number of fixes in this area.

Looking at provided logs I am not able to tell which call is exactly failing. 
Do you have any steps to reproduce?

Thanks,
Piotr

- Original Message -
From: Renchu Mathew ren...@cracknell.com
To: Piotr Kliczewski piotr.kliczew...@gmail.com
Cc: Eli Mesika emes...@redhat.com, Piotr Kliczewski 
pklic...@redhat.com, users@ovirt.org
Sent: Monday, January 26, 2015 5:15:46 AM
Subject: RE: [ovirt-users] Power Management config on Ovirt

Hi Piotr,
VDSM Version is vdsm-4.16.4-0.el6

Regards

Renchu Mathew
.

From: Piotr Kliczewski [mailto:piotr.kliczew...@gmail.com]
Sent: Sunday, January 25, 2015 6:08 PM
To: Renchu Mathew
Cc: Eli Mesika; Piotr Kliczewski; users@ovirt.org
Subject: Re: [ovirt-users] Power Management config on Ovirt



On Sun, Jan 25, 2015 at 2:07 PM, Renchu Mathew 
ren...@cracknell.commailto:ren...@cracknell.com wrote:

Hi Eli,



I have 2 hypervisor host and both are installed with below iso file. Master 
data domain is glusterfs which is configured on another server. Is there any 
latest iso available?



ovirt-node-iso-3.5.0.ovirt35.20140912.el6 (Edited).

@Eli - Do we know which vdsm version is in this iso?




Please find below details.



[cid:image001.png@01D0393F.9A5CF2D0]



Thanks  Regards



Renchu Mathew



-Original Message-
From: Eli Mesika [mailto:emes...@redhat.commailto:emes...@redhat.com]
Sent: Sunday, January 25, 2015 4:46 PM
To: Renchu Mathew
Cc: Martin Perina; users@ovirt.orgmailto:users@ovirt.org; Piotr Kliczewski
Subject: Re: [ovirt-users] Power Management config on Ovirt







- Original Message -

 From: Renchu Mathew ren...@cracknell.commailto:ren...@cracknell.com

 To: Eli Mesika emes...@redhat.commailto:emes...@redhat.com

 Cc: Martin Perina mper...@redhat.commailto:mper...@redhat.com, 
 users@ovirt.orgmailto:users@ovirt.org

 Sent: Sunday, January 25, 2015 11:19:21 AM

 Subject: RE: [ovirt-users] Power Management config on Ovirt



 Hi Eli,



 Please use any details.



 Thanks for your support.



 Regards



 Renchu Mathew



Hi again



Looking in the logs I see that this is a issue we handled when we switched from 
XML RPC to JSON RPC protocol.

Which vdsm is installed in the proxy host ?

I am adding Piotr who is on charge of the JSON RPC protocol and who helped me 
in the past when I got a similar error on PM operations

It looks like marshaling issue but I am not sure whether it was already fixed 
or it is something new. I am not able to match a call when looking at 
engine.log.







 -Original Message-

 From: Eli Mesika [mailto:emes...@redhat.commailto:emes...@redhat.com]

 Sent: Sunday, January 25, 2015 12:58 PM

 To: Renchu Mathew

 Cc: Martin Perina; users@ovirt.orgmailto:users@ovirt.org

 Subject: Re: [ovirt-users] Power Management config on Ovirt



 Hi Renchu



 I am looking currently on he case and will respond later on today

 after investigating the logs.

 If I will not find anything, can I use the details for the host PM

 only for status command to try to debug and find the source of the problem ?



 Thanks



 Eli



 - Original Message -

  From: Renchu Mathew ren...@cracknell.commailto:ren...@cracknell.com

  To: Martin Perina mper...@redhat.commailto:mper...@redhat.com

  Cc: users@ovirt.orgmailto:users@ovirt.org, Eli Mesika 
  emes...@redhat.commailto:emes...@redhat.com

  Sent: Sunday, January 25, 2015 6:50:00 AM

  Subject: RE: [ovirt-users] Power Management config on Ovirt

 

  Hi Martin,

 

  I got the below output when I run the command and it works from both

  the nodes.

 

  fence_ipmilan -a 192.168.1.114 -l admin -p admin -o status -v -P

 

  Getting status of IPMI:192.168.1.114...Spawning: '/usr/bin/ipmitool

  -I lanplus -H '192.168.1.114' -U 'admin' -P '[set]' -v chassis power

  status'...

  Chassis power = On

  Done

 

  Thanks

 

  Renchu Mathew  |  Sr. IT Administrator

 

 

 

  CRACKNELL  DUBAI   |  P.O. Box 66231  |   United Arab Emirates  |  T

  +971 4

  3445417  |  F +971 4 3493675tel:%2B971%204%203493675 |  M +971 50 
  7386484tel:%2B971%2050%207386484 ABU DHABI | DUBAI

  | LONDON | MUSCAT | DOHA | JEDDAH EMAIL 
  ren...@cracknell.commailto:ren...@cracknell.com | WEB

  www.cracknell.comhttp://www.cracknell.com

 

  This email, its content and any files transmitted with it are

  intended solely for the addressee(s) and may be legally privileged

  and/or confidential. If you are not the intended recipient please

  let us know by email reply and delete it from the system. Please

  note that any views or opinions presented in this email do not

  necessarily represent those of the company. Email transmissions

  cannot be guaranteed to be secure or error-free as information could

  be intercepted, corrupted, lost, destroyed, arrive late or

  incomplete, or contain viruses. The company therefore does not

  accept liability for any errors or 

[ovirt-users] Does my Storage Domain crashed or is this iSCSI LUN's a problem?

2015-01-27 Thread shimano
Hi guys,

I'm trying to run one of my storage domains, which experienced failure.
Unfortunately, I meet a very nasty error (Storage domain does not exist).

Could someone tell me, how to try to restore this domain?

P.S.
It's an oVirt 3.4.2-1.el6

**


/var/log/messages:
Jan 26 12:48:49 node002 vdsm TaskManager.Task ERROR
Task=`10d02993-b585-448f-9a50-bd3e8cda7082`::Unexpected error#012Traceback
(most recent call last):#012  File /usr/share/vdsm/storage/task.py, line
873, in _run#012return fn(*args, **kargs)#012  File
/usr/share/vdsm/logUtils.py, line 45, in wrapper#012res = f(*args,
**kwargs)#012  File /usr/share/vdsm/storage/hsm.py, line 2959, in
getVGInfo#012return dict(info=self.__getVGsInfo([vgUUID])[0])#012  File
/usr/share/vdsm/storage/hsm.py, line 2892, in __getVGsInfo#012vgList
= [lvm.getVGbyUUID(vgUUID) for vgUUID in vgUUIDs]#012  File
/usr/share/vdsm/storage/lvm.py, line 894, in getVGbyUUID#012raise
se.VolumeGroupDoesNotExist(vg_uuid: %s %
vgUUID)#012VolumeGroupDoesNotExist: Volume Group does not exist: ('vg_uuid:
gyaCWf-6VKi-lI9W-JT6H-IZdy-rIsB-hTvZ4O',)
Jan 26 12:48:49 node002 kernel: device-mapper: table: 253:26: multipath:
error getting device
Jan 26 12:48:49 node002 kernel: device-mapper: ioctl: error adding target
to table

**

/var/log/vdsm.log:
Thread-22::ERROR::2015-01-26
12:43:03,376::sdc::137::Storage.StorageDomainCache::(_findDomain) looking
for unfetched domain db52e9cb-7306-43fd-aff3-20831bc2bcaf
Thread-22::ERROR::2015-01-26
12:43:03,377::sdc::154::Storage.StorageDomainCache::(_findUnfetchedDomain)
looking for domain db52e9cb-7306-43fd-aff3-20831bc2bcaf
Thread-22::DEBUG::2015-01-26
12:43:03,377::lvm::373::OperationMutex::(_reloadvgs) Operation 'lvm reload
operation' got the operation mutex
Thread-22::DEBUG::2015-01-26
12:43:03,378::lvm::296::Storage.Misc.excCmd::(cmd) u'/usr/bin/sudo -n
/sbin/lvm vgs --config  devices { preferred_names = [\\^/dev/mapper/\\]
ignore_suspended_devices=1 write_cache_state=0 disable_after_error_count=3
obtain_device_list_from_udev=0 filter = [
\'a|/dev/mapper/mpathb|/dev/mapper/mpathc|/dev/mapper/mpathd|/dev/mapper/mpathe|/dev/mapper/mpathf|\',
\'r|.*|\' ] }  global {  locking_type=1  prioritise_write_locks=1
wait_for_locks=1  use_lvmetad=0 }  backup {  retain_min = 50  retain_days =
0 }  --noheadings --units b --nosuffix --separator | -o
uuid,name,attr,size,free,extent_size,extent_count,free_count,tags,vg_mda_size,vg_mda_free,lv_count,pv_count,pv_name
db52e9cb-7306-43fd-aff3-20831bc2bcaf' (cwd None)
Thread-22::DEBUG::2015-01-26
12:43:03,462::lvm::296::Storage.Misc.excCmd::(cmd) FAILED: err = '
/dev/mapper/mpathc: Checksum error\n  /dev/mapper/mpathc: Checksum error\n
Volume group db52e9cb-7306-43fd-aff3-20831bc2bcaf not found\n  Skipping
volume group db52e9cb-7306-43fd-aff3-20831bc2bcaf\n'; rc = 5
Thread-22::WARNING::2015-01-26
12:43:03,466::lvm::378::Storage.LVM::(_reloadvgs) lvm vgs failed: 5 [] ['
/dev/mapper/mpathc: Checksum error', '  /dev/mapper/mpathc: Checksum
error', '  Volume group db52e9cb-7306-43fd-aff3-20831bc2bcaf not found',
'  Skipping volume group db52e9cb-7306-43fd-aff3-20831bc2bcaf']
Thread-22::DEBUG::2015-01-26
12:43:03,466::lvm::415::OperationMutex::(_reloadvgs) Operation 'lvm reload
operation' released the operation mutex
Thread-22::ERROR::2015-01-26
12:43:03,477::sdc::143::Storage.StorageDomainCache::(_findDomain) domain
db52e9cb-7306-43fd-aff3-20831bc2bcaf not found
Traceback (most recent call last):
  File /usr/share/vdsm/storage/sdc.py, line 141, in _findDomain
dom = findMethod(sdUUID)
  File /usr/share/vdsm/storage/sdc.py, line 171, in _findUnfetchedDomain
raise se.StorageDomainDoesNotExist(sdUUID)
StorageDomainDoesNotExist: Storage domain does not exist:
(u'db52e9cb-7306-43fd-aff3-20831bc2bcaf',)
Thread-22::ERROR::2015-01-26
12:43:03,478::domainMonitor::239::Storage.DomainMonitorThread::(_monitorDomain)
Error while collecting domain db52e9cb-7306-43fd-aff3-20831bc2bcaf
monitoring information
Traceback (most recent call last):
  File /usr/share/vdsm/storage/domainMonitor.py, line 204, in
_monitorDomain
self.domain = sdCache.produce(self.sdUUID)
  File /usr/share/vdsm/storage/sdc.py, line 98, in produce
domain.getRealDomain()
  File /usr/share/vdsm/storage/sdc.py, line 52, in getRealDomain
return self._cache._realProduce(self._sdUUID)
  File /usr/share/vdsm/storage/sdc.py, line 122, in _realProduce
domain = self._findDomain(sdUUID)
  File /usr/share/vdsm/storage/sdc.py, line 141, in _findDomain
dom = findMethod(sdUUID)
  File /usr/share/vdsm/storage/sdc.py, line 171, in _findUnfetchedDomain
raise se.StorageDomainDoesNotExist(sdUUID)
StorageDomainDoesNotExist: Storage domain does not exist:
(u'db52e9cb-7306-43fd-aff3-20831bc2bcaf',)
Thread-13::DEBUG::2015-01-26
12:43:05,102::task::595::TaskManager.Task::(_updateState)

Re: [ovirt-users] Power Management config on Ovirt

2015-01-27 Thread Renchu Mathew
Hi Piotr,

Can you please provide me the link to download the new node iso? I can see only 
ovirt live iso. 

Regards

Renchu Mathew 

-Original Message-
From: Piotr Kliczewski [mailto:pklic...@redhat.com] 
Sent: Tuesday, January 27, 2015 12:21 PM
To: Renchu Mathew
Cc: Piotr Kliczewski; Eli Mesika; users@ovirt.org
Subject: Re: [ovirt-users] Power Management config on Ovirt

Hi Mathew,

It seems like this is quite old version (September 2014). There were good 
number of fixes in this area.

Looking at provided logs I am not able to tell which call is exactly failing. 
Do you have any steps to reproduce?

Thanks,
Piotr

- Original Message -
From: Renchu Mathew ren...@cracknell.com
To: Piotr Kliczewski piotr.kliczew...@gmail.com
Cc: Eli Mesika emes...@redhat.com, Piotr Kliczewski 
pklic...@redhat.com, users@ovirt.org
Sent: Monday, January 26, 2015 5:15:46 AM
Subject: RE: [ovirt-users] Power Management config on Ovirt

Hi Piotr,
VDSM Version is vdsm-4.16.4-0.el6

Regards

Renchu Mathew
.

From: Piotr Kliczewski [mailto:piotr.kliczew...@gmail.com]
Sent: Sunday, January 25, 2015 6:08 PM
To: Renchu Mathew
Cc: Eli Mesika; Piotr Kliczewski; users@ovirt.org
Subject: Re: [ovirt-users] Power Management config on Ovirt



On Sun, Jan 25, 2015 at 2:07 PM, Renchu Mathew 
ren...@cracknell.commailto:ren...@cracknell.com wrote:

Hi Eli,



I have 2 hypervisor host and both are installed with below iso file. Master 
data domain is glusterfs which is configured on another server. Is there any 
latest iso available?



ovirt-node-iso-3.5.0.ovirt35.20140912.el6 (Edited).

@Eli - Do we know which vdsm version is in this iso?




Please find below details.



[cid:image001.png@01D0393F.9A5CF2D0]



Thanks  Regards



Renchu Mathew



-Original Message-
From: Eli Mesika [mailto:emes...@redhat.commailto:emes...@redhat.com]
Sent: Sunday, January 25, 2015 4:46 PM
To: Renchu Mathew
Cc: Martin Perina; users@ovirt.orgmailto:users@ovirt.org; Piotr Kliczewski
Subject: Re: [ovirt-users] Power Management config on Ovirt







- Original Message -

 From: Renchu Mathew ren...@cracknell.commailto:ren...@cracknell.com

 To: Eli Mesika emes...@redhat.commailto:emes...@redhat.com

 Cc: Martin Perina mper...@redhat.commailto:mper...@redhat.com, 
 users@ovirt.orgmailto:users@ovirt.org

 Sent: Sunday, January 25, 2015 11:19:21 AM

 Subject: RE: [ovirt-users] Power Management config on Ovirt



 Hi Eli,



 Please use any details.



 Thanks for your support.



 Regards



 Renchu Mathew



Hi again



Looking in the logs I see that this is a issue we handled when we switched from 
XML RPC to JSON RPC protocol.

Which vdsm is installed in the proxy host ?

I am adding Piotr who is on charge of the JSON RPC protocol and who helped me 
in the past when I got a similar error on PM operations

It looks like marshaling issue but I am not sure whether it was already fixed 
or it is something new. I am not able to match a call when looking at 
engine.log.







 -Original Message-

 From: Eli Mesika [mailto:emes...@redhat.commailto:emes...@redhat.com]

 Sent: Sunday, January 25, 2015 12:58 PM

 To: Renchu Mathew

 Cc: Martin Perina; users@ovirt.orgmailto:users@ovirt.org

 Subject: Re: [ovirt-users] Power Management config on Ovirt



 Hi Renchu



 I am looking currently on he case and will respond later on today

 after investigating the logs.

 If I will not find anything, can I use the details for the host PM

 only for status command to try to debug and find the source of the problem ?



 Thanks



 Eli



 - Original Message -

  From: Renchu Mathew ren...@cracknell.commailto:ren...@cracknell.com

  To: Martin Perina mper...@redhat.commailto:mper...@redhat.com

  Cc: users@ovirt.orgmailto:users@ovirt.org, Eli Mesika 
  emes...@redhat.commailto:emes...@redhat.com

  Sent: Sunday, January 25, 2015 6:50:00 AM

  Subject: RE: [ovirt-users] Power Management config on Ovirt

 

  Hi Martin,

 

  I got the below output when I run the command and it works from both

  the nodes.

 

  fence_ipmilan -a 192.168.1.114 -l admin -p admin -o status -v -P

 

  Getting status of IPMI:192.168.1.114...Spawning: '/usr/bin/ipmitool

  -I lanplus -H '192.168.1.114' -U 'admin' -P '[set]' -v chassis power

  status'...

  Chassis power = On

  Done

 

  Thanks

 

  Renchu Mathew  |  Sr. IT Administrator

 

 

 

  CRACKNELL  DUBAI   |  P.O. Box 66231  |   United Arab Emirates  |  T

  +971 4

  3445417  |  F +971 4 3493675tel:%2B971%204%203493675 |  M +971 50 
  7386484tel:%2B971%2050%207386484 ABU DHABI | DUBAI

  | LONDON | MUSCAT | DOHA | JEDDAH EMAIL 
  ren...@cracknell.commailto:ren...@cracknell.com | WEB

  www.cracknell.comhttp://www.cracknell.com

 

  This email, its content and any files transmitted with it are

  intended solely for the addressee(s) and may be legally privileged

  and/or confidential. If you are not the intended recipient please

  let us know by email reply and delete it from the system. 

Re: [ovirt-users] Power Management config on Ovirt

2015-01-27 Thread Piotr Kliczewski




- Original Message -
 From: Renchu Mathew ren...@cracknell.com
 To: Piotr Kliczewski pklic...@redhat.com
 Cc: Piotr Kliczewski piotr.kliczew...@gmail.com, Eli Mesika 
 emes...@redhat.com, users@ovirt.org
 Sent: Tuesday, January 27, 2015 10:48:12 AM
 Subject: RE: [ovirt-users] Power Management config on Ovirt
 
 Hi Piotr,
 
 Can you please provide me the link to download the new node iso? I can see
 only ovirt live iso.
 

I saw that Douglas sent following links in other thread [1] on this list:

centos7:
http://jenkins.ovirt.org/job/ovirt-node_master_create-iso-el7_merged/

Fedora20:
http://jenkins.ovirt.org/job/ovirt-node_master_create-iso-fc20_merged/

Please note that We should share the official iso this week.

[1] http://lists.ovirt.org/pipermail/users/2015-January/030846.html


 Regards
 
 Renchu Mathew
 
 -Original Message-
 From: Piotr Kliczewski [mailto:pklic...@redhat.com]
 Sent: Tuesday, January 27, 2015 12:21 PM
 To: Renchu Mathew
 Cc: Piotr Kliczewski; Eli Mesika; users@ovirt.org
 Subject: Re: [ovirt-users] Power Management config on Ovirt
 
 Hi Mathew,
 
 It seems like this is quite old version (September 2014). There were good
 number of fixes in this area.
 
 Looking at provided logs I am not able to tell which call is exactly failing.
 Do you have any steps to reproduce?
 
 Thanks,
 Piotr
 
 - Original Message -
 From: Renchu Mathew ren...@cracknell.com
 To: Piotr Kliczewski piotr.kliczew...@gmail.com
 Cc: Eli Mesika emes...@redhat.com, Piotr Kliczewski
 pklic...@redhat.com, users@ovirt.org
 Sent: Monday, January 26, 2015 5:15:46 AM
 Subject: RE: [ovirt-users] Power Management config on Ovirt
 
 Hi Piotr,
 VDSM Version is vdsm-4.16.4-0.el6
 
 Regards
 
 Renchu Mathew
 .
 
 From: Piotr Kliczewski [mailto:piotr.kliczew...@gmail.com]
 Sent: Sunday, January 25, 2015 6:08 PM
 To: Renchu Mathew
 Cc: Eli Mesika; Piotr Kliczewski; users@ovirt.org
 Subject: Re: [ovirt-users] Power Management config on Ovirt
 
 
 
 On Sun, Jan 25, 2015 at 2:07 PM, Renchu Mathew
 ren...@cracknell.commailto:ren...@cracknell.com wrote:
 
 Hi Eli,
 
 
 
 I have 2 hypervisor host and both are installed with below iso file. Master
 data domain is glusterfs which is configured on another server. Is there any
 latest iso available?
 
 
 
 ovirt-node-iso-3.5.0.ovirt35.20140912.el6 (Edited).
 
 @Eli - Do we know which vdsm version is in this iso?
 
 
 
 
 Please find below details.
 
 
 
 [cid:image001.png@01D0393F.9A5CF2D0]
 
 
 
 Thanks  Regards
 
 
 
 Renchu Mathew
 
 
 
 -Original Message-
 From: Eli Mesika [mailto:emes...@redhat.commailto:emes...@redhat.com]
 Sent: Sunday, January 25, 2015 4:46 PM
 To: Renchu Mathew
 Cc: Martin Perina; users@ovirt.orgmailto:users@ovirt.org; Piotr Kliczewski
 Subject: Re: [ovirt-users] Power Management config on Ovirt
 
 
 
 
 
 
 
 - Original Message -
 
  From: Renchu Mathew ren...@cracknell.commailto:ren...@cracknell.com
 
  To: Eli Mesika emes...@redhat.commailto:emes...@redhat.com
 
  Cc: Martin Perina mper...@redhat.commailto:mper...@redhat.com,
  users@ovirt.orgmailto:users@ovirt.org
 
  Sent: Sunday, January 25, 2015 11:19:21 AM
 
  Subject: RE: [ovirt-users] Power Management config on Ovirt
 
 
 
  Hi Eli,
 
 
 
  Please use any details.
 
 
 
  Thanks for your support.
 
 
 
  Regards
 
 
 
  Renchu Mathew
 
 
 
 Hi again
 
 
 
 Looking in the logs I see that this is a issue we handled when we switched
 from XML RPC to JSON RPC protocol.
 
 Which vdsm is installed in the proxy host ?
 
 I am adding Piotr who is on charge of the JSON RPC protocol and who helped me
 in the past when I got a similar error on PM operations
 
 It looks like marshaling issue but I am not sure whether it was already fixed
 or it is something new. I am not able to match a call when looking at
 engine.log.
 
 
 
 
 
 
 
  -Original Message-
 
  From: Eli Mesika [mailto:emes...@redhat.commailto:emes...@redhat.com]
 
  Sent: Sunday, January 25, 2015 12:58 PM
 
  To: Renchu Mathew
 
  Cc: Martin Perina; users@ovirt.orgmailto:users@ovirt.org
 
  Subject: Re: [ovirt-users] Power Management config on Ovirt
 
 
 
  Hi Renchu
 
 
 
  I am looking currently on he case and will respond later on today
 
  after investigating the logs.
 
  If I will not find anything, can I use the details for the host PM
 
  only for status command to try to debug and find the source of the problem
  ?
 
 
 
  Thanks
 
 
 
  Eli
 
 
 
  - Original Message -
 
   From: Renchu Mathew ren...@cracknell.commailto:ren...@cracknell.com
 
   To: Martin Perina mper...@redhat.commailto:mper...@redhat.com
 
   Cc: users@ovirt.orgmailto:users@ovirt.org, Eli Mesika
   emes...@redhat.commailto:emes...@redhat.com
 
   Sent: Sunday, January 25, 2015 6:50:00 AM
 
   Subject: RE: [ovirt-users] Power Management config on Ovirt
 
  
 
   Hi Martin,
 
  
 
   I got the below output when I run the command and it works from both
 
   the nodes.
 
  
 
   fence_ipmilan -a 192.168.1.114 -l admin 

Re: [ovirt-users] [ovirt-devel] oVirt 3.6 Feature: Cumulative Network Usage Statistics

2015-01-27 Thread Lior Vernia


On 26/01/15 15:45, Dan Kenigsberg wrote:
 On Mon, Dec 22, 2014 at 01:40:06PM +0200, Lior Vernia wrote:
 Hello users and developers,

 Just put up a feature page for the aforementioned feature; in summary,
 to report total RX/TX statistics for hosts and VMs in oVirt. This has
 been requested several times on the users mailing list, and is
 especially useful for accounting in VDI deployments.

 You're more than welcome to review the feature page:
 http://www.ovirt.org/Features/Cumulative_RX_TX_Statistics
 
 Sorry for the late review; I have a couple of questions/comments.
 - What do you mean by VDI use cases in the Benefit to oVirt sanpshot
   section?
   Do you refer to hosting services who would like to charge their
   customers based on actual bandwidth usage?

Indeed, as well as monitoring utilisation by non-paying users (say
inside the same organization). Changed the wording a little, as hosting
services are really the prime candidate.

 - I've added another motivation: currently-reported rxRate/txRate
   can be utterly meaningless.
 
 
 I don't see reference to nasty negative flows: what happens if a host
 disappears? Or a VM? I suppose there's always a chance that some traffic
 would go unaccounted for. But do you expect to extract this information
 somehow? Either way, it should be mentioned as a caveat on the feature
 page.
 

What do you mean by disappears? Engine loses connectivity to it?


 Note that this only deals with network usage - it'll be great if we have
 similar features for CPU and disk usage!
 
 There's a formal feature request about this:
 Bug 1172153 - [RFE] Collect CPU, IO and network accounting
 information
 
 Dan
 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] change network MTU settings without taking all the VMs down?

2015-01-27 Thread Donny Davis
 I'm on the same.. And I see the issue. Why don't you create a new network with 
the correct parameters and then move the network the VM is attached to 

On Jan 27, 2015 1:39 PM, Darrell Budic bu...@onholyground.com wrote:

 Try changing that custom MTU and hitting OK. I get:

 on 3.5 and 3.5.1. What version are you running?

 On Jan 27, 2015, at 2:04 PM, Donny Davis do...@cloudspin.me wrote:

 Maybe I missed your question, but I can change the MTU from the gui without 
 any problems. As long as you make sure there are not any vm's on the host 
 you are trying to sync, I have experienced no issues. 

 Donny

 -Original Message-
 From: Darrell Budic [mailto:bu...@onholyground.com] 
 Sent: Tuesday, January 27, 2015 12:30 PM
 To: Donny Davis
 Cc: users@ovirt.org
 Subject: Re: [ovirt-users] change network MTU settings without taking all 
 the VMs down?

 Except you can’t change the network MTU setting in the first place on the 
 network in the GUI. I’ve thought about doing it in the database, with a 
 migration as you mention. Just checking first for better options :)


 On Jan 27, 2015, at 12:08 PM, Donny Davis do...@cloudspin.me wrote:

 Migrate the vms between hosts, and when a host gets to no running VMS you 
 can sync that network. 

 Donny D
 cloudspin.meOn Jan 27, 2015 10:43 AM, Darrell Budic 
 bu...@onholyground.com wrote:


 I finally got a couple of networks our from behind a wan based layer 2 
 bridge that required me to run at MTU 1448, and would like to get back up 
 to MTU 1500. I see the GUI won’t let me do that while the network is in 
 use. Any way around this, clean or otherwise? Restarting VMs to update 
 them is ok, just trying to avoid having to take everything down at the 
 same time.

 -Darrell
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users


 Capture.PNG


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] MaxVmNameLengthNonWindows not set right ?

2015-01-27 Thread Matt .
Hi,

My VM doesn' t want to start because the vdsm host says it's over 45
characters or so.

This is set:

# engine-config -g MaxVmNameLengthNonWindows
MaxVmNameLengthNonWindows: 64 version: general

But when I set it to 80 it also doesn't start.

What goes wrong here ?

It's a Linux VM.

Thanks,

Matt
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] ovirt-guest-agent issue on rhel5.5

2015-01-27 Thread John Michael Mercado
Hi All,

I need your help. Anyone who encounter the below error and have the
solution? Can you help me how to fix this?

MainThread::INFO::2015-01-27
10:22:53,247::ovirt-guest-agent::57::root::Starting oVirt guest agent
MainThread::ERROR::2015-01-27
10:22:53,248::ovirt-guest-agent::138::root::Unhandled exception in oVirt
guest agent!
Traceback (most recent call last):
  File /usr/share/ovirt-guest-agent/ovirt-guest-agent.py, line 132, in ?
agent.run(daemon, pidfile)
  File /usr/share/ovirt-guest-agent/ovirt-guest-agent.py, line 63, in run
self.agent = LinuxVdsAgent(config)
  File /usr/share/ovirt-guest-agent/GuestAgentLinux2.py, line 371, in
__init__
AgentLogicBase.__init__(self, config)
  File /usr/share/ovirt-guest-agent/OVirtAgentLogic.py, line 171, in
__init__
self.vio = VirtIoChannel(config.get(virtio, device))
  File /usr/share/ovirt-guest-agent/VirtIoChannel.py, line 150, in
__init__
self._stream = VirtIoStream(vport_name)
  File /usr/share/ovirt-guest-agent/VirtIoChannel.py, line 131, in
__init__
self._vport = os.open(vport_name, os.O_RDWR)
OSError: [Errno 2] No such file or directory:
'/dev/virtio-ports/com.redhat.rhevm.vdsm'


Thanks
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] change network MTU settings without taking all the VMs down?

2015-01-27 Thread Darrell Budic
Try changing that custom MTU and hitting OK. I get:


on 3.5 and 3.5.1. What version are you running?

 On Jan 27, 2015, at 2:04 PM, Donny Davis do...@cloudspin.me wrote:
 
 Maybe I missed your question, but I can change the MTU from the gui without 
 any problems. As long as you make sure there are not any vm's on the host you 
 are trying to sync, I have experienced no issues. 
 
 Donny
 
 -Original Message-
 From: Darrell Budic [mailto:bu...@onholyground.com] 
 Sent: Tuesday, January 27, 2015 12:30 PM
 To: Donny Davis
 Cc: users@ovirt.org
 Subject: Re: [ovirt-users] change network MTU settings without taking all the 
 VMs down?
 
 Except you can’t change the network MTU setting in the first place on the 
 network in the GUI. I’ve thought about doing it in the database, with a 
 migration as you mention. Just checking first for better options :)
 
 
 On Jan 27, 2015, at 12:08 PM, Donny Davis do...@cloudspin.me wrote:
 
 Migrate the vms between hosts, and when a host gets to no running VMS you 
 can sync that network. 
 
 Donny D
 cloudspin.meOn Jan 27, 2015 10:43 AM, Darrell Budic bu...@onholyground.com 
 wrote:
 
 I finally got a couple of networks our from behind a wan based layer 2 
 bridge that required me to run at MTU 1448, and would like to get back up 
 to MTU 1500. I see the GUI won’t let me do that while the network is in 
 use. Any way around this, clean or otherwise? Restarting VMs to update them 
 is ok, just trying to avoid having to take everything down at the same time.
 
 -Darrell
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 Capture.PNG

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] change network MTU settings without taking all the VMs down?

2015-01-27 Thread Donny Davis
Please let us know how it works out, it may be helpful to someone in the future

Cheers
Donny DOn Jan 27, 2015 2:45 PM, Darrell Budic bu...@onholyground.com wrote:

 Ah, good idea. That will be cleaner than the hack I was considering, I’ll see 
 if I can work that somehow. Thanks! 

  On Jan 27, 2015, at 2:59 PM, Donny Davis do...@cloudspin.me wrote: 
  
  I'm on the same.. And I see the issue. Why don't you create a new network 
  with the correct parameters and then move the network the VM is attached to 
  
  On Jan 27, 2015 1:39 PM, Darrell Budic bu...@onholyground.com wrote: 
  
  Try changing that custom MTU and hitting OK. I get: 
  
  on 3.5 and 3.5.1. What version are you running? 
  
  On Jan 27, 2015, at 2:04 PM, Donny Davis do...@cloudspin.me wrote: 
  
  Maybe I missed your question, but I can change the MTU from the gui 
  without any problems. As long as you make sure there are not any vm's on 
  the host you are trying to sync, I have experienced no issues. 
  
  Donny 
  
  -Original Message- 
  From: Darrell Budic [mailto:bu...@onholyground.com] 
  Sent: Tuesday, January 27, 2015 12:30 PM 
  To: Donny Davis 
  Cc: users@ovirt.org 
  Subject: Re: [ovirt-users] change network MTU settings without taking all 
  the VMs down? 
  
  Except you can’t change the network MTU setting in the first place on the 
  network in the GUI. I’ve thought about doing it in the database, with a 
  migration as you mention. Just checking first for better options :) 
  
  
  On Jan 27, 2015, at 12:08 PM, Donny Davis do...@cloudspin.me wrote: 
  
  Migrate the vms between hosts, and when a host gets to no running VMS 
  you can sync that network. 
  
  Donny D 
  cloudspin.meOn Jan 27, 2015 10:43 AM, Darrell Budic 
  bu...@onholyground.com wrote: 
  
  
  I finally got a couple of networks our from behind a wan based layer 2 
  bridge that required me to run at MTU 1448, and would like to get back 
  up to MTU 1500. I see the GUI won’t let me do that while the network is 
  in use. Any way around this, clean or otherwise? Restarting VMs to 
  update them is ok, just trying to avoid having to take everything down 
  at the same time. 
  
  -Darrell 
  ___ 
  Users mailing list 
  Users@ovirt.org 
  http://lists.ovirt.org/mailman/listinfo/users 
  
  
  Capture.PNG 
  
  

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] change network MTU settings without taking all the VMs down?

2015-01-27 Thread Darrell Budic
Ah, good idea. That will be cleaner than the hack I was considering, I’ll see 
if I can work that somehow. Thanks!

 On Jan 27, 2015, at 2:59 PM, Donny Davis do...@cloudspin.me wrote:
 
 I'm on the same.. And I see the issue. Why don't you create a new network 
 with the correct parameters and then move the network the VM is attached to 
 
 On Jan 27, 2015 1:39 PM, Darrell Budic bu...@onholyground.com wrote:
 
 Try changing that custom MTU and hitting OK. I get:
 
 on 3.5 and 3.5.1. What version are you running?
 
 On Jan 27, 2015, at 2:04 PM, Donny Davis do...@cloudspin.me wrote:
 
 Maybe I missed your question, but I can change the MTU from the gui without 
 any problems. As long as you make sure there are not any vm's on the host 
 you are trying to sync, I have experienced no issues. 
 
 Donny
 
 -Original Message-
 From: Darrell Budic [mailto:bu...@onholyground.com] 
 Sent: Tuesday, January 27, 2015 12:30 PM
 To: Donny Davis
 Cc: users@ovirt.org
 Subject: Re: [ovirt-users] change network MTU settings without taking all 
 the VMs down?
 
 Except you can’t change the network MTU setting in the first place on the 
 network in the GUI. I’ve thought about doing it in the database, with a 
 migration as you mention. Just checking first for better options :)
 
 
 On Jan 27, 2015, at 12:08 PM, Donny Davis do...@cloudspin.me wrote:
 
 Migrate the vms between hosts, and when a host gets to no running VMS you 
 can sync that network. 
 
 Donny D
 cloudspin.meOn Jan 27, 2015 10:43 AM, Darrell Budic 
 bu...@onholyground.com wrote:
 
 
 I finally got a couple of networks our from behind a wan based layer 2 
 bridge that required me to run at MTU 1448, and would like to get back up 
 to MTU 1500. I see the GUI won’t let me do that while the network is in 
 use. Any way around this, clean or otherwise? Restarting VMs to update 
 them is ok, just trying to avoid having to take everything down at the 
 same time.
 
 -Darrell
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
 
 Capture.PNG
 
 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Can not add data domain via nfs

2015-01-27 Thread Xiaoqiang Zhou
in addition, my  host systeme is CentOS7 and ovirt version is 3.5.1

2015-01-27 14:50 GMT+08:00 Xiaoqiang Zhou zhouxiaoqiang.mst...@gmail.com:

 Hi ALL:
  have some wrong when add data domain into a data center.
   I can not add a data domain from web panel.
   error log on vdsm server:
   Jan 27 14:49:12 localhost rpc.mountd[3020]: authenticated mount
 request from 192.168.60.38:730 for /opt/ovirt-node-nfs/data
 (/opt/ovirt-node-nfs/data)
 Jan 27 14:49:12 localhost kernel: device-mapper: table: 253:5: multipath:
 error getting device
 Jan 27 14:49:12 localhost kernel: device-mapper: ioctl: error adding
 target to table
 Jan 27 14:49:12 localhost multipathd: dm-5: remove map (uevent)
 Jan 27 14:49:12 localhost multipathd: dm-5: remove map (uevent)
 Jan 27 14:49:13 localhost avahi-daemon[1223]: Received response from host
 192.168.61.145 with invalid source port 40237 on interface 'ovirtmgmt.0'
 Jan 27 14:49:14 localhost kernel: device-mapper: table: 253:5: multipath:
 error getting device
 Jan 27 14:49:14 localhost kernel: device-mapper: ioctl: error adding
 target to table
 Jan 27 14:49:14 localhost multipathd: dm-5: remove map (uevent)
 Jan 27 14:49:14 localhost multipathd: dm-5: remove map (uevent)


   [root@localhost etc]# cat multipath.conf
 # RHEV REVISION 1.1

 defaults {
 polling_interval5
 getuid_callout  /usr/lib/udev/scsi_id --whitelisted
 --replace-whitespace --device=/dev/%n
 no_path_retry   fail
 user_friendly_names no
 flush_on_last_del   yes
 fast_io_fail_tmo5
 dev_loss_tmo30
 max_fds 4096
 }

 devices {
 device {
 vendor  HITACHI
 product DF.*
 getuid_callout  /usr/lib/udev/scsi_id --whitelisted
 --replace-whitespace --device=/dev/%n
 }
 device {
 vendor  COMPELNT
 product Compellent Vol
 no_path_retry   fail
 }
 device {
 # multipath.conf.default
 vendor  DGC
 product .*
 product_blacklist   LUNZ
 path_grouping_policygroup_by_prio
 path_checkeremc_clariion
 hardware_handler1 emc
 prioemc
 failbackimmediate
 rr_weight   uniform
 # vdsm required configuration
 getuid_callout  /usr/lib/udev/scsi_id --whitelisted
 --replace-whitespace --device=/dev/%n
 features0
 no_path_retry   fail
 }

  someone can tell me how to fix this issue, thanks

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Can not add data domain via nfs

2015-01-27 Thread Xiaoqiang Zhou
Hi ALL:
 have some wrong when add data domain into a data center.
  I can not add a data domain from web panel.
  error log on vdsm server:
  Jan 27 14:49:12 localhost rpc.mountd[3020]: authenticated mount
request from 192.168.60.38:730 for /opt/ovirt-node-nfs/data
(/opt/ovirt-node-nfs/data)
Jan 27 14:49:12 localhost kernel: device-mapper: table: 253:5: multipath:
error getting device
Jan 27 14:49:12 localhost kernel: device-mapper: ioctl: error adding target
to table
Jan 27 14:49:12 localhost multipathd: dm-5: remove map (uevent)
Jan 27 14:49:12 localhost multipathd: dm-5: remove map (uevent)
Jan 27 14:49:13 localhost avahi-daemon[1223]: Received response from host
192.168.61.145 with invalid source port 40237 on interface 'ovirtmgmt.0'
Jan 27 14:49:14 localhost kernel: device-mapper: table: 253:5: multipath:
error getting device
Jan 27 14:49:14 localhost kernel: device-mapper: ioctl: error adding target
to table
Jan 27 14:49:14 localhost multipathd: dm-5: remove map (uevent)
Jan 27 14:49:14 localhost multipathd: dm-5: remove map (uevent)


  [root@localhost etc]# cat multipath.conf
# RHEV REVISION 1.1

defaults {
polling_interval5
getuid_callout  /usr/lib/udev/scsi_id --whitelisted
--replace-whitespace --device=/dev/%n
no_path_retry   fail
user_friendly_names no
flush_on_last_del   yes
fast_io_fail_tmo5
dev_loss_tmo30
max_fds 4096
}

devices {
device {
vendor  HITACHI
product DF.*
getuid_callout  /usr/lib/udev/scsi_id --whitelisted
--replace-whitespace --device=/dev/%n
}
device {
vendor  COMPELNT
product Compellent Vol
no_path_retry   fail
}
device {
# multipath.conf.default
vendor  DGC
product .*
product_blacklist   LUNZ
path_grouping_policygroup_by_prio
path_checkeremc_clariion
hardware_handler1 emc
prioemc
failbackimmediate
rr_weight   uniform
# vdsm required configuration
getuid_callout  /usr/lib/udev/scsi_id --whitelisted
--replace-whitespace --device=/dev/%n
features0
no_path_retry   fail
}

 someone can tell me how to fix this issue, thanks
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] OVF Storage

2015-01-27 Thread Wolfgang Bucher
Hello,



i have create a new install of ovirt 3.51 on el7 with iscsi. All works fine but 
i get every hour a message : Failed to create OVF store disk for Storage Domain.



any ideas



greetings 

Wolfgang Bucher 









___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] OVF Storage

2015-01-27 Thread Allon Mureinik
Wolfgang - can you attach the exact message please? and preferably the logs? 

Liron/Gilad - isn't this something we've fixed already? 

- Original Message -

 From: Wolfgang Bucher wolfgang.buc...@netland-mn.de
 To: users@ovirt.org
 Sent: Tuesday, January 27, 2015 10:23:46 PM
 Subject: [ovirt-users] OVF Storage

 OVF Storage

 Hello,

 i have create a new install of ovirt 3.51 on el7 with iscsi. All works fine
 but i get every hour a message : Failed to create OVF store disk for Storage
 Domain.

 any ideas

 greetings

 Wolfgang Bucher

 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] oVirt 3.5.1 - VM hostedengine Failing to start

2015-01-27 Thread Mikola Rose
Hi there,


I seem to have run into a problem.  my hosted engine vm is failing to start;

vdsm.log;


Thread-20::DEBUG::2015-01-27 
16:53:37,999::fileSD::152::Storage.StorageDomain::(__init__) Reading domain in 
path 
/rhev/data-center/mnt/192.168.1.32:_Volumes_Raid1/443b4931-667f-441f-98d8-51384e67a0af
Thread-20::ERROR::2015-01-27 
16:53:38,000::domainMonitor::256::Storage.DomainMonitorThread::(_monitorDomain) 
Error while collecting domain 443b4931-667f-441f-98d8-51384e67a0af monitoring 
information
Traceback (most recent call last):
  File /usr/share/vdsm/storage/domainMonitor.py, line 221, in _monitorDomain
self.domain = sdCache.produce(self.sdUUID)
  File /usr/share/vdsm/storage/sdc.py, line 98, in produce
domain.getRealDomain()
  File /usr/share/vdsm/storage/sdc.py, line 52, in getRealDomain
return self._cache._realProduce(self._sdUUID)
  File /usr/share/vdsm/storage/sdc.py, line 122, in _realProduce
domain = self._findDomain(sdUUID)
  File /usr/share/vdsm/storage/sdc.py, line 141, in _findDomain
dom = findMethod(sdUUID)
  File /usr/share/vdsm/storage/nfsSD.py, line 122, in findDomain
return NfsStorageDomain(NfsStorageDomain.findDomainPath(sdUUID))
  File /usr/share/vdsm/storage/fileSD.py, line 159, in __init__
validateFileSystemFeatures(sdUUID, self.mountpoint)
  File /usr/share/vdsm/storage/fileSD.py, line 88, in 
validateFileSystemFeatures
oop.getProcessPool(sdUUID).directTouch(testFilePath)
  File /usr/share/vdsm/storage/outOfProcess.py, line 320, in directTouch
ioproc.touch(path, flags, mode)
  File /usr/lib/python2.6/site-packages/ioprocess/__init__.py, line 507, in 
touch
self.timeout)
  File /usr/lib/python2.6/site-packages/ioprocess/__init__.py, line 391, in 
_sendCommand
raise OSError(errcode, errstr)
OSError: [Errno 13] Permission denied

I assume this is an nfs issue so I checked to see if I could create a file in 
the mounts from the host machine,  which I could.

My test bed is usign an old OS X NFS via xraid adn the export is ;
/Volumes/Raid1 -maproot=root:wheel -network 192.168.1.0 -mask 255.255.255.0

drwxr-xr-x6 vdsm   kvm  204 Jan 27 16:30 
443b4931-667f-441f-98d8-51384e67a0af
drwxr-xr-x4 vdsm   kvm  136 Jan 27 12:32 data
drwxr-xr-x4 vdsm   kvm  136 Jan 27 00:18 iso
drwxr-xr-x3 vdsm   kvm  102 Jan 26 09:43 vm

the host machine seems to mount the shares fine

drwxr-xr-x  5 vdsm kvm 4096 Jan 27 12:28 .
drwxr-xr-x  3 vdsm kvm 4096 Jan 27 10:04 ..
drwxr-xr-x 13 vdsm kvm  544 Jan 27 00:12 192.168.1.32:_Volumes_Raid1
drwxr-xr-x  2 vdsm kvm 4096 Jan 27 00:12 192.168.1.32:_Volumes_Raid1_data
drwxr-xr-x  2 vdsm kvm 4096 Jan 27 00:24 192.168.1.32:_Volumes_Raid1_iso

and as I said above I can create files in any one of those mounts


Is there a place I can look at to find the offending file?  If that is the 
issue...  Oddly enough everything worked until i rebooted, so I must have 
either changed something or something is buggered.



___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Network config

2015-01-27 Thread Jorick Astrego

On 01/27/2015 01:56 PM, Koen Vanoppen wrote:
 Hello everybody,

 Just wanted to know if anybody else has the problem, that when you
 restart your network configuration of a hypervisor, all the settings
 concerning the bonding are gone...
 Do I file a bug report for this or is this already a known issue?

 Kind regards,

 Koen


After a host restart the bonds are indead gone, except for bond and I
get this output:


MainThread::DEBUG::2015-01-27
15:44:18,343::api::623::root::(setupNetworks) Validating configuration
MainThread::DEBUG::2015-01-27
15:44:18,355::api::635::setupNetworks::(setupNetworks) Applying...
MainThread::DEBUG::2015-01-27
15:44:18,355::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:44:18,356::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:44:18,356::utils::739::root::(execCmd) /sbin/ip route show to
0.0.0.0/0 table all (cwd None)
MainThread::DEBUG::2015-01-27
15:44:18,358::utils::759::root::(execCmd) SUCCESS: err = ''; rc = 0
MainThread::INFO::2015-01-27
15:44:18,366::netinfo::555::root::(_getNetInfo) Obtaining info for
net bond1.
Traceback (most recent call last):
  File /usr/lib/python2.7/site-packages/vdsm/netinfo.py, line 552,
in _getNetInfo
'mtu': str(getMtu(iface))})
  File /usr/lib/python2.7/site-packages/vdsm/netinfo.py, line 182,
in getMtu
mtu = open('/sys/class/net/%s/mtu' % iface).readline().rstrip()
IOError: [Errno 2] No such file or directory:
u'/sys/class/net/bond1/mtu'
MainThread::INFO::2015-01-27
15:44:18,366::netinfo::555::root::(_getNetInfo) Obtaining info for
net Internet.
Traceback (most recent call last):
  File /usr/lib/python2.7/site-packages/vdsm/netinfo.py, line 536,
in _getNetInfo
data.update({'ports': ports(iface), 'stp': bridge_stp_state(iface),
  File /usr/lib/python2.7/site-packages/vdsm/netinfo.py, line 178,
in ports
return os.listdir('/sys/class/net/' + bridge + '/brif')
OSError: [Errno 2] No such file or directory:
'/sys/class/net/Internet/brif'
MainThread::DEBUG::2015-01-27
15:44:18,373::utils::739::root::(execCmd) /sbin/ip route show to
0.0.0.0/0 table all (cwd None)
MainThread::DEBUG::2015-01-27
15:44:18,375::utils::759::root::(execCmd) SUCCESS: err = ''; rc = 0
MainThread::INFO::2015-01-27
15:44:18,384::netconfpersistence::158::root::(_clearDisk) Clearing
/var/run/vdsm/netconf/nets/ and /var/run/vdsm/netconf/bonds/
MainThread::DEBUG::2015-01-27
15:44:18,384::netconfpersistence::166::root::(_clearDisk) No
existent config to clear.
MainThread::INFO::2015-01-27
15:44:18,384::netconfpersistence::182::root::(save) Saved new config
RunningConfig({}, {}) to /var/run/vdsm/netconf/nets/ and
/var/run/vdsm/netconf/bonds/
MainThread::DEBUG::2015-01-27
15:44:18,386::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:44:18,386::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:44:18,470::utils::739::root::(execCmd) /usr/sbin/ifdown Internet
(cwd None)
MainThread::DEBUG::2015-01-27
15:45:46,165::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:45:46,165::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.

This is what  I get  later after I try Setup host networks again:

MainThread::INFO::2015-01-27
15:53:16,739::netconfpersistence::182::root::(save) Saved new config
RunningConfig({}, {}) to /var/run/vdsm/netconf/nets/ and
/var/run/vdsm/netconf/bonds/
MainThread::DEBUG::2015-01-27
15:53:16,740::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:53:16,740::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:53:16,743::utils::739::root::(execCmd) /usr/sbin/ifdown Internet
(cwd None)
MainThread::DEBUG::2015-01-27
15:54:46,772::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:54:46,772::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.
MainThread::DEBUG::2015-01-27
15:54:46,772::vdsm-restore-net-config::55::root::(unified_restoration)
Removing all networks ({}) and bonds ({}) in running config.
MainThread::DEBUG::2015-01-27
15:54:46,772::libvirtconnection::150::root::(get) trying to connect
libvirt
MainThread::DEBUG::2015-01-27
15:54:46,787::utils::739::root::(execCmd) /sbin/ip route show to
0.0.0.0/0 table all (cwd None)
MainThread::DEBUG::2015-01-27
15:54:46,790::utils::759::root::(execCmd) SUCCESS: err = 

Re: [ovirt-users] Network config

2015-01-27 Thread Ondřej Svoboda
This is weird, like there were multiple dhclients on the bond. Can you 
stop VDSM (systemctl stop vdsmd supervdsmd) and maybe `killall -SIGKILL 
dhclient` and restart VDSM afterwards?


On 27.1.2015 14:43, Jorick Astrego wrote:

Ah yes, I forgot to disable it as I switched to foreman provisioning.

Maybe it can be auto detected and disabled in the install host 
scripts when ovirt installs vdsm and sets up the networking. For 
people who forget to disable it.


But now I get this:

sourceRoute::DEBUG::2015-01-27
14:33:57,416::utils::739::root::(execCmd) /sbin/ip -4 route add
0.0.0.0/0 via ***.***.***.*** dev bond0 table 174351414 (cwd None)
sourceRoute::DEBUG::2015-01-27
14:33:57,420::utils::759::root::(execCmd) FAILED: err =
'RTNETLINK answers: File exists\n'; rc = 2
sourceRoute::ERROR::2015-01-27
14:33:57,420::sourceroute::88::root::(configure) ip binary failed
during source route configuration: ['RTNETLINK answers: File exists']
sourceRoute::WARNING::2015-01-27
14:33:57,420::utils::129::root::(rmFile) File:
/var/run/vdsm/trackedInterfaces/bond0 already removed

Jan 27 14:38:19 test3.netbulae.test network[3042]: RTNETLINK
answers: File exists
Jan 27 14:38:19 test3.netbulae.test systemd[1]: network.service:
control process exited, code=exited status=1
Jan 27 14:38:19 test3.netbulae.test systemd[1]: Failed to start
LSB: Bring up/down networking.
-- Subject: Unit network.service has failed
-- Defined-By: systemd
-- Support:
http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- 
-- Unit network.service has failed.
-- 
-- The result is failed.

Jan 27 14:38:19 test3.netbulae.test systemd[1]: Unit
network.service entered failed state.



Bond0 is still there though

sourceRoute::DEBUG::2015-01-27
14:33:57,410::sourceroutethread::39::root::(process_IN_CLOSE_WRITE_filePath)
Responding to DHCP response in /var/run/vdsm/sourceRoutes/1422365637
sourceRoute::INFO::2015-01-27
14:33:57,416::sourceroute::78::root::(configure) Configuring
gateway - ip: **.***.**.**, network: 10.100.100.0/24, subnet:
255.255.255.0, gateway: **.***.**.**, table: 174351414, device: bond0
sourceRoute::DEBUG::2015-01-27
14:33:57,416::utils::739::root::(execCmd) /sbin/ip -4 route add
0.0.0.0/0 via **.***.**.** dev bond0 table 174351414 (cwd None)
sourceRoute::DEBUG::2015-01-27
14:33:57,420::utils::759::root::(execCmd) FAILED: err =
'RTNETLINK answers: File exists\n'; rc = 2
sourceRoute::ERROR::2015-01-27
14:33:57,420::sourceroute::88::root::(configure) ip binary failed
during source route configuration: ['RTNETLINK answers: File exists']
sourceRoute::WARNING::2015-01-27
14:33:57,420::utils::129::root::(rmFile) File:
/var/run/vdsm/trackedInterfaces/bond0 already removed
sourceRoute::DEBUG::2015-01-27
14:37:52,085::sourceroutethread::39::root::(process_IN_CLOSE_WRITE_filePath)
Responding to DHCP response in /var/run/vdsm/sourceRoutes/1422365872
sourceRoute::INFO::2015-01-27
14:37:52,090::sourceroute::78::root::(configure) Configuring
gateway - ip: **.***.**.**, network: 10.100.100.0/24, subnet:
255.255.255.0, gateway: **.***.**.**, table: 174351414, device: bond0
sourceRoute::DEBUG::2015-01-27
14:37:52,090::utils::739::root::(execCmd) /sbin/ip -4 route add
0.0.0.0/0 via **.***.**.** dev bond0 table 174351414 (cwd None)
sourceRoute::DEBUG::2015-01-27
14:37:52,094::utils::759::root::(execCmd) FAILED: err =
'RTNETLINK answers: File exists\n'; rc = 2
sourceRoute::ERROR::2015-01-27
14:37:52,094::sourceroute::88::root::(configure) ip binary failed
during source route configuration: ['RTNETLINK answers: File exists']
sourceRoute::WARNING::2015-01-27
14:37:52,094::utils::129::root::(rmFile) File:
/var/run/vdsm/trackedInterfaces/bond0 already removed


If you need more info, let me know.


On 01/27/2015 02:20 PM, Ondřej Svoboda wrote:

Jorick,

just what comes to my mind: do you have NetworkManager running, by 
any chance? VDSM and NetworkManager don't mix well currently so 
please try


# systemctl stop NetworkManager
# systemctl disable NetworkManager

Regards,
Ondra Svoboda

On 27.1.2015 14:10, Jorick Astrego wrote:


On 01/27/2015 01:56 PM, Koen Vanoppen wrote:

Hello everybody,

Just wanted to know if anybody else has the problem, that when you 
restart your network configuration of a hypervisor, all the 
settings concerning the bonding are gone...

Do I file a bug report for this or is this already a known issue?

Kind regards,

Koen


Hmm,

I cannot restart the network service from the host for some reason:

Jan 27 14:00:27 test2 systemd: network.service: control process
exited, code=exited status=1
Jan 27 14:00:27 test2 systemd: Failed to start LSB: Bring
up/down networking.


systemctl status network.service
network.service - LSB: Bring up/down 

Re: [ovirt-users] change network MTU settings without taking all the VMs down?

2015-01-27 Thread Donny Davis
Migrate the vms between hosts, and when a host gets to no running VMS you can 
sync that network. 

Donny D
cloudspin.meOn Jan 27, 2015 10:43 AM, Darrell Budic bu...@onholyground.com 
wrote:

 I finally got a couple of networks our from behind a wan based layer 2 bridge 
 that required me to run at MTU 1448, and would like to get back up to MTU 
 1500. I see the GUI won’t let me do that while the network is in use. Any way 
 around this, clean or otherwise? Restarting VMs to update them is ok, just 
 trying to avoid having to take everything down at the same time.

 -Darrell
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Network config

2015-01-27 Thread Ondřej Svoboda


On 27.1.2015 19:10, Ondřej Svoboda wrote:
From the first log (journal): VDSM managed to start, only it is being 
run too early (it should be waiting patiently for its supervdsm 
component to start responding).


Are there no RTNETLINK answers: File exists errors anymore?

Second log: This is normal operation.

So the host stopped responding? It should have come up, assuming that 
it obtained the same address from DHCP (which it should, but please 
check).


Could you try using static network configuration first? And when it 
succeeds you can try DHCP again.


===

About the GUI: Is your host's status now Not Operational? In that 
case oVirt cannot tell anything about the status of the host's 
interfaces, it may only reflect the status before communication was lost.


Please file a bug if this is really the case I described :-) It would 
then be a matter of discussion if oVirt should display a yellow 
unknown status of the interfaces, or keep displaying the last known 
one. (The first solution would make it obvious that we really lost the 
host and go looking for the host's status to discover it is Not 
Operational. The other is informative as well, just in a different way.)


I should add that the yellow, unknown state is just a suggestion, it 
need not exist yet.



On 27.1.2015 16:31, Jorick Astrego wrote:


Done, maybe this helps:

Jan 27 16:20:26 test2 vdsmd_init_common.sh: vdsm: Running
unified_network_persistence_upgrade
Jan 27 16:20:26 test2 vdsmd_init_common.sh: vdsm: Running
restore_nets
Jan 27 16:21:47 test2 systemd-logind: New session 5 of user root.
Jan 27 16:21:47 test2 systemd: Starting Session 5 of user root.
Jan 27 16:21:47 test2 systemd: Started Session 5 of user root.
Jan 27 16:21:55 test2 systemd: vdsmd.service operation timed out.
Terminating.
Jan 27 16:21:55 test2 journal: End of file while reading data:
Input/output error
Jan 27 16:21:55 test2 systemd: Failed to start Virtual Desktop
Server Manager.
Jan 27 16:21:55 test2 systemd: Unit vdsmd.service entered failed
state.
Jan 27 16:21:55 test2 systemd: vdsmd.service holdoff time over,
scheduling restart.
Jan 27 16:21:55 test2 systemd: Stopping Virtual Desktop Server
Manager...
Jan 27 16:21:55 test2 systemd: Starting Virtual Desktop Server
Manager...
Jan 27 16:21:55 test2 vdsmd_init_common.sh: vdsm: Running mkdirs

Still:

MainThread::INFO::2015-01-27
16:21:56,964::netconfpersistence::158::root::(_clearDisk)
Clearing /var/run/vdsm/netconf/nets/ and /var/run/vdsm/netconf/bonds/
MainThread::DEBUG::2015-01-27
16:21:56,964::netconfpersistence::166::root::(_clearDisk) No
existent config to clear.
MainThread::INFO::2015-01-27
16:21:56,964::netconfpersistence::182::root::(save) Saved new
config RunningConfig({}, {}) to /var/run/vdsm/netconf/nets/ and
/var/run/vdsm/netconf/bonds/
MainThread::DEBUG::2015-01-27
16:21:56,965::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.

And when I want to reapply setup networks in ovirt I get:


Error while executing action Commit Network changes: Network
error during communication with the Host.

Every interface and bond is still green and visible in the webgui by 
the way. That is also strange, every interface of the host is bonded 
and up while in reality the host has lost it's network config and 
doesn't respond anymore.


Shouldn't they all be red on communication failure to the host?

Cheers,

Jorick

On 01/27/2015 04:01 PM, Ondřej Svoboda wrote:
This is weird, like there were multiple dhclients on the bond. Can 
you stop VDSM (systemctl stop vdsmd supervdsmd) and maybe `killall 
-SIGKILL dhclient` and restart VDSM afterwards?


On 27.1.2015 14:43, Jorick Astrego wrote:

Ah yes, I forgot to disable it as I switched to foreman provisioning.

Maybe it can be auto detected and disabled in the install host 
scripts when ovirt installs vdsm and sets up the networking. For 
people who forget to disable it.


But now I get this:

sourceRoute::DEBUG::2015-01-27
14:33:57,416::utils::739::root::(execCmd) /sbin/ip -4 route add
0.0.0.0/0 via ***.***.***.*** dev bond0 table 174351414 (cwd None)
sourceRoute::DEBUG::2015-01-27
14:33:57,420::utils::759::root::(execCmd) FAILED: err =
'RTNETLINK answers: File exists\n'; rc = 2
sourceRoute::ERROR::2015-01-27
14:33:57,420::sourceroute::88::root::(configure) ip binary
failed during source route configuration: ['RTNETLINK answers:
File exists']
sourceRoute::WARNING::2015-01-27
14:33:57,420::utils::129::root::(rmFile) File:
/var/run/vdsm/trackedInterfaces/bond0 already removed

Jan 27 14:38:19 test3.netbulae.test network[3042]: RTNETLINK
answers: File exists
Jan 27 14:38:19 test3.netbulae.test systemd[1]:
network.service: control process exited, code=exited status=1
Jan 27 14:38:19 test3.netbulae.test systemd[1]: Failed to start

[ovirt-users] Help understanding Gluster in oVirt

2015-01-27 Thread Groten, Ryan
I was planning on making a Gluster Data domain to test, and found some great 
information on this page: 
http://community.redhat.com/blog/2014/05/ovirt-3-4-glusterized/
The article the author uses the CTDB service for VIP failover.  Is it 
possible/recommended to not do this, and just create a gluster volume on all 
the hosts in a cluster, then create the Gluster data domain as 
localhost:gluster_vol?

Thanks,
Ryan
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] oVirt 3.5.1 - VM hostedengine Failing to start

2015-01-27 Thread Mikola Rose
Found the cause of this issue...

the __DIRECT_IO_TEST__ on the mount was inaccessible due to permisions.

removed the file and it was recreated


On Jan 27, 2015, at 5:10 PM, Mikola Rose 
mr...@power-soft.commailto:mr...@power-soft.com wrote:

Hi there,


I seem to have run into a problem.  my hosted engine vm is failing to start;

vdsm.log;


Thread-20::DEBUG::2015-01-27 
16:53:37,999::fileSD::152::Storage.StorageDomain::(__init__) Reading domain in 
path 
/rhev/data-center/mnt/192.168.1.32:_Volumes_Raid1/443b4931-667f-441f-98d8-51384e67a0af
Thread-20::ERROR::2015-01-27 
16:53:38,000::domainMonitor::256::Storage.DomainMonitorThread::(_monitorDomain) 
Error while collecting domain 443b4931-667f-441f-98d8-51384e67a0af monitoring 
information
Traceback (most recent call last):
  File /usr/share/vdsm/storage/domainMonitor.py, line 221, in _monitorDomain
self.domain = sdCache.produce(self.sdUUID)
  File /usr/share/vdsm/storage/sdc.py, line 98, in produce
domain.getRealDomain()
  File /usr/share/vdsm/storage/sdc.py, line 52, in getRealDomain
return self._cache._realProduce(self._sdUUID)
  File /usr/share/vdsm/storage/sdc.py, line 122, in _realProduce
domain = self._findDomain(sdUUID)
  File /usr/share/vdsm/storage/sdc.py, line 141, in _findDomain
dom = findMethod(sdUUID)
  File /usr/share/vdsm/storage/nfsSD.py, line 122, in findDomain
return NfsStorageDomain(NfsStorageDomain.findDomainPath(sdUUID))
  File /usr/share/vdsm/storage/fileSD.py, line 159, in __init__
validateFileSystemFeatures(sdUUID, self.mountpoint)
  File /usr/share/vdsm/storage/fileSD.py, line 88, in 
validateFileSystemFeatures
oop.getProcessPool(sdUUID).directTouch(testFilePath)
  File /usr/share/vdsm/storage/outOfProcess.py, line 320, in directTouch
ioproc.touch(path, flags, mode)
  File /usr/lib/python2.6/site-packages/ioprocess/__init__.py, line 507, in 
touch
self.timeout)
  File /usr/lib/python2.6/site-packages/ioprocess/__init__.py, line 391, in 
_sendCommand
raise OSError(errcode, errstr)
OSError: [Errno 13] Permission denied

I assume this is an nfs issue so I checked to see if I could create a file in 
the mounts from the host machine,  which I could.

My test bed is usign an old OS X NFS via xraid adn the export is ;
/Volumes/Raid1 -maproot=root:wheel -network 192.168.1.0 -mask 255.255.255.0

drwxr-xr-x6 vdsm   kvm  204 Jan 27 16:30 
443b4931-667f-441f-98d8-51384e67a0af
drwxr-xr-x4 vdsm   kvm  136 Jan 27 12:32 data
drwxr-xr-x4 vdsm   kvm  136 Jan 27 00:18 iso
drwxr-xr-x3 vdsm   kvm  102 Jan 26 09:43 vm

the host machine seems to mount the shares fine

drwxr-xr-x  5 vdsm kvm 4096 Jan 27 12:28 .
drwxr-xr-x  3 vdsm kvm 4096 Jan 27 10:04 ..
drwxr-xr-x 13 vdsm kvm  544 Jan 27 00:12 192.168.1.32:_Volumes_Raid1
drwxr-xr-x  2 vdsm kvm 4096 Jan 27 00:12 192.168.1.32:_Volumes_Raid1_data
drwxr-xr-x  2 vdsm kvm 4096 Jan 27 00:24 192.168.1.32:_Volumes_Raid1_iso

and as I said above I can create files in any one of those mounts


Is there a place I can look at to find the offending file?  If that is the 
issue...  Oddly enough everything worked until i rebooted, so I must have 
either changed something or something is buggered.



___
Users mailing list
Users@ovirt.orgmailto:Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users

Mik Rose | Manager, IT - Support Services | PowerSoft Development Corp
1 (250) 642-0295 x23   http://www.power-soft.comhttp://www.power-soft.com/   
Live 
Supporthttps://secure.logmeinrescue.com/Customer/Download.aspx?EntryID=15095831

This e-mail may be privileged and/or confidential, and the sender does not waive
any related rights and obligations. Any distribution, use or copying of this 
e-mail or the information
it contains by other than an intended recipient is unauthorized.
If you received this e-mail in error, please advise me (by return e-mail or 
otherwise) immediately.







___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Host remains Non-Responsive after reboot

2015-01-27 Thread Piotr Kliczewski




- Original Message -
 From: Eli Mesika emes...@redhat.com
 To: Piotr Kliczewski pklic...@redhat.com
 Cc: Artyom Lukianov aluki...@redhat.com, users@ovirt.org, 
 rabsh...@citytwist.net, ILanit Stein
 ist...@redhat.com
 Sent: Tuesday, January 27, 2015 4:39:26 PM
 Subject: Re: [ovirt-users] Host remains Non-Responsive after reboot
 
 
 
 - Original Message -
  From: ILanit Stein ist...@redhat.com
  To: Artyom Lukianov aluki...@redhat.com, Eli Mesika
  emes...@redhat.com
  Cc: users@ovirt.org, rabsh...@citytwist.net
  Sent: Tuesday, January 27, 2015 5:19:12 PM
  Subject: Fwd: [ovirt-users] Host remains Non-Responsive after reboot
  
  
  Hi Guys,
  
  Can you please look into this please?
 
 Hi
 From the logs I can see clearly that host is turned on in 2015-01-26
 11:56:51,191
 However, there is a stomp exception in 2015-01-26 11:56:53,544 and a
 connection timeout in 2015-01-26 11:56:53,553 that might be related
 
 Piotr, can you please have a look ?
 

Sure. Can you please send me the logs?

 
  
  Thanks,
  Ilanit.
  - Forwarded Message -
  From: Rob Abshear rabsh...@citytwist.net
  To: ILanit Stein ist...@redhat.com
  Sent: Tuesday, January 27, 2015 3:05:56 PM
  Subject: Re: [ovirt-users] Host remains Non-Responsive after reboot
  
  Here are the logs. you requested.  The shutdown of the node was at 11:53
  and vdsmd was manually restarted at 12:01 to get the node back online.
  
  On Tue, Jan 27, 2015 at 2:05 AM, ILanit Stein ist...@redhat.com wrote:
  
   It might be a bug,
   Would you please attach the logs, I mentioned bellow,
   that can bring more details on the failure?
   Adding Eli, that may want to give some input on this issue.
  
   Thanks,
   Ilanit.
  
   - Original Message -
   From: Rob Abshear rabsh...@citytwist.net
   To: ILanit Stein ist...@redhat.com
   Cc: users@ovirt.org
   Sent: Monday, January 26, 2015 9:43:14 PM
   Subject: Re: [ovirt-users] Host remains Non-Responsive after reboot
  
   I have done a bit more investigating on this matter.  If I restart the
   node
   from within oVirt using the power management option restart, then the
   node restarts and vdsmd DOES NOT start.  If I go into the DRAC and issue
   the command to power cycle the machine, then the machine restarts and
   vdsmd
   DOES start.  I can run the following command from another node in the
   cluster:
   fence_drac5 -a 192.168.200.105 -l root -p password -x -o reboot
   and the node restarts and vdsmd DOES start.
  
   On Sun, Jan 25, 2015 at 1:56 AM, ILanit Stein ist...@redhat.com wrote:
  
Hi Rob,
   
Thanks for this report.
   
Would you please provide these logs, at the time frame, the host
failure
occur:
1. oVirt Engine: /var/log/ovirt-engine/engine.log
2. host: /var/log/vdsm/vdsm.log
   
If it is reproducible, please add this info as well.
   
You can also check vdsm service status, on host, while host reported as
Non responsive,
by running on host 'service vdsmd status'
There might some problem, that might have prevented from vdsm service
to
come up, on host.
   
Ilanit.
   
- Original Message -
From: Rob Abshear rabsh...@citytwist.net
To: users@ovirt.org
Sent: Friday, January 23, 2015 9:22:42 PM
Subject: [ovirt-users] Host remains Non-Responsive after reboot
   
   
I am running oVirt Engine Version 3.5.0.1-1.el6. I have 4 hosts in the
cluster. Each host has a drac5 and it is configured and working. I am
trying to simulate a node failure. I am running one HA VM on one of the
hosts for testing. I simulate the failure by powering off the host with
   the
VM running.
   
Here is what is happening.
   
   
* Host is powered off
* ~4 minutes pass and the host is recognized as not responding
* Automatic fence runs and the VM migrates. Another host in the
node
is chosen as a proxy to execute Status command on the host.
* Same host is chosen as proxy to execute Start command on the
host.
* Same host is chosen as proxy to execute Status command on the
host.
* The host DOES physically start.
* The host never shows status of UP.
* I select “confirm host has been rebooted” and I see a manual
fence
start.
* Host stays non-responsive.
* I put the host in maintenance and then activate it.
* Host still non-responsive
* I put the host in maintenance and do a reinstall
* Reinstall finishes and host becomes UP
   
So, everything seems to go fine with the HA functionality, but the host
never recovers without being reinstalled. Please let me know which logs
   you
need to look at to help me out with this.
   
Thanks
   
   
Sent with Mixmax
   
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users
   
  
  
 

[ovirt-users] gluster only hosts managed by oVirt?

2015-01-27 Thread Jorick Astrego
Hi,

We are currently testing a setup with dedicated gluster servers and
dedicated compute nodes without disks.

For both we provision some custom os install and configuration.
Currently I am only able to manage and view gluster volumes from ovirt
when the Enable Gluster Service is enabled for the Cluster. Right?

There is a Enable Virt Service flag but the options is selected and
greyed out. So I'm not able to disable it.

Can I have gluster only hosts managed through the oVirt admin interface,
without all the virtualization stuff installed on them?






Met vriendelijke groet, With kind regards,

Jorick Astrego

Netbulae Virtualization Experts 



Tel: 053 20 30 270  i...@netbulae.euStaalsteden 4-3A
KvK 08198180
Fax: 053 20 30 271  www.netbulae.eu 7547 TA Enschede
BTW NL821234584B01



___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Network config

2015-01-27 Thread Koen Vanoppen
Hello everybody,

Just wanted to know if anybody else has the problem, that when you restart
your network configuration of a hypervisor, all the settings concerning the
bonding are gone...
Do I file a bug report for this or is this already a known issue?

Kind regards,

Koen
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Failed to add storage domain

2015-01-27 Thread Koen Vanoppen
Solved...

2015-01-21 13:46 GMT+01:00 Koen Vanoppen vanoppen.k...@gmail.com:

 Ok, after rebooting all the hypervisors I'm left with the following errors:
 Thread-16::ERROR::2015-01-21
 13:39:42,644::sdc::137::Storage.StorageDomainCache::(_findDomain) looking
 for unfetched domain 6cf8c48e-fbed-4b68-b376-57eab3039878
 Thread-16::ERROR::2015-01-21
 13:39:42,644::sdc::154::Storage.StorageDomainCache::(_findUnfetchedDomain)
 looking for domain 6cf8c48e-fbed-4b68-b376-57eab3039878
 Thread-18::ERROR::2015-01-21
 13:39:42,696::sdc::137::Storage.StorageDomainCache::(_findDomain) looking
 for unfetched domain dc662b6f-00d4-4b9a-a320-0f5ecf5da45e
 Thread-18::ERROR::2015-01-21
 13:39:42,698::sdc::154::Storage.StorageDomainCache::(_findUnfetchedDomain)
 looking for domain dc662b6f-00d4-4b9a-a320-0f5ecf5da45e
 Thread-15::ERROR::2015-01-21
 13:39:44,349::sdc::137::Storage.StorageDomainCache::(_findDomain) looking
 for unfetched domain e1bbe4ab-3717-4a9a-ada2-9d175ab38262
 Thread-15::ERROR::2015-01-21
 13:39:44,350::sdc::154::Storage.StorageDomainCache::(_findUnfetchedDomain)
 looking for domain e1bbe4ab-3717-4a9a-ada2-9d175ab38262
 Thread-17::ERROR::2015-01-21
 13:39:44,477::sdc::137::Storage.StorageDomainCache::(_findDomain) looking
 for unfetched domain e33afd52-812e-45f7-b45b-8b049fc0f31c
 Thread-17::ERROR::2015-01-21
 13:39:44,479::sdc::154::Storage.StorageDomainCache::(_findUnfetchedDomain)
 looking for domain e33afd52-812e-45f7-b45b-8b049fc0f31c
 Thread-18::ERROR::2015-01-21
 13:44:44,407::sdc::137::Storage.StorageDomainCache::(_findDomain) looking
 for unfetched domain dc662b6f-00d4-4b9a-a320-0f5ecf5da45e
 Thread-18::ERROR::2015-01-21
 13:44:44,408::sdc::154::Storage.StorageDomainCache::(_findUnfetchedDomain)
 looking for domain dc662b6f-00d4-4b9a-a320-0f5ecf5da45e
 Thread-16::ERROR::2015-01-21
 13:44:44,795::sdc::137::Storage.StorageDomainCache::(_findDomain) looking
 for unfetched domain 6cf8c48e-fbed-4b68-b376-57eab3039878
 Thread-16::ERROR::2015-01-21
 13:44:44,796::sdc::154::Storage.StorageDomainCache::(_findUnfetchedDomain)
 looking for domain 6cf8c48e-fbed-4b68-b376-57eab3039878
 Thread-15::ERROR::2015-01-21
 13:44:45,900::sdc::137::Storage.StorageDomainCache::(_findDomain) looking
 for unfetched domain e1bbe4ab-3717-4a9a-ada2-9d175ab38262
 Thread-15::ERROR::2015-01-21
 13:44:45,901::sdc::154::Storage.StorageDomainCache::(_findUnfetchedDomain)
 looking for domain e1bbe4ab-3717-4a9a-ada2-9d175ab38262
 Thread-17::ERROR::2015-01-21
 13:44:45,934::sdc::137::Storage.StorageDomainCache::(_findDomain) looking
 for unfetched domain e33afd52-812e-45f7-b45b-8b049fc0f31c
 Thread-17::ERROR::2015-01-21
 13:44:45,942::sdc::154::Storage.StorageDomainCache::(_findUnfetchedDomain)
 looking for domain e33afd52-812e-45f7-b45b-8b049fc0f31

 Kind regards,

 Koen

 2015-01-21 11:20 GMT+01:00 Koen Vanoppen vanoppen.k...@gmail.com:

 Added the direct lun, so all hypervisor can see it, no effect.. Still the
 same error

 2015-01-21 10:39 GMT+01:00 Koen Vanoppen vanoppen.k...@gmail.com:

 I noticed that there was a difference in the number of attached LUN's
 between the hypervisors. This is because we have a VM with direct LUN's. Do
 these Lun's on this particular vm also be attached to the other hypervisors
 or to 1 hypervisor only?

 2015-01-21 8:29 GMT+01:00 Koen Vanoppen vanoppen.k...@gmail.com:

 Dear all,

 We have a small issue on our ovirt environment. When I try to add a
 fibre storage pool,
 I get the following error:
 2015-01-21 08:24:48,705 WARN
 [org.ovirt.engine.core.vdsbroker.irsbroker.IrsProxyData]
 (org.ovirt.thread.pool-8-thread-22) Storage domain
 e5d59e58-6408-4f80-911e-a30d0e7ca1fe:BuranIsoDomain is not visible to one
 or more hosts. Since the domains type is ISO, hosts status will not be
 changed to non-operational
 2015-01-21 08:24:54,764 ERROR
 [org.ovirt.engine.core.vdsbroker.vdsbroker.CreateStorageDomainVDSCommand]
 (ajp--127.0.0.1-8702-3) [2c666536] Command
 CreateStorageDomainVDSCommand(HostName = saturnus1, HostId =
 1180a1f6-635e-47f6-bba1-871d8c432de0,
 storageDomain=StorageDomainStatic[StoragePoolOracle01,
 fd6c6779-8353-42f6-b2ff-0c670e4b8a73],
 args=qEZ3pE-03I3-5w9M-1XFN-ArBH-2d2e-fSVRF3) execution failed. Exception:
 VDSNetworkException: java.util.concurrent.TimeoutException
 2015-01-21 08:24:54,769 ERROR
 [org.ovirt.engine.core.bll.storage.AddSANStorageDomainCommand]
 (ajp--127.0.0.1-8702-3) [2c666536] Command
 org.ovirt.engine.core.bll.storage.AddSANStorageDomainCommand throw Vdc Bll
 exception. With error message VdcBLLException:
 org.ovirt.engine.core.vdsbroker.vdsbroker.VDSNetworkException:
 java.util.concurrent.TimeoutException (Failed with error VDS_NETWORK_ERROR
 and code 5022)
 2015-01-21 08:24:54,810 ERROR
 [org.ovirt.engine.core.bll.storage.AddSANStorageDomainCommand]
 (ajp--127.0.0.1-8702-3) [2c666536] Transaction rolled-back for command:
 org.ovirt.engine.core.bll.storage.AddSANStorageDomainCommand.
 2015-01-21 08:24:54,839 ERROR
 

Re: [ovirt-users] Network config

2015-01-27 Thread Jorick Astrego

On 01/27/2015 01:56 PM, Koen Vanoppen wrote:
 Hello everybody,

 Just wanted to know if anybody else has the problem, that when you
 restart your network configuration of a hypervisor, all the settings
 concerning the bonding are gone...
 Do I file a bug report for this or is this already a known issue?

 Kind regards,

 Koen

Hmm,

I cannot restart the network service from the host for some reason:

Jan 27 14:00:27 test2 systemd: network.service: control process
exited, code=exited status=1
Jan 27 14:00:27 test2 systemd: Failed to start LSB: Bring up/down
networking.


systemctl status network.service
network.service - LSB: Bring up/down networking
   Loaded: loaded (/etc/rc.d/init.d/network)
   Active: failed (Result: exit-code) since Tue 2015-01-27 14:00:27
CET; 1min 56s ago
  Process: 4113 ExecStart=/etc/rc.d/init.d/network start
(code=exited, status=1/FAILURE)

Jan 27 14:00:27 test2.netbulae.test network[4113]: RTNETLINK
answers: File exists
Jan 27 14:00:27 test2.netbulae.test network[4113]: RTNETLINK
answers: File exists
Jan 27 14:00:27 test2.netbulae.test network[4113]: RTNETLINK
answers: File exists
Jan 27 14:00:27 test2.netbulae.test network[4113]: RTNETLINK
answers: File exists
Jan 27 14:00:27 test2.netbulae.test network[4113]: RTNETLINK
answers: File exists
Jan 27 14:00:27 test2.netbulae.test network[4113]: RTNETLINK
answers: File exists
Jan 27 14:00:27 test2.netbulae.test network[4113]: RTNETLINK
answers: File exists
Jan 27 14:00:27 test2.netbulae.test systemd[1]: network.service:
control process exited, code=exited status=1
Jan 27 14:00:27 test2.netbulae.test systemd[1]: Failed to start LSB:
Bring up/down networking.
Jan 27 14:00:27 test2.netbulae.test systemd[1]: Unit network.service
entered failed state.

-- Subject: Unit network.service has failed
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- 
-- Unit network.service has failed.
-- 
-- The result is failed.
Jan 27 14:00:27 test2.netbulae.test systemd[1]: Unit network.service
entered failed state.


In supervdsm I find lots and lots of these:


sourceRoute::DEBUG::2015-01-27
14:02:40,802::utils::739::root::(execCmd) /sbin/ip -4 route add
0.0.0.0/0 via **.***.**.** dev bond0 table 174351414 (cwd None)
sourceRoute::DEBUG::2015-01-27
14:02:40,806::utils::759::root::(execCmd) FAILED: err = 'RTNETLINK
answers: File exists\n'; rc = 2
sourceRoute::ERROR::2015-01-27
14:02:40,806::sourceroute::88::root::(configure) ip binary failed
during source route configuration: ['RTNETLINK answers: File exists']
sourceRoute::WARNING::2015-01-27
14:02:40,807::utils::129::root::(rmFile) File:
/var/run/vdsm/trackedInterfaces/bond0 already removed
sourceRoute::DEBUG::2015-01-27
14:07:09,047::sourceroutethread::39::root::(process_IN_CLOSE_WRITE_filePath)
Responding to DHCP response in /var/run/vdsm/sourceRoutes/1422364029
sourceRoute::INFO::2015-01-27
14:07:09,052::sourceroute::78::root::(configure) Configuring gateway
- ip: **.***.**.***, network: **.***.**.**0/24, subnet:
255.255.255.0, gateway: **.***.**.**, table: 174351414, device: bond0
sourceRoute::DEBUG::2015-01-27
14:07:09,052::utils::739::root::(execCmd) /sbin/ip -4 route add
0.0.0.0/0 via **.***.**.** dev bond0 table 174351414 (cwd None)
sourceRoute::DEBUG::2015-01-27
14:07:09,057::utils::759::root::(execCmd) FAILED: err = 'RTNETLINK
answers: File exists\n'; rc = 2
sourceRoute::ERROR::2015-01-27
14:07:09,057::sourceroute::88::root::(configure) ip binary failed
during source route configuration: ['RTNETLINK answers: File exists']
sourceRoute::WARNING::2015-01-27
14:07:09,057::utils::129::root::(rmFile) File:
/var/run/vdsm/trackedInterfaces/bond0 already removed


The hosts are all Centos 7 freshly provisioned. With these vdsm packages:

vdsm-python-zombiereaper-4.16.10-8.gitc937927.el7.noarch
vdsm-python-4.16.10-8.gitc937927.el7.noarch
vdsm-yajsonrpc-4.16.10-8.gitc937927.el7.noarch
vdsm-4.16.10-8.gitc937927.el7.x86_64
vdsm-xmlrpc-4.16.10-8.gitc937927.el7.noarch
vdsm-jsonrpc-4.16.10-8.gitc937927.el7.noarch
vdsm-cli-4.16.10-8.gitc937927.el7.noarch





Met vriendelijke groet, With kind regards,

Jorick Astrego

Netbulae Virtualization Experts 



Tel: 053 20 30 270  i...@netbulae.euStaalsteden 4-3A
KvK 08198180
Fax: 053 20 30 271  www.netbulae.eu 7547 TA Enschede
BTW NL821234584B01



___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] gluster only hosts managed by oVirt?

2015-01-27 Thread Sven Kieske


On 27/01/15 13:56, Jorick Astrego wrote:
 For both we provision some custom os install and configuration.
 Currently I am only able to manage and view gluster volumes from ovirt
 when the Enable Gluster Service is enabled for the Cluster. Right?

AFAIK correct.

 There is a Enable Virt Service flag but the options is selected and
 greyed out. So I'm not able to disable it.

Mhm this is sad. I'm not aware of any bug tracking this, will you open
one? It's also possible I missed it, there are too many bugs to
remember them all.

-- 
Mit freundlichen Grüßen / Regards

Sven Kieske

Systemadministrator
Mittwald CM Service GmbH  Co. KG
Königsberger Straße 6
32339 Espelkamp
T: +49-5772-293-100
F: +49-5772-293-333
https://www.mittwald.de
Geschäftsführer: Robert Meyer
St.Nr.: 331/5721/1033, USt-IdNr.: DE814773217, HRA 6640, AG Bad Oeynhausen
Komplementärin: Robert Meyer Verwaltungs GmbH, HRB 13260, AG Bad Oeynhausen
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Network config

2015-01-27 Thread Ondřej Svoboda

Hello Koen,

please share parts of your /var/log/vdsm/vdsm.log and 
/var/log/vdsm/supervdsm.log and maybe also `journalctl` output that do 
not pose any security risk to you.


Regards,
Ondra

On 27.1.2015 13:56, Koen Vanoppen wrote:

Hello everybody,

Just wanted to know if anybody else has the problem, that when you 
restart your network configuration of a hypervisor, all the settings 
concerning the bonding are gone...

Do I file a bug report for this or is this already a known issue?

Kind regards,

Koen


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Network config

2015-01-27 Thread Jorick Astrego

Done, maybe this helps:

Jan 27 16:20:26 test2 vdsmd_init_common.sh: vdsm: Running
unified_network_persistence_upgrade
Jan 27 16:20:26 test2 vdsmd_init_common.sh: vdsm: Running restore_nets
Jan 27 16:21:47 test2 systemd-logind: New session 5 of user root.
Jan 27 16:21:47 test2 systemd: Starting Session 5 of user root.
Jan 27 16:21:47 test2 systemd: Started Session 5 of user root.
Jan 27 16:21:55 test2 systemd: vdsmd.service operation timed out.
Terminating.
Jan 27 16:21:55 test2 journal: End of file while reading data:
Input/output error
Jan 27 16:21:55 test2 systemd: Failed to start Virtual Desktop
Server Manager.
Jan 27 16:21:55 test2 systemd: Unit vdsmd.service entered failed state.
Jan 27 16:21:55 test2 systemd: vdsmd.service holdoff time over,
scheduling restart.
Jan 27 16:21:55 test2 systemd: Stopping Virtual Desktop Server
Manager...
Jan 27 16:21:55 test2 systemd: Starting Virtual Desktop Server
Manager...
Jan 27 16:21:55 test2 vdsmd_init_common.sh: vdsm: Running mkdirs

Still:

MainThread::INFO::2015-01-27
16:21:56,964::netconfpersistence::158::root::(_clearDisk) Clearing
/var/run/vdsm/netconf/nets/ and /var/run/vdsm/netconf/bonds/
MainThread::DEBUG::2015-01-27
16:21:56,964::netconfpersistence::166::root::(_clearDisk) No
existent config to clear.
MainThread::INFO::2015-01-27
16:21:56,964::netconfpersistence::182::root::(save) Saved new config
RunningConfig({}, {}) to /var/run/vdsm/netconf/nets/ and
/var/run/vdsm/netconf/bonds/
MainThread::DEBUG::2015-01-27
16:21:56,965::netconfpersistence::134::root::(_getConfigs)
Non-existing config set.

And when I want to reapply setup networks in ovirt I get:


Error while executing action Commit Network changes: Network error
during communication with the Host.

Every interface and bond is still green and visible in the webgui by the
way. That is also strange, every interface of the host is bonded and up
while in reality the host has lost it's network config and doesn't
respond anymore.

Shouldn't they all be red on communication failure to the host?

Cheers,

Jorick

On 01/27/2015 04:01 PM, Ondřej Svoboda wrote:
 This is weird, like there were multiple dhclients on the bond. Can you
 stop VDSM (systemctl stop vdsmd supervdsmd) and maybe `killall
 -SIGKILL dhclient` and restart VDSM afterwards?

 On 27.1.2015 14:43, Jorick Astrego wrote:
 Ah yes, I forgot to disable it as I switched to foreman provisioning.

 Maybe it can be auto detected and disabled in the install host
 scripts when ovirt installs vdsm and sets up the networking. For
 people who forget to disable it.

 But now I get this:

 sourceRoute::DEBUG::2015-01-27
 14:33:57,416::utils::739::root::(execCmd) /sbin/ip -4 route add
 0.0.0.0/0 via ***.***.***.*** dev bond0 table 174351414 (cwd None)
 sourceRoute::DEBUG::2015-01-27
 14:33:57,420::utils::759::root::(execCmd) FAILED: err =
 'RTNETLINK answers: File exists\n'; rc = 2
 sourceRoute::ERROR::2015-01-27
 14:33:57,420::sourceroute::88::root::(configure) ip binary failed
 during source route configuration: ['RTNETLINK answers: File exists']
 sourceRoute::WARNING::2015-01-27
 14:33:57,420::utils::129::root::(rmFile) File:
 /var/run/vdsm/trackedInterfaces/bond0 already removed

 Jan 27 14:38:19 test3.netbulae.test network[3042]: RTNETLINK
 answers: File exists
 Jan 27 14:38:19 test3.netbulae.test systemd[1]: network.service:
 control process exited, code=exited status=1
 Jan 27 14:38:19 test3.netbulae.test systemd[1]: Failed to start
 LSB: Bring up/down networking.
 -- Subject: Unit network.service has failed
 -- Defined-By: systemd
 -- Support:
 http://lists.freedesktop.org/mailman/listinfo/systemd-devel
 -- 
 -- Unit network.service has failed.
 -- 
 -- The result is failed.
 Jan 27 14:38:19 test3.netbulae.test systemd[1]: Unit
 network.service entered failed state.



 Bond0 is still there though

 sourceRoute::DEBUG::2015-01-27
 
 14:33:57,410::sourceroutethread::39::root::(process_IN_CLOSE_WRITE_filePath)
 Responding to DHCP response in /var/run/vdsm/sourceRoutes/1422365637
 sourceRoute::INFO::2015-01-27
 14:33:57,416::sourceroute::78::root::(configure) Configuring
 gateway - ip: **.***.**.**, network: 10.100.100.0/24, subnet:
 255.255.255.0, gateway: **.***.**.**, table: 174351414, device: bond0
 sourceRoute::DEBUG::2015-01-27
 14:33:57,416::utils::739::root::(execCmd) /sbin/ip -4 route add
 0.0.0.0/0 via **.***.**.** dev bond0 table 174351414 (cwd None)
 sourceRoute::DEBUG::2015-01-27
 14:33:57,420::utils::759::root::(execCmd) FAILED: err =
 'RTNETLINK answers: File exists\n'; rc = 2
 sourceRoute::ERROR::2015-01-27
 14:33:57,420::sourceroute::88::root::(configure) ip binary failed
 during source route configuration: ['RTNETLINK 

Re: [ovirt-users] Host remains Non-Responsive after reboot

2015-01-27 Thread Eli Mesika


- Original Message -
 From: ILanit Stein ist...@redhat.com
 To: Artyom Lukianov aluki...@redhat.com, Eli Mesika emes...@redhat.com
 Cc: users@ovirt.org, rabsh...@citytwist.net
 Sent: Tuesday, January 27, 2015 5:19:12 PM
 Subject: Fwd: [ovirt-users] Host remains Non-Responsive after reboot
 
 
 Hi Guys,
 
 Can you please look into this please?

Hi
From the logs I can see clearly that host is turned on in 2015-01-26 
11:56:51,191
However, there is a stomp exception in 2015-01-26 11:56:53,544 and a connection 
timeout in 2015-01-26 11:56:53,553 that might be related 

Piotr, can you please have a look ?


 
 Thanks,
 Ilanit.
 - Forwarded Message -
 From: Rob Abshear rabsh...@citytwist.net
 To: ILanit Stein ist...@redhat.com
 Sent: Tuesday, January 27, 2015 3:05:56 PM
 Subject: Re: [ovirt-users] Host remains Non-Responsive after reboot
 
 Here are the logs. you requested.  The shutdown of the node was at 11:53
 and vdsmd was manually restarted at 12:01 to get the node back online.
 
 On Tue, Jan 27, 2015 at 2:05 AM, ILanit Stein ist...@redhat.com wrote:
 
  It might be a bug,
  Would you please attach the logs, I mentioned bellow,
  that can bring more details on the failure?
  Adding Eli, that may want to give some input on this issue.
 
  Thanks,
  Ilanit.
 
  - Original Message -
  From: Rob Abshear rabsh...@citytwist.net
  To: ILanit Stein ist...@redhat.com
  Cc: users@ovirt.org
  Sent: Monday, January 26, 2015 9:43:14 PM
  Subject: Re: [ovirt-users] Host remains Non-Responsive after reboot
 
  I have done a bit more investigating on this matter.  If I restart the node
  from within oVirt using the power management option restart, then the
  node restarts and vdsmd DOES NOT start.  If I go into the DRAC and issue
  the command to power cycle the machine, then the machine restarts and vdsmd
  DOES start.  I can run the following command from another node in the
  cluster:
  fence_drac5 -a 192.168.200.105 -l root -p password -x -o reboot
  and the node restarts and vdsmd DOES start.
 
  On Sun, Jan 25, 2015 at 1:56 AM, ILanit Stein ist...@redhat.com wrote:
 
   Hi Rob,
  
   Thanks for this report.
  
   Would you please provide these logs, at the time frame, the host failure
   occur:
   1. oVirt Engine: /var/log/ovirt-engine/engine.log
   2. host: /var/log/vdsm/vdsm.log
  
   If it is reproducible, please add this info as well.
  
   You can also check vdsm service status, on host, while host reported as
   Non responsive,
   by running on host 'service vdsmd status'
   There might some problem, that might have prevented from vdsm service to
   come up, on host.
  
   Ilanit.
  
   - Original Message -
   From: Rob Abshear rabsh...@citytwist.net
   To: users@ovirt.org
   Sent: Friday, January 23, 2015 9:22:42 PM
   Subject: [ovirt-users] Host remains Non-Responsive after reboot
  
  
   I am running oVirt Engine Version 3.5.0.1-1.el6. I have 4 hosts in the
   cluster. Each host has a drac5 and it is configured and working. I am
   trying to simulate a node failure. I am running one HA VM on one of the
   hosts for testing. I simulate the failure by powering off the host with
  the
   VM running.
  
   Here is what is happening.
  
  
   * Host is powered off
   * ~4 minutes pass and the host is recognized as not responding
   * Automatic fence runs and the VM migrates. Another host in the node
   is chosen as a proxy to execute Status command on the host.
   * Same host is chosen as proxy to execute Start command on the host.
   * Same host is chosen as proxy to execute Status command on the host.
   * The host DOES physically start.
   * The host never shows status of UP.
   * I select “confirm host has been rebooted” and I see a manual fence
   start.
   * Host stays non-responsive.
   * I put the host in maintenance and then activate it.
   * Host still non-responsive
   * I put the host in maintenance and do a reinstall
   * Reinstall finishes and host becomes UP
  
   So, everything seems to go fine with the HA functionality, but the host
   never recovers without being reinstalled. Please let me know which logs
  you
   need to look at to help me out with this.
  
   Thanks
  
  
   Sent with Mixmax
  
   ___
   Users mailing list
   Users@ovirt.org
   http://lists.ovirt.org/mailman/listinfo/users
  
 
 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] 3.5.0 to 3.5.1 Upgrade Steps

2015-01-27 Thread Johan Kooijman
Hey Tim,

yum update ovirt-engine-setup*
engine-setup

That's enough for the engine upgrade :) Works like a charm.

On Tue, Jan 27, 2015 at 5:35 PM, Tim Macy mac...@gmail.com wrote:

 What are the proper steps to upgrade the engine from 3.5.0.1-1.el6 to
 3.5.1-1.el6?

 engine-upgrade or engine-setup after yum update ovirt-engine-setup?



 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users




-- 
Met vriendelijke groeten / With kind regards,
Johan Kooijman
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] 3.5.0 to 3.5.1 Upgrade Steps

2015-01-27 Thread Tim Macy
What are the proper steps to upgrade the engine from 3.5.0.1-1.el6 to
3.5.1-1.el6?

engine-upgrade or engine-setup after yum update ovirt-engine-setup?
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] change network MTU settings without taking all the VMs down?

2015-01-27 Thread Darrell Budic
I finally got a couple of networks our from behind a wan based layer 2 bridge 
that required me to run at MTU 1448, and would like to get back up to MTU 1500. 
I see the GUI won’t let me do that while the network is in use. Any way around 
this, clean or otherwise? Restarting VMs to update them is ok, just trying to 
avoid having to take everything down at the same time.

 -Darrell
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] gluster only hosts managed by oVirt?

2015-01-27 Thread Kanagaraj


On 01/27/2015 06:47 PM, Sven Kieske wrote:


On 27/01/15 13:56, Jorick Astrego wrote:

For both we provision some custom os install and configuration.
Currently I am only able to manage and view gluster volumes from ovirt
when the Enable Gluster Service is enabled for the Cluster. Right?

AFAIK correct.


Right.




There is a Enable Virt Service flag but the options is selected and
greyed out. So I'm not able to disable it.

Mhm this is sad. I'm not aware of any bug tracking this, will you open
one? It's also possible I missed it, there are too many bugs to
remember them all.


This can happen if there are any VMS in the cluster you are trying to edit.

Do you have any VMS running on the cluster?

If you create cluster with just 'Enable Gluster Service' selected, you 
should be able to manage your gluster hosts. Here the hosts need not to 
have any virtualization stuff in it.



Thanks,
Kanagaraj





___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users