Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-06 Thread Will Dennis
I actually had the opposite problem - I (not knowing) elected to say “Yes” 
(default answer) to “iptables was detected on your computer, do you wish setup 
to configure it?” which then put in the oVirt iptables rules, which assume the 
standard Gluster TCP ports… Since I am running hyperconverged and had followed 
the instructions found at:
http://www.ovirt.org/Features/Self_Hosted_Engine_Hyper_Converged_Gluster_Support
which ends up changing the Gluster ports, then I experienced a fault with 
Gluster where it lost quorum and went read-only since the firewall on the hosts 
were blocking Gluster communications...


On Jan 6, 2016, at 11:10 AM, Sahina Bose  wrote:

Also, worth checking that glusterd ports are open on the gluster hosts (we had 
an issue where HE install overrides glusterd ports and gluster volume was 
inaccessible)

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-06 Thread Sahina Bose



On 01/06/2016 06:42 PM, Will Dennis wrote:

On Jan 6, 2016, at 1:39 AM, Sahina Bose 
mailto:sab...@redhat.com>> wrote:

The reason why the host is Non-operational is usually in the General sub-tab 
for the host.

Ah, did not know that… It does say at the bottom of that pane:

“Host failed to attach one of the Storage Domains attached to it.”

As previously reported to the list last evening, this is true - it has not 
mounted the data SD (which is a Gluster SD.)

Any way to troubleshoot why?



The vdsm log from the non-operational host should have some information 
regarding this.


Can you also check if there are errors in the gluster mount logs - at 
/var/log/glusterfs/rhev-data-center-mnt-glusterSD*


Also, worth checking that glusterd ports are open on the gluster hosts 
(we had an issue where HE install overrides glusterd ports and gluster 
volume was inaccessible)




___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-06 Thread Will Dennis
I did what Joop suggested (put -node-02 into maint, clear vdsm.log on -node-02, 
clear engine.log on HE, then activate -node-02) and — what do you know, the 
node came up into an operational state! It was able to successfully mount the 
data SD this time:

$ ansible istgroup-ovirt -f 1 -i prod -u root -m shell -a "df -h | grep 
':'"ovirt-node-01 | success | rc=0 >>
localhost:/engine   1.9T  3.0G  1.9T   1% 
/rhev/data-center/mnt/glusterSD/localhost:_engine
ovirt-node-01.nec-labs.com:/vmdata  3.7T   70M  3.7T   1% 
/rhev/data-center/mnt/glusterSD/ovirt-node-01.nec-labs.com:_vmdata

ovirt-node-02 | success | rc=0 >>
localhost:/engine   1.9T  3.0G  1.9T   1% 
/rhev/data-center/mnt/glusterSD/localhost:_engine
ovirt-node-01.nec-labs.com:/vmdata  3.7T   70M  3.7T   1% 
/rhev/data-center/mnt/glusterSD/ovirt-node-01.nec-labs.com:_vmdata

ovirt-node-03 | success | rc=0 >>
localhost:/engine   1.9T  3.0G  1.9T   1% 
/rhev/data-center/mnt/glusterSD/localhost:_engine
ovirt-node-01.nec-labs.com:/vmdata  3.7T   70M  3.7T   1% 
/rhev/data-center/mnt/glusterSD/ovirt-node-01.nec-labs.com:_vmdata

Wonder what the magic was? ;)  I’ll take the result anyways :)

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-06 Thread Will Dennis

On Jan 6, 2016, at 1:39 AM, Sahina Bose 
mailto:sab...@redhat.com>> wrote:

The reason why the host is Non-operational is usually in the General sub-tab 
for the host.

Ah, did not know that… It does say at the bottom of that pane:

“Host failed to attach one of the Storage Domains attached to it.”

As previously reported to the list last evening, this is true - it has not 
mounted the data SD (which is a Gluster SD.)

Any way to troubleshoot why?
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-06 Thread Will Dennis

On Jan 6, 2016, at 7:59 AM, Moti Asayag 
mailto:masa...@redhat.com>> wrote:

In order to see the configuration of 'ovirtmgmt' network please paste the 
output of the following command to be executed on the host:
vdsClient -s 0 getVdsCaps

http://fpaste.org/307742/20853451/


In addition, in order to see the reported status of the networks run and paste 
on the host:
vdsClient -s 0 getVdsStats

http://fpaste.org/307744/45208555/

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-06 Thread Moti Asayag
Hi Will,

The engine relies on the status reported by VDSM for the management network
'ovirtmgmt' and for its underlying nics/vlans.

In order to see the configuration of 'ovirtmgmt' network please paste the
output of the following command to be executed on the host:
vdsClient -s 0 getVdsCaps

In addition, in order to see the reported status of the networks run and
paste on the host:
vdsClient -s 0 getVdsStats

That should give the indication of which nic is reported as down for
ovirtmgmt by vdsm.

On Wed, Jan 6, 2016 at 11:15 AM, Eliraz Levi  wrote:

> Hi Will how are you?
> The log is first pointing about certifications issues:
> 2016-01-04 00:02:11,259 ERROR
> [org.ovirt.engine.core.vdsbroker.jsonrpc.JsonRpcVdsServer]
> (DefaultQuartzScheduler_Worker-81) [] Failed to get peer certification for
> host 'ovirt-node-02': SSL session is invalid
> 2016-01-04 00:02:11,259 ERROR
> [org.ovirt.engine.core.bll.CertificationValidityChecker]
> (DefaultQuartzScheduler_Worker-81) [] Failed to retrieve peer
> certifications for host 'ovirt-node-02'
>
> So first thing we should do is to try and solve this problem.
> Please try to re install the host.
> Thanks.
> Eliraz :)
>
> - Original Message -
> From: "Will Dennis" 
> To: "Eliraz Levi" , "users" 
> Sent: Tuesday, 5 January, 2016 5:46:23 AM
> Subject: Re: [ovirt-users] host status "Non Operational" - how to diagnose
> & fix?
>
> I must admit I’m getting a bit weary of fighting oVirt problems at this
> point… Before I move on to deploying any VMs onto my new infra, I’d like to
> get the base infra working…
>
> I’m still experiencing a “Non Operational” problem on my “ovirt-node-02”
> host:
>
> http://s1096.photobucket.com/user/willdennis/media/ovirt-node-02_problem.png.html
>
> I have pored thru the logs (all the engine logs, plus the syslogs from the
> engine VM + and my three hypervisor/storage hosts) and I can’t pin down why
> the one node is having a problem… Of course with how voluminous all these
> logs are, it’s kind of like looking for a needle in a haystack, and I’m not
> even sure what the needle looks like, or if it’s even a needle :-/
>
> I have also rebooted this host in past days, this also did not fix the
> problem.
>
> Note that on the screenshot I posted above, that the webadmin hosts screen
> says that -node-01 has one VM running, and the others 0… You’d think that
> would be the HE VM running on there, but it’s actually on -node-02:
>
> $ ansible istgroup-ovirt -f 1 -i prod -u root -m shell -a "hosted-engine
> --vm-status | grep -e '^Hostname' -e '^Engine'"
> ovirt-node-01 | success | rc=0 >>
> Hostname   : ovirt-node-01
> Engine status  : {"reason": "bad vm status", "health":
> "bad", "vm": "down", "detail": "down"}
> Hostname   : ovirt-node-02
> Engine status  : {"health": "good", "vm": "up",
> "detail": "up"}
> Hostname   : ovirt-node-03
> Engine status  : {"reason": "vm not running on this
> host", "health": "bad", "vm": "down", "detail": "unknown"}
>
> ovirt-node-02 | success | rc=0 >>
> Hostname   : ovirt-node-01
> Engine status  : {"reason": "bad vm status", "health":
> "bad", "vm": "down", "detail": "down"}
> Hostname   : ovirt-node-02
> Engine status  : {"health": "good", "vm": "up",
> "detail": "up"}
> Hostname   : ovirt-node-03
> Engine status  : {"reason": "vm not running on this
> host", "health": "bad", "vm": "down", "detail": "unknown"}
>
> ovirt-node-03 | success | rc=0 >>
> Hostname   : ovirt-node-01
> Engine status  : {"reason": "bad vm status", "health":
> "bad", "vm": "down", "detail": "down"}
> Hostname   : ovirt-node-02
> Engine status  : {"health": "good", "vm": "up",
> "detail": "up"}
> Hostname   : ovirt-node-03
> En

Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-06 Thread Will Dennis
Define “reinstall the host” - do you just mean 'yum remove ovirt* vdsm*’ then 
‘yum install ovirt* vdsm*’, or completely reinstall the OS, reset-up Gluster, 
etc.?

On Jan 6, 2016, at 4:15 AM, Eliraz Levi 
mailto:el...@redhat.com>> wrote:

Hi Will how are you?
The log is first pointing about certifications issues:
2016-01-04 00:02:11,259 ERROR 
[org.ovirt.engine.core.vdsbroker.jsonrpc.JsonRpcVdsServer] 
(DefaultQuartzScheduler_Worker-81) [] Failed to get peer certification for host 
'ovirt-node-02': SSL session is invalid
2016-01-04 00:02:11,259 ERROR 
[org.ovirt.engine.core.bll.CertificationValidityChecker] 
(DefaultQuartzScheduler_Worker-81) [] Failed to retrieve peer certifications 
for host 'ovirt-node-02'

So first thing we should do is to try and solve this problem.
Please try to re install the host.
Thanks.
Eliraz :)

- Original Message -
From: "Will Dennis" mailto:wden...@nec-labs.com>>
To: "Eliraz Levi" mailto:el...@redhat.com>>, "users" 
mailto:users@ovirt.org>>
Sent: Tuesday, 5 January, 2016 5:46:23 AM
Subject: Re: [ovirt-users] host status "Non Operational" - how to diagnose & 
fix?

I must admit I’m getting a bit weary of fighting oVirt problems at this point… 
Before I move on to deploying any VMs onto my new infra, I’d like to get the 
base infra working…

I’m still experiencing a “Non Operational” problem on my “ovirt-node-02” host:
http://s1096.photobucket.com/user/willdennis/media/ovirt-node-02_problem.png.html

I have pored thru the logs (all the engine logs, plus the syslogs from the 
engine VM + and my three hypervisor/storage hosts) and I can’t pin down why the 
one node is having a problem… Of course with how voluminous all these logs are, 
it’s kind of like looking for a needle in a haystack, and I’m not even sure 
what the needle looks like, or if it’s even a needle :-/

I have also rebooted this host in past days, this also did not fix the problem.

Note that on the screenshot I posted above, that the webadmin hosts screen says 
that -node-01 has one VM running, and the others 0… You’d think that would be 
the HE VM running on there, but it’s actually on -node-02:

$ ansible istgroup-ovirt -f 1 -i prod -u root -m shell -a "hosted-engine 
--vm-status | grep -e '^Hostname' -e '^Engine'"
ovirt-node-01 | success | rc=0 >>
Hostname   : ovirt-node-01
Engine status  : {"reason": "bad vm status", "health": 
"bad", "vm": "down", "detail": "down"}
Hostname   : ovirt-node-02
Engine status  : {"health": "good", "vm": "up", "detail": 
"up"}
Hostname   : ovirt-node-03
Engine status  : {"reason": "vm not running on this host", 
"health": "bad", "vm": "down", "detail": "unknown"}

ovirt-node-02 | success | rc=0 >>
Hostname   : ovirt-node-01
Engine status  : {"reason": "bad vm status", "health": 
"bad", "vm": "down", "detail": "down"}
Hostname   : ovirt-node-02
Engine status  : {"health": "good", "vm": "up", "detail": 
"up"}
Hostname   : ovirt-node-03
Engine status  : {"reason": "vm not running on this host", 
"health": "bad", "vm": "down", "detail": "unknown"}

ovirt-node-03 | success | rc=0 >>
Hostname   : ovirt-node-01
Engine status  : {"reason": "bad vm status", "health": 
"bad", "vm": "down", "detail": "down"}
Hostname   : ovirt-node-02
Engine status  : {"health": "good", "vm": "up", "detail": 
"up"}
Hostname   : ovirt-node-03
Engine status  : {"reason": "vm not running on this host", 
"health": "bad", "vm": "down", "detail": "unknown”}

So it looks like the webadmin UI is wrong as well…

It would be awesome if the UI would give a reason for the “Non Operational” 
status somehow… Or if there was a troubleshooter that could be used to analyze 
the problem… As it is, being so new to all of this, I am completely at the 
list’s mercy to figure this out.

This software has such promise, so I’ll keep working thru these issues, but it 
sure hasn’t been a smooth ride so far…


On Jan 4, 2016, at 7:54 AM, Will Dennis 
mailto:wden...@nec-labs.com><mailto:wden...@nec-labs.com>>
 wrote:

I put all of the engine logs up there now… Try 
engine.log-20160103.gzhttp://i1096.photobucket.com/albums/g330/willdennis/ovirt-node-02_problem.png

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-06 Thread Eliraz Levi
Hi Will how are you?
The log is first pointing about certifications issues:
2016-01-04 00:02:11,259 ERROR 
[org.ovirt.engine.core.vdsbroker.jsonrpc.JsonRpcVdsServer] 
(DefaultQuartzScheduler_Worker-81) [] Failed to get peer certification for host 
'ovirt-node-02': SSL session is invalid
2016-01-04 00:02:11,259 ERROR 
[org.ovirt.engine.core.bll.CertificationValidityChecker] 
(DefaultQuartzScheduler_Worker-81) [] Failed to retrieve peer certifications 
for host 'ovirt-node-02'

So first thing we should do is to try and solve this problem.
Please try to re install the host.
Thanks.
Eliraz :)

- Original Message -
From: "Will Dennis" 
To: "Eliraz Levi" , "users" 
Sent: Tuesday, 5 January, 2016 5:46:23 AM
Subject: Re: [ovirt-users] host status "Non Operational" - how to diagnose & 
fix?

I must admit I’m getting a bit weary of fighting oVirt problems at this point… 
Before I move on to deploying any VMs onto my new infra, I’d like to get the 
base infra working…

I’m still experiencing a “Non Operational” problem on my “ovirt-node-02” host:
http://s1096.photobucket.com/user/willdennis/media/ovirt-node-02_problem.png.html

I have pored thru the logs (all the engine logs, plus the syslogs from the 
engine VM + and my three hypervisor/storage hosts) and I can’t pin down why the 
one node is having a problem… Of course with how voluminous all these logs are, 
it’s kind of like looking for a needle in a haystack, and I’m not even sure 
what the needle looks like, or if it’s even a needle :-/

I have also rebooted this host in past days, this also did not fix the problem.

Note that on the screenshot I posted above, that the webadmin hosts screen says 
that -node-01 has one VM running, and the others 0… You’d think that would be 
the HE VM running on there, but it’s actually on -node-02:

$ ansible istgroup-ovirt -f 1 -i prod -u root -m shell -a "hosted-engine 
--vm-status | grep -e '^Hostname' -e '^Engine'"
ovirt-node-01 | success | rc=0 >>
Hostname   : ovirt-node-01
Engine status  : {"reason": "bad vm status", "health": 
"bad", "vm": "down", "detail": "down"}
Hostname   : ovirt-node-02
Engine status  : {"health": "good", "vm": "up", "detail": 
"up"}
Hostname   : ovirt-node-03
Engine status  : {"reason": "vm not running on this host", 
"health": "bad", "vm": "down", "detail": "unknown"}

ovirt-node-02 | success | rc=0 >>
Hostname   : ovirt-node-01
Engine status  : {"reason": "bad vm status", "health": 
"bad", "vm": "down", "detail": "down"}
Hostname   : ovirt-node-02
Engine status  : {"health": "good", "vm": "up", "detail": 
"up"}
Hostname   : ovirt-node-03
Engine status  : {"reason": "vm not running on this host", 
"health": "bad", "vm": "down", "detail": "unknown"}

ovirt-node-03 | success | rc=0 >>
Hostname   : ovirt-node-01
Engine status  : {"reason": "bad vm status", "health": 
"bad", "vm": "down", "detail": "down"}
Hostname   : ovirt-node-02
Engine status  : {"health": "good", "vm": "up", "detail": 
"up"}
Hostname   : ovirt-node-03
Engine status  : {"reason": "vm not running on this host", 
"health": "bad", "vm": "down", "detail": "unknown”}

So it looks like the webadmin UI is wrong as well…

It would be awesome if the UI would give a reason for the “Non Operational” 
status somehow… Or if there was a troubleshooter that could be used to analyze 
the problem… As it is, being so new to all of this, I am completely at the 
list’s mercy to figure this out.

This software has such promise, so I’ll keep working thru these issues, but it 
sure hasn’t been a smooth ride so far…


On Jan 4, 2016, at 7:54 AM, Will Dennis 
mailto:wden...@nec-labs.com>> wrote:

I put all of the engine logs up there now… Try 
engine.log-20160103.gzhttp://i1096.photobucket.com/albums/g330/willdennis/ovirt-node-02_problem.png
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-05 Thread Sahina Bose



On 01/06/2016 07:45 AM, Will Dennis wrote:

Feel like I’m in a bit of an echo chamber here… Is anyone out there? ;) Or have 
I worn out the oVirt crew?

Anyhow, not sure if this is a cause, or an effect, but I noticed tonight that 
the data storage domain (which I’m using Gluster for in a hyperconverged way) 
is not mounted on the problem hypervisor host…

$ ansible istgroup-ovirt -f 1 -i prod -u root -m shell -a "df -h | grep ':’"
ovirt-node-01 | success | rc=0 >>
localhost:/engine 1.9T  3.0G  1.9T   1% 
/rhev/data-center/mnt/glusterSD/localhost:_engine
ovirt-node-01.nec-labs.com:/vmdata 
   3.7T   70M  3.7T   1% 
/rhev/data-center/mnt/glusterSD/ovirt-node-01.nec-labs.com:_vmdata

ovirt-node-02 | success | rc=0 >>
localhost:/engine 1.9T  3.0G  1.9T   1% 
/rhev/data-center/mnt/glusterSD/localhost:_engine

ovirt-node-03 | success | rc=0 >>
localhost:/engine 1.9T  3.0G  1.9T   1% 
/rhev/data-center/mnt/glusterSD/localhost:_engine
ovirt-node-01.nec-labs.com:/vmdata 
3.7T   70M  3.7T   1% 
/rhev/data-center/mnt/glusterSD/ovirt-node-01.nec-labs.com:_vmdata

What causes this mount to occur, and is there a way to trigger the mount 
manually?


Activating the host from maintenance mode should ensure that the storage 
domain is mounted on the host, AFAIK.


The reason why the host is Non-operational is usually in the General 
sub-tab for the host. Were you able to trim the logs (empty log, and 
activate host) like Joop suggested?







On Jan 4, 2016, at 10:47 PM, Will Dennis 
mailto:wden...@nec-labs.com>> wrote:

I must admit I’m getting a bit weary of fighting oVirt problems at this point… 
Before I move on to deploying any VMs onto my new infra, I’d like to get the 
base infra working…

I’m still experiencing a “Non Operational” problem on my “ovirt-node-02” host:
http://s1096.photobucket.com/user/willdennis/media/ovirt-node-02_problem.png.html

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-05 Thread Will Dennis
Feel like I’m in a bit of an echo chamber here… Is anyone out there? ;) Or have 
I worn out the oVirt crew?

Anyhow, not sure if this is a cause, or an effect, but I noticed tonight that 
the data storage domain (which I’m using Gluster for in a hyperconverged way) 
is not mounted on the problem hypervisor host…

$ ansible istgroup-ovirt -f 1 -i prod -u root -m shell -a "df -h | grep ':’"
ovirt-node-01 | success | rc=0 >>
localhost:/engine 1.9T  3.0G  1.9T   1% 
/rhev/data-center/mnt/glusterSD/localhost:_engine
ovirt-node-01.nec-labs.com:/vmdata   
 3.7T   70M  3.7T   1% 
/rhev/data-center/mnt/glusterSD/ovirt-node-01.nec-labs.com:_vmdata

ovirt-node-02 | success | rc=0 >>
localhost:/engine 1.9T  3.0G  1.9T   1% 
/rhev/data-center/mnt/glusterSD/localhost:_engine

ovirt-node-03 | success | rc=0 >>
localhost:/engine 1.9T  3.0G  1.9T   1% 
/rhev/data-center/mnt/glusterSD/localhost:_engine
ovirt-node-01.nec-labs.com:/vmdata   
  3.7T   70M  3.7T   1% 
/rhev/data-center/mnt/glusterSD/ovirt-node-01.nec-labs.com:_vmdata

What causes this mount to occur, and is there a way to trigger the mount 
manually?



On Jan 4, 2016, at 10:47 PM, Will Dennis 
mailto:wden...@nec-labs.com>> wrote:

I must admit I’m getting a bit weary of fighting oVirt problems at this point… 
Before I move on to deploying any VMs onto my new infra, I’d like to get the 
base infra working…

I’m still experiencing a “Non Operational” problem on my “ovirt-node-02” host:
http://s1096.photobucket.com/user/willdennis/media/ovirt-node-02_problem.png.html

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-05 Thread Joop
On 5-1-2016 4:46, Will Dennis wrote:
> I must admit I’m getting a bit weary of fighting oVirt problems at this 
> point… Before I move on to deploying any VMs onto my new infra, I’d like to 
> get the base infra working…
>
> I’m still experiencing a “Non Operational” problem on my “ovirt-node-02” host:
> http://s1096.photobucket.com/user/willdennis/media/ovirt-node-02_problem.png.html
>
>
What you can do to get the logs down to a minimum is to do the following:
- put node-02 in maintenance
- on the host which is down (node-02) cd /var/log/vdsm; :>vdsm.log (this
truncates the log)
- activate node-02
- if it goes to non-operational: cp vdsm.log vdsm.log.error

This will give you a much smaller log and maybe the error will be more
visible.
You could do this on the engine too:
- cd /var/log/ovirt-engine
- :>engine.log
- activate
- cp engine.log engine.log.error

Regards,

Joop


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-04 Thread Will Dennis
I must admit I’m getting a bit weary of fighting oVirt problems at this point… 
Before I move on to deploying any VMs onto my new infra, I’d like to get the 
base infra working…

I’m still experiencing a “Non Operational” problem on my “ovirt-node-02” host:
http://s1096.photobucket.com/user/willdennis/media/ovirt-node-02_problem.png.html

I have pored thru the logs (all the engine logs, plus the syslogs from the 
engine VM + and my three hypervisor/storage hosts) and I can’t pin down why the 
one node is having a problem… Of course with how voluminous all these logs are, 
it’s kind of like looking for a needle in a haystack, and I’m not even sure 
what the needle looks like, or if it’s even a needle :-/

I have also rebooted this host in past days, this also did not fix the problem.

Note that on the screenshot I posted above, that the webadmin hosts screen says 
that -node-01 has one VM running, and the others 0… You’d think that would be 
the HE VM running on there, but it’s actually on -node-02:

$ ansible istgroup-ovirt -f 1 -i prod -u root -m shell -a "hosted-engine 
--vm-status | grep -e '^Hostname' -e '^Engine'"
ovirt-node-01 | success | rc=0 >>
Hostname   : ovirt-node-01
Engine status  : {"reason": "bad vm status", "health": 
"bad", "vm": "down", "detail": "down"}
Hostname   : ovirt-node-02
Engine status  : {"health": "good", "vm": "up", "detail": 
"up"}
Hostname   : ovirt-node-03
Engine status  : {"reason": "vm not running on this host", 
"health": "bad", "vm": "down", "detail": "unknown"}

ovirt-node-02 | success | rc=0 >>
Hostname   : ovirt-node-01
Engine status  : {"reason": "bad vm status", "health": 
"bad", "vm": "down", "detail": "down"}
Hostname   : ovirt-node-02
Engine status  : {"health": "good", "vm": "up", "detail": 
"up"}
Hostname   : ovirt-node-03
Engine status  : {"reason": "vm not running on this host", 
"health": "bad", "vm": "down", "detail": "unknown"}

ovirt-node-03 | success | rc=0 >>
Hostname   : ovirt-node-01
Engine status  : {"reason": "bad vm status", "health": 
"bad", "vm": "down", "detail": "down"}
Hostname   : ovirt-node-02
Engine status  : {"health": "good", "vm": "up", "detail": 
"up"}
Hostname   : ovirt-node-03
Engine status  : {"reason": "vm not running on this host", 
"health": "bad", "vm": "down", "detail": "unknown”}

So it looks like the webadmin UI is wrong as well…

It would be awesome if the UI would give a reason for the “Non Operational” 
status somehow… Or if there was a troubleshooter that could be used to analyze 
the problem… As it is, being so new to all of this, I am completely at the 
list’s mercy to figure this out.

This software has such promise, so I’ll keep working thru these issues, but it 
sure hasn’t been a smooth ride so far…


On Jan 4, 2016, at 7:54 AM, Will Dennis 
mailto:wden...@nec-labs.com>> wrote:

I put all of the engine logs up there now… Try 
engine.log-20160103.gzhttp://i1096.photobucket.com/albums/g330/willdennis/ovirt-node-02_problem.png
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-04 Thread Will Dennis
I put all of the engine logs up there now… Try engine.log-20160103.gz

> On Jan 4, 2016, at 7:48 AM, Eliraz Levi  wrote:
> 
> ok thanks :)
> It looks like you didn't press refresh capabilities.
> I can't learn a lot from this log.
> can you refresh the host's capabilities and then send the log?
> thanks :)
> Eliraz. 
> 
> - Original Message -
> From: "Will Dennis" 
> To: "Eliraz Levi" 
> Sent: Monday, 4 January, 2016 2:22:03 PM
> Subject: RE: [ovirt-users] host status "Non Operational" - how to diagnose & 
> fix?
> 
> If you try it again, should work now... Damn hackers...
> 
> 
> 
> Sent with Good (www.good.com)
> 
> 
> -Original Message-
> From: Eliraz Levi [el...@redhat.com<mailto:el...@redhat.com>]
> Sent: Monday, January 04, 2016 07:17 AM Eastern Standard Time
> To: Will Dennis
> Subject: Re: [ovirt-users] host status "Non Operational" - how to diagnose & 
> fix?
> 
> 
> Hi Will :)
> The link is broken.
> Can you please send a valid one to the list?
> thanks :)
> Eliraz.
> 
> - Original Message -
> From: "Will Dennis" 
> To: "Eliraz Levi" 
> Sent: Sunday, 3 January, 2016 8:23:59 PM
> Subject: Re: [ovirt-users] host status "Non Operational" - how to diagnose & 
> fix?
> 
> Digital Ocean droplet and Python SimpleHTTPServer FTW ;)
> http://c7-01.thiscant.fail
> 
> 
> 
> On Jan 3, 2016, at 9:33 AM, Eliraz Levi 
> mailto:el...@redhat.com>> wrote:
> 
> 
> 
> vdsClient output: http://fpaste.org/30/82858714/
> 
> The engine.log is very large (4496 lines) so cannot fpaste… Is there a file 
> upload service that can be used to share these sorts of things with you?
> 
> Hi Will how are you?
> Perhaps you can upload the log to some sort of a cloud? say google and share 
> the URL?
> I think it will be the fastest way around.
> Thanks :)
> please send the URL in the mailing list so everybody will be able to follow.
> Cheers!
> Eliraz :)
> 
> 

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-04 Thread Will Dennis
Sorry, link should be working again now...



Sent with Good (www.good.com)


-Original Message-
From: Eliraz Levi [el...@redhat.com<mailto:el...@redhat.com>]
Sent: Monday, January 04, 2016 07:17 AM Eastern Standard Time
To: Will Dennis
Subject: Re: [ovirt-users] host status "Non Operational" - how to diagnose & 
fix?


Hi Will :)
The link is broken.
Can you please send a valid one to the list?
thanks :)
Eliraz.

- Original Message -
From: "Will Dennis" 
To: "Eliraz Levi" 
Sent: Sunday, 3 January, 2016 8:23:59 PM
Subject: Re: [ovirt-users] host status "Non Operational" - how to diagnose & 
fix?

Digital Ocean droplet and Python SimpleHTTPServer FTW ;)
http://c7-01.thiscant.fail



On Jan 3, 2016, at 9:33 AM, Eliraz Levi 
mailto:el...@redhat.com>> wrote:



vdsClient output: http://fpaste.org/30/82858714/

The engine.log is very large (4496 lines) so cannot fpaste… Is there a file 
upload service that can be used to share these sorts of things with you?

Hi Will how are you?
Perhaps you can upload the log to some sort of a cloud? say google and share 
the URL?
I think it will be the fastest way around.
Thanks :)
please send the URL in the mailing list so everybody will be able to follow.
Cheers!
Eliraz :)



___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-04 Thread Will Dennis
Link is working again...

http://c7-01.thiscant.fail



Sent with Good (www.good.com)


-Original Message-
From: Eliraz Levi [el...@redhat.com<mailto:el...@redhat.com>]
Sent: Monday, January 04, 2016 07:17 AM Eastern Standard Time
To: Will Dennis
Subject: Re: [ovirt-users] host status "Non Operational" - how to diagnose & 
fix?


Hi Will :)
The link is broken.
Can you please send a valid one to the list?
thanks :)
Eliraz.

- Original Message -
From: "Will Dennis" 
To: "Eliraz Levi" 
Sent: Sunday, 3 January, 2016 8:23:59 PM
Subject: Re: [ovirt-users] host status "Non Operational" - how to diagnose & 
fix?

Digital Ocean droplet and Python SimpleHTTPServer FTW ;)
http://c7-01.thiscant.fail



On Jan 3, 2016, at 9:33 AM, Eliraz Levi 
mailto:el...@redhat.com>> wrote:



vdsClient output: http://fpaste.org/30/82858714/

The engine.log is very large (4496 lines) so cannot fpaste… Is there a file 
upload service that can be used to share these sorts of things with you?

Hi Will how are you?
Perhaps you can upload the log to some sort of a cloud? say google and share 
the URL?
I think it will be the fastest way around.
Thanks :)
please send the URL in the mailing list so everybody will be able to follow.
Cheers!
Eliraz :)



___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-03 Thread Will Dennis
Forgot to cc: list, sorry…

On Jan 3, 2016, at 1:24 PM, Will Dennis 
mailto:wden...@nec-labs.com>> wrote:

Digital Ocean droplet and Python SimpleHTTPServer FTW ;)
http://c7-01.thiscant.fail



On Jan 3, 2016, at 9:33 AM, Eliraz Levi 
mailto:el...@redhat.com>> wrote:



vdsClient output: http://fpaste.org/30/82858714/

The engine.log is very large (4496 lines) so cannot fpaste… Is there a file 
upload service that can be used to share these sorts of things with you?

Hi Will how are you?
Perhaps you can upload the log to some sort of a cloud? say google and share 
the URL?
I think it will be the fastest way around.
Thanks :)
please send the URL in the mailing list so everybody will be able to follow.
Cheers!
Eliraz :)




___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-03 Thread Will Dennis
vdsClient output: http://fpaste.org/30/82858714/

The engine.log is very large (4496 lines) so cannot fpaste… Is there a file 
upload service that can be used to share these sorts of things with you?


On Jan 3, 2016, at 5:24 AM, Eliraz Levi 
mailto:el...@redhat.com>> wrote:

Hi how are you?

Can you please send the following information:
1. Run the following command in the host and send it's output:
  vdsClient -s 0 getVdsStats
2. engine.log

Also, please try to refresh caps.

Thanks.
BR'
Eliraz :)


From: "Karli Sjöberg" mailto:karli.sjob...@slu.se>>
To: "Will Dennis" mailto:wden...@nec-labs.com>>
Cc: "users" mailto:users@ovirt.org>>
Sent: Sunday, 3 January, 2016 9:14:04 AM
Subject: Re: [ovirt-users] host status "Non Operational" - how to diagnose & 
fix?




Den 3 jan. 2016 2:43 fm skrev Will Dennis 
mailto:wden...@nec-labs.com>>:

 The ‘ovirtmgmt’ network has been & is still placed on a working NIC 
(enp12s0f0)… It’s just that now, oVirt somehow doesn’t *think* it’s working…

Here's something I wrote a long time ago now, for those times when 
auto-gui-config fluff just won't do:


http://www.ovirt.org/Bonding_VLAN_Bridge

/K

 http://s1096.photobucket.com/user/willdennis/media/setup-networks.png.html

 However, as I showed you in the ‘ip link show up’ output, it is indeed up and 
working.




 On Jan 2, 2016, at 8:00 PM, Roy Golan 
mailto:rgo...@redhat.com><mailto:rgo...@redhat.com>> wrote:



 On Sun, Jan 3, 2016 at 2:46 AM, Will Dennis 
mailto:wden...@nec-labs.com><mailto:wden...@nec-labs.com>>
 wrote:
 I have had one of my hosts go into the state “Non Operational” after I 
rebooted it… I also noticed that in the oVirt webadmin UI, the NIC that’s used 
in the ‘ovirtmgmt’ network is showing “down”, but in Linux the NIC is 
operational and up, as is the ‘ovirtmgmt’ bridge…


 Hosts tab -> Network Interfaces subtab -> click "Setup networks" and make sure 
"ovirtmgmt" is placed on a working nic.

 make sure
 [root@ovirt-node-02 ~]# ip link sh up
 1: lo:  mtu 65536 qdisc noqueue state UNKNOWN mode 
DEFAULT
 link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
 2: bond0:  mtu 1500 qdisc noqueue 
state DOWN mode DEFAULT
 link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff
 3: enp4s0f0:  mtu 1500 qdisc 
pfifo_fast master bond0 state DOWN mode DEFAULT qlen 1000
 link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff
 4: enp4s0f1:  mtu 1500 qdisc 
pfifo_fast master bond0 state DOWN mode DEFAULT qlen 1000
 link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff
 5: enp12s0f0:  mtu 1500 qdisc pfifo_fast 
master ovirtmgmt state UP mode DEFAULT qlen 1000
 link/ether 00:21:85:35:08:4c brd ff:ff:ff:ff:ff:ff
 7: ovirtmgmt:  mtu 1500 qdisc noqueue state 
UP mode DEFAULT
 link/ether 00:21:85:35:08:4c brd ff:ff:ff:ff:ff:ff

 What should I take a look at first?

 ___
 Users mailing list
 Users@ovirt.org<mailto:Users@ovirt.org><mailto:Users@ovirt.org>
 http://lists.ovirt.org/mailman/listinfo/users


 ___
 Users mailing list
 Users@ovirt.org<mailto:Users@ovirt.org>
 http://lists.ovirt.org/mailman/listinfo/users

___

Users mailing list
Users@ovirt.org<mailto:Users@ovirt.org>
http://lists.ovirt.org/mailman/listinfo/users

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-03 Thread Eliraz Levi
Hi how are you? 

Can you please send the following information: 
1. Run the following command in the host and send it's output: 
vdsClient -s 0 getVdsStats 
2. engine.log 

Also, please try to refresh caps. 

Thanks. 
BR' 
Eliraz :) 
- Original Message -

From: "Karli Sjöberg"  
To: "Will Dennis"  
Cc: "users"  
Sent: Sunday, 3 January, 2016 9:14:04 AM 
Subject: Re: [ovirt-users] host status "Non Operational" - how to diagnose & 
fix? 




Den 3 jan. 2016 2:43 fm skrev Will Dennis : 



The ‘ovirtmgmt’ network has been & is still placed on a working NIC 
(enp12s0f0)… It’s just that now, oVirt somehow doesn’t *think* it’s working… 

Here's something I wrote a long time ago now, for those times when 
auto-gui-config fluff just won't do: 




http://www.ovirt.org/Bonding_VLAN_Bridge 

/K 



http://s1096.photobucket.com/user/willdennis/media/setup-networks.png.html 

However, as I showed you in the ‘ip link show up’ output, it is indeed up and 
working. 




On Jan 2, 2016, at 8:00 PM, Roy Golan 
mailto:rgo...@redhat.com>> wrote: 



On Sun, Jan 3, 2016 at 2:46 AM, Will Dennis 
mailto:wden...@nec-labs.com>> wrote: 
I have had one of my hosts go into the state “Non Operational” after I rebooted 
it… I also noticed that in the oVirt webadmin UI, the NIC that’s used in the 
‘ovirtmgmt’ network is showing “down”, but in Linux the NIC is operational and 
up, as is the ‘ovirtmgmt’ bridge… 


Hosts tab -> Network Interfaces subtab -> click "Setup networks" and make sure 
"ovirtmgmt" is placed on a working nic. 

make sure 
[root@ovirt-node-02 ~]# ip link sh up 
1: lo:  mtu 65536 qdisc noqueue state UNKNOWN mode 
DEFAULT 
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 
2: bond0:  mtu 1500 qdisc noqueue 
state DOWN mode DEFAULT 
link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff 
3: enp4s0f0:  mtu 1500 qdisc 
pfifo_fast master bond0 state DOWN mode DEFAULT qlen 1000 
link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff 
4: enp4s0f1:  mtu 1500 qdisc 
pfifo_fast master bond0 state DOWN mode DEFAULT qlen 1000 
link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff 
5: enp12s0f0:  mtu 1500 qdisc pfifo_fast 
master ovirtmgmt state UP mode DEFAULT qlen 1000 
link/ether 00:21:85:35:08:4c brd ff:ff:ff:ff:ff:ff 
7: ovirtmgmt:  mtu 1500 qdisc noqueue state UP 
mode DEFAULT 
link/ether 00:21:85:35:08:4c brd ff:ff:ff:ff:ff:ff 

What should I take a look at first? 

___ 
Users mailing list 
Users@ovirt.org<mailto:Users@ovirt.org> 
http://lists.ovirt.org/mailman/listinfo/users 


___ 
Users mailing list 
Users@ovirt.org 
http://lists.ovirt.org/mailman/listinfo/users 

___ 



Users mailing list 
Users@ovirt.org 
http://lists.ovirt.org/mailman/listinfo/users 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-02 Thread Karli Sjöberg

Den 3 jan. 2016 2:43 fm skrev Will Dennis :
>
> The ‘ovirtmgmt’ network has been & is still placed on a working NIC 
> (enp12s0f0)… It’s just that now, oVirt somehow doesn’t *think* it’s working…

Here's something I wrote a long time ago now, for those times when 
auto-gui-config fluff just won't do:

http://www.ovirt.org/Bonding_VLAN_Bridge

/K
>
> http://s1096.photobucket.com/user/willdennis/media/setup-networks.png.html
>
> However, as I showed you in the ‘ip link show up’ output, it is indeed up and 
> working.
>
>
>
>
> On Jan 2, 2016, at 8:00 PM, Roy Golan 
> mailto:rgo...@redhat.com>> wrote:
>
>
>
> On Sun, Jan 3, 2016 at 2:46 AM, Will Dennis 
> mailto:wden...@nec-labs.com>> wrote:
> I have had one of my hosts go into the state “Non Operational” after I 
> rebooted it… I also noticed that in the oVirt webadmin UI, the NIC that’s 
> used in the ‘ovirtmgmt’ network is showing “down”, but in Linux the NIC is 
> operational and up, as is the ‘ovirtmgmt’ bridge…
>
>
> Hosts tab -> Network Interfaces subtab -> click "Setup networks" and make 
> sure "ovirtmgmt" is placed on a working nic.
>
> make sure
> [root@ovirt-node-02 ~]# ip link sh up
> 1: lo:  mtu 65536 qdisc noqueue state UNKNOWN mode 
> DEFAULT
> link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
> 2: bond0:  mtu 1500 qdisc noqueue 
> state DOWN mode DEFAULT
> link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff
> 3: enp4s0f0:  mtu 1500 qdisc 
> pfifo_fast master bond0 state DOWN mode DEFAULT qlen 1000
> link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff
> 4: enp4s0f1:  mtu 1500 qdisc 
> pfifo_fast master bond0 state DOWN mode DEFAULT qlen 1000
> link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff
> 5: enp12s0f0:  mtu 1500 qdisc pfifo_fast 
> master ovirtmgmt state UP mode DEFAULT qlen 1000
> link/ether 00:21:85:35:08:4c brd ff:ff:ff:ff:ff:ff
> 7: ovirtmgmt:  mtu 1500 qdisc noqueue state 
> UP mode DEFAULT
> link/ether 00:21:85:35:08:4c brd ff:ff:ff:ff:ff:ff
>
> What should I take a look at first?
>
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
>
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-02 Thread Will Dennis
The ‘ovirtmgmt’ network has been & is still placed on a working NIC 
(enp12s0f0)… It’s just that now, oVirt somehow doesn’t *think* it’s working…

http://s1096.photobucket.com/user/willdennis/media/setup-networks.png.html

However, as I showed you in the ‘ip link show up’ output, it is indeed up and 
working.




On Jan 2, 2016, at 8:00 PM, Roy Golan 
mailto:rgo...@redhat.com>> wrote:



On Sun, Jan 3, 2016 at 2:46 AM, Will Dennis 
mailto:wden...@nec-labs.com>> wrote:
I have had one of my hosts go into the state “Non Operational” after I rebooted 
it… I also noticed that in the oVirt webadmin UI, the NIC that’s used in the 
‘ovirtmgmt’ network is showing “down”, but in Linux the NIC is operational and 
up, as is the ‘ovirtmgmt’ bridge…


Hosts tab -> Network Interfaces subtab -> click "Setup networks" and make sure 
"ovirtmgmt" is placed on a working nic.

make sure
[root@ovirt-node-02 ~]# ip link sh up
1: lo:  mtu 65536 qdisc noqueue state UNKNOWN mode DEFAULT
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
2: bond0:  mtu 1500 qdisc noqueue 
state DOWN mode DEFAULT
link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff
3: enp4s0f0:  mtu 1500 qdisc 
pfifo_fast master bond0 state DOWN mode DEFAULT qlen 1000
link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff
4: enp4s0f1:  mtu 1500 qdisc 
pfifo_fast master bond0 state DOWN mode DEFAULT qlen 1000
link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff
5: enp12s0f0:  mtu 1500 qdisc pfifo_fast 
master ovirtmgmt state UP mode DEFAULT qlen 1000
link/ether 00:21:85:35:08:4c brd ff:ff:ff:ff:ff:ff
7: ovirtmgmt:  mtu 1500 qdisc noqueue state UP 
mode DEFAULT
link/ether 00:21:85:35:08:4c brd ff:ff:ff:ff:ff:ff

What should I take a look at first?

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-02 Thread Roy Golan
On Sun, Jan 3, 2016 at 2:46 AM, Will Dennis  wrote:

> I have had one of my hosts go into the state “Non Operational” after I
> rebooted it… I also noticed that in the oVirt webadmin UI, the NIC that’s
> used in the ‘ovirtmgmt’ network is showing “down”, but in Linux the NIC is
> operational and up, as is the ‘ovirtmgmt’ bridge…
>
>
Hosts tab -> Network Interfaces subtab -> click "Setup networks" and make
sure "ovirtmgmt" is placed on a working nic.

make sure

> [root@ovirt-node-02 ~]# ip link sh up
> 1: lo:  mtu 65536 qdisc noqueue state UNKNOWN mode
> DEFAULT
> link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
> 2: bond0:  mtu 1500 qdisc
> noqueue state DOWN mode DEFAULT
> link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff
> 3: enp4s0f0:  mtu 1500 qdisc
> pfifo_fast master bond0 state DOWN mode DEFAULT qlen 1000
> link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff
> 4: enp4s0f1:  mtu 1500 qdisc
> pfifo_fast master bond0 state DOWN mode DEFAULT qlen 1000
> link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff
> 5: enp12s0f0:  mtu 1500 qdisc pfifo_fast
> master ovirtmgmt state UP mode DEFAULT qlen 1000
> link/ether 00:21:85:35:08:4c brd ff:ff:ff:ff:ff:ff
> 7: ovirtmgmt:  mtu 1500 qdisc noqueue
> state UP mode DEFAULT
> link/ether 00:21:85:35:08:4c brd ff:ff:ff:ff:ff:ff
>
> What should I take a look at first?
>
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] host status "Non Operational" - how to diagnose & fix?

2016-01-02 Thread Will Dennis
I have had one of my hosts go into the state “Non Operational” after I rebooted 
it… I also noticed that in the oVirt webadmin UI, the NIC that’s used in the 
‘ovirtmgmt’ network is showing “down”, but in Linux the NIC is operational and 
up, as is the ‘ovirtmgmt’ bridge…

[root@ovirt-node-02 ~]# ip link sh up
1: lo:  mtu 65536 qdisc noqueue state UNKNOWN mode DEFAULT
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
2: bond0:  mtu 1500 qdisc noqueue 
state DOWN mode DEFAULT
link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff
3: enp4s0f0:  mtu 1500 qdisc 
pfifo_fast master bond0 state DOWN mode DEFAULT qlen 1000
link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff
4: enp4s0f1:  mtu 1500 qdisc 
pfifo_fast master bond0 state DOWN mode DEFAULT qlen 1000
link/ether 00:15:17:7b:e9:b0 brd ff:ff:ff:ff:ff:ff
5: enp12s0f0:  mtu 1500 qdisc pfifo_fast 
master ovirtmgmt state UP mode DEFAULT qlen 1000
link/ether 00:21:85:35:08:4c brd ff:ff:ff:ff:ff:ff
7: ovirtmgmt:  mtu 1500 qdisc noqueue state UP 
mode DEFAULT
link/ether 00:21:85:35:08:4c brd ff:ff:ff:ff:ff:ff

What should I take a look at first?

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users