Re: [ovirt-users] Cumulus Switch

2016-12-19 Thread Matt Wells
Thanks Yaniv; I'll have a got and share any progress.   I appreciate the
reply.

On Sun, Dec 18, 2016 at 7:28 AM Yaniv Dary  wrote:

> You can read on the external provider design in:
>
> http://www.ovirt.org/develop/release-management/features/external-network-provider/
> You can also start a project to integrate this software to oVirt with the
> refrence in:
> https://github.com/mmirecki/ovirt-provider-mock
>
> Patches are welcome!
>
> Yaniv Dary
> Technical Product Manager
> Red Hat Israel Ltd.
> 34 Jerusalem Road
> Building A, 4th floor
> Ra'anana, Israel 4350109
>
> Tel : +972 (9) 7692306 <+972%209-769-2306>
> 8272306
> Email: yd...@redhat.com
> IRC : ydary
>
>
> On Thu, Dec 15, 2016 at 10:07 PM, Matt Wells 
> wrote:
>
> I've seen some of the cool stuff coming with OVN and even a co-worker has
> done some great things with it.  However I was wondering if anyone had
> experience with Cumulus as the external provider for networks.
> It's just a "weekend project" I'm picking up and thought to ask on the
> list.  I've not found other posts on it yet but will continue to look.
> I've just made a fresh lab with the latest and greatest oVirt on CentOS 7.
> Thanks to all and a happy holiday season ( if you're into the holiday
> thing ).
> :-)
>
> ___
> Users mailing list
> Users@ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
>
>
> --
Matt Wells
Chief Systems Architect
RHCA III, RHCVA - #110-000-353
(702) 808-0424
matt.we...@mosaic451.com
 Las Vegas | Phoenix | Portland Mosaic451.com
CONFIDENTIALITY NOTICE: This transmittal is a confidential communication or
may otherwise be privileged. If you are not intended recipient, you are
hereby notified that you have received this transmittal in error and that
any review, dissemination, distribution or copying of this transmittal is
strictly prohibited. If you have received this communication in error,
please notify this office, and immediately delete this message and all its
attachments, if any.
1*
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Cumulus Switch

2016-12-15 Thread Matt Wells
I've seen some of the cool stuff coming with OVN and even a co-worker has
done some great things with it.  However I was wondering if anyone had
experience with Cumulus as the external provider for networks.
It's just a "weekend project" I'm picking up and thought to ask on the
list.  I've not found other posts on it yet but will continue to look.
I've just made a fresh lab with the latest and greatest oVirt on CentOS 7.
Thanks to all and a happy holiday season ( if you're into the holiday thing
).
:-)
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Gluster + ovirt + resize2fs

2016-06-02 Thread Matt Wells
Thanks Sahina; an item I should have added as well.

On Wed, Jun 1, 2016 at 10:58 PM Sahina Bose  wrote:

> [+gluster-users]
>
>
> On 06/01/2016 11:30 PM, Matt Wells wrote:
>
> Apologies, it's XFS so would be an xfs_growfs
>
> On Wed, Jun 1, 2016 at 10:58 AM, Matt Wells 
> wrote:
>
>> Hi everyone, I had a quick question that I really needed to bounce off
>> someone; one of those measure twice cut once moments.
>>
>> My primary datastore is on a gluster volume and the short story is I'm
>> going to grow it.  I've thought of two options
>>
>> 1 - add a brick with the new space
>> ** Was wondering from the gluster point of view if anyone had a best
>> practice for this.  I've looked around and find many people explaining
>> their stories but not a definitive best practices.
>>
>>
>> 2 - as I'm sitting atop LVMs grow the LVM.
>> ** This is the one that makes me a little nervous.  I've done many
>> resize2fs and never had issues, but I've never had gluster running atop
>> that volume and my VM's atop that.  Has anyone had any experiences they
>> could share?
>>
>> Thanks all -
>> Wells
>>
>
>
> ___
> Users mailing listUsers@ovirt.orghttp://lists.ovirt.org/mailman/listinfo/users
>
>
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Gluster + ovirt + resize2fs

2016-06-01 Thread Matt Wells
Apologies, it's XFS so would be an xfs_growfs

On Wed, Jun 1, 2016 at 10:58 AM, Matt Wells 
wrote:

> Hi everyone, I had a quick question that I really needed to bounce off
> someone; one of those measure twice cut once moments.
>
> My primary datastore is on a gluster volume and the short story is I'm
> going to grow it.  I've thought of two options
>
> 1 - add a brick with the new space
> ** Was wondering from the gluster point of view if anyone had a best
> practice for this.  I've looked around and find many people explaining
> their stories but not a definitive best practices.
>
>
> 2 - as I'm sitting atop LVMs grow the LVM.
> ** This is the one that makes me a little nervous.  I've done many
> resize2fs and never had issues, but I've never had gluster running atop
> that volume and my VM's atop that.  Has anyone had any experiences they
> could share?
>
> Thanks all -
> Wells
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Gluster + ovirt + resize2fs

2016-06-01 Thread Matt Wells
Hi everyone, I had a quick question that I really needed to bounce off
someone; one of those measure twice cut once moments.

My primary datastore is on a gluster volume and the short story is I'm
going to grow it.  I've thought of two options

1 - add a brick with the new space
** Was wondering from the gluster point of view if anyone had a best
practice for this.  I've looked around and find many people explaining
their stories but not a definitive best practices.


2 - as I'm sitting atop LVMs grow the LVM.
** This is the one that makes me a little nervous.  I've done many
resize2fs and never had issues, but I've never had gluster running atop
that volume and my VM's atop that.  Has anyone had any experiences they
could share?

Thanks all -
Wells
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Migration Return Errors

2015-12-10 Thread Matt Wells
We're running oVirt 3.5 in a two node setup.  We've had some storage issues
and getting it humming with Gluster but this seems to have all of that
squared away.  Our problem is peculiar and I wanted to ask the list to see
if anyone else has had this.  In short you can migrate a VM from one host
to the other  but once there it can not return to it's original host.
We attempted a few things yesterday and with testing today I'll attach logs
shortly.

** Environment
Hosts name -
Host1.example.com
Host2.example.com

VMs name -
Server1.example.com
Server2.example.com

This is a hosted-enginge configuration but all seems to be working well.
Even the hosted-engine shows this behavior but I'm focusing on Server 1 and
2; not the engine as I believe it complicates the tests.

* Both nodes are operational with no power management setup
* All VMs reside within the vm_storage mount point
* At this time there is only one Network; it has the VMs and hosts.  This
will be broken up after this is resolved.
* Server1 migrates (by hand ) from Host1 to Host2 with no issues
* Server2 is powered on atop Host1 with no issues; leaving one VM on both
hosts
* I attempt to migrate Server1 back to Host1 and he states there is no
available hypervisor
* I can reboot Server2 with no issues, create a new VM on Host1
* A VM can only migrate onces, never able to return from whence it came
The only way to force this is to place Host1 in maintenance mode and
activate again. After this Server2 can return to Host1 but then is unable
to go back to Server2; being in the same cycle
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [ovirt-users] Windows Networking Issues -

2015-12-01 Thread Matt Wells
I wanted to follow up on this after I found my resolution.

I started to see kernel errors when I migrated all but my windows host off
a hypervisor and generated traffic.  I then took those errors and started
looking back at all of the hypervisors only to find this error was on each
of them; also it actively reporting on systems with Windows VMs.  Tracing
back the logs lead me to the bug reports below where I learned that this
issue had been re-introduced to the 2.6.32-X kernel.

ABRT info -
Nov 30 19:28:42 server2.example.com kernel: WARNING: at net/core/dev.c:1915
skb_warn_bad_offload+0x99/0xb0() (Tainted: GW  --    )
Nov 30 19:28:42 server2.example.com kernel: Hardware name: PowerEdge M620
Nov 30 19:28:42 server2.example.com kernel: : caps=(0x40c9, 0x0) len=1514
data_len=1460 ip_summed=1
Nov 30 19:28:42 server2.example.com kernel: Modules linked in: sch_prio
act_mirred cls_u32 sch_ingress ebt_arp xt_physdev ipt_REJECT
nf_conntrack_ipv4 nf_defrag_ipv4 xt_multiport iptable_filter ip_tables fuse
nfs lockd fscache auth_rpcgss nfs_acl sunrpc vfat fat bonding ebtable_nat
ebtables be2iscsi iscsi_boot_sysfs bnx2i cnic uio cxgb4i iw_cxgb4 cxgb4
cxgb3i libcxgbi iw_cxgb3 cxgb3 ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad
ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi
mpt3sas mpt2sas scsi_transport_sas raid_class mptctl mptbase dell_rbu
autofs4 bridge 8021q garp stp llc ip6t_REJECT nf_conntrack_ipv6
nf_defrag_ipv6 xt_state nf_conntrack ip6table_filter ip6_tables ipv6
dm_round_robin dm_multipath vhost_net macvtap macvlan tun kvm_intel kvm sg
ipmi_devintf sr_mod cdrom joydev power_meter acpi_ipmi ipmi_si
ipmi_msghandler iTCO_wdt iTCO_vendor_support ixgbe dca ptp pps_core mdio
dcdbas sb_edac edac_core lpc_ich mfd_core shpchp usb_storage ext4 jbd2
mbcache sd_mod crc_t10dif megaraid_sas wmi ahci dm_mirror dm_region_hash
dm_log dm_mod [last unloaded: ip_tables]
Nov 30 19:28:42 server2.example.com kernel: Pid: 0, comm: swapper Tainted:
GW  -- 2.6.32-573.7.1.el6.x86_64 #1
Nov 30 19:28:42 server2.example.com kernel: Call Trace:
Nov 30 19:28:42 server2.example.com kernel:   [] ?
warn_slowpath_common+0x91/0xe0
Nov 30 19:28:42 server2.example.com kernel: [] ?
warn_slowpath_fmt+0x46/0x60
Nov 30 19:28:42 server2.example.com kernel: [] ?
__ratelimit+0xd5/0x120
Nov 30 19:28:42 server2.example.com kernel: [] ?
skb_warn_bad_offload+0x99/0xb0
Nov 30 19:28:42 server2.example.com kernel: [] ?
__skb_gso_segment+0x71/0xc0
Nov 30 19:28:42 server2.example.com kernel: [] ?
skb_gso_segment+0x13/0x20
Nov 30 19:28:42 server2.example.com kernel: [] ?
dev_hard_start_xmit+0x9b/0x490
Nov 30 19:28:42 server2.example.com kernel: [] ?
sch_direct_xmit+0x15a/0x1c0
Nov 30 19:28:42 server2.example.com kernel: [] ?
dev_queue_xmit+0x228/0x320
Nov 30 19:28:42 server2.example.com kernel: [] ?
__br_forward+0x0/0xd0 [bridge]
Nov 30 19:28:42 server2.example.com kernel: [] ?
br_dev_queue_push_xmit+0x88/0xc0 [bridge]
Nov 30 19:28:42 server2.example.com kernel: [] ?
br_forward_finish+0x58/0x60 [bridge]
Nov 30 19:28:42 server2.example.com kernel: [] ?
__br_forward+0xaa/0xd0 [bridge]
Nov 30 19:28:42 server2.example.com kernel: [] ?
skb_clone+0x58/0xb0
Nov 30 19:28:42 server2.example.com kernel: [] ?
deliver_clone+0x3e/0x60 [bridge]
Nov 30 19:28:42 server2.example.com kernel: [] ?
br_forward+0x41/0x70 [bridge]
Nov 30 19:28:42 server2.example.com kernel: [] ?
br_handle_frame_finish+0x17e/0x330 [bridge]
Nov 30 19:28:42 server2.example.com kernel: [] ?
br_handle_frame+0x1c0/0x270 [bridge]
Nov 30 19:28:42 server2.example.com kernel: [] ?
br_handle_frame+0x0/0x270 [bridge]
Nov 30 19:28:42 server2.example.com kernel: [] ?
__netif_receive_skb+0x1c7/0x570
Nov 30 19:28:42 server2.example.com kernel: [] ?
netif_receive_skb+0x58/0x60
Nov 30 19:28:42 server2.example.com kernel: [] ?
napi_skb_finish+0x50/0x70
Nov 30 19:28:42 server2.example.com kernel: [] ?
napi_gro_receive_gr+0x39/0x50
Nov 30 19:28:42 server2.example.com kernel: [] ?
vlan_gro_receive+0x1b/0x30
Nov 30 19:28:42 server2.example.com kernel: [] ?
ixgbe_clean_rx_irq+0x995/0xc70 [ixgbe]
Nov 30 19:28:42 server2.example.com kernel: [] ?
ixgbe_poll+0x40a/0x760 [ixgbe]
Nov 30 19:28:42 server2.example.com kernel: [] ?
net_rx_action+0x103/0x2f0
Nov 30 19:28:42 server2.example.com kernel: [] ?
ktime_get+0x6d/0x100
Nov 30 19:28:42 server2.example.com kernel: [] ?
__do_softirq+0xc1/0x1e0
Nov 30 19:28:42 server2.example.com kernel: [] ?
handle_IRQ_event+0x60/0x170
Nov 30 19:28:42 server2.example.com kernel: [] ?
call_softirq+0x1c/0x30
Nov 30 19:28:42 server2.example.com kernel: [] ?
do_softirq+0x65/0xa0
Nov 30 19:28:42 server2.example.com kernel: [] ?
irq_exit+0x85/0x90
Nov 30 19:28:42 server2.example.com kernel: [] ?
do_IRQ+0x75/0xf0
Nov 30 19:28:42 server2.example.com kernel: [] ?
ret_from_intr+0x0/0x11
Nov 30 19:28:42 server2.example.com kernel:   [] ?
intel_idle+0xfe/0x1b0
Nov 30 19:28:42 server2.example.com kernel: [] ?
intel_idle+0xe1/0x1b0
Nov 30 19:28:42 serve

Re: [ovirt-users] Windows Networking Issues -

2015-11-25 Thread Matt Wells
All networking within oVirt seems fine.  I can ping devices on other
networks, I'm able to use a squid proxy on the same network with no
issues.  I'm able to use Terminal Services from another VM in oVirt.  One
oddity is when I first boot, I'm able to hit Google and search for a Dog.
On my 2nd search it starts to timeout.

All and all ip networking seems to be configured well.  The DG lives
outside of oVirt and I'm always able to ping it; even increasing the packet
size.

MTU size is 1500 and matches the VLAN MTU in oVirt; the bond0 is set to set
to an 9000 MTU.
EM1 and EM2 are in bond0 and show no framing errors.

On Wed, Nov 25, 2015 at 1:30 AM, Yaniv Kaul  wrote:

> On Tue, Nov 24, 2015 at 11:17 PM, Matt Wells 
> wrote:
>
>> Hi all, I have a question about Windows VMs and a networking issue I'm
>> having.
>>
>> Here's the setup -
>> * oVirt - 3.5.1.1-1
>> * Hypervisors are CentOS 6.7 box with 2 NICs in bond0 'mode=4 miimon=100
>> lacp_rate=1'
>> * On bond0 I have a few networks using vlan tagging.
>> * Networks are 5,10,15,20 - All on an external switch
>>
>> Network 15 has a Windows 2012 R2 server and a CentOS 6.7 server on it.
>> The rest of the networks have a few linux.
>>
>> Every linux box on every network is happy.  However any and all Windows
>> boxes I bring online are incapable of patching or hitting the web.  I
>> pointed the Windows box to the linux box next to it as a proxy (after
>> installing squid on it)  When I do that the Windows box has no issues at
>> all; it's only when he's attempting to leave on his own.
>>
>> On my firewall I put in a 'permit any any' on the M$ box IP however all I
>> see is tcp resets in PCAPs,
>>
>
> Can you verify basic IP networking is working correctly for those VMs?
> For example, we've established that they can get to the Linux VMs - how?
> Are they on the same subnet? Or do they go through their default gateway?
> Without knowing the IP topology, if the Linux machines were on the same
> subnet as the Windows one, and the Windows machine fail to get to their
> default gateway for some reason, this may perfectly explain the issue.
> Y.
>
>
>>
>> I've been playing with for some time but can't seem to find the issue.
>> It would be one thing if everything on the 15 was bad but the linux box on
>> the network is fine. Here's the rub, I'm 99.999% sure this used to work.
>>  gggrrr...
>>
>> Any assistance anyone can offer would be amazingly appreciated.
>>
>> Thank you for taking the time to read this.
>>
>>
>> ___
>> Users mailing list
>> Users@ovirt.org
>> http://lists.ovirt.org/mailman/listinfo/users
>>
>>
>
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] Windows Networking Issues -

2015-11-24 Thread Matt Wells
Hi all, I have a question about Windows VMs and a networking issue I'm
having.

Here's the setup -
* oVirt - 3.5.1.1-1
* Hypervisors are CentOS 6.7 box with 2 NICs in bond0 'mode=4 miimon=100
lacp_rate=1'
* On bond0 I have a few networks using vlan tagging.
* Networks are 5,10,15,20 - All on an external switch

Network 15 has a Windows 2012 R2 server and a CentOS 6.7 server on it.
The rest of the networks have a few linux.

Every linux box on every network is happy.  However any and all Windows
boxes I bring online are incapable of patching or hitting the web.  I
pointed the Windows box to the linux box next to it as a proxy (after
installing squid on it)  When I do that the Windows box has no issues at
all; it's only when he's attempting to leave on his own.

On my firewall I put in a 'permit any any' on the M$ box IP however all I
see is tcp resets in PCAPs,

I've been playing with for some time but can't seem to find the issue.  It
would be one thing if everything on the 15 was bad but the linux box on the
network is fine. Here's the rub, I'm 99.999% sure this used to work.
 gggrrr...

Any assistance anyone can offer would be amazingly appreciated.

Thank you for taking the time to read this.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] qlnic errors -

2015-04-08 Thread Matt Wells
I'm doing a product demo for one of our customers.

They've setup RHEV 3.5 on a Dell Blade center. We also used 3.4 to
start this demo. So far they've been less than impressed as we have 10
VM's that are just putting some traffic on the system. Under any load
the blades network card driver fails and the VMs are then paused.

Under ESX the same blades have no issues at all. We started with Dell
on this but were unable to find any issues with these systems related
to hardware.

Running on the 3.5 node (even setup one blade as a RHEL + VDSM) and we
continue to get the same errors.

They continue to get this on the 3.5 node image, 3.4 node image,
RHEL{6..7}+VDSM, CentOS{6..7}

Apr  7 23:09:15 POCserver2 kernel: qlcnic :01:00.1: Pause control
frames disabled on all ports
Apr  7 23:09:15 POCserver2 kernel: qlcnic :01:00.1: firmware hang detected
Apr  7 23:09:15 POCserver2 kernel: qlcnic :01:00.1: Dumping hw/fw registers
Apr  7 23:09:15 POCserver2 kernel: PEG_HALT_STATUS1: 0x40001502,
PEG_HALT_STATUS2: 0x3dd980,
Apr  7 23:09:15 POCserver2 kernel: PEG_NET_0_PC: 0x6d394, PEG_NET_1_PC: 0x6d466,
Apr  7 23:09:15 POCserver2 kernel: PEG_NET_2_PC: 0x149, PEG_NET_3_PC: 0x6e598,
Apr  7 23:09:15 POCserver2 kernel: PEG_NET_4_PC: 0x12268
Apr  7 23:09:15 POCserver2 kernel: qlcnic :01:00.0: Pause control
frames disabled on all ports
Apr  7 23:09:15 POCserver2 kernel: qlcnic :01:00.0: firmware hang detected
Apr  7 23:09:15 POCserver2 kernel: qlcnic :01:00.0: Dumping hw/fw registers
Apr  7 23:09:15 POCserver2 kernel: PEG_HALT_STATUS1: 0x40001502,
PEG_HALT_STATUS2: 0x3dd980,
Apr  7 23:09:15 POCserver2 kernel: PEG_NET_0_PC: 0x6d394, PEG_NET_1_PC: 0x6d466,
Apr  7 23:09:15 POCserver2 kernel: PEG_NET_2_PC: 0x149, PEG_NET_3_PC: 0x6e598,
Apr  7 23:09:15 POCserver2 kernel: PEG_NET_4_PC: 0x12268

I've really got limited time for this POC so I sent a support case
with Red Hat as well.  Just hoped that this community may have seen
this.  Thus far my Googlefoo has failed me.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[ovirt-users] HA of VMs

2015-02-24 Thread Matt Wells
I've been poking around for a better way to perform HA.  With VM's like IPA
or even HA web sites behind an HAProxy; how do I ensure that they are never
on the same host?

I've seen the options for watchdog and I can see where that may help a
little, but not without the initial outage to trigger the alert.  There are
many ways to do this however I see few that scale.
I'm sure I'm missing something and I really appreciate any guidance.
Thanks all
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users