Re: [PATCH V7 libmlx4 2/2] Add support for XRC QPs

2013-07-24 Thread Yishai Hadas
On 7/11/2013 8:25 PM, Jason Gunthorpe wrote: On Thu, Jul 11, 2013 at 06:07:44PM +0300, Yishai Hadas wrote: + if (sizeof(*verbs_ctx) - + offsetof(struct verbs_context, +close_xrcd) = verbs_ctx-sz) + verbs_ctx-close_xrcd = mlx4_close_xrcd;

Re: [PATCH, RFC] mlx4: Avoid that mlx4_cmd_wait() contributes to the system load

2013-07-24 Thread Or Gerlitz
On 22/07/2013 18:23, Bart Van Assche wrote: Avoid that kernel threads running mlx4_cmd_wait() contribute to the system load by setting the task state to TASK_INTERRUPTIBLE instead of TASK_UNINTERRUPTIBLE while waiting. This patch reduces the load average from about 0.5 to about 0.0 on an idle

Dual star topology

2013-07-24 Thread Gandalf Corvotempesta
Hi to all i'm probably OT but I don't know where to ask I'm searching for some advice creating a dual-star topology to get full path redundancy. I have one dual-port DDR card on each server and two switches. I'll connect one port to each switch, but should I also interconnect both switches like in

Re: [patch] mlx5: use after free in mlx5_cmd_comp_handler()

2013-07-24 Thread Eli Cohen
This race has been spotted and fixed by Moshe Lazer two weeks ago. Anyways, Acked by Eli Cohen e...@mellanox.com On Mon, Jul 22, 2013 at 11:02:01AM +0300, Dan Carpenter wrote: We can't dereference ent after passing it to free_cmd(). Signed-off-by: Dan Carpenter dan.carpen...@oracle.com

Re: [PATCH for-3.11 7/7] IB/iser: Introduce fast memory registration model (FRWR)

2013-07-24 Thread Or Gerlitz
On 23/07/2013 17:47, Bart Van Assche wrote: Sorry but I had overlooked the bounce buffer patch. Regarding page sizes: is an InfiniBand HCA required to support a page size of 512 bytes ? To me it seems like the smallest page size supported by e.g. the ocrdma driver is 4KB. From

Re: [PATCH, RFC] mlx4: Avoid that mlx4_cmd_wait() contributes to the system load

2013-07-24 Thread Bart Van Assche
On 07/24/13 17:17, Or Gerlitz wrote: On 22/07/2013 18:23, Bart Van Assche wrote: Avoid that kernel threads running mlx4_cmd_wait() contribute to the system load by setting the task state to TASK_INTERRUPTIBLE instead of TASK_UNINTERRUPTIBLE while waiting. This patch reduces the load average

Re: [PATCH, RFC] mlx4: Avoid that mlx4_cmd_wait() contributes to the system load

2013-07-24 Thread Or Gerlitz
On 24/07/2013 19:48, Bart Van Assche wrote: I saw a load of 0.5 with several different upstream kernels (3.6..3.10 at least). The only IB-related process that was running on the system was opensmd. This is definitely reproducible. It was only a month after I had noticed this phenomenon that I

Re: [PATCH, RFC] mlx4: Avoid that mlx4_cmd_wait() contributes to the system load

2013-07-24 Thread Bart Van Assche
On 07/24/13 19:06, Or Gerlitz wrote: On 24/07/2013 19:48, Bart Van Assche wrote: I saw a load of 0.5 with several different upstream kernels (3.6..3.10 at least). The only IB-related process that was running on the system was opensmd. This is definitely reproducible. It was only a month after I

Re: Dual star topology

2013-07-24 Thread Hal Rosenstock
Hi Gandalf, On 7/24/2013 11:20 AM, Gandalf Corvotempesta wrote: Hi to all i'm probably OT but I don't know where to ask I'm searching for some advice creating a dual-star topology to get full path redundancy. I have one dual-port DDR card on each server and two switches. I'll connect one

Re: [PATCH, RFC] mlx4: Avoid that mlx4_cmd_wait() contributes to the system load

2013-07-24 Thread Bart Van Assche
On 07/24/13 19:06, Or Gerlitz wrote: On 24/07/2013 19:48, Bart Van Assche wrote: I saw a load of 0.5 with several different upstream kernels (3.6..3.10 at least). The only IB-related process that was running on the system was opensmd. This is definitely reproducible. It was only a month after I

Fwd: Dual star topology

2013-07-24 Thread Gandalf Corvotempesta
in this way all services should be dual stack with at least two difrent addresses, one for each infiniband subnets. I have to configure ceph on these subnets and ceph doesn't allow to set multiple addresses for each service. Il giorno 24/lug/2013 20:04, Hal Rosenstock h...@dev.mellanox.co.il ha

Re: Dual star topology

2013-07-24 Thread Gandalf Corvotempesta
2013/7/24 Gandalf Corvotempesta gandalf.corvotempe...@gmail.com: I have to configure ceph on these subnets and ceph doesn't allow to set multiple addresses for each service. Let me try to explain in a better way. I would like to create a ceph cluster over an infiniband network. Each server has

Re: Dual star topology

2013-07-24 Thread Hal Rosenstock
On 7/24/2013 4:56 PM, Gandalf Corvotempesta wrote: 2013/7/24 Gandalf Corvotempesta gandalf.corvotempe...@gmail.com: I have to configure ceph on these subnets and ceph doesn't allow to set multiple addresses for each service. Let me try to explain in a better way. I would like to create a

[PATCH] ib/mad: remove Device Mgmt from RMPP class list

2013-07-24 Thread Ira Weiny
The specification does not indicate that Device Mgmt is an RMPP class. Signed-off-by: Ira Weiny ira.we...@intel.com --- drivers/infiniband/core/mad.c |1 - 1 files changed, 0 insertions(+), 1 deletions(-) diff --git a/drivers/infiniband/core/mad.c b/drivers/infiniband/core/mad.c index

Re: [PATCH] ib/mad: remove Device Mgmt from RMPP class list

2013-07-24 Thread Hal Rosenstock
On 7/24/2013 5:44 PM, Ira Weiny wrote: The specification does not indicate that Device Mgmt is an RMPP class. See Annex 8; DevMgt class version 2 rather than 1 is currently supported. -- Hal Signed-off-by: Ira Weiny ira.we...@intel.com --- drivers/infiniband/core/mad.c |1 - 1 files

[PATCH for-3.11 3/3] rdma/cm: Only call cma_save_ib_info for CM REQs

2013-07-24 Thread sean . hefty
From: Sean Hefty sean.he...@intel.com Calling cma_save_ib_info for CM SIDR REQs results in a crash accessing an invalid path record pointer. Signed-off-by: Sean Hefty sean.he...@intel.com --- drivers/infiniband/core/cma.c |3 ++- 1 files changed, 2 insertions(+), 1 deletions(-) diff --git

[PATCH for-3.11 2/3] rdma/cm: Fix accessing invalid private data for UD

2013-07-24 Thread sean . hefty
From: Sean Hefty sean.he...@intel.com If a application is using AF_IB with a UD QP, but does not provide any private data, we will end up accessing invalid memory. Check for this case and handle it appropriately. Signed-off-by: Sean Hefty sean.he...@intel.com --- drivers/infiniband/core/cma.c

[PATCH for-3.11 1/3] [v2] RDMA/cma: silence GCC warning

2013-07-24 Thread sean . hefty
From: Paul Bolle pebo...@tiscali.nl Building cma.o triggers this GCC warning: drivers/infiniband/core/cma.c: In function ‘rdma_resolve_addr’: drivers/infiniband/core/cma.c:465:23: warning: ‘port’ may be used uninitialized in this function [-Wmaybe-uninitialized]

[PATCH for-3.11 0/3] rdma/cm: Fixes for AF_IB patch series

2013-07-24 Thread sean . hefty
From: Sean Hefty sean.he...@intel.com Additional testing of the latest AF_IB patch series showed that bugs were introduced when mixing AF_IB with UD QPs. Full support for AF_IB with UD QPs is limited, but we should at least avoid crashing the kernel! Please apply two fixes for crashes to 3.11,

RE: [PATCH] ib/mad: remove Device Mgmt from RMPP class list

2013-07-24 Thread Weiny, Ira
-Original Message- From: Hal Rosenstock [mailto:h...@dev.mellanox.co.il] Subject: Re: [PATCH] ib/mad: remove Device Mgmt from RMPP class list On 7/24/2013 5:44 PM, Ira Weiny wrote: The specification does not indicate that Device Mgmt is an RMPP class. See Annex 8; DevMgt class

Re: [PATCH] ib/mad: remove Device Mgmt from RMPP class list

2013-07-24 Thread Hal Rosenstock
On 7/24/2013 6:10 PM, Weiny, Ira wrote: -Original Message- From: Hal Rosenstock [mailto:h...@dev.mellanox.co.il] Subject: Re: [PATCH] ib/mad: remove Device Mgmt from RMPP class list On 7/24/2013 5:44 PM, Ira Weiny wrote: The specification does not indicate that Device Mgmt is an RMPP

RE: [PATCH] ib/mad: remove Device Mgmt from RMPP class list

2013-07-24 Thread Hefty, Sean
See Annex 8; DevMgt class version 2 rather than 1 is currently supported. Could older devices still return version 1? If so the kernel should allow DevMgt without RMPP, correct? This check has been this way since 2.6.17. I think it's reasonable to say that there aren't any devices using

RE: [PATCH] ib/mad: remove Device Mgmt from RMPP class list

2013-07-24 Thread Weiny, Ira
Roland, Please skip this patch. Sean, see below. -Original Message- From: Hefty, Sean Sent: Wednesday, July 24, 2013 3:23 PM To: Weiny, Ira; Hal Rosenstock Cc: linux-rdma@vger.kernel.org; rol...@kernel.org Subject: RE: [PATCH] ib/mad: remove Device Mgmt from RMPP class list

Re: [patch] mlx5: use after free in mlx5_cmd_comp_handler()

2013-07-24 Thread David Miller
From: Eli Cohen e...@dev.mellanox.co.il Date: Wed, 24 Jul 2013 19:35:44 +0300 This race has been spotted and fixed by Moshe Lazer two weeks ago. Anyways, Acked by Eli Cohen e...@mellanox.com Applied. -- To unsubscribe from this list: send the line unsubscribe linux-rdma in the body of a

RE: [PATCH] ib/mad: remove Device Mgmt from RMPP class list

2013-07-24 Thread Weiny, Ira
-Original Message- From: Hefty, Sean Subject: RE: [PATCH] ib/mad: remove Device Mgmt from RMPP class list See Annex 8; DevMgt class version 2 rather than 1 is currently supported. Could older devices still return version 1? If so the kernel should allow DevMgt without RMPP,

RE: [PATCH] ib/mad: remove Device Mgmt from RMPP class list

2013-07-24 Thread Hefty, Sean
See Annex 8; DevMgt class version 2 rather than 1 is currently supported. Could older devices still return version 1? If so the kernel should allow DevMgt without RMPP, correct? This check has been this way since 2.6.17. I think it's reasonable to say that there aren't

RE: [PATCH] ib/mad: remove Device Mgmt from RMPP class list

2013-07-24 Thread Weiny, Ira
-Original Message- From: Hefty, Sean Subject: RE: [PATCH] ib/mad: remove Device Mgmt from RMPP class list See Annex 8; DevMgt class version 2 rather than 1 is currently supported. Could older devices still return version 1? If so the kernel should allow DevMgt

[no subject]

2013-07-24 Thread xrg . dev
Hi list, I am moving my work environment on P/Linux, and I am having trouble register large amount of memories. The device I have are ConnectX-3 EN (RoCE). i am trying to register 16GB of memory, mmap and mlock succeeded, but ibv_reg_mr returns ENOMEM. Concurrently, the kernel log reports the