Re: [ofa-general] Installing SDP on existing OFED 1.3.1 install - DRBD SDP/Infiniband Support

2009-09-02 Thread Jack Morgenstein
On Tuesday 01 September 2009 16:24, Robert Dunklewrote: Hi Jack, Thanks for the tip. (*Embarassed*) I think a Kernel upgrade since might have broken the source RPM. Is there any way for me to fix this? (I have an identical hardware server that was originally installed with SDP, details

[ofa-general] Re: [PATCHv2] opensm: Parallelize (Stripe) MFT sets across switches

2009-09-02 Thread Sasha Khapyorsky
On 09:39 Mon 31 Aug , Hal Rosenstock wrote: Similar to previous patch to Parallelize (Stripe) LFT sets across switches. Currently, MADs are pipelined to a single switch first which effectively serializes these requests. This patch pipelines the MFT set MADs across switches first (before

[ofa-general] ofa_1_5_kernel 20090902-0200 daily build status

2009-09-02 Thread Vladimir Sokolovsky (Mellanox)
=KBUILD_STR(rds) -c -o /home/vlad/tmp/ofa_1_5_kernel-20090902-0200_linux-2.6.16.60-0.21-smp_x86_64_check/net/rds/.tmp_cong.o /home/vlad/tmp/ofa_1_5_kernel-20090902-0200_linux-2.6.16.60-0.21-smp_x86_64_check/net/rds/cong.c /home/vlad/tmp/ofa_1_5_kernel-20090902-0200_linux-2.6.16.60-0.21

[ofa-general] Re: [PATCH] opensm/osm_base.h: Add new SA ClassPortInfo:CapabilityMask2 bit allocations

2009-09-02 Thread Sasha Khapyorsky
On 10:42 Tue 01 Sep , Hal Rosenstock wrote: Per published MgtWG errata: RefID 4626 - reverse path PKey support in PathRecord responses RefID 4635 - multicast FDB top support RefID 4644 - hierarchy support Signed-off-by: Hal Rosenstock hal.rosenst...@gmail.com Applied. Thanks. Sasha

Re: [Scst-devel] [ofa-general] WinOF_2_0_5/SRP initiator: slow reads and eventually hangs

2009-09-02 Thread Vladislav Bolkhovitin
Chris Worley, on 09/01/2009 11:24 PM wrote: On Tue, Sep 1, 2009 at 1:02 PM, Vladislav Bolkhovitinv...@vlnb.net wrote: I'd suggest you to enable lockdep on the target. Google for more details how to do it. Also you should additional enable mgmt_minor SCST core trace level and only it. Don't

[ofa-general] [PATCH 1/6] ibutils/ibdm: Fix 'invalid conversion from const char* to char*' build error

2009-09-02 Thread sebastien dugue
This occurs under FC11 with gcc 4.4.0-4. Signed-off-by: Sebastien Dugue sebastien.du...@bull.net --- ibdm/ibdm/SysDef.cpp|2 +- ibdm/ibdm/TopoMatch.cpp |2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/ibdm/ibdm/SysDef.cpp b/ibdm/ibdm/SysDef.cpp index

[ofa-general] [PATCH 2/6] ibutils/ibdm: Add -fPIC to libreplace build

2009-09-02 Thread sebastien dugue
This allows to build under FC11. Otherwise, building shared libraries using libreplace results in the following error: .../ibutils/ibdm/replace/libreplace.a(regex.o): relocation R_X86_64_32S against `a local symbol' can not be used when making a shared object; recompile with -fPIC

[ofa-general] [PATCH 0/6] ibutils: Build fixes for FC11

2009-09-02 Thread sebastien dugue
Hi, here are some fixes I had to apply in order to be able to build under FC11 due to some changes in the toolchain. Sebastien. ___ general mailing list general@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

[ofa-general] [PATCH 5/6] ibutils: Allow parallel build

2009-09-02 Thread sebastien dugue
Signed-off-by: Sebastien Dugue sebastien.du...@bull.net --- ibdm/src/Makefile.am |4 ++-- ibmgtsim/src/Makefile.am |4 +++- 2 files changed, 5 insertions(+), 3 deletions(-) diff --git a/ibdm/src/Makefile.am b/ibdm/src/Makefile.am index b763387..682fb80 100644 ---

[ofa-general] [PATCH 6/6] ibutils: Fix libibsysapi build for old autotools

2009-09-02 Thread sebastien dugue
Signed-off-by: Sebastien Dugue sebastien.du...@bull.net --- ibdm/src/Makefile.am |1 - 1 files changed, 0 insertions(+), 1 deletions(-) diff --git a/ibdm/src/Makefile.am b/ibdm/src/Makefile.am index 682fb80..ec9cfbd 100644 --- a/ibdm/src/Makefile.am +++ b/ibdm/src/Makefile.am @@ -43,7 +43,6

[ofa-general] [PATCH 4/6] ibutils: Add libibsysapi.so to the spec file

2009-09-02 Thread sebastien dugue
Signed-off-by: Sebastien Dugue sebastien.du...@bull.net --- ibutils.spec.in |1 + 1 files changed, 1 insertions(+), 0 deletions(-) diff --git a/ibutils.spec.in b/ibutils.spec.in index abc54dd..e27dbaf 100644 --- a/ibutils.spec.in +++ b/ibutils.spec.in @@ -130,6 +130,7 @@ rm -rf

[ofa-general] [PATCH 3/6] ibutils/ibdm: Fix libibsysapi build

2009-09-02 Thread sebastien dugue
Add libibdmcom linker path to allow build under FC11. Signed-off-by: Sebastien Dugue sebastien.du...@bull.net --- ibdm/src/Makefile.am |2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/ibdm/src/Makefile.am b/ibdm/src/Makefile.am index 8b2f9ba..b763387 100644 ---

[ofa-general] [PATCH] libibmad/dump.c: In mad_dump_portcapmask, decode new capabilities

2009-09-02 Thread Hal Rosenstock
Per published MgtWG errata RefID 4484 - vendor specific MADs table support RefID 4626 - reverse path PKey support in PathRecord responses RefID 4635 - multicast FDB top support RefID 4644 - hierarchy support Signed-off-by: Hal Rosenstock hal.rosenst...@gmail.com --- diff --git

[ofa-general] [PATCH] opensm: Add support for MulticastFDBTop

2009-09-02 Thread Hal Rosenstock
Add support for SwitchInfo:MulticastFDBTop Added by MgtWG errata #4505-4508 Also, per MgtWG RefID #4640, MulticastFDBTop value of 0xbfff means no entries In osm_mcast_mgr.c:mcast_mgr_set_mftables call new routine mcast_mgr_set_mfttop to set MulticastFDBTop in SwitchInfo based on

[ofa-general] [PATCH] libibmad/mad.h: Add a couple of SM class attribute IDs

2009-09-02 Thread Hal Rosenstock
VendorSpecificMadsTable added by MgtWG errata RefID 4482 Signed-off-by: Hal Rosenstock hal.rosenst...@gmail.com --- diff --git a/libibmad/include/infiniband/mad.h b/libibmad/include/infiniband/mad.h index 5f3b52b..94b64cf 100644 --- a/libibmad/include/infiniband/mad.h +++

Re: [ofa-general] [PATCHv2 RESEND] IB/IPoIB: Don't let a bad muticast address in the join list stop subsequent joins

2009-09-02 Thread Moni Shoua
Roland Dreier wrote: Illegal multicast address can be handed for IPoIB from userspace. For example the command ip maddr add 33:33:00:00:00:01 dev ib0 injects an illegal muticast address to IPoIB that will start a join task for this address. However, whenever an illegal multicast

[ofa-general] Bad interaction between ofed, NFS and Gaussian

2009-09-02 Thread BOYRIE Fabrice
Hello Hoping I'm in the good mailing list. I've a problem with ofed 1.4.2 on Centos 5.3. We have a new cluster with QDR infiniband. I've installed ofed from source using the install.pl script with the default values. I've used default kernel from Centos (2.6.18-128.7.1.el5) When a node

[ofa-general] Connection timeout on localhost (using libsdp)

2009-09-02 Thread Chris Tilt
Hello, I am very hopeful of getting libsdp working with an existing application. Specifically, I am trying to port Erlang to use SDP for it's distributed Erlang mechanism. With LD_PRELOAD, this may be very easy. However, I am having trouble with one of it's daemon processes (a port map deamon

Re: [Scst-devel] [ofa-general] WinOF_2_0_5/SRP initiator: slow reads and eventually hangs

2009-09-02 Thread Chris Worley
On Wed, Sep 2, 2009 at 4:01 AM, Vladislav Bolkhovitinv...@vlnb.net wrote: Chris Worley, on 09/01/2009 11:24 PM wrote: On Tue, Sep 1, 2009 at 1:02 PM, Vladislav Bolkhovitinv...@vlnb.net wrote: I'd suggest you to enable lockdep on the target. Google for more details how to do it. Also you

Re: [ofa-general] [PATCH] IPoIB: check multicast address format (V2)

2009-09-02 Thread Roland Dreier
thanks for updating, applied. ___ general mailing list general@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general To unsubscribe, please visit http://openib.org/mailman/listinfo/openib-general

[ofa-general] Re: [PATCHv4] IB/mad: Allow tuning of QP0 and QP1 sizes

2009-09-02 Thread Roland Dreier
applied -- would be nice to have a way to do this automatically instead of yet another tunable to sysadmins to worry about, but oh well. ___ general mailing list general@lists.openfabrics.org http://lists.openfabrics.org/cgi-bin/mailman/listinfo/general

Re: [Scst-devel] [ofa-general] WinOF_2_0_5/SRP initiator: slow reads and eventually hangs

2009-09-02 Thread Chris Worley
On Wed, Sep 2, 2009 at 11:27 AM, Chris Worleyworl...@gmail.com wrote: On Wed, Sep 2, 2009 at 4:01 AM, Vladislav Bolkhovitinv...@vlnb.net wrote: Chris Worley, on 09/01/2009 11:24 PM wrote: On Tue, Sep 1, 2009 at 1:02 PM, Vladislav Bolkhovitinv...@vlnb.net wrote: I'd suggest you to enable

Re: [Scst-devel] [ofa-general] WinOF_2_0_5/SRP initiator: slow reads and eventually hangs

2009-09-02 Thread Bart Van Assche
On Wed, Sep 2, 2009 at 7:27 PM, Chris Worley worl...@gmail.com wrote: Then, the majority of messages were the pair: Sep  2 17:12:22 nameme kernel: [408676.552666] [0]: scst: __scst_init_cmd:3361:Too many pending commands (50) in session, returning BUSY to initiator

Re: [Scst-devel] [ofa-general] WinOF_2_0_5/SRP initiator: slow reads and eventually hangs

2009-09-02 Thread Chris Worley
On Wed, Sep 2, 2009 at 12:05 PM, Bart Van Asschebart.vanass...@gmail.com wrote: On Wed, Sep 2, 2009 at 7:27 PM, Chris Worley worl...@gmail.com wrote: Then, the majority of messages were the pair: Sep  2 17:12:22 nameme kernel: [408676.552666] [0]: scst: __scst_init_cmd:3361:Too many pending

Re: [Scst-devel] [ofa-general] WinOF_2_0_5/SRP initiator: slow reads and eventually hangs

2009-09-02 Thread Chris Worley
On Wed, Sep 2, 2009 at 12:17 PM, Chris Worleyworl...@gmail.com wrote: On Wed, Sep 2, 2009 at 12:05 PM, Bart Van Asschebart.vanass...@gmail.com wrote: On Wed, Sep 2, 2009 at 7:27 PM, Chris Worley worl...@gmail.com wrote: Then, the majority of messages were the pair: Sep  2 17:12:22 nameme

Re: [ofa-general] WinOF_2_0_5/SRP initiator: slow reads and eventually hangs

2009-09-02 Thread Bart Van Assche
On Tue, Sep 1, 2009 at 1:04 AM, Chris Worleyworl...@gmail.com wrote: [ ... ] I've found a good kernel/scst mix to easily repeat this; I can get it to repeatedly hang w/ 8K block transfers running Ubuntu 9.04 w/ the 2.6.27-14-server kernel on _both_ target and initiator (i.e. no WinOF or OFED

Re: [ofa-general] Bad interaction between ofed, NFS and Gaussian

2009-09-02 Thread Tziporet Koren
BOYRIE Fabrice wrote: Hello Hoping I'm in the good mailing list. I've a problem with ofed 1.4.2 on Centos 5.3. We have a new cluster with QDR infiniband. I've installed ofed from source using the install.pl script with the default values. I've used default kernel from Centos

Re: [ofa-general] WinOF_2_0_5/SRP initiator: slow reads and eventually hangs

2009-09-02 Thread Chris Worley
On Wed, Sep 2, 2009 at 1:31 PM, Bart Van Asschebart.vanass...@gmail.com wrote: On Tue, Sep 1, 2009 at 1:04 AM, Chris Worleyworl...@gmail.com wrote: [ ... ] I've found a good kernel/scst mix to easily repeat this; I can get it to repeatedly hang w/ 8K block transfers running Ubuntu 9.04 w/ the

[ofa-general] performance to call ibv_poll_cq() vs. call select() on completion channel

2009-09-02 Thread Tang, Changqing
Roland or Mellanox Engineers: We setup completion channel for a completion queue. We want to check if there is any event available, and suppose there is NO event on both completion channel and completion queue. What we can do is: 1. call select() on completion channel

Re: [ofa-general] WinOF_2_0_5/SRP initiator: slow reads and eventually hangs

2009-09-02 Thread Bart Van Assche
On Wed, Sep 2, 2009 at 9:53 PM, Chris Worleyworl...@gmail.com wrote: On Wed, Sep 2, 2009 at 1:31 PM, Bart Van Asschebart.vanass...@gmail.com wrote: On Tue, Sep 1, 2009 at 1:04 AM, Chris Worleyworl...@gmail.com wrote: [ ... ] I've found a good kernel/scst mix to easily repeat this; I can get

Re: [ofa-general] WinOF_2_0_5/SRP initiator: slow reads and eventually hangs

2009-09-02 Thread Chris Worley
On Wed, Sep 2, 2009 at 1:53 PM, Chris Worleyworl...@gmail.com wrote: On Wed, Sep 2, 2009 at 1:31 PM, Bart Van Asschebart.vanass...@gmail.com wrote: On Tue, Sep 1, 2009 at 1:04 AM, Chris Worleyworl...@gmail.com wrote: [ ... ] I've found a good kernel/scst mix to easily repeat this; I can get

Re: [ofa-general] performance to call ibv_poll_cq() vs. call select() on completion channel

2009-09-02 Thread Tziporet Koren
Tang, Changqing wrote: Roland or Mellanox Engineers: We setup completion channel for a completion queue. We want to check if there is any event available, and suppose there is NO event on both completion channel and completion queue. What we can do is: 1. call

Re: [ofa-general] Bad interaction between ofed, NFS and Gaussian

2009-09-02 Thread Jeff Becker
Tziporet Koren wrote: BOYRIE Fabrice wrote: Hello Hoping I'm in the good mailing list. I've a problem with ofed 1.4.2 on Centos 5.3. Salut Fabrice! Does it also happen with OFED 1.5 alpha? Thanks. -jeff We have a new cluster with QDR infiniband. I've installed ofed from

RE: [ofa-general] performance to call ibv_poll_cq() vs. call select() on completion channel

2009-09-02 Thread Tang, Changqing
1. call select() on completion channel with zero timeout and return 0. 2. call ibv_poll_cq() directly and return 0. Question: Which way has lower overhead ? ibv_poll_cq We know select() has to switch to kernel mode, does ibv_poll_cq() switch to kernel mode

RE: [ofa-general] performance to call ibv_poll_cq() vs. call select() on completion channel

2009-09-02 Thread Sean Hefty
But I just check the source code, ibv_poll_cq() is actually ibv_cmd_poll_cq(), and ibv_cmd_poll_cq() calls write() system call on the IB device. Doesn't this write() system call switch to kernel mode and possiblely casuse a context switch ? See verbs.h: static inline int ibv_poll_cq(struct

RE: [ofa-general] performance to call ibv_poll_cq() vs. call select() on completion channel

2009-09-02 Thread Tang, Changqing
Sean: I understand that ops.poll_cq is actually ibv_cmd_poll_cq(), right ? Do you mean during ibv_poll_cq() call, there is no system call involved ? --CQ -Original Message- From: Sean Hefty [mailto:sean.he...@intel.com] Sent: Wednesday, September 02, 2009 5:25 PM

Re: [ofa-general] performance to call ibv_poll_cq() vs. call select() on completion channel

2009-09-02 Thread Roland Dreier
I understand that ops.poll_cq is actually ibv_cmd_poll_cq(), right ? No, not for most devices. Look at libmthca, etc to see what the poll_cq method is set to. Do you mean during ibv_poll_cq() call, there is no system call involved ? Right, for most devices poll_cq can be done

[ofa-general] InfiniBand/RDMA merge plans for 2.6.32

2009-09-02 Thread Roland Dreier
Since 2.6.31-rc8 has been out more than a week already, it's probably a good time to talk about 2.6.32 merge plans. All the pending things that I'm aware of are listed below. Boilerplate: If something isn't already in my tree and it isn't listed below, I probably missed it or dropped it

RE: [ofa-general] performance to call ibv_poll_cq() vs. call select() on completion channel

2009-09-02 Thread Tang, Changqing
I did not understand the relation between ops.poll_cq() and ibv_cmd_poll_cq() correctly. It is clear now. Thank you. --CQ -Original Message- From: Roland Dreier [mailto:rdre...@cisco.com] Sent: Wednesday, September 02, 2009 6:09 PM To: Tang, Changqing Cc: Sean Hefty;

Re: [ofa-general] WinOF_2_0_5/SRP initiator: slow reads and eventually hangs

2009-09-02 Thread Chris Worley
On Wed, Sep 2, 2009 at 2:58 PM, Chris Worleyworl...@gmail.com wrote: On Wed, Sep 2, 2009 at 2:00 PM, Bart Van Asschebart.vanass...@gmail.com wrote: On Wed, Sep 2, 2009 at 9:53 PM, Chris Worleyworl...@gmail.com wrote: On Wed, Sep 2, 2009 at 1:31 PM, Bart Van Asschebart.vanass...@gmail.com

Re: [ofa-general] WinOF_2_0_5/SRP initiator: slow reads and eventually hangs

2009-09-02 Thread Bart Van Assche
On Thu, Sep 3, 2009 at 6:08 AM, Chris Worleyworl...@gmail.com wrote: In trying to build a kernel w/ lockdep support as Vlad requested, my lack of Debian knowledge shone through, and, although I believe I followed all the instructions correctly, I'm not sure if I have a 2.6.28-15 or 2.6.28-10