[OMPI devel] asm/byteorder.h needed in infiniband/cm.h

2008-09-21 Thread Jon Mason
is the patch in question. Thanks, Jon Signed-Off-By: Jon Mason <j...@opengridcomputing.com> --- /usr/include/infiniband/cm.h.orig 2008-09-21 15:36:46.0 -0700 +++ /usr/include/infiniband/cm.h2008-09-21 14:17:43.0 -0700 @@ -38,6 +38,7 @@ #include #include +#i

Re: [OMPI devel] RDMA CM CPC HG ready again

2008-10-01 Thread Jon Mason
On Wed, Oct 01, 2008 at 08:08:48AM -0400, Jeff Squyres wrote: > Per the call yesterday, I'll merge this into the trunk once I get it > working with Brad on PPC. > > Brad and I discovered a missing htonl/ntohl somewhere in the code last > night right before I had to go offline (i.e., we can see

[OMPI devel] Branch for iWARP uDAPL enablement

2007-11-29 Thread Jon Mason
I created a public branch to make available the patch which gets OPMI uDAPL to kinda work on iWARP. The branch can be found at: http://svn.open-mpi.org/svn/ompi/tmp-public/iwarp-ompi-v1.2/ The branch contains an updated version of the patch Steve Wise sent out some time ago. Below is the patch

[OMPI devel] [PATCH] openib btl: remove excess ompi_btl_openib_connect_base_open call

2007-12-05 Thread Jon Mason
There is a double call to ompi_btl_openib_connect_base_open in mca_btl_openib_mca_setup_qps(). It looks like someone just forgot to clean-up the previous call when they added the check for the return code. I ran a quick IMB test over IB to verify everything is still working. Thanks, Jon

[OMPI devel] [PATCH] openib btl: correct help message error

2007-12-10 Thread Jon Mason
Slight word usage and grammar error in the openib btl help test. I believe the change below is the intended meaning. Thanks, Jon Index: ompi/mca/btl/openib/help-mpi-btl-openib.txt === --- ompi/mca/btl/openib/help-mpi-btl-openib.txt

Re: [OMPI devel] [PATCH] openib btl: correct help message error

2007-12-10 Thread Jon Mason
ing (especially for trivial fixes like this :-) ). Sorry, I was just trying to err on the side of caution and openness. Do you have a rule of thumb for what should be sent on the list versus simply committed? Thanks, Jon > > > On Dec 10, 2007, at 4:05 PM, Jon Mason wrote: > > > Sl

Re: [OMPI devel] [PATCH] openib: clean-up connect to allow for new cm's

2007-12-12 Thread Jon Mason
Ok, glad I got this conversation started :) So, we need a slight redesign to determine the cm method (unless forced via commandline arg). This can be determined by calling all the individual open routines, and having them return a priority based on their ability to function. For example, the

[OMPI devel] [PATCH] openib btl: extensable cpc selection enablement

2008-01-09 Thread Jon Mason
The new cpc selection framework is now in place. The patch below allows for dynamic selection of cpc methods based on what is available. It also allows for inclusion/exclusions of methods. It even futher allows for modifying the priorities of certain cpc methods to better determine the optimal

Re: [OMPI devel] [PATCH] openib btl: extensable cpc selection enablement

2008-01-10 Thread Jon Mason
On Thu, Jan 10, 2008 at 11:17:48AM +0200, Pavel Shamis (Pasha) wrote: > Jon Mason wrote: > > The new cpc selection framework is now in place. The patch below allows > > for dynamic selection of cpc methods based on what is available. It > > also allows for inclusion/

[OMPI devel] Setting CQ depth

2008-02-25 Thread Jon Mason
A quick sanity check. When setting the cq depth in the openib btl, it checks the calculated depth against the maxmium cq depth allowed and sets the minimum of those two. However, I think it is checking the wrong variable. If I understand correctly, ib_dev_attr.max_cq represents the maximum

[OMPI devel] OMPI OpenIB Credit Schema breaks Chelsio HW

2008-03-09 Thread Jon Mason
After discussing this issue with Jeff via private e-mails. I would like to open the issue to the group for futher discussion. Issue (as described by Steve Wise): Currently OMPI uses qp 0 for all credit updates (by design). This breaks when running over the chelsio rnic due to a race condition

[OMPI devel] Ssh tunnelling broken in trunk?

2008-04-02 Thread Jon Mason
I am noticing that ssh seems to be broken on trunk (and my cpc branch, as it is based on trunk). When I try to use xterm and gdb to debug, I only successfully get 1 xterm. I have tried this on 2 different setups. I can successfully get the xterm's on the 1.2 svn branch. I am running the

Re: [OMPI devel] Ssh tunnelling broken in trunk?

2008-04-02 Thread Jon Mason
I regressed my tree and it looks like it happened between 17590:17917 On Wednesday 02 April 2008 10:22:52 am Jon Mason wrote: > I am noticing that ssh seems to be broken on trunk (and my cpc branch, as > it is based on trunk). When I try to use xterm and gdb to debug, I only > successfu

Re: [OMPI devel] Ssh tunnelling broken in trunk?

2008-04-02 Thread Jon Mason
nch, it happened at r17920. Thanks, Jon > On Apr 2, 2008, at 11:59 AM, Jon Mason wrote: > > I regressed my tree and it looks like it happened between 17590:17917 > > > > On Wednesday 02 April 2008 10:22:52 am Jon Mason wrote: > >> I am noticing that ssh seems to be broken

Re: [OMPI devel] Ssh tunnelling broken in trunk?

2008-04-02 Thread Jon Mason
... > > > >george. > > > > On Apr 2, 2008, at 12:14 PM, Jon Mason wrote: > >> On Wednesday 02 April 2008 11:07:18 am Jeff Squyres wrote: > >>> Are these r numbers relevant on the /tmp-public branch, or the trunk? > >> > >> I pull

Re: [OMPI devel] Ssh tunnelling broken in trunk?

2008-04-02 Thread Jon Mason
On Wednesday 02 April 2008 01:21:31 pm Jon Mason wrote: > On Wednesday 02 April 2008 11:54:50 am Ralph H Castain wrote: > > I remember that someone had found a bug that caused orte_debug_flag to not > > get properly set (local var covering over a global one) - could be that >

Re: [OMPI devel] Ssh tunnelling broken in trunk?

2008-04-03 Thread Jon Mason
I've tried it, it works fine. I'll double check and do a completely fresh svn pull and install and see where that gets me. Thanks for the help, Jon > Ralph > > On 4/2/08 5:41 PM, "Jon Mason" <j...@opengridcomputing.com> wrote: > > On Wednesday 02 April 2008

[OMPI devel] Flush CQ error on iWARP/Out-of-sync shutdown

2008-05-05 Thread Jon Mason
I am seeing some unusual behavior during the shutdown phase of ompi at the end of my testcase. While running a IMB pingpong test over the rdmacm on openib, I get cq flush errors on my iWARP adapters. This error is happening because the remote node is still polling the endpoint while the other

Re: [OMPI devel] NO IP address found

2008-05-06 Thread Jon Mason
On Tuesday 06 May 2008 09:41:53 am Jeff Squyres wrote: > I actually don't know what the RDMA CM requires for the LMC>0 case -- > does it require a unique IP address for every LID? It requires a unique IP address for every hca/port in use by rdmacm. I see the bug in rdmacm (since I don't

Re: [OMPI devel] btl_openib_iwarp.c : making platform specific calls

2008-05-13 Thread Jon Mason
On Tuesday 13 May 2008 09:56:13 am Don Kerr wrote: > I believe btl_open_iwarp.c is making platform specific calls. I don't Silly question, but I thought openib ONLY worked in Linux. If this is not the case, then this whole chunk of code will have to be redesigned (if it can even be done at

Re: [OMPI devel] Threaded progress for CPCs

2008-05-19 Thread Jon Mason
On Mon, May 19, 2008 at 05:17:57PM +0300, Gleb Natapov wrote: > On Mon, May 19, 2008 at 05:08:17PM +0300, Pavel Shamis (Pasha) wrote: > > >> 5. ...? > > >> > > > What about moving posting of receive buffers into main thread. With > > > SRQ it is easy: don't post anything in CPC thread. Main

Re: [OMPI devel] Threaded progress for CPCs

2008-05-19 Thread Jon Mason
On Mon, May 19, 2008 at 01:38:53PM -0400, Jeff Squyres wrote: > On May 19, 2008, at 8:25 AM, Gleb Natapov wrote: > > > Is it possible to have sane SRQ implementation without HW flow > > control? > > It seems pretty unlikely if the only available HW flow control is to > terminate the

Re: [OMPI devel] Threaded progress for CPCs

2008-05-19 Thread Jon Mason
On Mon, May 19, 2008 at 10:12:19PM +0300, Gleb Natapov wrote: > On Mon, May 19, 2008 at 01:52:22PM -0500, Jon Mason wrote: > > On Mon, May 19, 2008 at 05:17:57PM +0300, Gleb Natapov wrote: > > > On Mon, May 19, 2008 at 05:08:17PM +0300, Pavel Shamis (Pasha) w

Re: [OMPI devel] get_iwarp_subnet_id in openib btl

2008-05-20 Thread Jon Mason
On Tue, May 20, 2008 at 02:48:49PM -0400, Pak Lui wrote: > Hi, > > I am not familiar with get_iwarp_subnet_id and I am not sure why it is > causing trunk to barf. I think I am using ofed 1.2.5. See attached for That is in the 1.3 tree, not 1.2. There was a bug in Solaris that was fixed

[OMPI devel] Documentation update

2008-05-22 Thread Jon Mason
I know this is slightly premature, but if someone could update the link below to reflect that iWARP is now supported in OMPI it would be much appreciated. http://www.open-mpi.org/faq/?category=openfabrics#iwarp-support Thanks, Jon