Re: [OMPI devel] InfiniBand timeout errors

2007-07-17 Thread Pavel Shamis (Pasha)
Hi, Try to increase the IB time out parameter: --mca btl_mvapi_ib_timeout 14 If the 14 will not work , try to increase little bit more (16) Thanks, Pasha Neil Ludban wrote: Hi, I'm getting the errors below when calling MPI_Alltoallv() as part of a matrix transpose operation. It's 100% repeata

Re: [OMPI devel] devel Digest, Vol 801, Issue 1

2007-07-17 Thread Jeff Squyres
On Jul 16, 2007, at 2:28 PM, Matthew Moskewicz wrote: MPI-2 does support the MPI_COMM_JOIN and MPI_COMM_ACCEPT/ MPI_COMM_CONNECT models. We do support this in Open MPI, but the restrictions (in terms of ORTE) may not be sufficient for you. perhaps i'll experiment -- any clues as to what the o

Re: [OMPI devel] devel Digest, Vol 801, Issue 1

2007-07-17 Thread Ralph Castain
On 7/17/07 5:37 AM, "Jeff Squyres" wrote: > On Jul 16, 2007, at 2:28 PM, Matthew Moskewicz wrote: > >>> MPI-2 does support the MPI_COMM_JOIN and MPI_COMM_ACCEPT/ >>> MPI_COMM_CONNECT models. We do support this in Open MPI, but the >>> restrictions (in terms of ORTE) may not be sufficient for

[OMPI devel] Call for OMPI Binary Distributions

2007-07-17 Thread Terry D. Dontje
This announcement is to request links to Binary Distributions of Open MPI that our community may have on the web for users to download. We'd like to take those links and post them on our download page to make it easier for those who are insterested in getting binaries to install and not the so

Re: [OMPI devel] Call for OMPI Binary Distributions

2007-07-17 Thread Jeff Squyres
Cisco distributes binaries for the OpenFabrics Enterprise Distribution (OFED), to include Open MPI. 1. Cisco OFED v1.1 includes Open MPI v1.1.2 2. Cisco OFED v1.1 3. http://www.cisco.com/cgi-bin/tablebuild.pl/sfs-linux (free registration required) 4. 14 Dec 2006 Cisco OFED v1.2 will be post

Re: [OMPI devel] devel Digest, Vol 801, Issue 1

2007-07-17 Thread Rolf vandeVaart
Ralph Castain wrote: On 7/17/07 5:37 AM, "Jeff Squyres" wrote: On Jul 16, 2007, at 2:28 PM, Matthew Moskewicz wrote: MPI-2 does support the MPI_COMM_JOIN and MPI_COMM_ACCEPT/ MPI_COMM_CONNECT models. We do support this in Open MPI, but the restrictions (in terms of ORTE) may not be

Re: [OMPI devel] Call for OMPI Binary Distributions

2007-07-17 Thread Manuel Prinz
Hi Terry, Am Dienstag, den 17.07.2007, 08:54 -0400 schrieb Terry D. Dontje: > This announcement is to request links to Binary Distributions of Open > MPI that our community may have on the web for users to download. [...] > What we need is the following: I'm member of a group that maintains Ope

Re: [OMPI devel] devel Digest, Vol 801, Issue 1

2007-07-17 Thread Ralph Castain
I believe that the only problem with that procedure is that it automatically connects the new application with *all* pre-existing applications. There is no discrimination possible as your client doesn't know the server's jobid, nor is there any way for it to "discover" that information. So this is

[OMPI devel] OMPI SVN commit mails

2007-07-17 Thread Jeff Squyres
There was a configuration issue this morning at IU that prevented a few SVN commit e-mails from going out. For example, I committed the minutes of the teleconference this morning and the mail didn't go out. The configure problem has since been fixed; commit mails should be flowing properl

[OMPI devel] Hostfile changes

2007-07-17 Thread Ralph Castain
Yo all As part of the ORTE discussions that took place today, it was mentioned that we would be modifying the hostfile behavior to support several user requests and to resolve problems when mixing resource manager-provided allocations and hostfiles. This has been an on-going discussion for some ti

[OMPI devel] FW: Major reduction in ORTE

2007-07-17 Thread Ralph Castain
Yo all This was discussed at some length in the weekly core developers telecon - with the discussion continued in a dedicated follow-on telecon later this afternoon. The consensus of those participating in the dedicated telecon was that this plan should be followed, and that the proposed cellid co

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r15474

2007-07-17 Thread George Bosilca
If you really want to have a fully featured barrier why not using the collective barrier ? This double ring barrier have really bad performance, and it will became a real scalability issue. Or do we really need to force this particular connection shape (left & right) ? george. Modified

Re: [OMPI devel] Fwd: lsf support / farm use models

2007-07-17 Thread Bill McMillan
> there appear to be some overlaps between the ls_* and lsb_* functions, > but they seem basically compatible as far as i can tell. almost all > the functions have a command line version as well, for example: > lsb_submit()/bsub Like openmpi and orte, there are two layers in LSF. The ls_*

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r15474

2007-07-17 Thread Brian W. Barrett
So first, there's an error in the patch (e-mail with details coming shortly, as there are many errors in the patch). There's no need for both isends (the new one and the one in there already). Second, this is in code that's a crutch around the real issue, which is that for a very small class of a