Re: [OMPI devel] [OMPI svn] svn:open-mpi r24903

2011-07-14 Thread Eugene Loh
Thanks for the clarification. My myopic sense of the issue came out of stumbling on this behavior due to MPI_Comm_spawn_multiple failing. I think *multiple* issues caused this problem to escape notice for so long. One is that if the system thought it was oversubscribed, num_procs_alive was u

Re: [OMPI devel] [OMPI svn] svn:open-mpi r24903

2011-07-14 Thread Ralph Castain
Just to clarify, as this commit message is somewhat misleading. The nested loop problem would cause a problem whenever the system had a specified limit (that we had sensed) on the number of files a process could have open, and that number would have been violated by starting another process. It

Re: [OMPI devel] Libevent visibility problem

2011-07-14 Thread Barrett, Brian W
Looks good, thanks! Brian On 7/14/11 1:12 AM, "Ralph Castain" wrote: >Should be fixed in r24902 - let me know. > > >On Jul 12, 2011, at 4:30 PM, Barrett, Brian W wrote: > >> On 7/12/11 4:21 PM, "Ralph Castain" wrote: >> >>> On Jul 12, 2011, at 12:29 PM, Barrett, Brian W wrote: >>> On 7/

Re: [OMPI devel] orte_odls_base_default_launch_local()

2011-07-14 Thread Ralph Castain
Sorry for the delay - got tied up. This looks sane to me and should work. FWIW: I haven't seen any problem with comm_spawn_multiple. That code will only executes if the specific limit is hit, so I suspect that is an issue of scale and race conditions. On Jul 12, 2011, at 6:44 PM, Eugene Loh wr

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r24830

2011-07-14 Thread Jeff Squyres
On Jul 14, 2011, at 10:16 AM, Yevgeny Kliteynik wrote: > Also, besides checking if compilation works (assuming that I get > this file), I have no way checking if linkage also works... You can AC_LINK_IFELSE to see if the symbols you need are in the libraries. Er... I'm sorry, I should have not

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r24830

2011-07-14 Thread Yevgeny Kliteynik
On 14-Jul-11 4:38 PM, Terry Dontje wrote: > > > On 7/14/2011 9:17 AM, Yevgeny Kliteynik wrote: >> On 14-Jul-11 3:30 PM, Jeff Squyres wrote: >>> The real question is: does Solaris have the same data structures required >>> for Linux's dynamic SL support? If so, this header file inquiry is >>> w

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r24830

2011-07-14 Thread Terry Dontje
On 7/14/2011 9:17 AM, Yevgeny Kliteynik wrote: On 14-Jul-11 3:30 PM, Jeff Squyres wrote: The real question is: does Solaris have the same data structures required for Linux's dynamic SL support? If so, this header file inquiry is worthwhile. If not, then perhaps a separate port will be req

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r24830

2011-07-14 Thread Terry Dontje
On 7/14/2011 9:30 AM, Yevgeny Kliteynik wrote: On 14-Jul-11 4:21 PM, Paul H. Hargrove wrote: On 7/13/2011 11:42 PM, Yevgeny Kliteynik wrote: [adding Terry] On 14-Jul-11 2:49 AM, Eugene Loh wrote: On 7/13/2011 4:31 PM, Paul H. Hargrove wrote: On 7/13/2011 4:20 PM, Yevgeny Kliteynik wrote:

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r24830

2011-07-14 Thread Yevgeny Kliteynik
On 14-Jul-11 4:21 PM, Paul H. Hargrove wrote: > > > On 7/13/2011 11:42 PM, Yevgeny Kliteynik wrote: >> [adding Terry] >> >> On 14-Jul-11 2:49 AM, Eugene Loh wrote: >>> On 7/13/2011 4:31 PM, Paul H. Hargrove wrote: On 7/13/2011 4:20 PM, Yevgeny Kliteynik wrote: >>> Finally, are you sure t

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r24830

2011-07-14 Thread Paul H. Hargrove
On 7/13/2011 11:42 PM, Yevgeny Kliteynik wrote: [adding Terry] On 14-Jul-11 2:49 AM, Eugene Loh wrote: On 7/13/2011 4:31 PM, Paul H. Hargrove wrote: On 7/13/2011 4:20 PM, Yevgeny Kliteynik wrote: Finally, are you sure that infiniband/complib/cl_types_osd.h exists on all platforms? (e.g., S

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r24830

2011-07-14 Thread Yevgeny Kliteynik
On 14-Jul-11 3:30 PM, Jeff Squyres wrote: > The real question is: does Solaris have the same data structures required for > Linux's dynamic SL support? If so, this header file inquiry is worthwhile. > If not, then perhaps a separate port will be required for Solaris to support > the dynamic SL

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r24830

2011-07-14 Thread Jeff Squyres
The real question is: does Solaris have the same data structures required for Linux's dynamic SL support? If so, this header file inquiry is worthwhile. If not, then perhaps a separate port will be required for Solaris to support the dynamic SL functionality. On Jul 14, 2011, at 7:24 AM, Ter

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r24830

2011-07-14 Thread Terry Dontje
I do but my machine room's power is down so I don't have access to it right now. I will grope around once it comes up to see what it has. I also have sent email to our IB team for some direction. --td On 7/14/2011 2:42 AM, Yevgeny Kliteynik wrote: [adding Terry] On 14-Jul-11 2:49 AM, Eugen

Re: [OMPI devel] Libevent visibility problem

2011-07-14 Thread Ralph Castain
Should be fixed in r24902 - let me know. On Jul 12, 2011, at 4:30 PM, Barrett, Brian W wrote: > On 7/12/11 4:21 PM, "Ralph Castain" wrote: > >> On Jul 12, 2011, at 12:29 PM, Barrett, Brian W wrote: >> >>> On 7/11/11 4:31 PM, "Ralph Castain" wrote: >>> On Jul 11, 2011, at 2:51 PM, Barre

Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r24830

2011-07-14 Thread Yevgeny Kliteynik
[adding Terry] On 14-Jul-11 2:49 AM, Eugene Loh wrote: > On 7/13/2011 4:31 PM, Paul H. Hargrove wrote: >> On 7/13/2011 4:20 PM, Yevgeny Kliteynik wrote: > Finally, are you sure that infiniband/complib/cl_types_osd.h exists on > all platforms? (e.g., Solaris) I know you said you don't ha