Thanks for the clarification. My myopic sense of the issue came out of
stumbling on this behavior due to MPI_Comm_spawn_multiple failing.
I think *multiple* issues caused this problem to escape notice for so
long. One is that if the system thought it was oversubscribed,
num_procs_alive was u
Just to clarify, as this commit message is somewhat misleading. The nested loop
problem would cause a problem whenever the system had a specified limit (that
we had sensed) on the number of files a process could have open, and that
number would have been violated by starting another process. It
Looks good, thanks!
Brian
On 7/14/11 1:12 AM, "Ralph Castain" wrote:
>Should be fixed in r24902 - let me know.
>
>
>On Jul 12, 2011, at 4:30 PM, Barrett, Brian W wrote:
>
>> On 7/12/11 4:21 PM, "Ralph Castain" wrote:
>>
>>> On Jul 12, 2011, at 12:29 PM, Barrett, Brian W wrote:
>>>
On 7/
Sorry for the delay - got tied up. This looks sane to me and should work.
FWIW: I haven't seen any problem with comm_spawn_multiple. That code will only
executes if the specific limit is hit, so I suspect that is an issue of scale
and race conditions.
On Jul 12, 2011, at 6:44 PM, Eugene Loh wr
On Jul 14, 2011, at 10:16 AM, Yevgeny Kliteynik wrote:
> Also, besides checking if compilation works (assuming that I get
> this file), I have no way checking if linkage also works...
You can AC_LINK_IFELSE to see if the symbols you need are in the libraries.
Er... I'm sorry, I should have not
On 14-Jul-11 4:38 PM, Terry Dontje wrote:
>
>
> On 7/14/2011 9:17 AM, Yevgeny Kliteynik wrote:
>> On 14-Jul-11 3:30 PM, Jeff Squyres wrote:
>>> The real question is: does Solaris have the same data structures required
>>> for Linux's dynamic SL support? If so, this header file inquiry is
>>> w
On 7/14/2011 9:17 AM, Yevgeny Kliteynik wrote:
On 14-Jul-11 3:30 PM, Jeff Squyres wrote:
The real question is: does Solaris have the same data structures required for
Linux's dynamic SL support? If so, this header file inquiry is worthwhile. If
not, then perhaps a separate port will be req
On 7/14/2011 9:30 AM, Yevgeny Kliteynik wrote:
On 14-Jul-11 4:21 PM, Paul H. Hargrove wrote:
On 7/13/2011 11:42 PM, Yevgeny Kliteynik wrote:
[adding Terry]
On 14-Jul-11 2:49 AM, Eugene Loh wrote:
On 7/13/2011 4:31 PM, Paul H. Hargrove wrote:
On 7/13/2011 4:20 PM, Yevgeny Kliteynik wrote:
On 14-Jul-11 4:21 PM, Paul H. Hargrove wrote:
>
>
> On 7/13/2011 11:42 PM, Yevgeny Kliteynik wrote:
>> [adding Terry]
>>
>> On 14-Jul-11 2:49 AM, Eugene Loh wrote:
>>> On 7/13/2011 4:31 PM, Paul H. Hargrove wrote:
On 7/13/2011 4:20 PM, Yevgeny Kliteynik wrote:
>>> Finally, are you sure t
On 7/13/2011 11:42 PM, Yevgeny Kliteynik wrote:
[adding Terry]
On 14-Jul-11 2:49 AM, Eugene Loh wrote:
On 7/13/2011 4:31 PM, Paul H. Hargrove wrote:
On 7/13/2011 4:20 PM, Yevgeny Kliteynik wrote:
Finally, are you sure that infiniband/complib/cl_types_osd.h exists on all
platforms? (e.g., S
On 14-Jul-11 3:30 PM, Jeff Squyres wrote:
> The real question is: does Solaris have the same data structures required for
> Linux's dynamic SL support? If so, this header file inquiry is worthwhile.
> If not, then perhaps a separate port will be required for Solaris to support
> the dynamic SL
The real question is: does Solaris have the same data structures required for
Linux's dynamic SL support? If so, this header file inquiry is worthwhile. If
not, then perhaps a separate port will be required for Solaris to support the
dynamic SL functionality.
On Jul 14, 2011, at 7:24 AM, Ter
I do but my machine room's power is down so I don't have access to it
right now. I will grope around once it comes up to see what it has. I
also have sent email to our IB team for some direction.
--td
On 7/14/2011 2:42 AM, Yevgeny Kliteynik wrote:
[adding Terry]
On 14-Jul-11 2:49 AM, Eugen
Should be fixed in r24902 - let me know.
On Jul 12, 2011, at 4:30 PM, Barrett, Brian W wrote:
> On 7/12/11 4:21 PM, "Ralph Castain" wrote:
>
>> On Jul 12, 2011, at 12:29 PM, Barrett, Brian W wrote:
>>
>>> On 7/11/11 4:31 PM, "Ralph Castain" wrote:
>>>
On Jul 11, 2011, at 2:51 PM, Barre
[adding Terry]
On 14-Jul-11 2:49 AM, Eugene Loh wrote:
> On 7/13/2011 4:31 PM, Paul H. Hargrove wrote:
>> On 7/13/2011 4:20 PM, Yevgeny Kliteynik wrote:
> Finally, are you sure that infiniband/complib/cl_types_osd.h exists on
> all platforms? (e.g., Solaris) I know you said you don't ha
15 matches
Mail list logo