Re: [OMPI users] SGE integration broken in 2.0.0

2016-08-18 Thread Dave Love
"Jeff Squyres (jsquyres)" writes: > On Aug 16, 2016, at 3:07 PM, Reuti wrote: >> >> Thx a bunch - that was it. Despite searching for a solution I found >> only hints that didn't solve the issue. > > FWIW, we talk about this in the HACKING file,

Re: [OMPI users] SGE integration broken in 2.0.0

2016-08-17 Thread Orion Poplawski
On 08/12/2016 02:59 PM, r...@open-mpi.org wrote: > >> On Aug 12, 2016, at 1:48 PM, Reuti > > wrote: >> >> >> Am 12.08.2016 um 21:44 schrieb r...@open-mpi.org : >> >>> Don’t know about the toolchain issue - I

Re: [OMPI users] SGE integration broken in 2.0.0

2016-08-16 Thread Jeff Squyres (jsquyres)
On Aug 16, 2016, at 3:07 PM, Reuti wrote: > > Thx a bunch - that was it. Despite searching for a solution I found only > hints that didn't solve the issue. FWIW, we talk about this in the HACKING file, but I admit that's not necessarily the easiest place to find:

Re: [OMPI users] SGE integration broken in 2.0.0

2016-08-16 Thread Reuti
Am 16.08.2016 um 13:26 schrieb Jeff Squyres (jsquyres): > On Aug 12, 2016, at 2:15 PM, Reuti wrote: >> >> I updated my tools to: >> >> autoconf-2.69 >> automake-1.15 >> libtool-2.4.6 >> >> but I face with Open MPI's ./autogen.pl: >> >> configure.ac:152: error:

Re: [OMPI users] SGE integration broken in 2.0.0

2016-08-16 Thread Jeff Squyres (jsquyres)
On Aug 12, 2016, at 2:15 PM, Reuti wrote: > > I updated my tools to: > > autoconf-2.69 > automake-1.15 > libtool-2.4.6 > > but I face with Open MPI's ./autogen.pl: > > configure.ac:152: error: possibly undefined macro: AC_PROG_LIBTOOL > > I recall seeing in

Re: [OMPI users] SGE integration broken in 2.0.0

2016-08-12 Thread r...@open-mpi.org
> On Aug 12, 2016, at 1:48 PM, Reuti wrote: > > > Am 12.08.2016 um 21:44 schrieb r...@open-mpi.org : > >> Don’t know about the toolchain issue - I use those same versions, and don’t >> have a problem. I’m on CentOS-7, so that might be the

Re: [OMPI users] SGE integration broken in 2.0.0

2016-08-12 Thread Reuti
Am 12.08.2016 um 21:44 schrieb r...@open-mpi.org: > Don’t know about the toolchain issue - I use those same versions, and don’t > have a problem. I’m on CentOS-7, so that might be the difference? > > Anyway, I found the missing code to assemble the cmd line for qrsh - not sure > how/why it

Re: [OMPI users] SGE integration broken in 2.0.0

2016-08-12 Thread r...@open-mpi.org
Don’t know about the toolchain issue - I use those same versions, and don’t have a problem. I’m on CentOS-7, so that might be the difference? Anyway, I found the missing code to assemble the cmd line for qrsh - not sure how/why it got deleted. https://github.com/open-mpi/ompi/pull/1960 > On

Re: [OMPI users] SGE integration broken in 2.0.0

2016-08-12 Thread Reuti
> Am 12.08.2016 um 16:52 schrieb r...@open-mpi.org: > > IIRC, the rationale behind adding the check was that someone using SGE wanted > to specify a custom launch agent, and we were overriding it with qrsh. > However, the check is incorrect as that MCA param cannot be NULL. > > I have updated

Re: [OMPI users] SGE integration broken in 2.0.0

2016-08-12 Thread Reuti
Am 12.08.2016 um 16:52 schrieb r...@open-mpi.org: > IIRC, the rationale behind adding the check was that someone using SGE wanted > to specify a custom launch agent, and we were overriding it with qrsh. > However, the check is incorrect as that MCA param cannot be NULL. > > I have updated

Re: [OMPI users] SGE integration broken in 2.0.0

2016-08-12 Thread r...@open-mpi.org
IIRC, the rationale behind adding the check was that someone using SGE wanted to specify a custom launch agent, and we were overriding it with qrsh. However, the check is incorrect as that MCA param cannot be NULL. I have updated this on master - can you see if this fixes the problem for you?

Re: [OMPI users] SGE integration broken in 2.0.0

2016-08-11 Thread Reuti
> Am 11.08.2016 um 13:28 schrieb Reuti : > > Hi, > > In the file orte/mca/plm/rsh/plm_rsh_component I see an if-statement, which > seems to prevent the tight integration with SGE to start: > >if (NULL == mca_plm_rsh_component.agent) { > > Why is it there (it

[OMPI users] SGE integration broken in 2.0.0

2016-08-11 Thread Reuti
Hi, In the file orte/mca/plm/rsh/plm_rsh_component I see an if-statement, which seems to prevent the tight integration with SGE to start: if (NULL == mca_plm_rsh_component.agent) { Why is it there (it wasn't in 1.10.3)? If I just remove it I get: [node17:25001] [[27678,0],0] plm:rsh: