Re: [OMPI users] openmpi/slurm/pmix

2018-04-25 Thread r...@open-mpi.org


> On Apr 25, 2018, at 8:16 AM, Michael Di Domenico  
> wrote:
> 
> On Mon, Apr 23, 2018 at 6:07 PM, r...@open-mpi.org  wrote:
>> Looks like the problem is that you didn’t wind up with the external PMIx. 
>> The component listed in your error is the internal PMIx one which shouldn’t 
>> have built given that configure line.
>> 
>> Check your config.out and see what happened. Also, ensure that your 
>> LD_LIBRARY_PATH is properly pointing to the installation, and that you built 
>> into a “clean” prefix.
> 
> the "clean prefix" part seemed to fix my issue.  i'm not exactly sure
> i understand why/how though.  i recompiled pmix and removed the old
> installation before doing a make install

When you build, we don’t automatically purge the prefix location of any prior 
libraries. Thus, the old install of the internal PMIx library was still 
present. It has a higher priority than the external components, and so it was 
being picked up and used.

Starting clean removed it, leaving the external component to be selected.

> 
> when i recompiled openmpi it seems to have figured itself out
> 
> i think things are still a little wonky, but at least that issue is gone
> ___
> users mailing list
> users@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/users

___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Re: [OMPI users] openmpi/slurm/pmix

2018-04-25 Thread Michael Di Domenico
On Mon, Apr 23, 2018 at 6:07 PM, r...@open-mpi.org  wrote:
> Looks like the problem is that you didn’t wind up with the external PMIx. The 
> component listed in your error is the internal PMIx one which shouldn’t have 
> built given that configure line.
>
> Check your config.out and see what happened. Also, ensure that your 
> LD_LIBRARY_PATH is properly pointing to the installation, and that you built 
> into a “clean” prefix.

the "clean prefix" part seemed to fix my issue.  i'm not exactly sure
i understand why/how though.  i recompiled pmix and removed the old
installation before doing a make install

when i recompiled openmpi it seems to have figured itself out

i think things are still a little wonky, but at least that issue is gone
___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Re: [OMPI users] openmpi/slurm/pmix

2018-04-24 Thread Charles A Taylor
Hi Gilles,

Yes, I did.  It was ignored AFAICT.I did not look for the reason - only so 
many hours in the day.  

Regards,

Charlie


> On Apr 24, 2018, at 8:07 AM,   wrote:
> 
> Charles,
> 
> have you tried to configure --with-pmix-libdir=/.../lib64 ?
> 
> Cheers,
> 
> Gilles
> 
> - Original Message -
>> I´ll add that when building OpenMPI 3.0.0 with an external PMIx, I 
> found that the OpenMPI configure script only looks in “lib” for the the 
> pmix library but the pmix configure/build uses “lib64” (as it should on 
> a 64-bit system) so the configure script falls back to the internal PMIx.
>  As Robert suggested, check your config.log for “not found” messages.  
>> 
>> In my case, I simply added a “lib -> lib64” symlink in the PMIx 
> installation directory rather than alter the configure script and that 
> did the trick.
>> 
>> Good luck,
>> 
>> Charlie
>> 
>>> On Apr 23, 2018, at 6:07 PM, r...@open-mpi.org wrote:
>>> 
>>> Hi Michael
>>> 
>>> Looks like the problem is that you didn´t wind up with the external 
> PMIx. The component listed in your error is the internal PMIx one which 
> shouldn´t have built given that configure line.
>>> 
>>> Check your config.out and see what happened. Also, ensure that your 
> LD_LIBRARY_PATH is properly pointing to the installation, and that you 
> built into a “clean” prefix.
>>> 
>>> 
 On Apr 23, 2018, at 12:01 PM, Michael Di Domenico  gmail.com> wrote:
 
 i'm trying to get slurm 17.11.5 and openmpi 3.0.1 working with pmix.
 
 everything compiled, but when i run something it get
 
 : symbol lookup error: /openmpi/mca_pmix_pmix2x.so: undefined 
> symbol:
 opal_libevent2022_evthread_use_pthreads
 
 i more then sure i did something wrong, but i'm not sure what, here
> 's what i did
 
 compile libevent 2.1.8
 
 ./configure --prefix=/libevent-2.1.8
 
 compile pmix 2.1.0
 
 ./configure --prefix=/pmix-2.1.0 --with-psm2
 --with-munge=/munge-0.5.13 --with-libevent=/libevent-2.1.8
 
 compile openmpi
 
 ./configure --prefix=/openmpi-3.0.1 --with-slurm=/slurm-17.11.5
 --with-hwloc=external --with-mxm=/opt/mellanox/mxm
 --with-cuda=/usr/local/cuda --with-pmix=/pmix-2.1.0
 --with-libevent=/libevent-2.1.8
 
 when i look at the symbols in the mca_pmix_pmix2x.so library the
 function is indeed undefined (U) in the output, but checking ldd
 against the library doesn't show any missing
 
 any thoughts?
 ___
 users mailing list
 users@lists.open-mpi.org
 https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.open-2Dmpi.org_mailman_listinfo_users&d=DwIGaQ&c=pZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM&r=HOtXciFqK5GlgIgLAxthUQ&m=XE6hInyZVJ5VMrO5vdTEKEw3pZBBVnLE7U8Nm67zj2M&s=_sgJVrkRzlv7dIYMvtMfj26AJdbH-fcOOarmN7PyJCI&e=
> 
>>> 
>>> ___
>>> users mailing list
>>> users@lists.open-mpi.org
>>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.open-2Dmpi.org_mailman_listinfo_users&d=DwIGaQ&c=pZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM&r=HOtXciFqK5GlgIgLAxthUQ&m=XE6hInyZVJ5VMrO5vdTEKEw3pZBBVnLE7U8Nm67zj2M&s=_sgJVrkRzlv7dIYMvtMfj26AJdbH-fcOOarmN7PyJCI&e=
> 
>> 
>> ___
>> users mailing list
>> users@lists.open-mpi.org
>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.open-2Dmpi.org_mailman_listinfo_users&d=DwIFag&c=pZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM&r=HOtXciFqK5GlgIgLAxthUQ&m=0XUVnlQfzGhlRDSBAm8nGvZt27jITo3r1oX9_vg639w&s=ErD6RckR-Uvdpj4CTtNvT9iZck285Vdf6sgYskQ_Z-k&e=
> ___
> users mailing list
> users@lists.open-mpi.org
> https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.open-2Dmpi.org_mailman_listinfo_users&d=DwIFag&c=pZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM&r=HOtXciFqK5GlgIgLAxthUQ&m=0XUVnlQfzGhlRDSBAm8nGvZt27jITo3r1oX9_vg639w&s=ErD6RckR-Uvdpj4CTtNvT9iZck285Vdf6sgYskQ_Z-k&e=

___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Re: [OMPI users] openmpi/slurm/pmix

2018-04-24 Thread gilles
Charles,

have you tried to configure --with-pmix-libdir=/.../lib64 ?

Cheers,

Gilles

- Original Message -
> I’ll add that when building OpenMPI 3.0.0 with an external PMIx, I 
found that the OpenMPI configure script only looks in “lib” for the the 
pmix library but the pmix configure/build uses “lib64” (as it should on 
a 64-bit system) so the configure script falls back to the internal PMIx.
  As Robert suggested, check your config.log for “not found” messages.  
> 
> In my case, I simply added a “lib -> lib64” symlink in the PMIx 
installation directory rather than alter the configure script and that 
did the trick.
> 
> Good luck,
> 
> Charlie
> 
> > On Apr 23, 2018, at 6:07 PM, r...@open-mpi.org wrote:
> > 
> > Hi Michael
> > 
> > Looks like the problem is that you didn’t wind up with the external 
PMIx. The component listed in your error is the internal PMIx one which 
shouldn’t have built given that configure line.
> > 
> > Check your config.out and see what happened. Also, ensure that your 
LD_LIBRARY_PATH is properly pointing to the installation, and that you 
built into a “clean” prefix.
> > 
> > 
> >> On Apr 23, 2018, at 12:01 PM, Michael Di Domenico  wrote:
> >> 
> >> i'm trying to get slurm 17.11.5 and openmpi 3.0.1 working with pmix.
> >> 
> >> everything compiled, but when i run something it get
> >> 
> >> : symbol lookup error: /openmpi/mca_pmix_pmix2x.so: undefined 
symbol:
> >> opal_libevent2022_evthread_use_pthreads
> >> 
> >> i more then sure i did something wrong, but i'm not sure what, here
's what i did
> >> 
> >> compile libevent 2.1.8
> >> 
> >> ./configure --prefix=/libevent-2.1.8
> >> 
> >> compile pmix 2.1.0
> >> 
> >> ./configure --prefix=/pmix-2.1.0 --with-psm2
> >> --with-munge=/munge-0.5.13 --with-libevent=/libevent-2.1.8
> >> 
> >> compile openmpi
> >> 
> >> ./configure --prefix=/openmpi-3.0.1 --with-slurm=/slurm-17.11.5
> >> --with-hwloc=external --with-mxm=/opt/mellanox/mxm
> >> --with-cuda=/usr/local/cuda --with-pmix=/pmix-2.1.0
> >> --with-libevent=/libevent-2.1.8
> >> 
> >> when i look at the symbols in the mca_pmix_pmix2x.so library the
> >> function is indeed undefined (U) in the output, but checking ldd
> >> against the library doesn't show any missing
> >> 
> >> any thoughts?
> >> ___
> >> users mailing list
> >> users@lists.open-mpi.org
> >> https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.open-2Dmpi.org_mailman_listinfo_users&d=DwIGaQ&c=pZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM&r=HOtXciFqK5GlgIgLAxthUQ&m=XE6hInyZVJ5VMrO5vdTEKEw3pZBBVnLE7U8Nm67zj2M&s=_sgJVrkRzlv7dIYMvtMfj26AJdbH-fcOOarmN7PyJCI&e=

> > 
> > ___
> > users mailing list
> > users@lists.open-mpi.org
> > https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.open-2Dmpi.org_mailman_listinfo_users&d=DwIGaQ&c=pZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM&r=HOtXciFqK5GlgIgLAxthUQ&m=XE6hInyZVJ5VMrO5vdTEKEw3pZBBVnLE7U8Nm67zj2M&s=_sgJVrkRzlv7dIYMvtMfj26AJdbH-fcOOarmN7PyJCI&e=

> 
> ___
> users mailing list
> users@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/users
___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users


Re: [OMPI users] openmpi/slurm/pmix

2018-04-24 Thread Charles A Taylor
I’ll add that when building OpenMPI 3.0.0 with an external PMIx, I found that 
the OpenMPI configure script only looks in “lib” for the the pmix library but 
the pmix configure/build uses “lib64” (as it should on a 64-bit system) so the 
configure script falls back to the internal PMIx.  As Robert suggested, check 
your config.log for “not found” messages.  

In my case, I simply added a “lib -> lib64” symlink in the PMIx installation 
directory rather than alter the configure script and that did the trick.

Good luck,

Charlie

> On Apr 23, 2018, at 6:07 PM, r...@open-mpi.org wrote:
> 
> Hi Michael
> 
> Looks like the problem is that you didn’t wind up with the external PMIx. The 
> component listed in your error is the internal PMIx one which shouldn’t have 
> built given that configure line.
> 
> Check your config.out and see what happened. Also, ensure that your 
> LD_LIBRARY_PATH is properly pointing to the installation, and that you built 
> into a “clean” prefix.
> 
> 
>> On Apr 23, 2018, at 12:01 PM, Michael Di Domenico  
>> wrote:
>> 
>> i'm trying to get slurm 17.11.5 and openmpi 3.0.1 working with pmix.
>> 
>> everything compiled, but when i run something it get
>> 
>> : symbol lookup error: /openmpi/mca_pmix_pmix2x.so: undefined symbol:
>> opal_libevent2022_evthread_use_pthreads
>> 
>> i more then sure i did something wrong, but i'm not sure what, here's what i 
>> did
>> 
>> compile libevent 2.1.8
>> 
>> ./configure --prefix=/libevent-2.1.8
>> 
>> compile pmix 2.1.0
>> 
>> ./configure --prefix=/pmix-2.1.0 --with-psm2
>> --with-munge=/munge-0.5.13 --with-libevent=/libevent-2.1.8
>> 
>> compile openmpi
>> 
>> ./configure --prefix=/openmpi-3.0.1 --with-slurm=/slurm-17.11.5
>> --with-hwloc=external --with-mxm=/opt/mellanox/mxm
>> --with-cuda=/usr/local/cuda --with-pmix=/pmix-2.1.0
>> --with-libevent=/libevent-2.1.8
>> 
>> when i look at the symbols in the mca_pmix_pmix2x.so library the
>> function is indeed undefined (U) in the output, but checking ldd
>> against the library doesn't show any missing
>> 
>> any thoughts?
>> ___
>> users mailing list
>> users@lists.open-mpi.org
>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.open-2Dmpi.org_mailman_listinfo_users&d=DwIGaQ&c=pZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM&r=HOtXciFqK5GlgIgLAxthUQ&m=XE6hInyZVJ5VMrO5vdTEKEw3pZBBVnLE7U8Nm67zj2M&s=_sgJVrkRzlv7dIYMvtMfj26AJdbH-fcOOarmN7PyJCI&e=
> 
> ___
> users mailing list
> users@lists.open-mpi.org
> https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.open-2Dmpi.org_mailman_listinfo_users&d=DwIGaQ&c=pZJPUDQ3SB9JplYbifm4nt2lEVG5pWx2KikqINpWlZM&r=HOtXciFqK5GlgIgLAxthUQ&m=XE6hInyZVJ5VMrO5vdTEKEw3pZBBVnLE7U8Nm67zj2M&s=_sgJVrkRzlv7dIYMvtMfj26AJdbH-fcOOarmN7PyJCI&e=

___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Re: [OMPI users] openmpi/slurm/pmix

2018-04-23 Thread r...@open-mpi.org
Hi Michael

Looks like the problem is that you didn’t wind up with the external PMIx. The 
component listed in your error is the internal PMIx one which shouldn’t have 
built given that configure line.

Check your config.out and see what happened. Also, ensure that your 
LD_LIBRARY_PATH is properly pointing to the installation, and that you built 
into a “clean” prefix.


> On Apr 23, 2018, at 12:01 PM, Michael Di Domenico  
> wrote:
> 
> i'm trying to get slurm 17.11.5 and openmpi 3.0.1 working with pmix.
> 
> everything compiled, but when i run something it get
> 
> : symbol lookup error: /openmpi/mca_pmix_pmix2x.so: undefined symbol:
> opal_libevent2022_evthread_use_pthreads
> 
> i more then sure i did something wrong, but i'm not sure what, here's what i 
> did
> 
> compile libevent 2.1.8
> 
> ./configure --prefix=/libevent-2.1.8
> 
> compile pmix 2.1.0
> 
> ./configure --prefix=/pmix-2.1.0 --with-psm2
> --with-munge=/munge-0.5.13 --with-libevent=/libevent-2.1.8
> 
> compile openmpi
> 
> ./configure --prefix=/openmpi-3.0.1 --with-slurm=/slurm-17.11.5
> --with-hwloc=external --with-mxm=/opt/mellanox/mxm
> --with-cuda=/usr/local/cuda --with-pmix=/pmix-2.1.0
> --with-libevent=/libevent-2.1.8
> 
> when i look at the symbols in the mca_pmix_pmix2x.so library the
> function is indeed undefined (U) in the output, but checking ldd
> against the library doesn't show any missing
> 
> any thoughts?
> ___
> users mailing list
> users@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/users

___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

[OMPI users] openmpi/slurm/pmix

2018-04-23 Thread Michael Di Domenico
i'm trying to get slurm 17.11.5 and openmpi 3.0.1 working with pmix.

everything compiled, but when i run something it get

: symbol lookup error: /openmpi/mca_pmix_pmix2x.so: undefined symbol:
opal_libevent2022_evthread_use_pthreads

i more then sure i did something wrong, but i'm not sure what, here's what i did

compile libevent 2.1.8

./configure --prefix=/libevent-2.1.8

compile pmix 2.1.0

./configure --prefix=/pmix-2.1.0 --with-psm2
--with-munge=/munge-0.5.13 --with-libevent=/libevent-2.1.8

compile openmpi

./configure --prefix=/openmpi-3.0.1 --with-slurm=/slurm-17.11.5
--with-hwloc=external --with-mxm=/opt/mellanox/mxm
--with-cuda=/usr/local/cuda --with-pmix=/pmix-2.1.0
--with-libevent=/libevent-2.1.8

when i look at the symbols in the mca_pmix_pmix2x.so library the
function is indeed undefined (U) in the output, but checking ldd
against the library doesn't show any missing

any thoughts?
___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users