Re: [OMPI users] **URGENT: Error during testing

2019-08-19 Thread Jeff Squyres (jsquyres) via users
Note that it is not advisable to run autogen from a distribution tarball.

Specifically: we include the autogen script for advanced users who want to 
tweak their own Open MPI copy (without cloning from git), but 99.99% of users 
can just run "./configure ..." directly (without first running autogen).  If 
you're an advanced user and want / need to run autogen -- cool, go for it.  But 
if you're just downloading an Open MPI tarball and are trying to build / 
install it, there's no need to run autogen.

Put simply: in official release tarballs of Open MPI, we've already run autogen 
for you.


On Aug 19, 2019, at 2:17 PM, Steven Varga via users 
mailto:users@lists.open-mpi.org>> wrote:

Hi
this is steven. I am building custom clusters on AWS Ec2 and had some problems 
in the past. I am getting good result with external pmix 3.1.3
./autogen.sh && ./configure --prefix=/usr/local/ --with-platform=optimized 
--with-hwloc=/usr/local --with-libevent=/usr/local --enable-pmix-binaries  
--enable-pmi-backward-compatibility
make -j4 && sudo make install
and ompenmpi 4.0.1
./autogen.pl && ./configure --prefix=/usr/local 
--with-slurm --with-pmix=/usr/local --enable-mpi1-compatibility 
--with-libevent=/usr/local --with-hwloc=/usr/local

making certain linking against the same libevent
This is on linux most recent custom kernel, and most recent SLURM scheduler.
best:
steve





On Mon, Aug 19, 2019 at 2:07 PM Jeff Squyres (jsquyres) via users 
mailto:users@lists.open-mpi.org>> wrote:
Can you provide some more details?

https://www.open-mpi.org/community/help/


On Aug 19, 2019, at 1:18 PM, Riddhi A Mehta via users 
mailto:users@lists.open-mpi.org>> wrote:

Hello

My name is Riddhi and I am a Graduate Research Assistant in the Dept. of 
Physics & Astronomy at Purdue University. About a month ago I correctly 
configured openmpi on my mac and the ‘mpirun -np 2 ./hello.c’ ran correctly. 
But today, it gave me the following error:


[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file orted/pmix/pmix_server.c at line 262
[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file ess_hnp_module.c at line 667
--
It looks like orte_init failed for some reason; your parallel process is
likely to abort.  There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems.  This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):

  pmix server init failed
  --> Returned value Bad parameter (-5) instead of ORTE_SUCCESS

I tried to google this error and the only thing I found was to change the 
TMPDIR name to a shorter name. But I don’t think I understand what it’s asking 
me to do. Kindly help since I am not able to understand what went wrong in one 
month and mpirun stopped working.

Thank you

Riddhi Mehta
Research Group: Maxim Lyutikov, Theoretical High Energy Astrophysics
Dept. of Physics & Astronomy
Purdue University

___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users


--
Jeff Squyres
jsquy...@cisco.com




___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users
___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users


--
Jeff Squyres
jsquy...@cisco.com




___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Re: [OMPI users] **URGENT: Error during testing

2019-08-19 Thread Jeff Squyres (jsquyres) via users
Do not use Open MPI v2.0.x -- it's ancient.

You should probably use Open MPI v3.1.x or v4.0.x.



On Aug 19, 2019, at 2:12 PM, Riddhi A Mehta via users 
mailto:users@lists.open-mpi.org>> wrote:

Hi

I followed the exact procedure stated in this link: 
http://www.science.smith.edu/dftwiki/index.php/Install_MPI_on_a_MacBook.


It runs correctly until this line : mpicc -o hello helloWorld.c

After which it gives me the error when I do mpirun.

Thank you

Riddhi

From: users 
mailto:users-boun...@lists.open-mpi.org>> on 
behalf of "Riebs, Andy via users" 
mailto:users@lists.open-mpi.org>>
Reply-To: Open MPI Users 
mailto:users@lists.open-mpi.org>>
Date: Monday, August 19, 2019 at 2:10 PM
To: Open MPI Users mailto:users@lists.open-mpi.org>>
Cc: "Riebs, Andy" mailto:andy.ri...@hpe.com>>
Subject: Re: [OMPI users] **URGENT: Error during testing

Is there any chance that the fact that Riddhi appears to be trying to execute 
an uncompiled hello.c could be the problem here?

From: users [mailto:users-boun...@lists.open-mpi.org] On Behalf Of Jeff Squyres 
(jsquyres) via users
Sent: Monday, August 19, 2019 2:05 PM
To: Open MPI User's List 
mailto:users@lists.open-mpi.org>>
Cc: Jeff Squyres (jsquyres) mailto:jsquy...@cisco.com>>
Subject: Re: [OMPI users] **URGENT: Error during testing

Can you provide some more details?

https://www.open-mpi.org/community/help/




On Aug 19, 2019, at 1:18 PM, Riddhi A Mehta via users 
mailto:users@lists.open-mpi.org>> wrote:

Hello

My name is Riddhi and I am a Graduate Research Assistant in the Dept. of 
Physics & Astronomy at Purdue University. About a month ago I correctly 
configured openmpi on my mac and the ‘mpirun -np 2 ./hello.c’ ran correctly. 
But today, it gave me the following error:


[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file orted/pmix/pmix_server.c at line 262
[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file ess_hnp_module.c at line 667
--
It looks like orte_init failed for some reason; your parallel process is
likely to abort.  There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems.  This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):

  pmix server init failed
  --> Returned value Bad parameter (-5) instead of ORTE_SUCCESS

I tried to google this error and the only thing I found was to change the 
TMPDIR name to a shorter name. But I don’t think I understand what it’s asking 
me to do. Kindly help since I am not able to understand what went wrong in one 
month and mpirun stopped working.

Thank you

Riddhi Mehta
Research Group: Maxim Lyutikov, Theoretical High Energy Astrophysics
Dept. of Physics & Astronomy
Purdue University

___
users mailing list
users@lists.open-mpi.org<mailto:users@lists.open-mpi.org>
https://lists.open-mpi.org/mailman/listinfo/users


--
Jeff Squyres
jsquy...@cisco.com<mailto:jsquy...@cisco.com>



___
users mailing list
users@lists.open-mpi.org<mailto:users@lists.open-mpi.org>
https://lists.open-mpi.org/mailman/listinfo/users


--
Jeff Squyres
jsquy...@cisco.com<mailto:jsquy...@cisco.com>




___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Re: [OMPI users] **URGENT: Error during testing

2019-08-19 Thread Steven Varga via users
Hi
this is steven. I am building custom clusters on AWS Ec2 and had some
problems in the past. I am getting good result with external pmix 3.1.3
./autogen.sh && ./configure --prefix=/usr/local/ --with-platform=optimized
--with-hwloc=/usr/local --with-libevent=/usr/local --enable-pmix-binaries
 --enable-pmi-backward-compatibility
make -j4 && sudo make install
and ompenmpi 4.0.1
./autogen.pl && ./configure --prefix=/usr/local --with-slurm
--with-pmix=/usr/local --enable-mpi1-compatibility
--with-libevent=/usr/local --with-hwloc=/usr/local

making certain linking against the same libevent
This is on linux most recent custom kernel, and most recent SLURM scheduler.
best:
steve





On Mon, Aug 19, 2019 at 2:07 PM Jeff Squyres (jsquyres) via users <
users@lists.open-mpi.org> wrote:

> Can you provide some more details?
>
> https://www.open-mpi.org/community/help/
>
>
> On Aug 19, 2019, at 1:18 PM, Riddhi A Mehta via users <
> users@lists.open-mpi.org> wrote:
>
> Hello
>
> My name is Riddhi and I am a Graduate Research Assistant in the Dept. of
> Physics & Astronomy at Purdue University. About a month ago I correctly
> configured openmpi on my mac and the ‘mpirun -np 2 ./hello.c’ ran
> correctly. But today, it gave me the following error:
>
>
> *[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad
> parameter in file orted/pmix/pmix_server.c at line 262*
> *[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad
> parameter in file ess_hnp_module.c at line 667*
>
> *--*
> *It looks like orte_init failed for some reason; your parallel process is*
> *likely to abort.*  *There are many reasons that a parallel process can*
> *fail during orte_init; some of which are due to configuration or*
> *environment problems.*  *This failure appears to be an internal failure;*
> *here's some additional information (which may only be relevant to an*
> *Open MPI developer):*
>
>   *pmix server init failed*
>   *--> Returned value Bad parameter (-5) instead of ORTE_SUCCESS*
>
> I tried to google this error and the only thing I found was to change the
> TMPDIR name to a shorter name. But I don’t think I understand what it’s
> asking me to do. Kindly help since I am not able to understand what went
> wrong in one month and mpirun stopped working.
>
> Thank you
>
> Riddhi Mehta
> Research Group: Maxim Lyutikov, Theoretical High Energy Astrophysics
> Dept. of Physics & Astronomy
> Purdue University
>
> ___
> users mailing list
> users@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/users
>
>
>
> --
> Jeff Squyres
> jsquy...@cisco.com
>
>
>
>
> ___
> users mailing list
> users@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/users
___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Re: [OMPI users] **URGENT: Error during testing

2019-08-19 Thread Riddhi A Mehta via users
Hi

I followed the exact procedure stated in this link: 
http://www.science.smith.edu/dftwiki/index.php/Install_MPI_on_a_MacBook.


It runs correctly until this line : mpicc -o hello helloWorld.c
After which it gives me the error when I do mpirun.

Thank you

Riddhi

From: users  on behalf of "Riebs, Andy via 
users" 
Reply-To: Open MPI Users 
Date: Monday, August 19, 2019 at 2:10 PM
To: Open MPI Users 
Cc: "Riebs, Andy" 
Subject: Re: [OMPI users] **URGENT: Error during testing

Is there any chance that the fact that Riddhi appears to be trying to execute 
an uncompiled hello.c could be the problem here?

From: users [mailto:users-boun...@lists.open-mpi.org] On Behalf Of Jeff Squyres 
(jsquyres) via users
Sent: Monday, August 19, 2019 2:05 PM
To: Open MPI User's List 
Cc: Jeff Squyres (jsquyres) 
Subject: Re: [OMPI users] **URGENT: Error during testing

Can you provide some more details?

https://www.open-mpi.org/community/help/




On Aug 19, 2019, at 1:18 PM, Riddhi A Mehta via users 
mailto:users@lists.open-mpi.org>> wrote:

Hello

My name is Riddhi and I am a Graduate Research Assistant in the Dept. of 
Physics & Astronomy at Purdue University. About a month ago I correctly 
configured openmpi on my mac and the ‘mpirun -np 2 ./hello.c’ ran correctly. 
But today, it gave me the following error:


[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file orted/pmix/pmix_server.c at line 262
[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file ess_hnp_module.c at line 667
--
It looks like orte_init failed for some reason; your parallel process is
likely to abort.  There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems.  This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):

  pmix server init failed
  --> Returned value Bad parameter (-5) instead of ORTE_SUCCESS

I tried to google this error and the only thing I found was to change the 
TMPDIR name to a shorter name. But I don’t think I understand what it’s asking 
me to do. Kindly help since I am not able to understand what went wrong in one 
month and mpirun stopped working.

Thank you

Riddhi Mehta
Research Group: Maxim Lyutikov, Theoretical High Energy Astrophysics
Dept. of Physics & Astronomy
Purdue University

___
users mailing list
users@lists.open-mpi.org<mailto:users@lists.open-mpi.org>
https://lists.open-mpi.org/mailman/listinfo/users


--
Jeff Squyres
jsquy...@cisco.com<mailto:jsquy...@cisco.com>



___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Re: [OMPI users] **URGENT: Error during testing

2019-08-19 Thread Riebs, Andy via users
Is there any chance that the fact that Riddhi appears to be trying to execute 
an uncompiled hello.c could be the problem here?

From: users [mailto:users-boun...@lists.open-mpi.org] On Behalf Of Jeff Squyres 
(jsquyres) via users
Sent: Monday, August 19, 2019 2:05 PM
To: Open MPI User's List 
Cc: Jeff Squyres (jsquyres) 
Subject: Re: [OMPI users] **URGENT: Error during testing

Can you provide some more details?

https://www.open-mpi.org/community/help/<https://www.open-mpi.org/community/help/>



On Aug 19, 2019, at 1:18 PM, Riddhi A Mehta via users 
mailto:users@lists.open-mpi.org>> wrote:

Hello

My name is Riddhi and I am a Graduate Research Assistant in the Dept. of 
Physics & Astronomy at Purdue University. About a month ago I correctly 
configured openmpi on my mac and the ‘mpirun -np 2 ./hello.c’ ran correctly. 
But today, it gave me the following error:


[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file orted/pmix/pmix_server.c at line 262
[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file ess_hnp_module.c at line 667
--
It looks like orte_init failed for some reason; your parallel process is
likely to abort.  There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems.  This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):

  pmix server init failed
  --> Returned value Bad parameter (-5) instead of ORTE_SUCCESS

I tried to google this error and the only thing I found was to change the 
TMPDIR name to a shorter name. But I don’t think I understand what it’s asking 
me to do. Kindly help since I am not able to understand what went wrong in one 
month and mpirun stopped working.

Thank you

Riddhi Mehta
Research Group: Maxim Lyutikov, Theoretical High Energy Astrophysics
Dept. of Physics & Astronomy
Purdue University

___
users mailing list
users@lists.open-mpi.org<mailto:users@lists.open-mpi.org>
https://lists.open-mpi.org/mailman/listinfo/users<https://lists.open-mpi.org/mailman/listinfo/users>


--
Jeff Squyres
jsquy...@cisco.com<mailto:jsquy...@cisco.com>



___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Re: [OMPI users] **URGENT: Error during testing

2019-08-19 Thread Jeff Squyres (jsquyres) via users
Can you provide some more details?

https://www.open-mpi.org/community/help/


On Aug 19, 2019, at 1:18 PM, Riddhi A Mehta via users 
mailto:users@lists.open-mpi.org>> wrote:

Hello

My name is Riddhi and I am a Graduate Research Assistant in the Dept. of 
Physics & Astronomy at Purdue University. About a month ago I correctly 
configured openmpi on my mac and the ‘mpirun -np 2 ./hello.c’ ran correctly. 
But today, it gave me the following error:


[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file orted/pmix/pmix_server.c at line 262
[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file ess_hnp_module.c at line 667
--
It looks like orte_init failed for some reason; your parallel process is
likely to abort.  There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems.  This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):

  pmix server init failed
  --> Returned value Bad parameter (-5) instead of ORTE_SUCCESS

I tried to google this error and the only thing I found was to change the 
TMPDIR name to a shorter name. But I don’t think I understand what it’s asking 
me to do. Kindly help since I am not able to understand what went wrong in one 
month and mpirun stopped working.

Thank you

Riddhi Mehta
Research Group: Maxim Lyutikov, Theoretical High Energy Astrophysics
Dept. of Physics & Astronomy
Purdue University

___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users


--
Jeff Squyres
jsquy...@cisco.com




___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

[OMPI users] **URGENT: Error during testing

2019-08-19 Thread Riddhi A Mehta via users
Hello



My name is Riddhi and I am a Graduate Research Assistant in the Dept. of 
Physics & Astronomy at Purdue University. About a month ago I correctly 
configured openmpi on my mac and the ‘mpirun -np 2 ./hello.c’ ran correctly. 
But today, it gave me the following error:





[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file orted/pmix/pmix_server.c at line 262

[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file ess_hnp_module.c at line 667

--

It looks like orte_init failed for some reason; your parallel process is

likely to abort.  There are many reasons that a parallel process can

fail during orte_init; some of which are due to configuration or

environment problems.  This failure appears to be an internal failure;

here's some additional information (which may only be relevant to an

Open MPI developer):



  pmix server init failed

  --> Returned value Bad parameter (-5) instead of ORTE_SUCCESS

I tried to google this error and the only thing I found was to change the 
TMPDIR name to a shorter name. But I don’t think I understand what it’s asking 
me to do. Kindly help since I am not able to understand what went wrong in one 
month and mpirun stopped working.

Thank you

Riddhi Mehta
Research Group: Maxim Lyutikov, Theoretical High Energy Astrophysics
Dept. of Physics & Astronomy
Purdue University

___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users