Re: [OMPI users] CUDA supported APIs

2019-08-20 Thread Riddhi A Mehta via users
Sorry I forgot to attach the error log. Here it is.

Riddhi Mehta
Research Group: Maxim Lyutikov, Theoretical High Energy Astrophysics
Dept. of Physics & Astronomy
Purdue University


From: users  on behalf of Riddhi A Mehta via 
users 
Reply-To: Open MPI Users 
Date: Tuesday, August 20, 2019 at 3:06 PM
To: Open MPI Users 
Cc: Riddhi A Mehta 
Subject: Re: [OMPI users] CUDA supported APIs

Hello

I was able to correctly install and test OpenMPI 4.0.1 functionality on my Mac. 
However, I am running into another problem. I am running an astrophysical code 
named PLUTO which uses certain old routines which have been discontinued in the 
newer MPI versions. As a result, I face errors during the ‘make’ process. The 
error log is attached as a text file. Can someone guide me as to how I can fix 
that and make use of new routines?

Thanks
Riddhi


From: users  on behalf of "Zhang, Junchao via 
users" 
Reply-To: Open MPI Users 
Date: Monday, August 19, 2019 at 6:17 PM
To: "Fang, Leo" 
Cc: "Zhang, Junchao" , Open MPI Users 

Subject: Re: [OMPI users] CUDA supported APIs

Leo,
  Thanks for the info. That is interesting.  And yes, Having a CUDA aware MPI 
API list would be very useful.
--Junchao Zhang


On Mon, Aug 19, 2019 at 10:23 AM Fang, Leo 
mailto:leof...@bnl.gov>> wrote:

Hi Junchao,





First, for your second question, the answer is here: 
https://www.mail-archive.com/users@lists.open-mpi.org/msg33279.html. I know 
this because I also asked it earlier  It'd be nice to have this documented in 
the Q though.



As for your first question, I am also interested. It'd be nice for Open MPI 
core devs to keep the supported API list up-to-date. We recently added support 
of CUDA-aware MPI to mpi4py, and such a list is important for us to keep track 
upstream support so that we know whether a test fails due to lack of 
CUDA-awareness or because we messed up (much less likely).



Thanks.




Sincerely,
Leo

---
Yao-Lung Leo Fang
Assistant Computational Scientist
Computational Science Initiative
Brookhaven National Laboratory
Bldg. 725, Room 2-169
P.O. Box 5000, Upton, NY 11973-5000
Office: (631) 344-3265
Email: leof...@bnl.gov<mailto:leof...@bnl.gov>
Website: https://leofang.github.io/

寄件者: users 
mailto:users-boun...@lists.open-mpi.org>> 代表 
Zhang, Junchao via users 
mailto:users@lists.open-mpi.org>>
寄件日期: 2019年8月15日 上午 11:52:56
收件者: Open MPI Users mailto:users@lists.open-mpi.org>>
副本: Zhang, Junchao mailto:jczh...@mcs.anl.gov>>
主旨: Re: [OMPI users] CUDA supported APIs

Another question: If MPI_Allgatherv(const void *sendbuf, int sendcount, 
MPI_Datatype sendtype, void *recvbuf, const int recvcounts[],const int 
displs[], MPI_Datatype recvtype, MPI_Comm comm) is cuda aware, are recvcounts, 
displs in CPU memory or GPU memory?

--Junchao Zhang


On Thu, Aug 15, 2019 at 9:55 AM Junchao Zhang 
mailto:jczh...@mcs.anl.gov>> wrote:
Hi,
   Are the APIs at 
https://www.open-mpi.org/faq/?category=runcuda#mpi-apis-cuda<https://urldefense.proofpoint.com/v2/url?u=https-3A__www.open-2Dmpi.org_faq_-3Fcategory-3Druncuda-23mpi-2Dapis-2Dcuda=DwMGaQ=aTOVZmpUfPKZuaG9NO7J7Mh6imZbfhL47t9CpZ-pCOw=xdA_wfZm0r4KH07in_vhZg=RZswVqXwi-LuVtqni8ecrzkJU3WCkvSRUw1u7n32neQ=FKXyPpx3kLJRAQirASnnXD2Q-HLG3G0XMwvXmJ4sPdQ=>
 latest? I could not find MPI_Neighbor_xxx and MPI_Reduce_local.
  Thanks.

--Junchao Zhang
{\rtf1\ansi\ansicpg1252\cocoartf1671\cocoasubrtf600
{\fonttbl\f0\fswiss\fcharset0 Helvetica;\f1\fnil\fcharset0 Menlo-Bold;\f2\fnil\fcharset0 Menlo-Regular;
}
{\colortbl;\red255\green255\blue255;\red0\green0\blue0;\red202\green51\blue35;\red57\green192\blue38;
}
{\*\expandedcolortbl;;\csgray\c0;\cssrgb\c83898\c28665\c18024;\cssrgb\c25704\c77963\c19556;
}
\margl1440\margr1440\vieww10800\viewh8400\viewkind0
\pard\tx720\tx1440\tx2160\tx2880\tx3600\tx4320\tx5040\tx5760\tx6480\tx7200\tx7920\tx8640\pardirnatural\partightenfactor0

\f0\fs24 \cf0 Mpirun error\
\
\pard\tx560\tx1120\tx1680\tx2240\tx2800\tx3360\tx3920\tx4480\tx5040\tx5600\tx6160\tx6720\pardirnatural\partightenfactor0

\f1\b\fs22 \cf2 \CocoaLigature0 /Users/ram/Purdue_Physics/RProjects/Research_SW/PLUTO/Src/Parallel/al_subarray_.c:88:5: \cf3 error: \cf2 static_assert failed
\f2\b0 \

\f1\b   "MPI_Type_extent was removed in MPI-3.0. Use MPI_Type_get_extent instead."
\f2\b0 \
MPI_Type_extent(oldtype, (MPI_Aint *) );\

\f1\b \cf4 ^~
\f2\b0 \cf2 \

\f1\b /Users/ram/Purdue_Physics/RProjects/Research_SW/mpi/include/mpi.h:2820:31: note: 
\f2\b0 expanded from macro 'MPI_Type_extent'\
#define MPI_Type_extent(...)  THIS_SYMBOL_WAS_REMOVED_IN_MPI30(MPI_Type_extent, MPI_Type_get_extent)\

\f1\b \cf4   ^~
\f2\b0 \cf2 \

\f1\b /Users/ram/Purdue_Physics/RProjects/Research_SW/mpi/include/mpi.h:322:57: note: 
\f2\b0 expanded from ma

Re: [OMPI users] CUDA supported APIs

2019-08-20 Thread Riddhi A Mehta via users
Hello

I was able to correctly install and test OpenMPI 4.0.1 functionality on my Mac. 
However, I am running into another problem. I am running an astrophysical code 
named PLUTO which uses certain old routines which have been discontinued in the 
newer MPI versions. As a result, I face errors during the ‘make’ process. The 
error log is attached as a text file. Can someone guide me as to how I can fix 
that and make use of new routines?

Thanks
Riddhi


From: users  on behalf of "Zhang, Junchao via 
users" 
Reply-To: Open MPI Users 
Date: Monday, August 19, 2019 at 6:17 PM
To: "Fang, Leo" 
Cc: "Zhang, Junchao" , Open MPI Users 

Subject: Re: [OMPI users] CUDA supported APIs

Leo,
  Thanks for the info. That is interesting.  And yes, Having a CUDA aware MPI 
API list would be very useful.
--Junchao Zhang


On Mon, Aug 19, 2019 at 10:23 AM Fang, Leo 
mailto:leof...@bnl.gov>> wrote:

Hi Junchao,





First, for your second question, the answer is here: 
https://www.mail-archive.com/users@lists.open-mpi.org/msg33279.html. I know 
this because I also asked it earlier  It'd be nice to have this documented in 
the Q though.



As for your first question, I am also interested. It'd be nice for Open MPI 
core devs to keep the supported API list up-to-date. We recently added support 
of CUDA-aware MPI to mpi4py, and such a list is important for us to keep track 
upstream support so that we know whether a test fails due to lack of 
CUDA-awareness or because we messed up (much less likely).



Thanks.




Sincerely,
Leo

---
Yao-Lung Leo Fang
Assistant Computational Scientist
Computational Science Initiative
Brookhaven National Laboratory
Bldg. 725, Room 2-169
P.O. Box 5000, Upton, NY 11973-5000
Office: (631) 344-3265
Email: leof...@bnl.gov
Website: https://leofang.github.io/

寄件者: users 
mailto:users-boun...@lists.open-mpi.org>> 代表 
Zhang, Junchao via users 
mailto:users@lists.open-mpi.org>>
寄件日期: 2019年8月15日 上午 11:52:56
收件者: Open MPI Users mailto:users@lists.open-mpi.org>>
副本: Zhang, Junchao mailto:jczh...@mcs.anl.gov>>
主旨: Re: [OMPI users] CUDA supported APIs

Another question: If MPI_Allgatherv(const void *sendbuf, int sendcount, 
MPI_Datatype sendtype, void *recvbuf, const int recvcounts[],const int 
displs[], MPI_Datatype recvtype, MPI_Comm comm) is cuda aware, are recvcounts, 
displs in CPU memory or GPU memory?

--Junchao Zhang


On Thu, Aug 15, 2019 at 9:55 AM Junchao Zhang 
mailto:jczh...@mcs.anl.gov>> wrote:
Hi,
   Are the APIs at 
https://www.open-mpi.org/faq/?category=runcuda#mpi-apis-cuda
 latest? I could not find MPI_Neighbor_xxx and MPI_Reduce_local.
  Thanks.

--Junchao Zhang
___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Re: [OMPI users] **URGENT: Error during testing

2019-08-19 Thread Riddhi A Mehta via users
Hi

I followed the exact procedure stated in this link: 
http://www.science.smith.edu/dftwiki/index.php/Install_MPI_on_a_MacBook.


It runs correctly until this line : mpicc -o hello helloWorld.c
After which it gives me the error when I do mpirun.

Thank you

Riddhi

From: users  on behalf of "Riebs, Andy via 
users" 
Reply-To: Open MPI Users 
Date: Monday, August 19, 2019 at 2:10 PM
To: Open MPI Users 
Cc: "Riebs, Andy" 
Subject: Re: [OMPI users] **URGENT: Error during testing

Is there any chance that the fact that Riddhi appears to be trying to execute 
an uncompiled hello.c could be the problem here?

From: users [mailto:users-boun...@lists.open-mpi.org] On Behalf Of Jeff Squyres 
(jsquyres) via users
Sent: Monday, August 19, 2019 2:05 PM
To: Open MPI User's List 
Cc: Jeff Squyres (jsquyres) 
Subject: Re: [OMPI users] **URGENT: Error during testing

Can you provide some more details?

https://www.open-mpi.org/community/help/




On Aug 19, 2019, at 1:18 PM, Riddhi A Mehta via users 
mailto:users@lists.open-mpi.org>> wrote:

Hello

My name is Riddhi and I am a Graduate Research Assistant in the Dept. of 
Physics & Astronomy at Purdue University. About a month ago I correctly 
configured openmpi on my mac and the ‘mpirun -np 2 ./hello.c’ ran correctly. 
But today, it gave me the following error:


[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file orted/pmix/pmix_server.c at line 262
[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file ess_hnp_module.c at line 667
--
It looks like orte_init failed for some reason; your parallel process is
likely to abort.  There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems.  This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):

  pmix server init failed
  --> Returned value Bad parameter (-5) instead of ORTE_SUCCESS

I tried to google this error and the only thing I found was to change the 
TMPDIR name to a shorter name. But I don’t think I understand what it’s asking 
me to do. Kindly help since I am not able to understand what went wrong in one 
month and mpirun stopped working.

Thank you

Riddhi Mehta
Research Group: Maxim Lyutikov, Theoretical High Energy Astrophysics
Dept. of Physics & Astronomy
Purdue University

___
users mailing list
users@lists.open-mpi.org<mailto:users@lists.open-mpi.org>
https://lists.open-mpi.org/mailman/listinfo/users


--
Jeff Squyres
jsquy...@cisco.com<mailto:jsquy...@cisco.com>



___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

[OMPI users] **URGENT: Error during testing

2019-08-19 Thread Riddhi A Mehta via users
Hello



My name is Riddhi and I am a Graduate Research Assistant in the Dept. of 
Physics & Astronomy at Purdue University. About a month ago I correctly 
configured openmpi on my mac and the ‘mpirun -np 2 ./hello.c’ ran correctly. 
But today, it gave me the following error:





[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file orted/pmix/pmix_server.c at line 262

[Riddhis-MacBook-Air.local:89582] [[56525,0],0] ORTE_ERROR_LOG: Bad parameter 
in file ess_hnp_module.c at line 667

--

It looks like orte_init failed for some reason; your parallel process is

likely to abort.  There are many reasons that a parallel process can

fail during orte_init; some of which are due to configuration or

environment problems.  This failure appears to be an internal failure;

here's some additional information (which may only be relevant to an

Open MPI developer):



  pmix server init failed

  --> Returned value Bad parameter (-5) instead of ORTE_SUCCESS

I tried to google this error and the only thing I found was to change the 
TMPDIR name to a shorter name. But I don’t think I understand what it’s asking 
me to do. Kindly help since I am not able to understand what went wrong in one 
month and mpirun stopped working.

Thank you

Riddhi Mehta
Research Group: Maxim Lyutikov, Theoretical High Energy Astrophysics
Dept. of Physics & Astronomy
Purdue University

___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users