Re: [OMPI users] Beginner Troubleshooting OpenMPI Installation - pmi.h Error

2022-10-04 Thread Barrett, Brian via users
Can you include the configure command you used for Open MPI, as well as the 
output of “make all V=1” (it’s ok if that’s from a tree you’ve already tried to 
build, the full compile command for the file that is failing to compile is the 
part of interest.

Thanks,

Brian

On 10/4/22, 9:06 AM, "users on behalf of Jeffrey D. (JD) Tamucci via users" 
mailto:users-boun...@lists.open-mpi.org> on 
behalf of users@lists.open-mpi.org> wrote:


CAUTION: This email originated from outside of the organization. Do not click 
links or open attachments unless you can confirm the sender and know the 
content is safe.


Hi,

I have been trying to install OpenMPI v4.1.4 on a university HPC cluster. We 
use the Bright cluster manager and have SLURM v21.08.8 and RHEL 8.6. I used a 
script to install OpenMPI that a former co-worker had used to successfully 
install OpenMPI v3.0.0 previously. I updated it to include new versions of the 
dependencies and new paths to those installs.

Each time, it fails in the make install step. There is a fatal error about 
finding pmi.h. It specifically says:

make[2]: Entering directory '/shared/maylab/src/openmpi-4.1.4/opal/mca/pmix/s1'
  CC   libmca_pmix_s1_la-pmix_s1_component.lo
  CC   libmca_pmix_s1_la-pmix_s1.lo
pmix_s1.c:29:10: fatal error: pmi.h: No such file or directory
   29 | #include 

I've looked through the archives and seen others face similar errors in years 
past but I couldn't understand the solutions. One person suggested that SLURM 
may be missing PMI libraries. I think I've verified that SLURM has PMI. I 
include paths to those files and it seems to find them earlier in the process.

I'm not sure what the next step is in troubleshooting this. I have included a 
bz2 file containing my install script, a log file containing the script output 
(from build, make, make install), the config.log, and the opal_config.h file. 
If anyone could provide any guidance, I'd  sincerely appreciate it.

Best,
JD


[OMPI users] Beginner Troubleshooting OpenMPI Installation - pmi.h Error

2022-10-04 Thread Jeffrey D. (JD) Tamucci via users
Hi,

I have been trying to install OpenMPI v4.1.4 on a university HPC cluster.
We use the Bright cluster manager and have SLURM v21.08.8 and RHEL 8.6. I
used a script to install OpenMPI that a former co-worker had used to
successfully install OpenMPI v3.0.0 previously. I updated it to include new
versions of the dependencies and new paths to those installs.

Each time, it fails in the *make install *step. There is a fatal error
about finding pmi.h. It specifically says:

make[2]: Entering directory
'/shared/maylab/src/openmpi-4.1.4/opal/mca/pmix/s1'
  CC   libmca_pmix_s1_la-pmix_s1_component.lo
  CC   libmca_pmix_s1_la-pmix_s1.lo
pmix_s1.c:29:10: fatal error: pmi.h: No such file or directory
   29 | #include 

I've looked through the archives and seen others face similar errors in
years past but I couldn't understand the solutions. One person suggested
that SLURM may be missing PMI libraries. I think I've verified that SLURM
has PMI. I include paths to those files and it seems to find them earlier
in the process.

I'm not sure what the next step is in troubleshooting this. I have included
a bz2 file containing my install script, a log file containing the script
output (from build, make, make install), the config.log, and the
opal_config.h file. If anyone could provide any guidance, I'd  sincerely
appreciate it.

Best,
JD