On 4 Feb 2017, at 20:13, Gwyneth Allwright <[email protected]> wrote:

> Hi All,
> 
> I'm trying to get the Einstein Toolkit installed on an HPC cluster running 
> SLES. The trouble is that Cactus tries to use all the available processors 
> even when I specify a smaller number (by setting ppn in my PBS script).
> 
> As a test, I tried running the compilation with a parameter file that 
> required about 5 GB of RAM. In my PBS script, I set nodes=1 and ppn=3, and 
> then ran using openmpi-1.10.1:
> 
> mpirun -hostfile $PBS_NODEFILE <ET exe> <parameter file>
> 
> This resulted in the simulation running on all 24 available processors, even 
> though I'd only requested 3. Since PBS and MPI are integrated, I was told 
> that using -np with mpirun wouldn't help. 
> 
> Does anyone know how to address this issue?

Hi Gwyneth,

A few questions:

Can you check which nodes are appearing in $PBS_NODEFILE?

When you say it's running on all 24 available processors, what do you mean?  Do 
you mean that "top" shows 24 processes, or that the CPU is at 100%?  Could it 
be that you do in fact have three processes, but due to OpenMP, each process 
actually has 12 threads?

Can you do

ps -u $USER f

on the node while Cactus is running?  This will show you a tree view of the 
processes that are running.

Is the parameter file using PUGH or Carpet?  The standard output from the first 
process should show you the number of processes that Cactus thinks it is 
running on.

When you compiled the ET, did you specify the location of the MPI library with 
MPI_DIR?  If Cactus failed to find the library automatically, it will have 
automatically built its own version of OpenMPI and linked with that version.  
If you then use an mpirun from a different installation of MPI, things will not 
work correctly.

Maybe you can increase the verbosity of mpirun, to check how many processes it 
is trying to start.

-- 
Ian Hinder
http://members.aei.mpg.de/ianhin

_______________________________________________
Users mailing list
[email protected]
http://lists.einsteintoolkit.org/mailman/listinfo/users

Reply via email to