Dear all,

I finally got some time to come back to this problem. I just want to thank
Moe and Matthieu
for their advice. I decided to implement the signal trapping in my
application to avoid
messing arround with slurm and break things. In case someone finds himself
in a similar
situation and has access to the source code of his application, here follows
some useful links:

* Sharcnet documentation: Implementing signal trapping in Fortran/C
applications with subsequent
                           action triggering:

https://www.sharcnet.ca/help/index.php/Signal_Handling_and_Checkpointing

* Get Remaining Time library (though i haven't tested yet):
   https://github.com/chaos/libyogrt.git

   or Matthieu's piece of code for that matter.

Finally about the PMI interface of Intel MPI, it is true that version
4.0.0.28 offers
the PMI interface but it seems to have some problems with SLURM. I posted
this problem
to the Intel forums some time ago and got the same advice of upgrading to
version 4.0 Update 2.
Thanks David.


Best Regards,
Domingos

___________________________________
Domingos Rodrigues, PhD
Laboratório de Computação Científica
ICeX, Sala 2040
Universidade Federal de Minas Gerais,
Av. Antônio Carlos, 6627 - Pampulha
31270-901 - Belo Horizonte - MG - Brasil
Tel +55 31 3409-4909
VOIP: +55 31 3409-3333 / 10811803
Fax +55 313409-5390
http://www.cenapad.ufmg.br
Email: ddcr(at)lcc.ufmg.br, ddcr(at)ufmg.br
___________________________________



On Tue, Oct 18, 2011 at 12:48 PM, David N. Lombard <
dnlom...@ichips.intel.com> wrote:

> On Sat, Oct 15, 2011 at 11:23:59AM -0600, Domingos wrote:
> > Dear community,
> >
> > I am trying to design a batch script that launches a parallel job with
> > mpirun (the Intel MPI version
> > i'm using does not have PMI interface so i can't launch via srun).
>
> Intel MPI does offer a PMI interface. Here's a quick example:
>
>  $ export I_MPI_PMI_LIBRARY=/full/path/to/slurm/libpmi.so
>  $ export I_MPI_FABRICS=shm:ofa
>  $ srun -n 2 ./hello_world
>
> You can also find more info at
>
> http://software.intel.com/en-us/articles/how-to-use-slurm-pmi-with-the-intel-mpi-library-for-linux
>
> I have been told that you should use 4.0.3 due to some fixes.
>
> --
> David N. Lombard, Intel, Irvine, CA
> I do not speak for Intel Corporation; all comments are strictly my own.
>

Reply via email to