Re: [OMPI users] Fault tolerant ompi - Error: Unable to find a list of active MPIRUN processes on this machine.

2011-03-31 Thread Hellmüller Roman
solved don't know exactly how. just work on it, set some other parameters/directorys. cheers roman Von: users-boun...@open-mpi.org [users-boun...@open-mpi.org] im Auftrag von Hellmüller Roman [hro...@student.ethz.ch] Gesendet: Donnerstag, 31. März 2011

Re: [OMPI users] Fault tolerant ompi - Error: Unable to find a list of active MPIRUN processes on this machine.

2011-03-31 Thread Hellmüller Roman
Hi I noticed that the directory /tmp/openmpi-sessions-hroman@cbl1_0 is created on the login nodes but not on the compute nodes. By setting orte_tmpdir_base=/tmp in \$prefix/ect/openmpi-mca-params.conf i could make sure that the session directory is created. But when i now try to checkpoint

[OMPI users] Fault tolerant ompi - Error: Unable to find a list of active MPIRUN processes on this machine.

2011-03-30 Thread Hellmüller Roman
Hi I'm trying to get fault tolerant ompi running on our cluster for my semesterthesis. On the login node i was successful, checkpointing works. Since the compute nodes have different kernels, i had to compile blcr on the compute nodes again. blcr on the compute nodes works. after that i