Re: [OMPI users] how this is possible?

2007-10-02 Thread Rolf . Vandevaart
My guess is you must have a mismatched MPI_Bcast somewhere in the code. Presumably, there is a call to MPI_Bcast on the head node that broadcasts something larger than 1 MPI_INT and does not have the matching call on the worker nodes. Then, when the MPI_Bcast on the worker nodes is called, they

[OMPI users] MPI_Gatherv on One Process

2007-10-02 Thread Chris Johnson
Hi, I'm trying to run an MPI program of mine under OpenMPI 1.2 using just one process (mpirun -np 1 ./a.out) and I'm getting some unexpected results. The problem is that I'm getting unexpected results from an MPI_Gatherv call when the offset for rank is nonzero. I've worked up a small sample that

Re: [OMPI users] how this is possible?

2007-10-02 Thread Marcin Skoczylas
Sorry I forgot to mention: Open MPI version 1.2.4 Marcin Skoczylas wrote: Hello, After whole day of coding I'm fighting little bit with one small fragment which seems strange for me. For testing I have one head node and two worker nodes on localhost. Having this code (with debug stuff added

[OMPI users] how this is possible?

2007-10-02 Thread Marcin Skoczylas
Hello, After whole day of coding I'm fighting little bit with one small fragment which seems strange for me. For testing I have one head node and two worker nodes on localhost. Having this code (with debug stuff added like sleeps, barriers, etc): void CImageData::SpreadToNodes() { sleep(5)

Re: [OMPI users] mpirun ERROR: The daemon exited unexpectedly with status 255.

2007-10-02 Thread Dino Rossegger
Here the Syntax & Output of the Command: root@sun:~# mpirun --hostfile hostfile saturn [sun:28777] [0,0,0] ORTE_ERROR_LOG: Timeout in file base/pls_base_orted_cmds.c at line 275 [sun:28777] [0,0,0] ORTE_ERROR_LOG: Timeout in file pls_rsh_module.c at line 1164 [sun:28777] [0,0,0] ORTE_ERROR_LOG: Tim

Re: [OMPI users] mpirun ERROR: The daemon exited unexpectedly with status 255.

2007-10-02 Thread Tim Prins
This is very odd. The daemon is being launched properly, but then things get strange. It looks like mpirun is sending a message to kill application processes on saturn. What version of Open MPI are you using? Are you sure that the same version of Open MPI us being used everywhere? Can you try

Re: [OMPI users] problem with 'orted'

2007-10-02 Thread Sebastian Schulz
Amit Kumar Saha wrote: > what i find bizarre is that I used Open MPI 1.2.3 to install on all my > 4 machines. whereas, 'orted' is installed in /usr/local/bin on all the > other 3 machines, the 4th machine which is giving me trouble has got > it installed in '/usr/bin' . Yes, 'orted' is accessible

Re: [OMPI users] mpirun ERROR: The daemon exited unexpectedly with status 255.

2007-10-02 Thread Dino Rossegger
Hi again, Tim Prins schrieb: > Hi, > > On Monday 01 October 2007 03:56:16 pm Dino Rossegger wrote: >> Hi again, >> >> Yes the error output is the same: >> root@sun:~# mpirun --hostfile hostfile main >> [sun:23748] [0,0,0] ORTE_ERROR_LOG: Timeout in file >> base/pls_base_orted_cmds.c at line 275 >

Re: [OMPI users] problem with 'orted'

2007-10-02 Thread Amit Kumar Saha
On 10/1/07, Si Hammond wrote: > Can you make sure that orted is in the PATH variable for each machine? > You can do this by adding it to /etc/bash.bashrc or an appropriate file? > what i find bizarre is that I used Open MPI 1.2.3 to install on all my 4 machines. whereas, 'orted' is installed in /