My guess is you must have a mismatched MPI_Bcast somewhere
in the code. Presumably, there is a call to MPI_Bcast on the head
node that broadcasts something larger than 1 MPI_INT and does not
have the matching call on the worker nodes. Then, when the MPI_Bcast
on the worker nodes is called, they
Hi, I'm trying to run an MPI program of mine under OpenMPI 1.2 using
just one process (mpirun -np 1 ./a.out) and I'm getting some
unexpected results. The problem is that I'm getting unexpected
results from an MPI_Gatherv call when the offset for rank is nonzero.
I've worked up a small sample that
Sorry I forgot to mention: Open MPI version 1.2.4
Marcin Skoczylas wrote:
Hello,
After whole day of coding I'm fighting little bit with one small
fragment which seems strange for me.
For testing I have one head node and two worker nodes on localhost.
Having this code (with debug stuff added
Hello,
After whole day of coding I'm fighting little bit with one small
fragment which seems strange for me.
For testing I have one head node and two worker nodes on localhost.
Having this code (with debug stuff added like sleeps, barriers, etc):
void CImageData::SpreadToNodes()
{
sleep(5)
Here the Syntax & Output of the Command:
root@sun:~# mpirun --hostfile hostfile saturn
[sun:28777] [0,0,0] ORTE_ERROR_LOG: Timeout in file
base/pls_base_orted_cmds.c at line 275
[sun:28777] [0,0,0] ORTE_ERROR_LOG: Timeout in file pls_rsh_module.c at
line 1164
[sun:28777] [0,0,0] ORTE_ERROR_LOG: Tim
This is very odd. The daemon is being launched properly, but then things
get strange. It looks like mpirun is sending a message to kill
application processes on saturn.
What version of Open MPI are you using?
Are you sure that the same version of Open MPI us being used everywhere?
Can you try
Amit Kumar Saha wrote:
> what i find bizarre is that I used Open MPI 1.2.3 to install on all my
> 4 machines. whereas, 'orted' is installed in /usr/local/bin on all the
> other 3 machines, the 4th machine which is giving me trouble has got
> it installed in '/usr/bin' . Yes, 'orted' is accessible
Hi again,
Tim Prins schrieb:
> Hi,
>
> On Monday 01 October 2007 03:56:16 pm Dino Rossegger wrote:
>> Hi again,
>>
>> Yes the error output is the same:
>> root@sun:~# mpirun --hostfile hostfile main
>> [sun:23748] [0,0,0] ORTE_ERROR_LOG: Timeout in file
>> base/pls_base_orted_cmds.c at line 275
>
On 10/1/07, Si Hammond wrote:
> Can you make sure that orted is in the PATH variable for each machine?
> You can do this by adding it to /etc/bash.bashrc or an appropriate file?
>
what i find bizarre is that I used Open MPI 1.2.3 to install on all my
4 machines. whereas, 'orted' is installed in /