Hello,

Version - Open MPI 1.2.1.

I have a simple program as below:

#include<string.h>
#include<stdlib.h>
#include<stdio.h>
#include"mpi.h"

void
main(int argc, char **argv)
{

        int             tag = 0;
        int             my_rank;
        int             num_proc;
        char            message_0[] = "hello slave, i'm your master";
        char            message_1[50];
        char            master_data[] = "slaves to work";
        int             num;
        MPI_Status      status;
        MPI_Comm        inter_comm;
        MPI_Info        info;
        int             arr[1];
        int             rc1;
        MPI_Init(&argc, &argv);
        MPI_Comm_rank(MPI_COMM_WORLD, &my_rank);
        MPI_Comm_size(MPI_COMM_WORLD, &num_proc);
        rc1 = MPI_Comm_spawn("/bin/hostname", MPI_ARGV_NULL, 1,
MPI_INFO_NULL, 0, MPI_COMM_WORLD, &inter_comm, arr);
        printf("MASTER : send a message to master of slaves ...\n");
        MPI_Send(message_0, 50, MPI_CHAR,0 , tag, inter_comm);
        MPI_Recv(message_1, 50, MPI_CHAR, 0, tag, inter_comm, &status);
        printf("MASTER : message received : %s\n", message_1);
        MPI_Send(master_data, 50, MPI_CHAR,0 , tag, inter_comm);
        MPI_Finalize();
        exit(0);
}

When this is run, all I get is 
>~/thesis/CS/Samples/x86_64> mpirun -np 4 --pernode --hostfile
machinefile --prefix /usr/local/openmpi-1.2 ./master1
MASTER : spawning 3 slaves ... 
MASTER : spawning 3 slaves ... 
MASTER : spawning 3 slaves ... 
MASTER : spawning 3 slaves ... 
src is (null) and orte type is 0
[bmi-opt2-01:25441] [0,0,0] ORTE_ERROR_LOG: Bad parameter in file
dss/dss_copy.c at line 43
[bmi-opt2-01:25441] [0,0,0] ORTE_ERROR_LOG: Bad parameter in file
gpr_replica_put_get_fn.c at line 410
[bmi-opt2-01:25441] [0,0,0] ORTE_ERROR_LOG: Bad parameter in file
base/rmaps_base_registry_fns.c at line 612
[bmi-opt2-01:25441] [0,0,0] ORTE_ERROR_LOG: Bad parameter in file
base/rmaps_base_map_job.c at line 93
[bmi-opt2-01:25441] [0,0,0] ORTE_ERROR_LOG: Bad parameter in file
base/rmaps_base_receive.c at line 139
mpirun: killing job...

mpirun noticed that job rank 0 with PID 25447 on node bmi-opt2-01 exited
on signal 15 (Terminated). 
3 additional processes aborted (not shown)

Any idea what is wrong with this.

Thanks,
Prakash

Reply via email to