Ralph,

thanks for the info

can you detail your full mpirun command line, the number of servers you are
> using, the btl involved and the ompi release that can be used to reproduce
> the issue ?
>
>
> Running on only one server, using the current head of the svn repo. My
> cluster only has Ethernet, and I let it freely choose the BTLs (so I
> imagine the candidates are sm,self,tcp,vader). The cmd line is really
> trivial:
>
>
is MPSS installed and loaded ?
if yes, scif is also a candidate


> mpirun -n 1 ./loop_spawn
>
> I modified loop_spawn to only run 100 iterations as I am not patient
> enough to wait for 1000, and the number of iters isn't a factor so long as
> it is greater than 1. When the parent calls finalize, I get one of the
> following emitted for every iteration that was done:
>
> dpm_base_disconnect_init: error -12 in isend to process 0
>
>
so we do the same thing but have different behaviour ...

just to be sure :
are we talking about the loop_spawn test from the ibm test suite available
at
http://svn.open-mpi.org/svn/ompi-tests/trunk/ibm/dynamic/loop_spawn.c
and
http://svn.open-mpi.org/svn/ompi-tests/trunk/ibm/dynamic/loop_child.c

number of iterations is 2000 (and not 1000)
MPI_Comm_disconnect is invoked by both parent in loop_spawn.c :

                MPI_Comm_free(&comm_merged);
                MPI_Comm_disconnect(&comm_spawned);


and children in loop_child.c :

        MPI_Comm_free(&merged);
        MPI_Comm_disconnect(&parent);


is there any possibility you are running a different test called loop_spawn
or an older version of the dynamic/loop_spawn test from the ibm test suite ?

Cheers,

Gilles

Reply via email to