Emmanuel, I tried with both 2.x and master (they are only syntactically different with regard to reduce_scatter) and I can't reproduce your issue. I run the OSU test with the following command line:
mpirun -n 97 --mca coll basic,libnbc,self --mca pml ob1 ./osu_reduce_scatter -m 524288: -i 1 -x 0 I used the IB, TCP, vader and self BTLs. George. On Tue, Mar 28, 2017 at 6:21 AM, Howard Pritchard <hpprit...@gmail.com> wrote: > Hello Emmanuel, > > Which version of Open MPI are you using? > > Howard > > > 2017-03-28 3:38 GMT-06:00 BRELLE, EMMANUEL <emmanuel.bre...@atos.net>: > >> Hi, >> >> We are working on a portals4 components and we have found a bug >> (causing a segmentation fault ) which must be related to the coll/basic >> component. >> Due to a lack of time, we cannot investigate further but this seems to be >> caused by a “free(disps);“ (around line 300 in coll_basic_reduce_scatter) >> in some specific situations. In our case it happens on a >> osu_reduce_scatter (from the OSU microbenchmarks) with at least 97 procs >> for sizes bigger than 512Ko >> >> Step to reproduce : >> export OMPI_MCA_mtl=^portals4 >> export OMPI_MCA_btl=^portals4 >> export OMPI_MCA_coll=basic,libnbc,self,tuned >> export OMPI_MCA_osc=^portals4 >> export OMPI_MCA_pml=ob1 >> mpirun -n 97 osu_reduce_scatter -m 524288: >> >> ( reducing the number of iterations with –i 1 –x 0 should keep the bug) >> Our git branch is based on the v2.x branch and the files differ almost >> only on portals4 parts. >> >> Could someone confirm this bug ? >> >> Emmanuel BRELLE >> >> >> >> >> _______________________________________________ >> devel mailing list >> devel@lists.open-mpi.org >> https://rfd.newmexicoconsortium.org/mailman/listinfo/devel >> > > > _______________________________________________ > devel mailing list > devel@lists.open-mpi.org > https://rfd.newmexicoconsortium.org/mailman/listinfo/devel >
_______________________________________________ devel mailing list devel@lists.open-mpi.org https://rfd.newmexicoconsortium.org/mailman/listinfo/devel