Hello developers, I am trying to debug the mca choices the library is taking for collective operations. The reason is because I want to force the library to choose a particular module and compare it with a different one. One thing I have notice is that I can do:
mpirun --mca coll basic,libnbc --np 4 ./iallreduce for an "iallreduce" operation, but I get an error if I do mpirun --mca coll libnbc --np 4 ./iallreduce or mpirun --mca coll basic --np 4 ./iallreduce -------------------------------------------------------------------------- Although some coll components are available on your system, none of them said that they could be used for iallgather on a new communicator. This is extremely unusual -- either the "basic", "libnbc" or "self" components should be able to be chosen for any communicator. As such, this likely means that something else is wrong (although you should double check that the "basic", "libnbc" and "self" coll components are available on your system -- check the output of the "ompi_info" command). A coll module failed to finalize properly when a communicator that was using it was destroyed. This is somewhat unusual: the module itself may be at fault, or this may be a symptom of another issue (e.g., a memory problem). mca_coll_base_comm_select(MPI_COMM_WORLD) failed --> Returned "Not found" (-13) instead of "Success" (0) Can you please help? Regards, Luis The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336.