for (...) { loop_spawn spawns loop_child parent and child execute MPI_Intercomm_merge parent and child execute MPI_Comm_free parent and child execute MPI_Comm_disconnect }
If the openib BTL is involved and you run long enough, it appears that you run out of locked memory. Does anyone have a sense for whether that is expected or it shows a resource leak?