Hello: I am testing a rather large code on several computers. It works fine on all except for a Linux Pop!_OS machine. I tried both OpenMPI 2.1.1 and 4.0.5. I fear there is an issue because of the Pop!_OS but before I contact System76 I would like to explore things further.
I get the following message while running the code on a box called jp1: [jp1:3331418] 7 more processes have sent help message help-mpi-btl-base.txt / btl:no-nics [jp1:3331418] Set MCA parameter "orte_base_help_aggregate" to 0 to see all help / error messages and then mpirun noticed that process rank 3 with PID 0 on node jp1 exited on signal 9 (Killed). It seems I should set this MCA parameter "orte_base_help_aggregate" to 0 in order to see the error messages. How can I do this? I suppose I should do it before running the code. Is this correct? Thank you, Paul