Package: libopenmpi3
Version: 3.1.3-11
Severity: serious

Even with the fixed libpmix2_4.0.0~rc1-2, I am getting runtime failures
trying to run MPI programs, e.g. the nwchem autopkgtests all fail like
this:

| Running tests/water/water_md 
| 
|     cleaning scratch
|     copying input and verified output files
|     running nwchem (/usr/bin/nwchem)  with 1 processors 
| 
|     NWChem execution failed
|[kohn:13218] [[5127,0],0] ORTE_ERROR_LOG: Not found in file 
../../../../../../orte/mca/ess/hnp/ess_hnp_module.c at line 320
|--------------------------------------------------------------------------
|It looks like orte_init failed for some reason; your parallel process is
|likely to abort.  There are many reasons that a parallel process can
|fail during orte_init; some of which are due to configuration or
|environment problems.  This failure appears to be an internal failure;
|here's some additional information (which may only be relevant to an
|Open MPI developer):
|
|  opal_pmix_base_select failed
|  --> Returned value Not found (-13) instead of ORTE_SUCCESS
|--------------------------------------------------------------------------

Not sure whether this is libopenmpi3, openmpi-bin, libpmix2 or something
else, so please reassign as needed. But at least the openmpi excuses is
full of ci.debian.net regressions:

https://qa.debian.org/excuses.php?package=openmpi

Or is there something needed on the application side, like a new
environment variable or library to be linked in?


Michael

Reply via email to