Hi Siegmar
The patch was merged into the v4.0.0 branch on Oct 10th, so should be available
in the nightly tarball from that date onward.
> On Oct 6, 2018, at 2:12 AM, Siegmar Gross
> wrote:
>
> Hi Jeff, hi Ralph,
>
> Great, it works again! Thank you very much for your help. I'm really happy,
> if the undefined references for Sun C are resolved and there are no new
> problems for that compiler :-)). Do you know when the pmix patch will be
> integrated into version 4.0.0?
>
>
> Best regards
>
> Siegmar
>
>
> On 10/5/18 4:33 PM, Jeff Squyres (jsquyres) via users wrote:
>> Oops! We had a typo in yesterday's fix -- fixed:
>> https://github.com/open-mpi/ompi/pull/5847
>> Ralph also put double extra super protection to make triple sure that this
>> error can't happen again in:
>> https://github.com/open-mpi/ompi/pull/5846
>> Both of these should be in tonight's nightly snapshot.
>> Thank you!
>>> On Oct 5, 2018, at 5:45 AM, Ralph H Castain wrote:
>>>
>>> Please send Jeff and I the opal/mca/pmix/pmix4x/pmix/config.log again -
>>> we’ll need to see why it isn’t building. The patch definitely is not in the
>>> v4.0 branch, but it should have been in master.
>>>
>>>
On Oct 5, 2018, at 2:04 AM, Siegmar Gross
wrote:
Hi Ralph, hi Jeff,
On 10/3/18 8:14 PM, Ralph H Castain wrote:
> Jeff and I talked and believe the patch in
> https://github.com/open-mpi/ompi/pull/5836 should fix the problem.
Today I've installed openmpi-master-201810050304-5f1c940 and
openmpi-v4.0.x-201810050241-c079666. Unfortunately, I still get the
same error for all seven versions that I was able to build.
loki hello_1 114 mpicc --showme
gcc -I/usr/local/openmpi-master_64_gcc/include -fexceptions -pthread
-std=c11 -m64 -Wl,-rpath -Wl,/usr/local/openmpi-master_64_gcc/lib64
-Wl,--enable-new-dtags -L/usr/local/openmpi-master_64_gcc/lib64 -lmpi
loki hello_1 115 ompi_info | grep "Open MPI repo revision"
Open MPI repo revision: v2.x-dev-6262-g5f1c940
loki hello_1 116 mpicc hello_1_mpi.c
loki hello_1 117 mpiexec -np 2 a.out
[loki:25575] [[64603,0],0] ORTE_ERROR_LOG: Not found in file
../../../../../openmpi-master-201810050304-5f1c940/orte/mca/ess/hnp/ess_hnp_module.c
at line 320
--
It looks like orte_init failed for some reason; your parallel process is
likely to abort. There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems. This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):
opal_pmix_base_select failed
--> Returned value Not found (-13) instead of ORTE_SUCCESS
--
loki hello_1 118
I don't know, if you have already applied your suggested patch or if the
error message is still from a version without that patch. Do you need
anything else?
Best regards
Siegmar
>> On Oct 2, 2018, at 2:50 PM, Jeff Squyres (jsquyres) via users
>> wrote:
>>
>> (Ralph sent me Siegmar's pmix config.log, which Siegmar sent to him
>> off-list)
>>
>> It looks like Siegmar passed --with-hwloc=internal.
>>
>> Open MPI's configure understood this and did the appropriate things.
>> PMIX's configure didn't.
>>
>> I think we need to add an adjustment into the PMIx configure.m4 in
>> OMPI...
>>
>>
>>> On Oct 2, 2018, at 5:25 PM, Ralph H Castain wrote:
>>>
>>> Hi Siegmar
>>>
>>> I honestly have no idea - for some reason, the PMIx component isn’t
>>> seeing the internal hwloc code in your environment.
>>>
>>> Jeff, Brice - any ideas?
>>>
>>>
On Oct 2, 2018, at 1:18 PM, Siegmar Gross
wrote:
Hi Ralph,
how can I confirm that HWLOC built? Some hwloc files are available
in the built directory.
loki openmpi-master-201809290304-73075b8-Linux.x86_64.64_gcc 111 find
. -name '*hwloc*'
./opal/mca/btl/usnic/.deps/btl_usnic_hwloc.Plo
./opal/mca/hwloc
./opal/mca/hwloc/external/.deps/hwloc_external_component.Plo
./opal/mca/hwloc/base/hwloc_base_frame.lo
./opal/mca/hwloc/base/.deps/hwloc_base_dt.Plo
./opal/mca/hwloc/base/.deps/hwloc_base_maffinity.Plo
./opal/mca/hwloc/base/.deps/hwloc_base_frame.Plo
./opal/mca/hwloc/base/.deps/hwloc_base_util.Plo
./opal/mca/hwloc/base/hwloc_base_dt.lo
./opal/mca/hwloc/base/hwloc_base_util.lo
./opal/mca/hwloc/base/hwloc_base_maffinity.lo