Bug#903492: Runtime error "PMIX-XFER-VALUE: UNSUPPORTED TYPE 28016"

2018-07-11 Thread Alastair McKinstry
This may also be due to recent uploads of a new PMIX (OpenMPI now uses 
an external pmix library).


This linkage seems fragile and I need to come up with package tests in 
pmix / openmpi to test it.


regards

Alastair


On 10/07/2018 18:57, Anton Gladky wrote:

Source: openmpi
Version: 3.1.1.real-1
Severity: serious

Dear maintainer,

it looks like the version 3.1.1.real-1 introduces the regression in autopkgtest.

We are preparing some set of autopkgtests for the boost libraries (not in
the package yet) [1]. And the MPI-autopkgtests are failing, when they
are built against 3.1.1.real-1 [2]:

===
build: OK
[runner-103358cc-project-28133-concurrent-0:04241] PMIX-XFER-VALUE: UNSUPPORTED 
TYPE 28016
[runner-103358cc-project-28133-concurrent-0:04241] PMIX ERROR: ERROR in file 
../../../src/server/pmix_server.c at line 332
ERROR: Job failed: exit code 1
===

The compiled code is here [3] and the run-command is the following:
mpirun --oversubscribe --allow-run-as-root -np 8 ./demo1

It compiles just fine in the Debian Buster against the version 3.1.0-7 [4].

[1] https://salsa.debian.org/gladk/boost/pipelines
[2] https://salsa.debian.org/gladk/boost/-/jobs/30504
[3] 
https://salsa.debian.org/gladk/boost/blob/master/debian/tests/srcs/graph-parallel/demo1.cpp
[4] https://salsa.debian.org/gladk/boost/-/jobs/30510


Thanks,

Anton


--
Alastair McKinstry, , , 
https://diaspora.sceal.ie/u/amckinstry
Commander Vimes didn’t like the phrase “The innocent have nothing to fear,”
 believing the innocent had everything to fear, mostly from the guilty but in 
the longer term
 even more from those who say things like “The innocent have nothing to fear.”
 - T. Pratchett, Snuff



Bug#903492: Runtime error "PMIX-XFER-VALUE: UNSUPPORTED TYPE 28016"

2018-07-11 Thread Drew Parsons
On Wed, 11 Jul 2018 13:29:54 +0800 Drew Parsons 
wrote:
> 
> Not just the autopkgtest, it's fouling up petsc and dolfin too ;(

Might be worth mentioning though, openmpi 3.1.1.real-1 has been with us
over a week now, but the problem with the dolfin build (unable to build
with petsc, and getting the 28016 error) only started today.  Something
else might be involved.

Drew



Bug#903492: Runtime error "PMIX-XFER-VALUE: UNSUPPORTED TYPE 28016"

2018-07-10 Thread Drew Parsons
On Tue, 10 Jul 2018 19:57:57 +0200 Anton Gladky >  
> it looks like the version 3.1.1.real-1 introduces the regression in
autopkgtest.
> 

Not just the autopkgtest, it's fouling up petsc and dolfin too ;(



Bug#903492: Runtime error "PMIX-XFER-VALUE: UNSUPPORTED TYPE 28016"

2018-07-10 Thread Anton Gladky
Source: openmpi
Version: 3.1.1.real-1
Severity: serious

Dear maintainer,

it looks like the version 3.1.1.real-1 introduces the regression in autopkgtest.

We are preparing some set of autopkgtests for the boost libraries (not in
the package yet) [1]. And the MPI-autopkgtests are failing, when they
are built against 3.1.1.real-1 [2]:

===
build: OK
[runner-103358cc-project-28133-concurrent-0:04241] PMIX-XFER-VALUE: UNSUPPORTED 
TYPE 28016
[runner-103358cc-project-28133-concurrent-0:04241] PMIX ERROR: ERROR in file 
../../../src/server/pmix_server.c at line 332
ERROR: Job failed: exit code 1
===

The compiled code is here [3] and the run-command is the following:
mpirun --oversubscribe --allow-run-as-root -np 8 ./demo1

It compiles just fine in the Debian Buster against the version 3.1.0-7 [4].

[1] https://salsa.debian.org/gladk/boost/pipelines
[2] https://salsa.debian.org/gladk/boost/-/jobs/30504
[3] 
https://salsa.debian.org/gladk/boost/blob/master/debian/tests/srcs/graph-parallel/demo1.cpp
[4] https://salsa.debian.org/gladk/boost/-/jobs/30510


Thanks,

Anton