Re: [OMPI users] IBM Spectrum MPI problem

2017-05-18 Thread John Hearns via users
Gabriele,   pleae run  'ibv_devinfo'
It looks to me like you may have the physical interface cards in these
systems, but you do not have the correct drivers or libraries loaded.

I have had similar messages when using Infiniband on x86 systems - which
did not have libibverbs installed.


On 19 May 2017 at 08:41, Gabriele Fatigati  wrote:

> Hi Gilles, using your command:
>
> [openpower:88536] mca: base: components_register: registering framework
> pml components
> [openpower:88536] mca: base: components_register: found loaded component
> pami
> [openpower:88536] mca: base: components_register: component pami register
> function successful
> [openpower:88536] mca: base: components_open: opening pml components
> [openpower:88536] mca: base: components_open: found loaded component pami
> [openpower:88536] mca: base: components_open: component pami open function
> successful
> [openpower:88536] select: initializing pml component pami
> findActiveDevices Error
> We found no active IB device ports
> [openpower:88536] select: init returned failure for component pami
> [openpower:88536] PML pami cannot be selected
> --
> No components were able to be opened in the pml framework.
>
> This typically means that either no components of this type were
> installed, or none of the installed componnets can be loaded.
> Sometimes this means that shared libraries required by these
> components are unable to be found/loaded.
>
>   Host:  openpower
>   Framework: pml
> --
>
>
> 2017-05-19 7:03 GMT+02:00 Gilles Gouaillardet :
>
>> Gabriele,
>>
>>
>> pml/pami is here, at least according to ompi_info
>>
>>
>> can you update your mpirun command like this
>>
>> mpirun --mca pml_base_verbose 100 ..
>>
>>
>> and post the output ?
>>
>>
>> Cheers,
>>
>> Gilles
>>
>> On 5/18/2017 10:41 PM, Gabriele Fatigati wrote:
>>
>>> Hi Gilles, attached the requested info
>>>
>>> 2017-05-18 15:04 GMT+02:00 Gilles Gouaillardet <
>>> gilles.gouaillar...@gmail.com >:
>>>
>>> Gabriele,
>>>
>>> can you
>>> ompi_info --all | grep pml
>>>
>>> also, make sure there is nothing in your environment pointing to
>>> an other Open MPI install
>>> for example
>>> ldd a.out
>>> should only point to IBM libraries
>>>
>>> Cheers,
>>>
>>> Gilles
>>>
>>>
>>> On Thursday, May 18, 2017, Gabriele Fatigati >> > wrote:
>>>
>>> Dear OpenMPI users and developers, I'm using IBM Spectrum MPI
>>> 10.1.0 based on OpenMPI, so I hope there are some MPI expert
>>> can help me to solve the problem.
>>>
>>> When I run a simple Hello World MPI program, I get the follow
>>> error message:
>>>
>>>
>>> A requested component was not found, or was unable to be
>>> opened.  This
>>> means that this component is either not installed or is unable
>>> to be
>>> used on your system (e.g., sometimes this means that shared
>>> libraries
>>> that the component requires are unable to be found/loaded).
>>>Note that
>>> Open MPI stopped checking at the first component that it did
>>> not find.
>>>
>>> Host:  openpower
>>> Framework: pml
>>> Component: pami
>>> 
>>> --
>>> 
>>> --
>>> It looks like MPI_INIT failed for some reason; your parallel
>>> process is
>>> likely to abort. There are many reasons that a parallel
>>> process can
>>> fail during MPI_INIT; some of which are due to configuration
>>> or environment
>>> problems.  This failure appears to be an internal failure;
>>> here's some
>>> additional information (which may only be relevant to an Open MPI
>>> developer):
>>>
>>> mca_pml_base_open() failed
>>>   --> Returned "Not found" (-13) instead of "Success" (0)
>>> 
>>> --
>>> *** An error occurred in MPI_Init
>>> *** on a NULL communicator
>>> *** MPI_ERRORS_ARE_FATAL (processes in this communicator will
>>> now abort,
>>> ***and potentially your MPI job)
>>>
>>> My sysadmin used official IBM Spectrum packages to install
>>> MPI, so It's quite strange that there are some components
>>> missing (pami). Any help? Thanks
>>>
>>>
>>> -- Ing. Gabriele Fatigati
>>>
>>> HPC specialist
>>>
>>> SuperComputing Applications and Innovation Department
>>>
>>> Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy
>>>
>>> www.cineca.it  

Re: [OMPI users] IBM Spectrum MPI problem

2017-05-18 Thread Gabriele Fatigati
Hi Gilles, using your command:

[openpower:88536] mca: base: components_register: registering framework pml
components
[openpower:88536] mca: base: components_register: found loaded component
pami
[openpower:88536] mca: base: components_register: component pami register
function successful
[openpower:88536] mca: base: components_open: opening pml components
[openpower:88536] mca: base: components_open: found loaded component pami
[openpower:88536] mca: base: components_open: component pami open function
successful
[openpower:88536] select: initializing pml component pami
findActiveDevices Error
We found no active IB device ports
[openpower:88536] select: init returned failure for component pami
[openpower:88536] PML pami cannot be selected
--
No components were able to be opened in the pml framework.

This typically means that either no components of this type were
installed, or none of the installed componnets can be loaded.
Sometimes this means that shared libraries required by these
components are unable to be found/loaded.

  Host:  openpower
  Framework: pml
--


2017-05-19 7:03 GMT+02:00 Gilles Gouaillardet :

> Gabriele,
>
>
> pml/pami is here, at least according to ompi_info
>
>
> can you update your mpirun command like this
>
> mpirun --mca pml_base_verbose 100 ..
>
>
> and post the output ?
>
>
> Cheers,
>
> Gilles
>
> On 5/18/2017 10:41 PM, Gabriele Fatigati wrote:
>
>> Hi Gilles, attached the requested info
>>
>> 2017-05-18 15:04 GMT+02:00 Gilles Gouaillardet <
>> gilles.gouaillar...@gmail.com >:
>>
>> Gabriele,
>>
>> can you
>> ompi_info --all | grep pml
>>
>> also, make sure there is nothing in your environment pointing to
>> an other Open MPI install
>> for example
>> ldd a.out
>> should only point to IBM libraries
>>
>> Cheers,
>>
>> Gilles
>>
>>
>> On Thursday, May 18, 2017, Gabriele Fatigati > > wrote:
>>
>> Dear OpenMPI users and developers, I'm using IBM Spectrum MPI
>> 10.1.0 based on OpenMPI, so I hope there are some MPI expert
>> can help me to solve the problem.
>>
>> When I run a simple Hello World MPI program, I get the follow
>> error message:
>>
>>
>> A requested component was not found, or was unable to be
>> opened.  This
>> means that this component is either not installed or is unable
>> to be
>> used on your system (e.g., sometimes this means that shared
>> libraries
>> that the component requires are unable to be found/loaded).
>>Note that
>> Open MPI stopped checking at the first component that it did
>> not find.
>>
>> Host:  openpower
>> Framework: pml
>> Component: pami
>> 
>> --
>> 
>> --
>> It looks like MPI_INIT failed for some reason; your parallel
>> process is
>> likely to abort. There are many reasons that a parallel
>> process can
>> fail during MPI_INIT; some of which are due to configuration
>> or environment
>> problems.  This failure appears to be an internal failure;
>> here's some
>> additional information (which may only be relevant to an Open MPI
>> developer):
>>
>> mca_pml_base_open() failed
>>   --> Returned "Not found" (-13) instead of "Success" (0)
>> 
>> --
>> *** An error occurred in MPI_Init
>> *** on a NULL communicator
>> *** MPI_ERRORS_ARE_FATAL (processes in this communicator will
>> now abort,
>> ***and potentially your MPI job)
>>
>> My sysadmin used official IBM Spectrum packages to install
>> MPI, so It's quite strange that there are some components
>> missing (pami). Any help? Thanks
>>
>>
>> -- Ing. Gabriele Fatigati
>>
>> HPC specialist
>>
>> SuperComputing Applications and Innovation Department
>>
>> Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy
>>
>> www.cineca.it    Tel: +39
>> 051 6171722 
>>
>> g.fatigati [AT] cineca.it 
>>
>>
>> ___
>> users mailing list
>> users@lists.open-mpi.org 
>> https://rfd.newmexicoconsortium.org/mailman/listinfo/users
>> 
>>
>>
>>
>>
>> --
>> Ing. Gabriele Fatigati
>>
>> HPC specialist
>>
>> SuperComputing Applications and Innovation D

Re: [OMPI users] Many different errors with ompi version 2.1.1

2017-05-18 Thread Gilles Gouaillardet

Allan,


- on which node is mpirun invoked ?

- are you running from a batch manager ?

- is there any firewall running on your nodes ?

- how many interfaces are part of bond0 ?


the error is likely occuring when wiring-up mpirun/orted

what if you

mpirun -np 2 --hostfile nodes --mca oob_tcp_if_include 192.168.1.0/24 
--mca oob_base_verbose 100 true


then (if the previous command worked)

mpirun -np 12 --hostfile nodes --mca oob_tcp_if_include 192.168.1.0/24 
--mca oob_base_verbose 100 true


and finally (if both previous commands worked)

mpirun -np 2 --hostfile nodes --mca oob_tcp_if_include 192.168.1.0/24 
--mca oob_base_verbose 100 ring



Cheers,

Gilles


On 5/19/2017 3:07 PM, Allan Overstreet wrote:
I experiencing many different errors with openmpi version 2.1.1. I 
have had a suspicion that this might be related to the way the servers 
were connected and configured. Regardless below is a diagram of how 
the server are configured.


__  _
   [__]|=|
   /::/|_|
   HOST: smd
   Dual 1Gb Ethernet Bonded
   .-> Bond0 IP: 192.168.1.200
   |   Infiniband Card: MHQH29B-XTR <.
   |   Ib0 IP: 10.1.0.1  |
   |   OS: Ubuntu Mate   |
   |   __ _ |
   | [__]|=||
   | /::/|_||
   |   HOST: sm1 |
   |   Dual 1Gb Ethernet Bonded  |
   |-> Bond0 IP: 192.168.1.196   |
   |   Infiniband Card: QLOGIC QLE7340 <-|
   |   Ib0 IP: 10.1.0.2  |
   |   OS: Centos 7 Minimal  |
   |   __ _ |
   | [__]|=||
   |-. /::/|_||
   | | HOST: sm2 |
   | | Dual 1Gb Ethernet Bonded  |
   | '---> Bond0 IP: 192.168.1.199   |
   __  Infiniband Card: QLOGIC QLE7340 __
  [_|||_°] Ib0 IP: 10.1.0.3 [_|||_°]
  [_|||_°] OS: Centos 7 Minimal [_|||_°]
  [_|||_°] __ _ [_|||_°]
   Gb Ethernet Switch [__]|=| Voltaire 4036 QDR Switch
   | /::/|_| |
   |   HOST: sm3  |
   |   Dual 1Gb Ethernet Bonded   |
   |-> Bond0 IP: 192.168.1.203|
   |   Infiniband Card: QLOGIC QLE7340 <--|
   |   Ib0 IP: 10.1.0.4   |
   |   OS: Centos 7 Minimal   |
   |  __ _   |
   | [__]|=|  |
   | /::/|_|  |
   |   HOST: sm4  |
   |   Dual 1Gb Ethernet Bonded   |
   |-> Bond0 IP: 192.168.1.204|
   |   Infiniband Card: QLOGIC QLE7340 <--|
   |   Ib0 IP: 10.1.0.5   |
   |   OS: Centos 7 Minimal   |
   | __ _|
   | [__]|=|   |
   | /::/|_|   |
   |   HOST: dl580|
   |   Dual 1Gb Ethernet Bonded   |
   '-> Bond0 IP: 192.168.1.201|
   Infiniband Card: QLOGIC QLE7340 <--'
   Ib0 IP: 10.1.0.6
   OS: Centos 7 Minimal

I have ensured that the Infiniband adapters can ping each other and 
every node can passwordless ssh into every other node. Every node has 
the same /etc/hosts file,


cat /etc/hosts

127.0.0.1localhost
192.168.1.200smd
192.168.1.196sm1
192.168.1.199sm2
192.168.1.203sm3
192.168.1.204sm4
192.168.1.201dl580

10.1.0.1smd-ib
10.1.0.2sm1-ib
10.1.0.3sm2-ib
10.1.0.4sm3-ib
10.1.0.5sm4-ib
10.1.0.6dl580-ib

I have been using a simple ring test program to test openmpi. The code 
for this program is attached.


The hostfile used in all the commands is,

cat ./nodes

smd slots=2
sm1 s

[OMPI users] Many different errors with ompi version 2.1.1

2017-05-18 Thread Allan Overstreet
I experiencing many different errors with openmpi version 2.1.1. I have 
had a suspicion that this might be related to the way the servers were 
connected and configured. Regardless below is a diagram of how the 
server are configured.


__  _
   [__]|=|
   /::/|_|
   HOST: smd
   Dual 1Gb Ethernet Bonded
   .-> Bond0 IP: 192.168.1.200
   |   Infiniband Card: MHQH29B-XTR <.
   |   Ib0 IP: 10.1.0.1  |
   |   OS: Ubuntu Mate   |
   |   __ _ |
   | [__]|=||
   | /::/|_||
   |   HOST: sm1 |
   |   Dual 1Gb Ethernet Bonded  |
   |-> Bond0 IP: 192.168.1.196   |
   |   Infiniband Card: QLOGIC QLE7340 <-|
   |   Ib0 IP: 10.1.0.2  |
   |   OS: Centos 7 Minimal  |
   |   __ _ |
   | [__]|=||
   |-. /::/|_||
   | | HOST: sm2 |
   | | Dual 1Gb Ethernet Bonded  |
   | '---> Bond0 IP: 192.168.1.199   |
   __  Infiniband Card: QLOGIC QLE7340  __
  [_|||_°] Ib0 IP: 10.1.0.3[_|||_°]
  [_|||_°] OS: Centos 7 Minimal[_|||_°]
  [_|||_°] __ _   [_|||_°]
   Gb Ethernet Switch [__]|=| Voltaire 4036 QDR Switch
   | /::/|_| |
   |   HOST: sm3  |
   |   Dual 1Gb Ethernet Bonded   |
   |-> Bond0 IP: 192.168.1.203|
   |   Infiniband Card: QLOGIC QLE7340 <--|
   |   Ib0 IP: 10.1.0.4   |
   |   OS: Centos 7 Minimal   |
   |  __ _   |
   | [__]|=|  |
   | /::/|_|  |
   |   HOST: sm4  |
   |   Dual 1Gb Ethernet Bonded   |
   |-> Bond0 IP: 192.168.1.204|
   |   Infiniband Card: QLOGIC QLE7340 <--|
   |   Ib0 IP: 10.1.0.5   |
   |   OS: Centos 7 Minimal   |
   | __ _|
   | [__]|=|   |
   | /::/|_|   |
   |   HOST: dl580|
   |   Dual 1Gb Ethernet Bonded   |
   '-> Bond0 IP: 192.168.1.201|
   Infiniband Card: QLOGIC QLE7340 <--'
   Ib0 IP: 10.1.0.6
   OS: Centos 7 Minimal

I have ensured that the Infiniband adapters can ping each other and 
every node can passwordless ssh into every other node. Every node has 
the same /etc/hosts file,


cat /etc/hosts

127.0.0.1localhost
192.168.1.200smd
192.168.1.196sm1
192.168.1.199sm2
192.168.1.203sm3
192.168.1.204sm4
192.168.1.201dl580

10.1.0.1smd-ib
10.1.0.2sm1-ib
10.1.0.3sm2-ib
10.1.0.4sm3-ib
10.1.0.5sm4-ib
10.1.0.6dl580-ib

I have been using a simple ring test program to test openmpi. The code 
for this program is attached.


The hostfile used in all the commands is,

cat ./nodes

smd slots=2
sm1 slots=2
sm2 slots=2
sm3 slots=2
sm4 slots=2
dl580 slots=2

When running the following command on smd,

mpirun -mca btl openib,self -np 2 --hostfile nodes ./ring

I obtain the following error,


A process or daemon was unable to complete a TCP connection
to another process:
  Local host:sm1
  Remote host:   192.168.1.200
This is usually caused by a firewall on the remote host. Please
check that any firewall (e.g., iptables) has been disabled and
try again.

--
No Op

Re: [OMPI users] MPI the correct solution?

2017-05-18 Thread doark
On Tue, 9 May 2017 00:30:38 +0200
Reuti  wrote:
> Hi,
> 
> Am 08.05.2017 um 23:25 schrieb David Niklas:
> 
> > Hello,
> > I originally ported this question at LQ, but the answer I got back
> > shows rather poor insight on the subject of MPI, so I'm taking the
> > liberty of posting here also.
> > 
> > https://www.linuxquestions.org/questions/showthread.php?p=5707962
> > 
> > What I'm trying to do is figure out how/what to use to update an osm
> > file (open street map), in a cross system manner. I know the correct
> > program osmosis and for de/re-compression lbzip2 but how to do this
> > across computers is confusing me, even after a few hours of searching
> > online.  
> 
> lbzip2 is only thread parallel on a single machine. With pbzip2 you
> mention it's the same, but it exists an MPI version MPIBZIP2 -
I can't find the project, do you have a link?

> unfortunately it looks unmaintained since 2007. Maybe you can contact
> the author about its state. Without an MPI application like this, the
> MPI library is nothing on its own which would divide and distribute one
> task to several machines automatically.
Well, there might be other ways to cause a program to run on multiple
computers. Perhaps a virtual machine made of of multiple physical
machines?

> osmosis itself seems to run in serial only (they don't say any word
> whether it uses any parallelism).
Yes, it does run multiple threads, you just start another task (and add a
buffer). I tested this on my machine, I think it is --read-xml
--write-xml and --read-xml-change that start new threads. The question is
whether or not java is naively MPI aware or does the app need special
coding?

> For the intended task the only option is to use a single machine with
> as many cores as possible AFAICS.
Though about that, and it is doable with respect to memory and disk
constraints, the problem is that it would take a *long* time esp. with the
amount of updates I must do, hence my inquiry.

Thanks,
David
___
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users


Re: [OMPI users] IBM Spectrum MPI problem

2017-05-18 Thread Gabriele Fatigati
Hi Gilles, attached the requested info

2017-05-18 15:04 GMT+02:00 Gilles Gouaillardet <
gilles.gouaillar...@gmail.com>:

> Gabriele,
>
> can you
> ompi_info --all | grep pml
>
> also, make sure there is nothing in your environment pointing to an other
> Open MPI install
> for example
> ldd a.out
> should only point to IBM libraries
>
> Cheers,
>
> Gilles
>
>
> On Thursday, May 18, 2017, Gabriele Fatigati  wrote:
>
>> Dear OpenMPI users and developers, I'm using IBM Spectrum MPI 10.1.0 based
>> on OpenMPI, so I hope there are some MPI expert can help me to solve the
>> problem.
>>
>> When I run a simple Hello World MPI program, I get the follow error
>> message:
>>
>> A requested component was not found, or was unable to be opened.  This
>> means that this component is either not installed or is unable to be
>> used on your system (e.g., sometimes this means that shared libraries
>> that the component requires are unable to be found/loaded).  Note that
>> Open MPI stopped checking at the first component that it did not find.
>>
>> Host:  openpower
>> Framework: pml
>> Component: pami
>> 
>> --
>> 
>> --
>> It looks like MPI_INIT failed for some reason; your parallel process is
>> likely to abort.  There are many reasons that a parallel process can
>> fail during MPI_INIT; some of which are due to configuration or
>> environment
>> problems.  This failure appears to be an internal failure; here's some
>> additional information (which may only be relevant to an Open MPI
>> developer):
>>
>>   mca_pml_base_open() failed
>>   --> Returned "Not found" (-13) instead of "Success" (0)
>> 
>> --
>> *** An error occurred in MPI_Init
>> *** on a NULL communicator
>> *** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
>> ***and potentially your MPI job)
>>
>> My sysadmin used official IBM Spectrum packages to install MPI, so It's
>> quite strange that there are some components missing (pami). Any help?
>> Thanks
>>
>> --
>> Ing. Gabriele Fatigati
>>
>> HPC specialist
>>
>> SuperComputing Applications and Innovation Department
>>
>> Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy
>>
>> www.cineca.itTel:   +39 051 6171722
>> <051%20617%201722>
>>
>> g.fatigati [AT] cineca.it
>>
>
> ___
> users mailing list
> users@lists.open-mpi.org
> https://rfd.newmexicoconsortium.org/mailman/listinfo/users
>



-- 
Ing. Gabriele Fatigati

HPC specialist

SuperComputing Applications and Innovation Department

Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy

www.cineca.itTel:   +39 051 6171722

g.fatigati [AT] cineca.it


info
Description: Binary data
___
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

Re: [OMPI users] IBM Spectrum MPI problem

2017-05-18 Thread Chuck Schneider
Another thing Gabriele might want to check is whether the Spectrum MPI 
launcher is being used.  In Spectrum MPI, "mpirun" is a script that, among 
other things, sets up the environment for Spectrum-specific libraries to be found.


In my experience, trying to "short circuit" the use of that script - for 
example, by using the orterun binary - can lead to problems similar to what 
Gabriele is experiencing.


My suggestion would be to run file(1) on the launcher to make sure it is the 
Spectrum script.  For example, if the command used to launch is simply 
"mpirun", then a suitable check would be:


file `which mpirun`

Regards,

Chuck Schneider | Software Engineer
Rogue Wave Software, Inc.
Innovate with Confidence
P 508.652.7718 | F 508.652.7701
www.roguewave.com | chuck.schnei...@roguewave.com

On 05/18/2017 09:04 AM, Gilles Gouaillardet wrote:

Gabriele,

can you
ompi_info --all | grep pml

also, make sure there is nothing in your environment pointing to an other Open
MPI install
for example
ldd a.out
should only point to IBM libraries

Cheers,

Gilles

On Thursday, May 18, 2017, Gabriele Fatigati mailto:g.fatig...@cineca.it>> wrote:

Dear OpenMPI users and developers, I'm using IBM Spectrum MPI 10.1.0 based
on OpenMPI, so I hope there are some MPI expert can help me to solve the
problem.

When I run a simple Hello World MPI program, I get the follow error message:


A requested component was not found, or was unable to be opened.  This
means that this component is either not installed or is unable to be
used on your system (e.g., sometimes this means that shared libraries
that the component requires are unable to be found/loaded).  Note that
Open MPI stopped checking at the first component that it did not find.

Host:  openpower
Framework: pml
Component: pami
--
--
It looks like MPI_INIT failed for some reason; your parallel process is
likely to abort.  There are many reasons that a parallel process can
fail during MPI_INIT; some of which are due to configuration or environment
problems.  This failure appears to be an internal failure; here's some
additional information (which may only be relevant to an Open MPI
developer):

   mca_pml_base_open() failed
   --> Returned "Not found" (-13) instead of "Success" (0)
--
*** An error occurred in MPI_Init
*** on a NULL communicator
*** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
***and potentially your MPI job)

My sysadmin used official IBM Spectrum packages to install MPI, so It's
quite strange that there are some components missing (pami). Any help? 
Thanks


--
Ing. Gabriele Fatigati

HPC specialist

SuperComputing Applications and Innovation Department

Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy

www.cineca.it Tel:   +39 051 
6171722

g.fatigati [AT] cineca.it 



___
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users


___
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users


Re: [OMPI users] IBM Spectrum MPI problem

2017-05-18 Thread Gilles Gouaillardet
Gabriele,

can you
ompi_info --all | grep pml

also, make sure there is nothing in your environment pointing to an other
Open MPI install
for example
ldd a.out
should only point to IBM libraries

Cheers,

Gilles

On Thursday, May 18, 2017, Gabriele Fatigati  wrote:

> Dear OpenMPI users and developers, I'm using IBM Spectrum MPI 10.1.0 based
> on OpenMPI, so I hope there are some MPI expert can help me to solve the
> problem.
>
> When I run a simple Hello World MPI program, I get the follow error
> message:
>
> A requested component was not found, or was unable to be opened.  This
> means that this component is either not installed or is unable to be
> used on your system (e.g., sometimes this means that shared libraries
> that the component requires are unable to be found/loaded).  Note that
> Open MPI stopped checking at the first component that it did not find.
>
> Host:  openpower
> Framework: pml
> Component: pami
> --
> --
> It looks like MPI_INIT failed for some reason; your parallel process is
> likely to abort.  There are many reasons that a parallel process can
> fail during MPI_INIT; some of which are due to configuration or environment
> problems.  This failure appears to be an internal failure; here's some
> additional information (which may only be relevant to an Open MPI
> developer):
>
>   mca_pml_base_open() failed
>   --> Returned "Not found" (-13) instead of "Success" (0)
> --
> *** An error occurred in MPI_Init
> *** on a NULL communicator
> *** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
> ***and potentially your MPI job)
>
> My sysadmin used official IBM Spectrum packages to install MPI, so It's
> quite strange that there are some components missing (pami). Any help?
> Thanks
>
> --
> Ing. Gabriele Fatigati
>
> HPC specialist
>
> SuperComputing Applications and Innovation Department
>
> Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy
>
> www.cineca.itTel:   +39 051 6171722
>
> g.fatigati [AT] cineca.it
>
___
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

Re: [OMPI users] IBM Spectrum MPI problem

2017-05-18 Thread John Hearns via users
One very stupid question...what does  'ibv_devinfo'  say when you run
it on the compute nodes?

ps. I know nothing about IBM MPI or pami but I think this is as you say
some simple library being missing etc.


On 18 May 2017 at 14:20, Gabriele Fatigati  wrote:

> Hi John, about btl this is the output of ompi_info:
>
> MCA btl: self (MCA v2.1.0, API v3.0.0, Component v10.1.0)
> MCA btl: sm (MCA v2.1.0, API v3.0.0, Component v10.1.0)
> MCA btl: tcp (MCA v2.1.0, API v3.0.0, Component v10.1.0)
> MCA btl: vader (MCA v2.1.0, API v3.0.0, Component v10.1.0)
> MCA fbtl: posix (MCA v2.1.0, API v2.0.0, Component v10.1.0)
>
>
> about mtl no information retrieve ompi_info
>
>
> 2017-05-18 14:13 GMT+02:00 John Hearns via users  >:
>
>> Gabriele,  as this is based on OpenMPI can you run ompi_info
>> then look for the btl which are available and the mtl which are available?
>>
>>
>>
>> On 18 May 2017 at 14:10, Reuti  wrote:
>>
>>> Hi,
>>>
>>> > Am 18.05.2017 um 14:02 schrieb Gabriele Fatigati >> >:
>>> >
>>> > Dear OpenMPI users and developers, I'm using IBM Spectrum MPI 10.1.0
>>>
>>> I noticed this on IBM's website too. Is this freely available? Up to now
>>> I was always bounced back to their former Platform MPI when trying to
>>> download the community edition (even the evaluation link on the Spectrum
>>> MPI page does the same).
>>>
>>> -- Reuti
>>>
>>>
>>> >  based on OpenMPI, so I hope there are some MPI expert can help me to
>>> solve the problem.
>>> >
>>> > When I run a simple Hello World MPI program, I get the follow error
>>> message:
>>> >
>>> > A requested component was not found, or was unable to be opened.  This
>>> > means that this component is either not installed or is unable to be
>>> > used on your system (e.g., sometimes this means that shared libraries
>>> > that the component requires are unable to be found/loaded).  Note that
>>> > Open MPI stopped checking at the first component that it did not find.
>>> >
>>> > Host:  openpower
>>> > Framework: pml
>>> > Component: pami
>>> > 
>>> --
>>> > 
>>> --
>>> > It looks like MPI_INIT failed for some reason; your parallel process is
>>> > likely to abort.  There are many reasons that a parallel process can
>>> > fail during MPI_INIT; some of which are due to configuration or
>>> environment
>>> > problems.  This failure appears to be an internal failure; here's some
>>> > additional information (which may only be relevant to an Open MPI
>>> > developer):
>>> >
>>> >   mca_pml_base_open() failed
>>> >   --> Returned "Not found" (-13) instead of "Success" (0)
>>> > 
>>> --
>>> > *** An error occurred in MPI_Init
>>> > *** on a NULL communicator
>>> > *** MPI_ERRORS_ARE_FATAL (processes in this communicator will now
>>> abort,
>>> > ***and potentially your MPI job)
>>> >
>>> > My sysadmin used official IBM Spectrum packages to install MPI, so
>>> It's quite strange that there are some components missing (pami). Any help?
>>> Thanks
>>> >
>>> > --
>>> > Ing. Gabriele Fatigati
>>> >
>>> > HPC specialist
>>> >
>>> > SuperComputing Applications and Innovation Department
>>> >
>>> > Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy
>>> >
>>> > www.cineca.itTel:   +39 051 6171722
>>> >
>>> > g.fatigati [AT] cineca.it
>>> > ___
>>> > users mailing list
>>> > users@lists.open-mpi.org
>>> > https://rfd.newmexicoconsortium.org/mailman/listinfo/users
>>>
>>> ___
>>> users mailing list
>>> users@lists.open-mpi.org
>>> https://rfd.newmexicoconsortium.org/mailman/listinfo/users
>>>
>>
>>
>> ___
>> users mailing list
>> users@lists.open-mpi.org
>> https://rfd.newmexicoconsortium.org/mailman/listinfo/users
>>
>
>
>
> --
> Ing. Gabriele Fatigati
>
> HPC specialist
>
> SuperComputing Applications and Innovation Department
>
> Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy
>
> www.cineca.itTel:   +39 051 6171722
> <+39%20051%20617%201722>
>
> g.fatigati [AT] cineca.it
>
> ___
> users mailing list
> users@lists.open-mpi.org
> https://rfd.newmexicoconsortium.org/mailman/listinfo/users
>
___
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

Re: [OMPI users] IBM Spectrum MPI problem

2017-05-18 Thread Gabriele Fatigati
Hi John, about btl this is the output of ompi_info:

MCA btl: self (MCA v2.1.0, API v3.0.0, Component v10.1.0)
MCA btl: sm (MCA v2.1.0, API v3.0.0, Component v10.1.0)
MCA btl: tcp (MCA v2.1.0, API v3.0.0, Component v10.1.0)
MCA btl: vader (MCA v2.1.0, API v3.0.0, Component v10.1.0)
MCA fbtl: posix (MCA v2.1.0, API v2.0.0, Component v10.1.0)


about mtl no information retrieve ompi_info


2017-05-18 14:13 GMT+02:00 John Hearns via users :

> Gabriele,  as this is based on OpenMPI can you run ompi_info
> then look for the btl which are available and the mtl which are available?
>
>
>
> On 18 May 2017 at 14:10, Reuti  wrote:
>
>> Hi,
>>
>> > Am 18.05.2017 um 14:02 schrieb Gabriele Fatigati > >:
>> >
>> > Dear OpenMPI users and developers, I'm using IBM Spectrum MPI 10.1.0
>>
>> I noticed this on IBM's website too. Is this freely available? Up to now
>> I was always bounced back to their former Platform MPI when trying to
>> download the community edition (even the evaluation link on the Spectrum
>> MPI page does the same).
>>
>> -- Reuti
>>
>>
>> >  based on OpenMPI, so I hope there are some MPI expert can help me to
>> solve the problem.
>> >
>> > When I run a simple Hello World MPI program, I get the follow error
>> message:
>> >
>> > A requested component was not found, or was unable to be opened.  This
>> > means that this component is either not installed or is unable to be
>> > used on your system (e.g., sometimes this means that shared libraries
>> > that the component requires are unable to be found/loaded).  Note that
>> > Open MPI stopped checking at the first component that it did not find.
>> >
>> > Host:  openpower
>> > Framework: pml
>> > Component: pami
>> > 
>> --
>> > 
>> --
>> > It looks like MPI_INIT failed for some reason; your parallel process is
>> > likely to abort.  There are many reasons that a parallel process can
>> > fail during MPI_INIT; some of which are due to configuration or
>> environment
>> > problems.  This failure appears to be an internal failure; here's some
>> > additional information (which may only be relevant to an Open MPI
>> > developer):
>> >
>> >   mca_pml_base_open() failed
>> >   --> Returned "Not found" (-13) instead of "Success" (0)
>> > 
>> --
>> > *** An error occurred in MPI_Init
>> > *** on a NULL communicator
>> > *** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
>> > ***and potentially your MPI job)
>> >
>> > My sysadmin used official IBM Spectrum packages to install MPI, so It's
>> quite strange that there are some components missing (pami). Any help?
>> Thanks
>> >
>> > --
>> > Ing. Gabriele Fatigati
>> >
>> > HPC specialist
>> >
>> > SuperComputing Applications and Innovation Department
>> >
>> > Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy
>> >
>> > www.cineca.itTel:   +39 051 6171722
>> >
>> > g.fatigati [AT] cineca.it
>> > ___
>> > users mailing list
>> > users@lists.open-mpi.org
>> > https://rfd.newmexicoconsortium.org/mailman/listinfo/users
>>
>> ___
>> users mailing list
>> users@lists.open-mpi.org
>> https://rfd.newmexicoconsortium.org/mailman/listinfo/users
>>
>
>
> ___
> users mailing list
> users@lists.open-mpi.org
> https://rfd.newmexicoconsortium.org/mailman/listinfo/users
>



-- 
Ing. Gabriele Fatigati

HPC specialist

SuperComputing Applications and Innovation Department

Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy

www.cineca.itTel:   +39 051 6171722

g.fatigati [AT] cineca.it
___
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

Re: [OMPI users] IBM Spectrum MPI problem

2017-05-18 Thread Gabriele Fatigati
Hi Reuti, I think is it freely available. I posted also on IBM Spectrum
forum, I'm waiting some reply.

2017-05-18 14:10 GMT+02:00 Reuti :

> Hi,
>
> > Am 18.05.2017 um 14:02 schrieb Gabriele Fatigati :
> >
> > Dear OpenMPI users and developers, I'm using IBM Spectrum MPI 10.1.0
>
> I noticed this on IBM's website too. Is this freely available? Up to now I
> was always bounced back to their former Platform MPI when trying to
> download the community edition (even the evaluation link on the Spectrum
> MPI page does the same).
>
> -- Reuti
>
>
> >  based on OpenMPI, so I hope there are some MPI expert can help me to
> solve the problem.
> >
> > When I run a simple Hello World MPI program, I get the follow error
> message:
> >
> > A requested component was not found, or was unable to be opened.  This
> > means that this component is either not installed or is unable to be
> > used on your system (e.g., sometimes this means that shared libraries
> > that the component requires are unable to be found/loaded).  Note that
> > Open MPI stopped checking at the first component that it did not find.
> >
> > Host:  openpower
> > Framework: pml
> > Component: pami
> > 
> --
> > 
> --
> > It looks like MPI_INIT failed for some reason; your parallel process is
> > likely to abort.  There are many reasons that a parallel process can
> > fail during MPI_INIT; some of which are due to configuration or
> environment
> > problems.  This failure appears to be an internal failure; here's some
> > additional information (which may only be relevant to an Open MPI
> > developer):
> >
> >   mca_pml_base_open() failed
> >   --> Returned "Not found" (-13) instead of "Success" (0)
> > 
> --
> > *** An error occurred in MPI_Init
> > *** on a NULL communicator
> > *** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
> > ***and potentially your MPI job)
> >
> > My sysadmin used official IBM Spectrum packages to install MPI, so It's
> quite strange that there are some components missing (pami). Any help?
> Thanks
> >
> > --
> > Ing. Gabriele Fatigati
> >
> > HPC specialist
> >
> > SuperComputing Applications and Innovation Department
> >
> > Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy
> >
> > www.cineca.itTel:   +39 051 6171722
> >
> > g.fatigati [AT] cineca.it
> > ___
> > users mailing list
> > users@lists.open-mpi.org
> > https://rfd.newmexicoconsortium.org/mailman/listinfo/users
>
> ___
> users mailing list
> users@lists.open-mpi.org
> https://rfd.newmexicoconsortium.org/mailman/listinfo/users
>



-- 
Ing. Gabriele Fatigati

HPC specialist

SuperComputing Applications and Innovation Department

Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy

www.cineca.itTel:   +39 051 6171722

g.fatigati [AT] cineca.it
___
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

Re: [OMPI users] IBM Spectrum MPI problem

2017-05-18 Thread John Hearns via users
Gabriele,  as this is based on OpenMPI can you run ompi_info
then look for the btl which are available and the mtl which are available?



On 18 May 2017 at 14:10, Reuti  wrote:

> Hi,
>
> > Am 18.05.2017 um 14:02 schrieb Gabriele Fatigati :
> >
> > Dear OpenMPI users and developers, I'm using IBM Spectrum MPI 10.1.0
>
> I noticed this on IBM's website too. Is this freely available? Up to now I
> was always bounced back to their former Platform MPI when trying to
> download the community edition (even the evaluation link on the Spectrum
> MPI page does the same).
>
> -- Reuti
>
>
> >  based on OpenMPI, so I hope there are some MPI expert can help me to
> solve the problem.
> >
> > When I run a simple Hello World MPI program, I get the follow error
> message:
> >
> > A requested component was not found, or was unable to be opened.  This
> > means that this component is either not installed or is unable to be
> > used on your system (e.g., sometimes this means that shared libraries
> > that the component requires are unable to be found/loaded).  Note that
> > Open MPI stopped checking at the first component that it did not find.
> >
> > Host:  openpower
> > Framework: pml
> > Component: pami
> > 
> --
> > 
> --
> > It looks like MPI_INIT failed for some reason; your parallel process is
> > likely to abort.  There are many reasons that a parallel process can
> > fail during MPI_INIT; some of which are due to configuration or
> environment
> > problems.  This failure appears to be an internal failure; here's some
> > additional information (which may only be relevant to an Open MPI
> > developer):
> >
> >   mca_pml_base_open() failed
> >   --> Returned "Not found" (-13) instead of "Success" (0)
> > 
> --
> > *** An error occurred in MPI_Init
> > *** on a NULL communicator
> > *** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
> > ***and potentially your MPI job)
> >
> > My sysadmin used official IBM Spectrum packages to install MPI, so It's
> quite strange that there are some components missing (pami). Any help?
> Thanks
> >
> > --
> > Ing. Gabriele Fatigati
> >
> > HPC specialist
> >
> > SuperComputing Applications and Innovation Department
> >
> > Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy
> >
> > www.cineca.itTel:   +39 051 6171722
> >
> > g.fatigati [AT] cineca.it
> > ___
> > users mailing list
> > users@lists.open-mpi.org
> > https://rfd.newmexicoconsortium.org/mailman/listinfo/users
>
> ___
> users mailing list
> users@lists.open-mpi.org
> https://rfd.newmexicoconsortium.org/mailman/listinfo/users
>
___
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

Re: [OMPI users] IBM Spectrum MPI problem

2017-05-18 Thread Reuti
Hi,

> Am 18.05.2017 um 14:02 schrieb Gabriele Fatigati :
> 
> Dear OpenMPI users and developers, I'm using IBM Spectrum MPI 10.1.0

I noticed this on IBM's website too. Is this freely available? Up to now I was 
always bounced back to their former Platform MPI when trying to download the 
community edition (even the evaluation link on the Spectrum MPI page does the 
same).

-- Reuti


>  based on OpenMPI, so I hope there are some MPI expert can help me to solve 
> the problem. 
> 
> When I run a simple Hello World MPI program, I get the follow error message:
> 
> A requested component was not found, or was unable to be opened.  This
> means that this component is either not installed or is unable to be
> used on your system (e.g., sometimes this means that shared libraries
> that the component requires are unable to be found/loaded).  Note that
> Open MPI stopped checking at the first component that it did not find.
> 
> Host:  openpower
> Framework: pml
> Component: pami
> --
> --
> It looks like MPI_INIT failed for some reason; your parallel process is
> likely to abort.  There are many reasons that a parallel process can
> fail during MPI_INIT; some of which are due to configuration or environment
> problems.  This failure appears to be an internal failure; here's some
> additional information (which may only be relevant to an Open MPI
> developer):
> 
>   mca_pml_base_open() failed
>   --> Returned "Not found" (-13) instead of "Success" (0)
> --
> *** An error occurred in MPI_Init
> *** on a NULL communicator
> *** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
> ***and potentially your MPI job)
> 
> My sysadmin used official IBM Spectrum packages to install MPI, so It's quite 
> strange that there are some components missing (pami). Any help? Thanks
> 
> -- 
> Ing. Gabriele Fatigati
> 
> HPC specialist
> 
> SuperComputing Applications and Innovation Department
> 
> Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy
> 
> www.cineca.itTel:   +39 051 6171722
> 
> g.fatigati [AT] cineca.it  
> ___
> users mailing list
> users@lists.open-mpi.org
> https://rfd.newmexicoconsortium.org/mailman/listinfo/users

___
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users


[OMPI users] IBM Spectrum MPI problem

2017-05-18 Thread Gabriele Fatigati
Dear OpenMPI users and developers, I'm using IBM Spectrum MPI 10.1.0 based
on OpenMPI, so I hope there are some MPI expert can help me to solve the
problem.

When I run a simple Hello World MPI program, I get the follow error message:

A requested component was not found, or was unable to be opened.  This
means that this component is either not installed or is unable to be
used on your system (e.g., sometimes this means that shared libraries
that the component requires are unable to be found/loaded).  Note that
Open MPI stopped checking at the first component that it did not find.

Host:  openpower
Framework: pml
Component: pami
--
--
It looks like MPI_INIT failed for some reason; your parallel process is
likely to abort.  There are many reasons that a parallel process can
fail during MPI_INIT; some of which are due to configuration or environment
problems.  This failure appears to be an internal failure; here's some
additional information (which may only be relevant to an Open MPI
developer):

  mca_pml_base_open() failed
  --> Returned "Not found" (-13) instead of "Success" (0)
--
*** An error occurred in MPI_Init
*** on a NULL communicator
*** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
***and potentially your MPI job)

My sysadmin used official IBM Spectrum packages to install MPI, so It's
quite strange that there are some components missing (pami). Any help?
Thanks

-- 
Ing. Gabriele Fatigati

HPC specialist

SuperComputing Applications and Innovation Department

Via Magnanelli 6/3, Casalecchio di Reno (BO) Italy

www.cineca.itTel:   +39 051 6171722

g.fatigati [AT] cineca.it
___
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users