Re: [OMPI users] Fwd: Unable to run basic mpirun command (OpenMPI v5.0.3)

2024-05-05 Thread John Hearns via users
Terence, if you are using or compiling multiple MPI versions on your system are you setting up a Modules environment? Modules is heavily used in HPC systems - it will configure your environment, for example PATH and LD_LIBRARY_PATH in a fashion which is correct for the MPI version you want to use

Re: [OMPI users] [EXTERNAL] Help deciphering error message

2024-03-08 Thread John Hearns via users
We always advise against --allow-run-as-root Just saying that in my experience it is common to run IO tests as root. I agree - run the thing as a normal user and if that means a few seconds extra work to set up permissionon on the target filesystem it is time well spent. On Fri, 8 Mar 2024 at

Re: [OMPI users] Homebrew-installed OpenMPI 5.0.1 can't run a simple test program

2024-02-05 Thread John Hearns via users
Stupid question... Why is it going 'out' to the loopback address? Is shared memory not being used these days? On Mon, Feb 5, 2024, 8:31 PM John Haiducek via users < users@lists.open-mpi.org> wrote: > Adding '--pmixmca ptl_tcp_if_include lo0' to the mpirun argument list > seems to fix (or at

Re: [hwloc-users] Support for Intel's hybrid architecture - can I restrict hwloc-distrib to P cores only?

2023-11-23 Thread John Hearns
Good question. Maybe not an answer referring to hwloc. When managing a large NUMA machine, SGI UV, I ran the OS processes in a boot cpuset which was restricted to (AFAIR) the first 8 Cpus. On Intel architecures with E and P cores could we think of running OS on E cores only and having the batch

Re: [OMPI users] [EXTERNAL] Re: MPI_Init_thread error

2023-07-26 Thread John Hearns via users
Another idiot question... Is there a Pack or Easy build recipe for this software? Should help you get it built. On Wed, 26 Jul 2023, 10:27 Aziz Ogutlu via users, wrote: > Hi Howard, > > I'm using with salloc+mpirun command that explained on faq page that you > send, this time I'm getting error

Re: [OMPI users] [EXTERNAL] Re: MPI_Init_thread error

2023-07-26 Thread John Hearns via users
Very stupid question from me... I see you do a module load su2 Is it necessary to load the module for openmpi? Run 'ldd *SU2_CFD' * and look to see if there are missing libraries. Apologies if this is a nonsense question. On Tue, 25 Jul 2023 at 18:00, Pritchard Jr., Howard via users <

Re: [OMPI users] Set maximum number of CPU (or threads) for a user

2023-06-26 Thread John Hearns via users
somehow to the cgroup of a job (assuming cgroups are in use)? On Tue, 27 Jun 2023 at 03:27, John Hearns wrote: > There is a way to do this. > The best way would be a resource management system which allocates cgroups. > Without the RMS it is possible to define a cgroup for a given us

Re: [OMPI users] Set maximum number of CPU (or threads) for a user

2023-06-26 Thread John Hearns via users
The other method is the use of the clue stick. https://www.urbandictionary.com/define.php?term=LART On Tue, 27 Jun 2023 at 03:31, John Hearns wrote: > There are also PAM modules available which forbid user logins to compute > nodes unless the user has a job running on that node. >

Re: [OMPI users] Set maximum number of CPU (or threads) for a user

2023-06-26 Thread John Hearns via users
There is a way to do this. The best way would be a resource management system which allocates cgroups. Without the RMS it is possible to define a cgroup for a given user (group ?) such that they are within the cgroup when logging in. https://github.com/plaguedbypenguins/splosh It would be

Re: [OMPI users] Issue with Running MPI Job on CentOS 7

2023-06-14 Thread John Hearns via users
ifferent versions. > > I would like to express my sincere appreciation for your patient > assistance throughout this troubleshooting process. Your guidance has been > invaluable in helping me understand and resolve the challenges I > encountered. > > Thank you once again fo

Re: [OMPI users] Issue with Running MPI Job on CentOS 7

2023-06-13 Thread John Hearns via users
You talk about adjusting your PATH and LD_LIBRARY_PATH in your .bashrc Jeff Squyres has given you some guidance on this. Please investigate the following. It is common to use Modules in an HPC environment https://www.admin-magazine.com/HPC/Articles/Lmod-Alternative-Environment-Modules For

Re: [OMPI users] MPI with RoCE

2022-09-05 Thread John Hearns via users
Stupid reply from me. You do know that Infiniband adapters operate without an IP address? Yes, configuring IPOIB is a good idea - however Infiniband adapters are more than 'super ethernet adapters' I would run the following utilities to investigate your Infiniband fabric sminfo ibhosts ibdiagnet

Re: [OMPI users] Multiple IPs on network interface

2022-07-08 Thread John Hearns via users
As Jeff says the output of ip addr is critical. Stupid question from me - what is the network topology and type here? Do you have two physical networks? On Thu, 7 Jul 2022 at 16:11, Jeff Squyres (jsquyres) via users < users@lists.open-mpi.org> wrote: > Can you send the full output of "ifconfig"

Re: [OMPI users] Trouble compiling OpenMPI with Infiniband support

2022-03-01 Thread John Hearns via users
Stupid answer from me. If latency/bandwidth numbers are bad then check that you are really running over the interface that you think you should be. You could be falling back to running over Ethernet. On Mon, 28 Feb 2022 at 20:10, Angel de Vicente via users < users@lists.open-mpi.org> wrote: >

Re: [OMPI users] Verbose logging options to track IB communication issues

2022-02-17 Thread John Hearns via users
I would start at a lower level. Clear your error counters then run some fabric over the network, maybe using an IMB or OSU benchmark. Then look to see if any ports are very noisy - that usually indicates a cable needing a reseat or replacement. Now start at a low level. Run IMB or OSU bandwidth

Re: [OMPI users] OpenMPI - Intel MPI

2022-01-26 Thread John Hearns via users
Luis, it is perfectly possible to use different MPI on the same cluster. May we ask what your OS and cluster management stack is? Normally you use the Modules system to configure your job to use a chosen MPI On Wed, 26 Jan 2022 at 15:01, Ralph Castain via users < users@lists.open-mpi.org>

Re: [OMPI users] Configure failure for thread support in libevent

2021-04-12 Thread John Hearns via users
he lib event that configure found / tried to > use did not have thread support enabled. > > What was the command line you used to invoke configure? > > > > On Apr 11, 2021, at 1:52 PM, John Hearns via users < > users@lists.open-mpi.org> wrote: > > > > OS - Ce

[OMPI users] Configure failure for thread support in libevent

2021-04-11 Thread John Hearns via users
OS - CentSO 7.9 OpenMPI 3.1.6 libevent libevent-2.0.21-4.el7.x86_6 OFEDMLNX_OFED_LINUX-5.2-2.2.0.0 When trying to configure openmpi-3.1.6 I get this error checking for evthread_set_lock_callbacks in -levent... no configure: WARNING: External libevent does not have thread support configure:

Re: [OMPI users] Building Open-MPI with Intel C

2021-04-07 Thread John Hearns via users
gt; > E.g. > > ssh othernode ldd /path/to/orted > > Your shell startup files may well differentiate between interactive and > non-interactive logins (i.e., it may set PATH / LD_LIBRARY_PATH / etc. > differently). > > > On Apr 7, 2021, at 7:21 AM, John Hearns via us

Re: [OMPI users] Building Open-MPI with Intel C

2021-04-07 Thread John Hearns via users
Manually log into one of your nodes. Load the modules you use in a batch job. Run 'ldd' on your executable. Start at the bottom and work upwards... By the way, have you looked at using Easybuild? Would be good to have your input there maybe. On Wed, 7 Apr 2021 at 01:01, Heinz, Michael William

Re: [OMPI users] Newbie With Issues

2021-04-01 Thread John Hearns via users
Hello Ben, nice to meet you! You refer to Liqid, which is interesting. I would say you should take a step back and consider using Easybuild or Spack https://easybuild.io/ https://spack.readthedocs.io/en/latest/ The bit of time spent getting to know these tools will be repaid many times over.

Re: [OMPI users] Stable and performant openMPI version for Ubuntu20.04 ?

2021-03-04 Thread John Hearns via users
How are you installing the OpenMPI versions? Are you using packages which are distributed by the OS? It might be worth looking at using Easybuid or Spack https://docs.easybuild.io/en/latest/Introduction.html https://spack.readthedocs.io/en/latest/ On Thu, 4 Mar 2021 at 07:35, Raut, S Biplab via

Re: [OMPI users] MPI executable fails on ArchLinux on Termux

2021-02-26 Thread John Hearns via users
Recommend then using 'less' or 'grep' to parse out the lines which are of interest On Fri, 26 Feb 2021 at 23:03, LINUS FERNANDES via users < users@lists.open-mpi.org> wrote: > The following command lists all the MCA parameters used by OpenMPI. > > ~ $ ompi_info --all > > The output is over 2000

Re: [OMPI users] MPI executable fails on ArchLinux on Termux

2021-02-25 Thread John Hearns via users
Let's back up a step. What is the hardware involved here. then go forward a step - what application do you want to run? Or is this perhaps self learning for MPI? On Thu, 25 Feb 2021 at 15:57, LINUS FERNANDES via users < users@lists.open-mpi.org> wrote: > Do you think executing mpirun as a

Re: [OMPI users] MPI executable fails on ArchLinux on Termux

2021-02-25 Thread John Hearns via users
ib/libpthread.so.0 (0x007faaf62000) > libudev.so.1 => /usr/lib/libudev.so.1 (0x007faaf16000) > > > ldd output on Termux: > > libandroid-shmem.so > libopen-rte.so > libopen-pal.so > libm.so > libdl.so > libc.so > > On Thu, 25 Feb 2021, 13:46 John

Re: [OMPI users] MPI executable fails on ArchLinux on Termux

2021-02-25 Thread John Hearns via users
Perhaps an idiot question from me.. In both environments run ldd Are there any libraries missing? On Thu, 25 Feb 2021 at 05:36, Gilles Gouaillardet via users < users@lists.open-mpi.org> wrote: > Can you run > > > ifconfig > > or > > ip addr > > > in both Termux and ArchLinux for Termux? > >

Re: [OMPI users] OpenMPI and maker - Multiple messages

2021-02-18 Thread John Hearns via users
>From a quick look at the documentation, the Maker tool use mpich2 http://gmod.org/wiki/MAKER_Tutorial#Prerequisites Should you not install mpich on your cluster? . Are you trying to use MPI to run multiple copies of the Maker tool? You would probably be better using SLurm srun or an array job

Re: [OMPI users] Limiting IP addresses used by OpenMPI

2020-09-01 Thread John Hearns via users
Charles, I recall using the I_MPI_NETMASK to choose which interface for MPI to use. I guess you are asking the same question for OpenMPI? On Tue, 1 Sep 2020 at 17:03, Charles Doland via users < users@lists.open-mpi.org> wrote: > Is there a way to limit the IP addresses or network interfaces used

Re: [OMPI users] Problem in starting openmpi job - no output just hangs

2020-08-25 Thread John Hearns via users
I apologise. That was an Omnipath issue https://www.beowulf.org/pipermail/beowulf/2017-March/034214.html On Tue, 25 Aug 2020 at 08:17, John Hearns wrote: > Aha. I dimly remember a problem with the ibverbs /dev device - maybe the > permissions, > or more likely the owner account for th

Re: [OMPI users] Problem in starting openmpi job - no output just hangs

2020-08-25 Thread John Hearns via users
s not encouraging when you a floundering. I know its a > newish project but I have used openib for 10+ years and its never had a > problem until now. I think this is not so much openib as the software > below. One other thing I should say is that if I run any recent version of > mstflin

Re: [OMPI users] Problem in starting openmpi job - no output just hangs

2020-08-23 Thread John Hearns via users
Tony, start at a low level. Is the Infiniband fabric healthy? Run ibstatus on every node sminfo on one node ibdiagnet on one node On Sun, 23 Aug 2020 at 05:02, Tony Ladd via users wrote: > Hi Jeff > > I installed ucx as you suggested. But I can't get even the simplest code >

Re: [OMPI users] Books/resources to learn (open)MPI from

2020-08-21 Thread John Hearns via users
Thankyou for the reference to the Divakar Viswanath book. It is very generous that it is available online - I just wish it was in ePub format. I guess that is a decision of MIT Press. I would happily pay for an ePub edition. I just cannot justify more shelf space for physical books. On

Re: [OMPI users] Correct mpirun Options for Hybrid OpenMPI/OpenMP

2020-08-04 Thread John Hearns via users
ps -eaf --forest or indeed pstree is a good way to see what is going on. Also 'htop' is a very useful utility. Also well worth running 'lstopo' to look at the layout of cores nd caches on your machines. On Mon, 3 Aug 2020 at 09:40, John Duffy via users wrote: > Hi > > I’m experimenting with

Re: [OMPI users] Vader - Where to Look for Shared Memory Use

2020-07-22 Thread John Hearns via users
John, as an aside it is always worth running 'lstopo' from the hwloc package to look at the layout of your cpus cores and caches. Getting a bit late now so I apologise for being too lazy to boot up my Pi to capture the output. On Wed, 22 Jul 2020 at 19:55, George Bosilca via users <

Re: [OMPI users] Question about virtual interface

2020-06-25 Thread John Hearns via users
Hello Vipul. could you tell us more about the grid you will be using? Is this the Gridengine scheduler on a local HPC cluster? Or is it running on Kubernetes maybe? You say that it is difficult to specify the IP address for btl_tcp_if_include I agree with you! But it is quit common to have to

Re: [OMPI users] Running mpirun with grid

2020-06-02 Thread John Hearns via users
As a suggestion can we see the configuration of your Parallel Environment? qconf -spl qconf -sp orte2 On Mon, 1 Jun 2020 at 22:20, Ralph Castain via users < users@lists.open-mpi.org> wrote: > Afraid I have no real ideas here. Best I can suggest is taking the qrsh > cmd line from the prior

Re: [OMPI users] Running mpirun with grid

2020-05-30 Thread John Hearns via users
Good morning Vipul. I would like to ask some higher level questions regarding your HPC cluster. What are the manufacturers of the cluster nodes. How many compute nodes? What network interconnect do you have - gigabit ethernet, 10gig ethernet, Infiniband, Omnipath? Which cluster middleware -

Re: [OMPI users] Can't start jobs with srun.

2020-05-07 Thread John Hearns via users
Patrick, I am sure that you have asked Dell for support on this issue? On Sun, 26 Apr 2020 at 18:09, Patrick Bégou via users < users@lists.open-mpi.org> wrote: > I have also this problem on servers I'm benching at DELL's lab with > OpenMPI-4.0.3. I've tried a new build of OpenMPI with

Re: [OMPI users] mpirun error only with one node

2020-04-03 Thread John Hearns via users
Are you SURE node125 is identical to the others? systems can boot up and disable DIMMs for instance. I would log on there and runfreelscpu lspci dmidecode Take those outputs and run a diff against outputs from a known good node Also hwloc/lstopo might show some difference? On Thu, 2

Re: [OMPI users] How to use OPENMPI with different Service Level in Infiniband Virtual Lane?

2020-02-26 Thread John Hearns via users
Hi Kihan. I did work on this topic a couple of years ago. Give me time and I might find any notes I made. May I ask which kernel version you are using On Tue, 25 Feb 2020, 05:28 Kihang Youn via users, wrote: > > > Hello, > > I am searching the option to apply different service level(SL) in >

Re: [OMPI users] can't run MPI job under SGE

2019-07-25 Thread John Hearns via users
Have you checked your ssh between nodes? Also how is your Path set up? There is a difference between interactive and non interactive login sessions I advuse A. Construct a hosts file and mpirun by hand B. Use modules rather than. Bashrc files C. Slurm On Thu, 25 Jul 2019, 18:00 David Laidlaw

Re: [OMPI users] How it the rank determined (Open MPI and Podman)

2019-07-11 Thread John Hearns via users
Not really a relevant reply, however Nomad has task drivers for Docker and Singularity https://www.hashicorp.com/blog/singularity-and-hashicorp-nomad-a-perfect-fit I'm not sure if it woul dbe easier to set up an MPI enviroment with Nomad though On Thu, 11 Jul 2019 at 11:08, Adrian Reber via

Re: [OMPI users] growing memory use from MPI application

2019-06-20 Thread John Hearns via users
Errr.. you chave dropped caches? echo 3 > /proc/sys/vm/drop_caches On Thu, 20 Jun 2019 at 15:59, Yann Jobic via users wrote: > Hi, > > Le 6/20/2019 à 3:31 PM, Noam Bernstein via users a écrit : > > > > > >> On Jun 20, 2019, at 4:44 AM, Charles A Taylor >> > wrote:

Re: [OMPI users] growing memory use from MPI application

2019-06-20 Thread John Hearns via users
The kernel using memory is why I suggested running slabtop, to see the kernel slab allocations. Clearly I Was barking up a wrong tree there... On Thu, 20 Jun 2019 at 14:41, Jeff Squyres (jsquyres) via users < users@lists.open-mpi.org> wrote: > On Jun 20, 2019, at 9:31 AM, Noam Bernstein via

Re: [OMPI users] growing memory use from MPI application

2019-06-19 Thread John Hearns via users
Noam, it may be a stupid question. Could you try runningslabtop ss the program executes Also 'watch cat /proc/meminfo'is also a good diagnostic On Wed, 19 Jun 2019 at 18:32, Noam Bernstein via users < users@lists.open-mpi.org> wrote: > Hi - we’re having a weird problem with OpenMPI on

Re: [OMPI users] job termination

2019-04-17 Thread John Hearns via users
I would do the normal things. Log into those nodes. Run dmesg and look at /var/log/messages Look at the Slurm log on the node and look for the job ending. Also look at the sysstat files and see if there was a lot of memory being used http://sebastien.godard.pagesperso-orange.fr/ On Wed, 17 Apr

Re: [OMPI users] Open MPI installation problem

2019-01-23 Thread John Hearns via users
Sorry if I am being stupid, Serdar might also have to set the location for the includes by setting MPI_INC On Wed, 23 Jan 2019 at 14:47, Ralph H Castain wrote: > Your PATH and LD_LIBRARY_PATH setting is incorrect. You installed OMPI > into $HOME/openmpi, so you should have done: > >

Re: [OMPI users] no openmpi over IB on new CentOS 7 system

2018-10-10 Thread John Hearns via users
On that system please tell us what these return: ibstat ibstatus sminfo ibdiagnet On Wed, 10 Oct 2018 at 12:49, John Hearns wrote: > > Noam, what does ompi_info say - specifically which BTLs are available? > Stupid question though - this is a single system with no connection to a

Re: [OMPI users] no openmpi over IB on new CentOS 7 system

2018-10-10 Thread John Hearns via users
Noam, what does ompi_info say - specifically which BTLs are available? Stupid question though - this is a single system with no connection to a switch? You probably dont have an OpenSM subnet manager running then - could that be the root cause? On Wed, 10 Oct 2018 at 09:53, Dave Love wrote: > >

Re: [OMPI users] Cannot run MPI code on multiple cores with PBS

2018-10-09 Thread John Hearns via users
libssl.so.0.9.8, I > still have one error message left from MPI: > > mca_base_component_repository_open: unable to open mca_btl_openib: > libibverbs.so.1: cannot open shared object file: No such file or directory > (ignored) > > Please let me know if you have any suggestions. >

Re: [OMPI users] Cannot run MPI code on multiple cores with PBS

2018-10-04 Thread John Hearns via users
PORT_URL="https://bugs.centos.org/; > > CENTOS_MANTISBT_PROJECT="CentOS-7" > CENTOS_MANTISBT_PROJECT_VERSION="7" > REDHAT_SUPPORT_PRODUCT="centos" > REDHAT_SUPPORT_PRODUCT_VERSION=“7" > > May you please tell me how to check whether the batch system

Re: [OMPI users] Cannot run MPI code on multiple cores with PBS

2018-10-04 Thread John Hearns via users
Michele one tip: log into a compute node using ssh and as your own username. If you use the Modules envirnonment then load the modules you use in the job script then use the ldd utility to check if you can load all the libraries in the code.io executable Actually you are better to submit a

Re: [OMPI users] OpenMPI building fails on Windows Linux Subsystem(WLS).

2018-09-19 Thread John Hearns via users
2018 at 17:36, John Hearns wrote: > > Oleg, I have a Windows 10 system and could help by testing this also. > But I have to say - it will be quicker just to install VirtualBox and > a CentOS VM. Or an Ubuntu VM. > You can then set up a small test network of VMs using the Virtua

Re: [OMPI users] OpenMPI building fails on Windows Linux Subsystem(WLS).

2018-09-19 Thread John Hearns via users
Oleg, I have a Windows 10 system and could help by testing this also. But I have to say - it will be quicker just to install VirtualBox and a CentOS VM. Or an Ubuntu VM. You can then set up a small test network of VMs using the VirtualBox HostOnly network for tests of your MPI code. On Wed, 19

Re: [OMPI users] RDMA over Ethernet in Open MPI - RoCE on AWS?

2018-09-07 Thread John Hearns via users
Ben, ping me off list. I know the guy who heads the HPC Solutions Architect team for AWS and an AWS Solutions Architect here in the UK. On Fri, 7 Sep 2018 at 03:11, Benjamin Brock wrote: > > I'm setting up a cluster on AWS, which will have a 10Gb/s or 25Gb/s Ethernet > network. Should I expect

Re: [OMPI users] MPI advantages over PBS

2018-08-25 Thread John Hearns via users
Diego, I am sorry but you have different things here. PBS is a resource allocation system. It will reserve the use of a compute server, or several compute servers, for you to run your parallel job on. PBS can launch the MPI job - there are several mechanisms for launching parallel jobs. MPI is an

Re: [OMPI users] need help finding mpi for Raspberry pi Raspian Streach

2018-05-30 Thread John Hearns via users
Forgive me for chipping in here. There is definitely a momentum behind the ARM architecture in HPC. However it seems to me that there are a lot of architectures under the 'ARM' umbrella. Does anyone have a simplified guide to what they all mean? On 30 May 2018 at 02:26, Gilles Gouaillardet

Re: [OMPI users] peformance abnormality with openib and tcp framework

2018-05-15 Thread John Hearns via users
n open the opensmd > service because it seems unnecessary in this situation. Can this be the > reason why IB performs poorer? > > Interconnection details are in the attachment. > > > > Best Regards, > > Xie Bin > > > John Hearns via users <users@lists.open-mpi.o

Re: [OMPI users] peformance abnormality with openib and tcp framework

2018-05-14 Thread John Hearns via users
Xie Bin, I do hate to ask this. You say "in a two-node cluster (IB direcet-connected). " Does that mean that you have no IB switch, and that there is a single IB cable joining up these two servers? If so please run:ibstatusibhosts ibdiagnet I am trying to check if the IB fabric is

Re: [OMPI users] OpenMPI 3.0.1 - mpirun hangs with 2 hosts

2018-05-14 Thread John Hearns via users
One very, very stupid question here. This arose over on the Slurm list actually. Those hostnames look like quite generic names, ie they are part of an HPC cluster? Do they happen to have independednt home directories for your userid? Could that possibly make a difference to the MPI launcher? On

Re: [OMPI users] problem

2018-05-09 Thread John Hearns via users
Ankita, looks like your program is not launching correctly. I would try the following: define two hosts in a machinefile. Use mpirun -np 2 machinefile date Ie can you use mpirun just to run the command 'date' Secondly compile up and try to run an MPI 'Hello World' program On 9 May 2018 at

Re: [OMPI users] MPI cartesian grid : cumulate a scalar value through the procs of a given axis of the grid

2018-05-02 Thread John Hearns via users
Peter is correct. We need to find out what K is. But we may never find out https://en.wikipedia.org/wiki/The_Trial It would be fun if we could get some real-world dimesnions here and some real-world numbers. What range of numbers are these also? On 2 May 2018 at 15:21, Peter Kjellström

Re: [OMPI users] MPI cartesian grid : cumulate a scalar value through the procs of a given axis of the grid

2018-05-02 Thread John Hearns via users
Pierre, I may not be able to help you directly. But I had better stop listening to the voices. Mail me off list please. This might do the trick using Julia http://juliadb.org/latest/api/aggregation.html On 2 May 2018 at 14:11, John Hearns <hear...@googlemail.com> wrote: > Also my in

Re: [OMPI users] MPI cartesian grid : cumulate a scalar value through the procs of a given axis of the grid

2018-05-02 Thread John Hearns via users
to the voices. On 2 May 2018 at 14:08, John Hearns <hear...@googlemail.com> wrote: > Peter, how large are your models, ie how many cells in each direction? > Something inside of me is shouting that if the models are small enough > then MPI is not the way here. > Assuming use o

Re: [OMPI users] MPI cartesian grid : cumulate a scalar value through the procs of a given axis of the grid

2018-05-02 Thread John Hearns via users
Peter, how large are your models, ie how many cells in each direction? Something inside of me is shouting that if the models are small enough then MPI is not the way here. Assuming use of a Xeon processor there should be some AVX instructions which can do this. This is rather out of date, but is

Re: [OMPI users] Fwd: Fwd: problem in cluster

2018-04-25 Thread John Hearns via users
ores > so total 64 processes. > > On Wed, Apr 25, 2018 at 2:57 PM, John Hearns via users < > users@lists.open-mpi.org> wrote: > >> I do not see much wrong with that. >> However nodes=4 ppn=2 makes 8 processes in all. >> You are using mpirun -np 64 >

Re: [OMPI users] Fwd: Fwd: problem in cluster

2018-04-25 Thread John Hearns via users
I do not see much wrong with that. However nodes=4 ppn=2 makes 8 processes in all. You are using mpirun -np 64 Actually it is better practice to use the PBS supplied environment variables during the job, rather than hard-wiring 64 I dont have access to a PBS cluster from my desk at the

Re: [OMPI users] Fwd: problem in cluster

2018-04-25 Thread John Hearns via users
Ankita, this is problem with your batch queuing system. Do you know which batch system you are using on this cluster? Can you share with us what command you use to submit a job? Also please do not share your teamviewer password with us. I doubt this is of much use to anyone, but... On 25 April

Re: [OMPI users] Old version openmpi 1.2 support infiniband?

2018-03-21 Thread John Hearns via users
California, Riverside > 900 University Avenue, Riverside, CA 92521 > > > On Tue, Mar 20, 2018 at 10:46 AM, John Hearns via users < > users@lists.open-mpi.org> wrote: > >> "It does not handle more recent improvements such as Intel's turbo >> mode and the proce

Re: [OMPI users] Old version openmpi 1.2 support infiniband?

2018-03-20 Thread John Hearns via users
"It does not handle more recent improvements such as Intel's turbo mode and the processor performance inhomogeneity that comes with it." I guess it is easy enough to disable Turbo mode in the BIOS though. On 20 March 2018 at 17:48, Kaiming Ouyang wrote: > I think the problem

Re: [OMPI users] Fwd: MCA version error

2017-10-13 Thread John Hearns via users
Abhisek ... Gilles asked which program you re trying to run, and how it was linked with OpenMPI Also please realise that you do not HAVE to use the openmpi packages provided by your linux distribution. It is perfectly OK to download, compile and install another version. On 13 October 2017 at

Re: [OMPI users] Open MPI internal error

2017-09-28 Thread John Hearns via users
ps. Before you do the reboot of a compute node, have you run 'ibdiagnet' ? On 28 September 2017 at 11:17, John Hearns <hear...@googlemail.com> wrote: > > Google turns this up: > https://groups.google.com/forum/#!topic/ulfm/OPdsHTXF5ls > > > On 28 September 2017

Re: [OMPI users] Open MPI internal error

2017-09-28 Thread John Hearns via users
Google turns this up: https://groups.google.com/forum/#!topic/ulfm/OPdsHTXF5ls On 28 September 2017 at 01:26, Ludovic Raess wrote: > Hi, > > > we have a issue on our 32 nodes Linux cluster regarding the usage of Open > MPI in a Infiniband dual-rail configuration (2 IB

Re: [OMPI users] Error building openmpi on Raspberry pi 2

2017-09-27 Thread John Hearns via users
This might be of interest for ARM users: https://developer.arm.com/products/software-development-tools/hpc/arm-compiler-for-hpc On 27 September 2017 at 06:58, Gilles Gouaillardet < gilles.gouaillar...@gmail.com> wrote: > Faraz, > > which OS are you running ? > > iirc, i faced similar issues,

Re: [OMPI users] mpif90 unable to find ibverbs

2017-09-14 Thread John Hearns via users
Jeff, from what I read yesterday it is OpenMPI 2 , I am not sure of the minor version. I do acknowledge that Mahmood reports that the Rocks 7 beta is available - when I last used Rocks this was not avaiable. But still - look at something more up to date, such as OpenHPC. There is nothing

Re: [OMPI users] mpif90 unable to find ibverbs

2017-09-14 Thread John Hearns via users
Then let me add in my thoughts please.. Rocks is getting out of date. Mahmood, I would imagine that you are not given the choice of installing something more modern, ie the place where you work has an existing Rocks cluster and is unwilling to re-install it. So what is wrong with using the

Re: [OMPI users] Setting LD_LIBRARY_PATH for orted

2017-08-22 Thread John Hearns via users
Gary, are you using Modules? http://www.admin-magazine.com/HPC/Articles/Environment-Modules On 22 August 2017 at 02:04, Gilles Gouaillardet wrote: > Gary, > > > one option (as mentioned in the error message) is to configure Open MPI > with --enable-orterun-prefix-by-default.

Re: [OMPI users] Q: Basic invoking of InfiniBand with OpenMPI

2017-07-17 Thread John Hearns via users
System image GUID: 0x248a0703005abb30 > Port 1: > State: Down > Physical state: Disabled > Rate: 100 > Base lid: 0 > LMC: 0 > SM lid: 0 > Capability mask: 0x3c01 > Port GUID: 0x268a07fffe5abb31 > Link layer:

Re: [OMPI users] Q: Basic invoking of InfiniBand with OpenMPI

2017-07-14 Thread John Hearns via users
ABoris, as Gilles says - first do som elower level checkouts of your Infiniband network. I suggest running: ibdiagnet ibhosts and then as Gilles says 'ibstat' on each node On 14 July 2017 at 03:58, Gilles Gouaillardet wrote: > Boris, > > > Open MPI should automatically

[OMPI users] Openmpi with btl_openib_ib_service_level

2017-06-22 Thread John Hearns via users
I may have asked this recently (if so sorry). If anyoen has worked with QoS settings with OpenMPI please ping me off list, eg mpirun --mca btl_openib_ib_service_level N ___ users mailing list users@lists.open-mpi.org

Re: [OMPI users] disable slurm/munge from mpirun

2017-06-22 Thread John Hearns via users
@open-mpi.org> wrote: > You can add "OMPI_MCA_plm=rsh OMPI_MCA_sec=^munge” to your environment > > > On Jun 22, 2017, at 7:28 AM, John Hearns via users < > users@lists.open-mpi.org> wrote: > > Michael, try > --mca plm_rsh_agent ssh > > I've been fooling wit

Re: [OMPI users] disable slurm/munge from mpirun

2017-06-22 Thread John Hearns via users
Michael, try --mca plm_rsh_agent ssh I've been fooling with this myself recently, in the contect of a PBS cluster On 22 June 2017 at 16:16, Michael Di Domenico wrote: > is it possible to disable slurm/munge/psm/pmi(x) from the mpirun > command line or (better) using

Re: [OMPI users] IBM Spectrum MPI problem

2017-05-19 Thread John Hearns via users
neral case. supercomputer cluster running over high performance fabrics are complicated beasts. Itis not sufficient to plug in cards and cable. On 19 May 2017 at 11:12, John Hearns <hear...@googlemail.com> wrote: > I am not sure I agree with that. > (a) the original error

Re: [OMPI users] IBM Spectrum MPI problem

2017-05-19 Thread John Hearns via users
he error message. >>> >>> >>> 2017-05-19 9:10 GMT+02:00 Gilles Gouaillardet <gil...@rist.or.jp >>> <mailto:gil...@rist.or.jp>>: >>> >>> Gabriele, >>> >>> >>> so it seems pml/pami

Re: [OMPI users] IBM Spectrum MPI problem

2017-05-19 Thread John Hearns via users
is an infiniband card available (!) >>> >>> i guess IBM folks will comment on that shortly. >>> >>> >>> meanwhile, you do not need pami since you are running on a single node >>> >>> mpirun --mca pml ^pami ... >>> >&g

Re: [OMPI users] Many different errors with ompi version 2.1.1

2017-05-19 Thread John Hearns via users
Allan, remember that Infiniband is not Ethernet. You dont NEED to set up IPOIB interfaces. Two diagnostics please for you to run: ibnetdiscover ibdiagnet Let us please have the reuslts ofibnetdiscover On 19 May 2017 at 09:25, John Hearns <hear...@googlemail.com> wrote: &

Re: [OMPI users] Many different errors with ompi version 2.1.1

2017-05-19 Thread John Hearns via users
Giles, Allan, if the host 'smd' is acting as a cluster head node it is not a must for it to have an Infiniband card. So you should be able to run jobs across the other nodes, which have Qlogic cards. I may have something mixed up here, if so I am sorry. If you want also to run jobs on the smd

Re: [OMPI users] IBM Spectrum MPI problem

2017-05-19 Thread John Hearns via users
ck > > (if it does not work, can run and post the logs) > > mpirun --mca pml ^pami --mca pml_base_verbose 100 ... > > > Cheers, > > > Gilles > > > On 5/19/2017 4:01 PM, Gabriele Fatigati wrote: > >> Hi John, >> Infiniband is not used, there is a s

Re: [OMPI users] IBM Spectrum MPI problem

2017-05-19 Thread John Hearns via users
Gabriele, pleae run 'ibv_devinfo' It looks to me like you may have the physical interface cards in these systems, but you do not have the correct drivers or libraries loaded. I have had similar messages when using Infiniband on x86 systems - which did not have libibverbs installed. On 19 May

Re: [OMPI users] IBM Spectrum MPI problem

2017-05-18 Thread John Hearns via users
0.1.0) > MCA fbtl: posix (MCA v2.1.0, API v2.0.0, Component v10.1.0) > > > about mtl no information retrieve ompi_info > > > 2017-05-18 14:13 GMT+02:00 John Hearns via users <users@lists.open-mpi.org > >: > >> Gabriele, as this is based on OpenMPI can

Re: [OMPI users] IBM Spectrum MPI problem

2017-05-18 Thread John Hearns via users
Gabriele, as this is based on OpenMPI can you run ompi_info then look for the btl which are available and the mtl which are available? On 18 May 2017 at 14:10, Reuti wrote: > Hi, > > > Am 18.05.2017 um 14:02 schrieb Gabriele Fatigati : > > >

Re: [OMPI users] Basic build trouble on RHEL7

2017-04-27 Thread John Hearns via users
Ray, probably a stupid question but do you have the hwloc-devel package installed? And also the libxml2-devel package? On 27 April 2017 at 21:54, Ray Sheppard wrote: > Hi All, > I have searched the mail archives because I think this issue was > addressed earlier, but I can

Re: [OMPI users] install OpenMPI on CentOS in HPC

2016-12-18 Thread John Hearns via users
Mahmoud, you should look at the OpenHPC project. http://www.openhpc.community/ On 15 December 2016 at 19:50, Mahmoud MIRZAEI wrote: > Dears, > > May you please let me know if there is any procedure to install OpenMPI on > CentOS in HPC? > > Thanks. > Mahmoud > > > >

Re: [OMPI users] OpenMPI + InfiniBand

2016-11-01 Thread John Hearns via users
Segei, can you run : ibhosts ibstat ibdiagnet Lord help me for being so naive, but do you have a subnet manager running? On 1 November 2016 at 06:40, Sergei Hrushev wrote: > Hi Jeff ! > > What does "ompi_info | grep openib" show? >> >> > $ ompi_info | grep openib >

Re: [OMPI users] OpenMPI + InfiniBand

2016-10-28 Thread John Hearns via users
Sorry - shoot down my idea. Over to someone else (me hides head in shame) On 28 October 2016 at 11:28, Sergei Hrushev wrote: > Sergei, what does the command "ibv_devinfo" return please? >> >> I had a recent case like this, but on Qlogic hardware. >> Sorry if I am mixing

Re: [OMPI users] OpenMPI + InfiniBand

2016-10-28 Thread John Hearns via users
Sergei, what does the command "ibv_devinfo" return please? I had a recent case like this, but on Qlogic hardware. Sorry if I am mixing things up. On 28 October 2016 at 10:48, Sergei Hrushev wrote: > Hello, All ! > > We have a problem with OpenMPI version 1.10.2 on a

Re: [OMPI users] job aborts "readv failed: Connection reset by peer"

2016-09-02 Thread John Hearns via users
Thankyou. That is helpful. Could you run an 'ldd' on your executable, on one of the compute nodes if possible? I will nto be able to solve your problem, but at least we now know what the application is, and can look at the libraries it is using. On 2 September 2016 at 17:19, Mahmood Naderan

Re: [OMPI users] job aborts "readv failed: Connection reset by peer"

2016-09-02 Thread John Hearns via users
Mahmood, are you compiling and linking this application? Or are you using an executable which someone else has prepared? It would be very useful if we could know the application. On 2 September 2016 at 16:35, Mahmood Naderan wrote: > >Did you ran > >ulimit -c unlimited

Re: [OMPI users] job aborts "readv failed: Connection reset by peer"

2016-09-02 Thread John Hearns via users
Mahmood, as Giles says start by looking at how that application is compiled and linked. Run 'ldd' on the executable and look closely at the libraries. Do this on a compute node if you can. There was a discussion on another mailign list recently about how to fingerpritn executables and see which

Re: [OMPI users] New to (Open)MPI

2016-09-02 Thread John Hearns via users
Hello Lachlan. I think Jeff Squyres will be along in a short while! HE is of course the expert on Cisco. In the meantime a quick Google turns up: http://www.cisco.com/c/en/us/td/docs/unified_computing/ucs/usnic/c/deployment/2_0_X/b_Cisco_usNIC_Deployment_Guide_For_Standalone_C-SeriesServers.html

  1   2   >