Thanks Ralph,
adding --hetero-nodes to the command line solved this issue.
Alina.
On Mon, Sep 15, 2014 at 6:51 AM, Ralph Castain wrote:
> Try adding --hetero-nodes to your mpirun cmd line
>
> On Sep 14, 2014, at 8:25 AM, Alina Sklarevich
> wrote:
>
> Hello,
>
>
> I am using ompi-v1.8 and have
I wonder if anyone else is seeing this failure. Not sure when this started but
it is only on the trunk. Here is a link to my failures as well as an example
below that. There are a variety of nonblocking collectives failing like this.
http://mtt.open-mpi.org/index.php?do_redir=2208
[rvandevaar
From time-to-time, and have a need for running Open MPI apps using the openib
btl on a single node, where port 1 on the HCA is connected to port 2 on the
same HCA.
Using a vintage 1.5.4, my command line would read:
mpiexec --mca btl self,openib --mca btl_openib_cpc_include oob \
-np 1 /usr/b
Hi Rolf,
This may be related to change set 32659.
If you back this change out, do the tests pass?
Howard
From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Rolf vandeVaart
Sent: Monday, September 15, 2014 8:55 AM
To: de...@open-mpi.org
Subject: [OMPI devel] coll ml error with some
Confirmed that trunk version r32658 does pass the test.
From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Pritchard Jr.,
Howard
Sent: Monday, September 15, 2014 4:16 PM
To: Open MPI Developers
Subject: Re: [OMPI devel] coll ml error with some nonblocking collectives
Hi Rolf,
This may
Hi Folks,
I'd like to rip out the cnos ess/alps code from ompi. Its dead - no
one is using CNOS (old cray xt systems) - and its very confusing
to leave around.
Any objections?
Howard
-
Howard Pritchard
HPC-5
Los Alamos National Laboratory
No objection from me - that pretty much belongs to you folks
On Sep 15, 2014, at 2:51 PM, Pritchard Jr., Howard wrote:
> Hi Folks,
>
> I’d like to rip out the cnos ess/alps code from ompi. Its dead – no
> one is using CNOS (old cray xt systems) – and its very confusing
> to leave around.
>
HI Rolf,
Okay. I'll work with ORNL folks to see how to really fix this.
Howard
From: devel [mailto:devel-boun...@open-mpi.org] On Behalf Of Rolf vandeVaart
Sent: Monday, September 15, 2014 3:10 PM
To: Open MPI Developers
Subject: Re: [OMPI devel] coll ml error with some nonblocking collectives
Howard, and Rolf,
i initially reported the issue at
http://www.open-mpi.org/community/lists/devel/2014/09/15767.php
r32659 is not a fix nor a regression, it simply aborts instead of
OBJ_RELEASE(mpi_comm_world).
/* my point here is we should focus on the root cause and not the
consequence */
firs