Like I said, I haven't tried any of that, so I have no idea if/how it would
work. I don't have access to any hetero system and we don't see it very
often at all, so it is quite possible the hetero support really isn't there.
I'll look at some of the Java-specific issues later.
On Thu, Oct 11,
I'm afraid I'm confused - I don't understand what is and isn't working.
What "next process" isn't starting?
On Thu, Oct 11, 2012 at 9:41 AM, Michael Di Domenico wrote:
> adding some additional info
>
> did an strace on an orted process where xhpl failed to start, i did
Hi,
The error I get I couldn't find in the mails from your link. But I also
didn't set CXX, F77 and FC. I'll try that tomorrow and we'll see if it
changes anything.
I find the error I get weird because some file is not found which
I guess should not occur when switching compilers ...
On Thu,
Hi Christian
Would your problem be similar to the one reported two days ago on
this thread? [It also failed to compile vampir trace tools,
it also didn't have the Intel C++ compiler specified to configure.]
http://www.open-mpi.org/community/lists/users/2012/10/20449.php
Have you tried to
adding some additional info
did an strace on an orted process where xhpl failed to start, i did
this after the mpirun execution, so i probably missed some output, but
it keeps scrolling
poll([{fd=4, events=POLLIN},{fd=7, events=POLLIN},{fd=8,
events=POLLIN},{fd=10, events=POLLIN},{fd=12,
too add a little more detail, it looks like xhpl is not actually
starting on all nodes when i kick off the mpirun
each time i cancel and restart the job, the nodes that do not start
change, so i can't call it a bad node
if i disable infiniband with --mca btl self,sm,tcp on occasion i can
get
I'm trying to diagnose an MPI job (in this case xhpl), that fails to
start when the rank count gets fairly high into the thousands.
My symptom is the jobs fires up via slurm, and I can see all the xhpl
processes on the nodes, but it never kicks over to the next process.
My question is, what
Hi,
I tried to compile the current OpenMPI 1.6.2 with the Intel Compiler
# icc --version
icc (ICC) 12.0.4 20110427
The error I get is the following (I changed directly in the vtfilter
directory where the error occurs to reduce output for this mail):
# cd ompi/contrib/vt/vt/tools/vtfilter/
#
Just to follow up my earlier post, checking out master and building that gives
me the same lock up in ompi_info
> ompi_info.exe!opal_atomic_lifo_push(opal_atomic_lifo_t * lifo,
> opal_list_item_t * item) Line 73 C
ompi_info.exe!ompi_free_list_grow(ompi_free_list_t * flist, unsigned
Hi,
I have built openmpi-1.9a1r27380 with Java support and try some small
programs. When I try to scatter an object, I get a ClassCastException.
I use the following object.
public class MyData implements java.io.Serializable
{
static final long serialVersionUID = -5243516570672186644L;
Hi,
> I haven't tried heterogeneous apps on the Java code yet - could well not
> work. At the least, I would expect you need to compile your Java app
> against the corresponding OMPI install on each architecture, and ensure the
> right one gets run on each node. Even though it's a Java app, the
Hi list,
I searched the archives, but didn't turn anything up...
I have a new machine which I've installed windows 8 x64 + MSVC 2012 (MSVC 11)
and have compiled openmpi from the git svn clone(on branch origin/v1.6) using
these settings ...
cmake -DOMPI_ENABLE_THREAD_MULTIPLE=true
12 matches
Mail list logo