I used one of the LANL platform files to build,

$ configure --with-platform=contrib/platform/lanl/rr-class/debug- panasas-nocell

Did the same thing with the non-debug platform file and it dies in the same location.

-david

--
David Gunter
HPC-3: Infrastructure Team
Los Alamos National Laboratory




On Nov 5, 2009, at 3:07 PM, Jeff Squyres wrote:

How did you build?

I see one carto component named "auto_detect" in the 1.3.4 source tree, but I don't see it in your ompi_info output.

Did that component not build?


On Nov 4, 2009, at 7:20 PM, Samuel K. Gutierrez wrote:

Hi All,

I just built OMPI 1.3.4rc4 on one of our Roadrunner machines.  When I
try to launch a simple MPI job, I get the following:

[rra011a.rr.lanl.gov:31601] mca: base: components_open: Looking for
carto components
[rra011a.rr.lanl.gov:31601] mca: base: components_open: opening carto
components
[rra011a.rr.lanl.gov:31601] mca:base:select: Auto-selecting carto
components
[rra011a.rr.lanl.gov:31601] mca:base:select:(carto) No component
selected!
--------------------------------------------------------------------------
It looks like opal_init failed for some reason; your parallel process is
likely to abort.  There are many reasons that a parallel process can
fail during opal_init; some of which are due to configuration or
environment problems. This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):

  opal_carto_base_select failed
  --> Returned value -13 instead of OPAL_SUCCESS
--------------------------------------------------------------------------
[rra011a.rr.lanl.gov:31601] [[INVALID],INVALID] ORTE_ERROR_LOG: Not
found in file runtime/orte_init.c at line 77
[rra011a.rr.lanl.gov:31601] [[INVALID],INVALID] ORTE_ERROR_LOG: Not
found in file orterun.c at line 541

This may be an issue on our end regarding a runtime parameter that
isn't set correctly.  See attached.  Please let me know if you need
any more info.

Thanks!
--
Samuel K. Gutierrez
Los Alamos National Laboratory


<lanl-rr-class-1.3.4rc4.tar.gz><ATT2634783.txt><ATT2634785.txt>


--
Jeff Squyres
jsquy...@cisco.com

_______________________________________________
devel mailing list
de...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/devel

Reply via email to