Brad,
Many thanks for the update.
Greg
On Oct 22, 2008, at 8:43 PM, Brad Benton wrote:
Greg,
Here is the latest schedule that we have for getting 1.3 out the door:
https://svn.open-mpi.org/trac/ompi/milestone/Open%20MPI%201.3
Basically, this schedule sets Nov. 10 as the release date with a
Greg,
Here is the latest schedule that we have for getting 1.3 out the door:
https://svn.open-mpi.org/trac/ompi/milestone/Open%20MPI%201.3
Basically, this schedule sets Nov. 10 as the release date with a backup date
of Nov. 17.
Here is a bit more detail as to the release to beta and then to releas
Leonardo,
As you say, there is the possiblity that moving from one node to
another has caused problems due to different shared libraries. The
result from this could be a segmentation fault, an illegal instruction
or even a bus error. In all three cases, however, this failure
generates a si
Ralf Wildenhues wrote:
Jeff Squyres wrote:
We use lt_dlopen() to open the plugins (Libtool's wrapper for a
portable dlopen). It opens all plugins (DSOs) in a private scope.
That private scope is kept deep in the OPAL MCA base and not exposed
elsewhere in the c
Hello Jeff, Eugene,
> Jeff Squyres wrote:
>
>> We use lt_dlopen() to open the plugins (Libtool's wrapper for a
>> portable dlopen). It opens all plugins (DSOs) in a private scope.
>> That private scope is kept deep in the OPAL MCA base and not exposed
>> elsewhere in the code base. So
Jeff Squyres wrote:
We use lt_dlopen() to open the plugins (Libtool's wrapper for a
portable dlopen). It opens all plugins (DSOs) in a private scope.
That private scope is kept deep in the OPAL MCA base and not exposed
elsewhere in the code base. So if you manually dlopen a plugin
again
Hmmm...interesting. I see what's going on - I'm having a build system
issue that is causing some of the dynamic libraries to not be seen.
Red herring - thanks for clarifying!
Camille: thanks for fixing this way back when.
Ralph
On Oct 22, 2008, at 1:17 PM, George Bosilca wrote:
Ralph,
Th
Ralph,
This problem was fixed long ago by some of the work Camille did. The
exact revision number is r15402 (https://svn.open-mpi.org/trac/ompi/changeset/15402
). I'm using this feature daily and so far I had any problems with it.
To reuse your example here is what Camille came up with.
$ m
I can't swear to this because I haven't fully grokked it yet, but I
believe the answer is:
1. if child jobs have completed, it won't hurt. I think the various
subsystem cleanup their bookkeeping when a job completes, so we could
possibly reuse the number. Might be some race conditions we wo
What's happened if we roll around with the counter ?
george.
On Oct 22, 2008, at 2:49 PM, Ralph Castain wrote:
There recently was activity on the mailing lists where someone was
attempting to call comm_spawn 100,000 times. Setting aside the
threading issues that were the focus of that exc
There recently was activity on the mailing lists where someone was
attempting to call comm_spawn 100,000 times. Setting aside the
threading issues that were the focus of that exchange, the fact is
that OMPI currently cannot handle that many comm_spawns.
The ORTE jobid is composed of two ele
George reminds me that I forgot to explain why you couldn't dlsym
We use lt_dlopen() to open the plugins (Libtool's wrapper for a
portable dlopen). It opens all plugins (DSOs) in a private scope.
That private scope is kept deep in the OPAL MCA base and not exposed
elsewhere in the cod
Youpiii!
george.
On Oct 21, 2008, at 4:53 PM, Ralph Castain wrote:
Hello all
I am working on adding a new radix tree routed module and am
simultaneously doing a little streamlining to the overall routed-
related code for scalability. One thing that would help cleanup
several areas o
Short answer because we're all still in Chicago...
Terry tells me that you're just hacking around trying to see what
works, etc. So adding direct calls to the BTL in this kind of
scenario is ok. I'm sure you're aware that this is not good for real
code. :-)
To directly call a BTL funct
Sorry for delayed response - had some things to finish, then had to
stare at this code for awhile.
Unfortunately, the OOB is a snarled can of hideous worms. It looks to
me that the OOB continues to attempt to complete any pending message
requests once it detects that retries have exceeded t
Sounds good to me.
On Oct 21, 2008, at 3:53 PM, Ralph Castain wrote:
Hello all
I am working on adding a new radix tree routed module and am
simultaneously doing a little streamlining to the overall routed-
related code for scalability. One thing that would help cleanup
several areas of th
I'm trying to prototype an idea inside OMPI and am running into a problem.
I want to add a new function to a BTL and to have the PML call this
function. I can't just put such a function call into the PML (not even
for my prototype) since the PML is loaded before the BTL and so the PML
will co
Ralph,
I guess the issue for us is that we will have to run two commands to
get the information we need. One to get the configuration information,
such as version and MCA parameters, and one to get the host
information, whereas it would seem more logical that this should all
be available
I've been digging a little into optimization and found something that
seems counterintuitive in the way OMPI is handling components.
Specifically, if I specify a component I want used for a framework,
OMPI still does a component load and open on every component in the
framework - it only us
Hi All,
I´m trying to implement my FT architecture in Open MPI. Just now I need
to restart a faulty process from a checkpoint. I saw that Josh uses
orte-restart which call opal-restart through an ordinary mpirun call.
It´s now good for me because in this case the restarted process becomes
in
20 matches
Mail list logo