Re: [OMPI devel] Amateur Guidance

2008-11-03 Thread Shipman, Galen M.
The TEG paper is woefully out of date, we don't use that interface anymore. Try the following for dated, but more relevant info: http://www.open-mpi.org/papers/euro-pvmmpi-2006-hpc-protocols http://www.open-mpi.org/papers/workshop-2006/wed_01_pt2pt.pdf These cover the point-to-point infrastruc

Re: [OMPI devel] ob1 and req->req_state

2008-06-23 Thread Shipman, Galen M.
Jun 23, 2008, at 5:45 PM, Shipman, Galen M. wrote: Oh, I see, you are confusing the req_state on the OMPI request with the req_state on the PML request. The ompi request state is for persistent requests, the PML request state is not and does not use that enum. - Galen On Jun 23, 2008, at

Re: [OMPI devel] ob1 and req->req_state

2008-06-23 Thread Shipman, Galen M.
4:43 PM, Shipman, Galen M. wrote: We use req_state currently to track that we receive both RNDV completion and RNDV ack prior to freeing the request.. Does that mean you're not using the enum values, but rather just to indicate that the value is >= 0? -- Jeff Squyres Cisco

Re: [OMPI devel] ob1 and req->req_state

2008-06-23 Thread Shipman, Galen M.
We use req_state currently to track that we receive both RNDV completion and RNDV ack prior to freeing the request.. - Galen On Jun 23, 2008, at 3:23 PM, Brian W. Barrett wrote: On Mon, 23 Jun 2008, Jeff Squyres wrote: On Jun 23, 2008, at 3:17 PM, Brian W. Barrett wrote: Just because

Re: [OMPI devel] PML selection logic

2008-06-23 Thread Shipman, Galen M.
I concur - galen On Jun 23, 2008, at 3:44 PM, Brian W. Barrett wrote: That sounds like a reasonable plan to me. Brian On Mon, 23 Jun 2008, Ralph H Castain wrote: Okay, so let's explore an alternative that preserves the support you are seeking for the "ignorant user", but doesn't penalize e

Re: [OMPI devel] Memory hooks stuff

2008-05-27 Thread Shipman, Galen M.
I will be there as well. - Galen On May 27, 2008, at 10:14 AM, Jeff Squyres wrote: To accommodate timezones spanning from US Mountain to Israel, let's have the teleconference tomorrow, Wednesday 28 May 2008: - 10:30am US Eastern time - 8:30am US Mountain time - 5:30pm Israel time I'll send a

Re: [OMPI devel] Open MPI session directory location

2008-05-27 Thread Shipman, Galen M.
Make that "ompi_info". We need to make that visible via orte_info. I thought this was done at some point, perhaps it got overwritten? Thanks, Galen On May 27, 2008, at 10:27 AM, Ralph H Castain wrote: -mca orte_tmpdir_base foo On 5/27/08 8:24 AM, "Gleb Natapov" wrote: Hi, Is there a

Re: [OMPI devel] Open MPI session directory location

2008-05-27 Thread Shipman, Galen M.
We need to make that visible via orte_info. I thought this was done at some point, perhaps it got overwritten? Thanks, Galen On May 27, 2008, at 10:27 AM, Ralph H Castain wrote: -mca orte_tmpdir_base foo On 5/27/08 8:24 AM, "Gleb Natapov" wrote: Hi, Is there a way to change where Ope

Re: [OMPI devel] Session directories in $HOME?

2008-04-01 Thread Shipman, Galen M.
On Apr 1, 2008, at 2:12 PM, Ralph H Castain wrote: Sure - I'll rename it "orte_tmpdir_base" so it shows up. Perfect, do we also need to carry on support for "OMPI_MCA_tmdir_base"? - Galen On 4/1/08 12:05 PM, "Shipman, Galen M." wrote: On Apr 1

Re: [OMPI devel] Session directories in $HOME?

2008-04-01 Thread Shipman, Galen M.
with Galen. Aurelien Le 31 mars 08 à 16:00, Shipman, Galen M. a écrit : Slightly OT but along the same lines.. We currently have an argument to mpirun to set the HNP tmpdir (-- tmpdir). Why don't we have an mca param to set the tmpdir for all the orted's and such? - Galen On Mar

Re: [OMPI devel] Session directories in $HOME?

2008-03-31 Thread Shipman, Galen M.
Slightly OT but along the same lines.. We currently have an argument to mpirun to set the HNP tmpdir (-- tmpdir). Why don't we have an mca param to set the tmpdir for all the orted's and such? - Galen On Mar 31, 2008, at 3:51 PM, George Bosilca wrote: I looked over the code and I don't se

Re: [OMPI devel] Ticket 1224: disable early completion in v1.2.x series

2008-03-03 Thread Shipman, Galen M.
Unfortunately this adds an "if" to the critical path. You should at least use OPAL_UNLIKELY.. On Mar 3, 2008, at 12:28 PM, Jeff Squyres wrote: The topic of the "early completion" behavior in OB1 for IB optimizations has come up several times in the v1.2 series (it causes problems in some scena

Re: [OMPI devel] [RFC] Non-blocking collectives (LibNBC) merge to trunk

2008-02-14 Thread Shipman, Galen M.
I am in favor of bringing this in. - Galen On Feb 14, 2008, at 1:15 PM, Jeff Squyres wrote: So I don't think that we ever concluded this discussion/RFC. I am in favor of bringing in libnbc, given the qualifications below. Others? On Feb 8, 2008, at 12:16 PM, Jeff Squyres wrote: Terry --

Re: [OMPI devel] RTE Issue IV: RTE/MPI relative modex responsibilities

2007-12-06 Thread Shipman, Galen M.
Sorry, to be clear that should have been: > One option is for the RTE to just pass in an enviro variable with a > comma-separated list of your local ranks, but that creates a problem down > the road when trying to integrate tighter with systems like SLURM where the > procs would get mass-launched

Re: [OMPI devel] RTE Issue IV: RTE/MPI relative modex responsibilities

2007-12-06 Thread Shipman, Galen M.
>> >> Do we really need a complete node map? A far as I can tell, it looks >> like the MPI layer only needs a list of local processes. So maybe it >> would be better to forget about the node ids at the mpi layer and just >> return the local procs. > > I agree, though I don't think we want a paral

Re: [OMPI devel] ORTE process name,, nodeid..

2007-11-22 Thread Shipman, Galen M.
x27;ll complete the orte revisions on a LANL server. I hate to do this, > though, as it means someone (LANL, IBM, Voltaire, some combination, or > whomever) will be left with the problem of dealing with either re-merging > the branches or supporting a split code. I only offer it as an opt

Re: [OMPI devel] IB/OpenFabrics pow wow

2007-11-19 Thread Shipman, Galen M.
I am off all next week, but I will try to call in anyway.. - Galen On 11/19/07 10:09 AM, "Don Kerr" wrote: > Sun would like to be represented at this meeting. Mon 10AM might be a > problem for Sun, any of the other times are good. > > -DON > > Jeff Squyres wrote: > >> Based on discussions

[OMPI devel] ORTE process name and nodeid

2007-11-17 Thread Shipman, Galen M.
I am doing some work on Cray's CNL to support shared memory. To support shared memory I need to know if processes are local or remote. For other systems we simply use the modex in ompi_proc_get_info to get the proc's nodeid. When using CNOS I don't need the modex to get a remote processes nodeid.

Re: [OMPI devel] collective problems

2007-11-07 Thread Shipman, Galen M.
The lengths we go to avoid progress :-) On 11/7/07 10:19 PM, "Richard Graham" wrote: > The real problem, as you and others have pointed out is the lack of > predictable time slices for the progress engine to do its work, when relying > on the ULP to make calls into the library... > > Rich >

Re: [OMPI devel] collective problems

2007-10-23 Thread Shipman, Galen M.
So this problem goes WAY back.. The problem here is that the PML marks MPI completion just prior to calling btl_send and then returns to the user. This wouldn't be a problem if the BTL then did something, but in the case of OpenIB this fragment may not actually be on the wire (the joys of user lev

Re: [OMPI devel] issue with --without-tm in configure?

2007-10-22 Thread Shipman, Galen M.
Jenny, I don't have access to a machine with TM on it right now, but as a workaround try adding this to your configure line: --enable-mca-no-build="ras_tm,pls_tm" - Galen On 10/22/07 12:03 PM, "Jennis Pruett" wrote: > Hello again. > > The last email I sent with attachements showed up o

Re: [OMPI devel] [RFC] change wrapper compilers from binaries to shell scripts

2007-10-12 Thread Shipman, Galen M.
I vote for it. - Galen On 10/11/07 5:32 PM, "George Bosilca" wrote: > Sounds perfect. I'll vote for it. > >Thanks, > george. > > On Oct 11, 2007, at 5:23 PM, Jeff Squyres wrote: > >> On Oct 11, 2007, at 5:17 PM, George Bosilca wrote: >> >>> I know that [with few exception] nobo