I'd prefer we not commit something in opal/hwloc until we have a plan for 
supporting platforms without hwloc support (ie, Red Storm).  I have no 
objection to your original RFC, but I had the impression at the time that you 
had a plan in place for non-hwloc supported platforms.

Brian

On May 17, 2010, at 5:26 PM, Jeff Squyres wrote:

> On May 15, 2010, at 4:39 PM, Ralph Castain wrote:
> 
>> So, to ensure I understand, you are proposing that we completely eliminate 
>> the paffinity framework and commit to hwloc in its place?
> 
> I think there's 2 issues here:
> 
> - topology information
> - binding
> 
> hwloc supports both.  paffinity mainly supports binding; it also supports 
> some minor socket/core mapping information stuff, but mainly as a means to 
> support binding better.  hwloc's topology information is far more complete 
> than paffinity's.
> 
> How about this?  (and this is very half-baked)
> 
> - commit hwloc to opal/hwloc; the entire tree can call it
>  - it's still TBD how to compile this out (e.g., for embedded environments)
>  - it *may* need something like #if OPAL_HAVE_HWLOC
> - split paffinity into two frameworks (because some OS's support one and not 
> the other):
>  - binding: just for getting and setting processor affinity
>  - hwmap: just for mapping (board, socket, core, hwthread) <--> OS processor 
> ID
> 
> In this way, if hwloc ever dies, we can still have OS-specific plugins for 
> these two things, and the #if OPAL_HAVE_HWLOC will be 0.
> 
> hwloc provides a very rich API for traversing the topology information; I 
> don't think the main OPAL/ORTE/OMPI code base necessarily needs all of that 
> functionality for the general case -- i.e., the binding/hwmap information 
> (e.g., just want to bind a process to (board 1, socket 3, core 2, hwthread 
> 1)).  
> 
> Anything that needs the detailed hwloc information (e.g., tuning the sm btl 
> based on cache sizes reported by hwloc) can use #if OPAL_HAVE_HWLOC to 
> protect itself.
> 
> -- 
> Jeff Squyres
> jsquy...@cisco.com
> For corporate legal information go to:
> http://www.cisco.com/web/about/doing_business/legal/cri/
> 
> 
> _______________________________________________
> devel mailing list
> de...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/devel
> 

--
  Brian W. Barrett
  Dept. 1423: Scalable System Software
  Sandia National Laboratories





Reply via email to