Re: [OMPI devel] RFC: Deprecate rankfile?

Eugene Loh Thu, 15 Apr 2010 19:52:32 -0400

Jeff Squyres wrote:

WHAT: Deprecate the "rankfile" component in the v1.5 series; remove it in the 
1.7 series


WHY: It's old, creaky, and difficult to maintain.  It'll require maintenance 
someday soon, when we support hardware / hyperthreads in OMPI.

WHERE: svn rm orte/mca/rmaps/rank_file

WHEN: Deprecate in 1.5, remove in 1.7.

TIMEOUT: Teleconf on Tue, 27 April 2010

-----

Now that we have nice paffinity binding options, can we deprecate rankfile in 
the 1.5 series and remove it in 1.7?

I only ask because it's kind of a pain to maintain.  It's not a huge deal, but 
Ralph and I were talking about another paffinity issue today and we both 
groaned at the prospect of extending rank file to support hyperthreads (and/or 
boards).  Perhaps it should just go away...?

Pro: less maintenance, especially since the original developers no longer 
maintain it

Con: it's the only way to have completely custom affinity bindings (i.e., outside of our 
pre-constructed "bind to socket", "bind to core", etc. options).

Arguably, someone could always hack something together him/herself withnumactl, processor_binding(), scripts, etc.

...do any other MPI's offer completely custom binding options?

Yes, "they all do." Okay, I haven't actually checked exhaustively, butfrom what I can tell every MPI (now that OMPI has joined their ranks)supports specification of process-bind mappings both via policy (a la"bysocket") and via customized list. Specifically, I've checked:


*) MVAPICH2 (which uses PLPA!  God help them)
http://mvapich.cse.ohio-state.edu/support/user_guide_mvapich2-1.4rc1.html#x1-360006.8
*) Scali/Platform
*) Intel MPI (the full-blown set of options is very extensive)
http://software.intel.com/file/15429
*) HP MPI
http://docs.hp.com/en/B6060-96022/ch03s07.html

I.e., do any real users care?

I think so. Users end up wanting fine controls on all kinds of stuff.I think I first ran into users wanting to bind processes about 15 yearsago. Issues include spreading/gathering processes on multicore nodes,squelching migration, avoiding or targeting particular processes,dealing with asymmetries within a node, etc. It just seems to keepcoming up over and over and over again. There's been a history ofproblems (like excessive migration, poor initial placement, etc.) andclever developers fixing these problems, but the constant has been theexistance of some subset of users who just want a stable workaround tocarry them through the broken/fixed cycles of history.

Note that various MPI implementations (also OMP, more on that in asecond) have spent a lot of time on process binding. Intel has a veryrich set of options. I don't know if that's because of demanding usersor because of developers drinking too much Red Bull, but their optionsare much richer than ours. So, imagine a socket that has two caches,with two cores per cache? Intel allows you to control placement on thisarchitecture via policy. We don't. So, what do our users do? Allowingcustom mappings remains the hammer that hammers every nail.

With regards to your two questions (what do others do? do users care?),I believe (FWIW) that OpenMP implementations tend to support customizedmappings. E.g., Oracle Solaris Studio (formerly Sun Studio), GNUlibgomp, PGI OMP, and Intel OMP. I think the OMP ARB is considering astandardized interface for binding threads, but I don't remember if itwas for policies, customized mappings, or both. Anyhow, custom mappingsgo even beyond just MPI.

I'm sympathetic to the maintenance-cost issue. And the currentfunctionality is kind of awkward. (Even just the name "rank file" is apuzzler.) And it's astounding that there is such extensivediversification of interfaces out there among MPI and OMPimplementations (with only Intel making any attempt to have MPI and OMPplay well with each other). But there is no question that customizedbinding mappings are commonplace.

Re: [OMPI devel] RFC: Deprecate rankfile?

Reply via email to