Re: [hwloc-devel] release status

Jeff Squyres Mon, 5 Oct 2009 08:27:32 -0400

On Oct 3, 2009, at 8:21 AM, Fawzi Mohamed wrote:

Ok you are right that storing in the struct might be overkill, andabout performance I fully agree, space not so much, especially ifyou really want to cache all the cpuset for all objects, this stillgrows quadratically, and allocates a lot of objects.

I'm still not sure that I agree -- I still think we're just quibblingover a few bytes here. It's commonplace to have 2GB RAM per corethese days; that number certainly isn't going to go down -- it'slikely that it's even going to go up.

So yes, if every process running on every core has a cpuset, youmultiply (for example) a 4k cpuset data structure times 1,000processors (cores): 4MB. But consider that each of those 1,000processors will have 2GB or more of RAM. That's 2 terabytes; whocares about 4MB when you have 2TB? That's 6 orders of magnitudedifference; put differently, 4MB is 0.0002 percent of 2TB.

I agree that we shouldn't be wasteful, but the difference we'retalking about here is in the noise.

That was the reason I was advocating having a function returning thecpuset from an object (sparse cpuset would also be a solution).
Anyway the real issue here is the API I think.
I would say that the best solution is
- keep cpuset a structure (not just void*), so it can be just avoid* or something more complex in the future without API changes

I'm not sure I parsed the above sentence properly -- I read it asadvocating 2 different things. Can you explain?

- add functions to allocate/deallocate/copy it, and make it clearthat these should be called on the cpusets returned by otherfunctions (i.e. clarify ownership transfers).

Such functions would be necessary only if there are non-public membersof the struct or if you want to deep copy the struct, right? Theywould also apply if we return opaque handles, not public structures.

- functions that are possibly inlined are ok (obviously changingthem breaks the binary compatibility), but recompilation fixes them,and other languages can still use the non inline function that ispart of the lib

The usual reason for inlining is a need for performance -- and Ihonestly think that we don't need it. So if the usual question forinlining is "why not?", I turn that question around and ask "if notfor performance, why?". :-)

- macros I don't like, they make binding to other languages moredifficult, as one has to write either a thin glue layer, orduplicate the macro, which will not stay in sync with lib changesautomatically (cpuset has some macros, but the structure is sosimply that I just used another bit compatible type when binding toD).


Agreed.  Macros = evil; should only be used where absolutely necessary.

To make the release quickly I think that just adding the requestedfunctions (alloc/dealloc would be noops at the moment) would be good.Then in the future one can switch to dynamic or sparse cpusetwithout user visible changes (apart recompilation).

Agreed; that is a good goal (switch to a new back-end type withoutneeding to change user code).


--
Jeff Squyres
jsquy...@cisco.com

Re: [hwloc-devel] release status

Reply via email to