Re: [OMPI devel] memcpy MCA framework

Bogdan Costescu Mon, 18 Aug 2008 11:29:13 -0400

We don't really need a finer grain knowledge about the processor atcompile time.

There are some other open-source projects which have already donesomething very similar if not identical; one of them is the mediaplayer mplayer (http://www.mplayerhq.hu/). Why not using these asstarting points ?

The second question is how and when to figure out which of theavailable memcpy functions give the best performance.

This depends a lot on whether the job has the nodes all by itself orthe nodes are shared with other jobs - if so, the data transferbetween CPU and RAM while benchmarking can be significantly skewed.

On a homogeneous architecture, this might be a one node selection [Idon't imagine using the modex to spread this information]

Hmm, doesn't sound nice to have n-1 nodes waiting while 1 node doesthe test. Maybe run it on all nodes and compare results ? And warn theuser if different mempcy versions would be chosen..

The really annoying thing here, is that in the best case [in aperfect world] this should be done once per cluster.

... and, in the view of node sharing pointed above, when thebenchmarking can have the nodes all by itself. This sounds very muchlike the collectives tuning, with MCA params to give the admin or userview of how the best performance can be achieved.


--
Bogdan Costescu

IWR, University of Heidelberg, INF 368, D-69120 Heidelberg, Germany
Phone: +49 6221 54 8869/8240, Fax: +49 6221 54 8868/8850
E-mail: bogdan.coste...@iwr.uni-heidelberg.de

Re: [OMPI devel] memcpy MCA framework

Reply via email to