Re: gridmix on a small cluster?

Chris Douglas Wed, 17 Sep 2008 17:28:18 -0700

Yes. If you look at the README, gridmix-env, and the generateDatascript, you should be able to alter the job mix to match yourrequirements. In particular, you probably want to look closely at thenumber of small, medium, and large jobs for each run. For a three nodecluster, you might want to try running only the small jobs (possiblythe medium jobs). Note that you don't have to generate the entropydataset if you don't plan on running any large jobs (what it tests isnot interesting on three nodes anyway). Note that the "real" datasetis 1000 times larger than what generateData does by default; a smallerdataset may let you keep the total number of jobs up, though youshould also be wary of the load on the submitting node (seesubmissionScripts/sleep_if_too_busy). Keep in mind that each node mayalso store (possibly uncompressed) copies of the datasets asintermedate map outputs, so budgeting for local disk space will alsobe important while gridmix runs, particularly for "medium" jobs. Goodluck. -C


On Sep 17, 2008, at 3:27 PM, Joel Welling wrote:

Hi folks;
 I'd like to try the gridmix benchmark on my small cluster (3 nodes at
8 cores each, Lustre with IB interconnect).  The documentation for

gridmix suggests that it will take 4 hours on a 500 node cluster,whichsuggests it would take me something like a week to run. Is there away

to scale the problem size back?  I don't mind the file size too much,
but the running time would be excessive if things scale linearly with
the number of nodes.

Thanks,
-Joel

Re: gridmix on a small cluster?

Reply via email to