Re: [gt-user] How to set cores-per-node in WS job submission?

Stuart Martin Tue, 06 May 2008 11:50:03 -0700

Jan,

I think that was well put.

GridWay would be another option here. Keeps a familiar batch queueinteraction for users, but uses the Globus tools under the covers:MDS, GRAM, delegation service, RFT, GridFTP, ... It can be used as a"global queue" for one or more users for submitting jobs on grids.


-Stu

On May 6, 2008, at May 6, 1:32 PM, Jan Ploski wrote:

Steve White wrote:
Jan,
I agree with your assessment that the need to adjust the memory useper
process is a general one in cluster job submission, and that it is in
some way implemented by any underlying job management system, andthat
these extensions ought not to be PBS-specific.
I also looked at your "messy solution". (The code looks veryprofessional,really.) It won't do for my purposes, because I need to present aminimal,
easily understood solution.
Let me explain my situation:
None of the compute resources is under my control.  I can point out
problems to admins, that is all.
I have been assigned two jobs.
I and our users are familiar with doing conventional cluster jobsubmission. One job was to bring them into the grid fold, showingthem the advantagesof globusrun-ws. If it can be shown to be really a cross-platformsolution, giving them the ability to (almost) effortlessly switchbetween grid clusters, the effort will be a success.My other job is to write a report on practical MPI job submissionover
the grid.
We have come a long way, but still have to deal with a couple ofpracticaldetails. At this point, it looks like both of them will end up aswork-arounds to incomplete implementation of a job submissioninterface
in Globus.
If with a future release of Globus, these issues can be dealt with,grid
job submission will look very attractive to real researchers.
Hi,
Based on my experience with Globus, you might be following a wrongroute (the route to disenchantment). I view Globus more as amiddleware that has to be adapted (as in: "wrapped around" or"slightly modified") according to your users' needs and which playsan important role behind the scenes, but it probably should not beexposed directly to users as a drop-in replacement for theirfamiliar job submission tools.
There is a reason for that more important than the limitations youhave discovered so far: Globus doesn't ship with command-line jobmanagement commands on par with those of TORQUE/Maui, Condor or SGE.If you let users submit jobs with globus-job-submit, the next thingthey are going to ask you is "how can I see what jobs I havesubmitted", "how can I cancel the job or resubmit it elsewhere", "ismy job running or not", "why is my job not running", "when is my jobgoing to start", etc.
You need something in front of Globus to make your users' lifebearable. Some projects lean toward application-specific web portals(I think that's AstroGrid's approach). In our project, we havedeployed a largely application-agnostic frontend based on Condor-G,but even so there was some customization and some user trainingrequired. The Condor-G approach might be relevant for you because itcovers the scenario of making a transparent transition from a localbatch system to a Grid - the Condor tools for submitting jobs andstatus querying are pretty much the same regardless of whether yourjob goes to a machine from a local pool (equivalent to an SGE or PBS-managed cluster) or to a pool of Globus hosts. (In fact, Condor cansubmit to GT2 [gLite], GT4, Unicore, and some more Grid middlewares.)
The disadvantage of Condor is that it is a rather huge softwareproduct and trying to understand all of it can be daunting. Still, Isuppose you could get the Grid submission piece of it running in acouple of hours if you wish to give it a try (by following ourtutorials and asking questions where necessary).
Regards,
Jan Ploski

Re: [gt-user] How to set cores-per-node in WS job submission?

Reply via email to