Re: [slurm-dev] sbatch and job steps

Yuri D'Elia Wed, 14 Sep 2011 15:10:10 -0700

On Wed, 14 Sep 2011 10:44:36 -0700, Danny Auble wrote:

Have you had a look at  the HTC documentation?


http://schedmd.com/slurmdocs/high_throughput.html

Yes, I have. I was able to improve the scheduling speed by tuning theconfiguration (before that, I couldn't even queue 65k jobs beforegetting timeouts and abysmal performance). Meanwhile, I will update to2.2 to get larger job counts, but still that doesn't address all myconcerns. Please be patient :)

Without knowing what your real objective is it is hard to prescribe a
real solution.

From your description it seems strange you would have the script
sbatch is calling call sbatch once again.  What are you trying to
accomplish there?

Wouldn't it just be easier to run this script outside of anallocation?

Ok, I will restate my problem in a more practical manner. Please ask ifthere's any question or any idea on how to improve the behavior.

I'm running bioinformatic batches of various kinds on genetic data. Atypical analysis will involve running a short batch (~ 10 minutes)multiplied for each polymorphism we have (roughly 100k times in thesmallest case). Perfect candidate for distribution, since every step ina single stage is independent.


Analyses are usually multi-stage:

- we run "stage 1" (first 100k jobs)
- collect and aggregate data (a single job depending on "stage 1"
- run "stage 2" using collected data (another 100k jobs)
- (repeat)

Let's assume queuing ~200k jobs is not a problem with 2.2.

First issue: "squeue" takes forever with more than >5000 jobs. If morethan one user is scheduling a workflow like this it becomes impossibleto use it at all. Also, managing the queue itself (managing jobs,killing just "stage 1" is impossible). I would like to group the first100k jobs in a single "id", so that I know that jobs 1-100k belong to"stage 1".

My impression by reading the docs is that I can create an allocationand run "steps" to achieve this behavior. squeue or salloc is theeasiest way, but since queuing that many jobs is also time-consuming,running the queuing script on the queue itself seemed a perfect solution(hence sbatch --jobid within sbatch). This method (using salloc orsbatch) also seems to work fine if I put a fat "sleep" to keep alive theallocation.

Also, consider that eventually I will need to queue jobs within ascript anyway (the ending step of "stage 1" might be scheduling "stage2" itself).

Second issue: job dependencies. If I can use a single job with steps, Ican put dependencies for "step 2" easily on a single id and scheduleeverything "outside" of slurm. If this is not possible, then I need abarrier (like "wait" in a script like you suggested) so that as soon a"stage 1" finishes I can schedule the next stages within the batchitself.

Right now, to word around these issues, I'm artificially limiting thejobs by scheduling N/Z jobs, where each resuling job runs Z stepssequentially. This limits parallelism however. To work arounddependencies issues, I'm looping with a script around "squeue" to see ifa pre-determined stage has finished. Ugly, but having people wait toschedule more jobs (and thus letting the machines idle) is worse.

Re: [slurm-dev] sbatch and job steps

Reply via email to