Good examples:
https://hpc.nih.gov/docs/job_dependencies.html

BR

On 2017-03-15 17:37, Álvaro pc wrote:
> Hi again!
> 
> I would really like to know about the behaviour of --dependency argument..
> 
> Nobody know anything?
> 
> *Álvaro Ponce Cabrera.*
> 
> 
> 2017-03-14 12:31 GMT+01:00 Álvaro pc <alvarop...@gmail.com
> <mailto:alvarop...@gmail.com>>:
> 
>     Hi,
> 
>     I'm having problems trying to launch jobs with dependency of another
>     one.
> 
>     I'm using '--dependency=afterany:Job_ID' argument. 
> 
>     The problem happens when the queue is full and the new job which
>     depends on another one (already running) can't enter in the queue
>     and need to wait.
>     Instead of wait properly to enter in the queue, the job try to enter
>     thousands of times per minute. 
> 
>     All the tries seems to be waiting to enter  in the queue... Here you
>     can see a piece of the queue where you can see the problem:
> 
>                  20217       UPO Macs2_DM alvaropc PD       0:00      1
>     (Dependency)
>                  20218       UPO Macs2_DM alvaropc PD       0:00      1
>     (Dependency)
>                  20219       UPO Macs2_DM alvaropc PD       0:00      1
>     (Dependency)
>                  20220       UPO Macs2_DM alvaropc PD       0:00      1
>     (Dependency)
>                  20221       UPO Macs2_DM alvaropc PD       0:00      1
>     (Dependency)
>                  20222       UPO Macs2_DM alvaropc PD       0:00      1
>     (Dependency)
>                  20223       UPO Macs2_DM alvaropc PD       0:00      1
>     (Dependency)
>                  20224       UPO Macs2_DM alvaropc PD       0:00      1
>     (Dependency)
>                  20225       UPO Macs2_DM alvaropc PD       0:00      1
>     (Dependency)
>                   4907       UPO notebook    panos  R 64-01:48:56      1
>     nodo01
>                   6454       UPO valinomy jraviles  R 7-05:45:32      1
>     nodo10
>                   6492       UPO input_ra  rbueper  R 13-08:44:42      1
>     nodo01
>                   6493       UPO input_ra  rbueper  R 13-08:44:42      1
>     nodo05
>                   6823       UPO FELIX-No     said  R 13-09:34:42      1
>     nodo06
>                   7219       UPO input_ra  rbueper  R 13-08:44:42      1
>     nodo05
> 
> 
> 
>     In addition I'm obtaining this error from the log/out file: 'sbatch:
>     error: Slurm temporarily unable to accept job, sleeping and retrying'. 
>     The error is repeated thousands of times too, obviously, one per
>     each try of the job entering the queue...
> 
>     I just want to launch ONE job  which waits untill another one ends... 
> 
>     Any ideas?
> 
>     Thank you so much.
> 
> 
> 
>     *Álvaro Ponce Cabrera.*
> 
> 
> 

-- 
FSU Jena | JULIELab.de/Staff/Benjamin+Redling.html
vox: +49 3641 9 44323 | fax: +49 3641 9 44321

Reply via email to