Hi, We have users submitting jobs with afterok dependency on. When a node is not working properly (filesystem problems) IO_setup at slurmstepd fails and user program never executed. However the job is reported as completed to slurmctl so jobs dependent on that job are executed.
Probably there's a good reason for this but I can not see it. We are working with slurm-2.1.6 but I have checked out last slurm version and it seems the same. WARNING / LEGAL TEXT: This message is intended only for the use of the individual or entity to which it is addressed and may contain information which is privileged, confidential, proprietary, or exempt from disclosure under applicable law. If you are not the intended recipient or the person responsible for delivering the message to the intended recipient, you are strictly prohibited from disclosing, distributing, copying, or in any way using this message. If you have received this communication in error, please notify the sender and destroy and delete any copies you may have received. http://www.bsc.es/disclaimer.htm
