Have you tried restarting slurmctld? We had this issue back a few revisions, but it seemed to go away with a newer rev ( though I couldn't tell you which one ).

Have you tried setting the state to RESUME for the nodes?

What version of Slurm are you running?

Jerry

Paul Thirumalai wrote:
I am not using epilog while launching these jobs. the jobs are simple python scripts that run the hostname command and put thte output in a file that is provided in teh command line.

I can see that file was written. This tells me that the job completed.
When I login to the node I dont see the process running. However squeue still tells me that the job is in CG state.

I stopped all the slurm daemons and restarted them, but the state of th job is still CG and it shows up in squeue

On Tue, Feb 22, 2011 at 2:45 PM, Paul Thirumalai <[email protected] <mailto:[email protected]>> wrote:

    Actually I just figured out that all jobs seem to be stuck in
    COMPLETING state. I am now
    reading https://computing.llnl.gov/linux/slurm/faq.html#comp
<http://www.google.com/url?sa=D&q=https://computing.llnl.gov/linux/slurm/faq.html%23comp>


    I will continue to trouble shoot. If I run into issues, I will
    repost to this thread.


Reply via email to