Dear Paul,

You will have you change the state of the job in the mysql. Is the only way
I know to deal with this kind of jobs. Just change the end time and state
of the "running" job. I don't know exactly which state you should set, but
search some job with state cancelled or node fail and set that number.

Regards,
Carles Fenoy


On Wed, Jun 5, 2013 at 4:48 PM, Paul Edmon <[email protected]> wrote:

>
> Do you mean the node that hosts the slurmdb? Or the node that runs
> slurmctld?  Or are you speaking of the nodes on which that job ran?
>
> -Paul Edmon-
>
> On 06/05/2013 10:45 AM, Sefa Arslan wrote:
> > if possible, rebooting the workerker node is the fastest solution.
> >
> >
> > On 06/05/2013 05:10 PM, Paul Edmon wrote:
> >> I have a job which shows up in sacct as Running but does not show up on
> >> squeue or any other probe of the cluster jobs.  I know this job is long
> >> dead but sacct is under the impression it is still running. I suspect
> >> that this is due to me having to rebuild my database while in
> >> production.  However, I've done this before and hadn't seen this issue
> >> crop up.  Is there a way to remove this job from sacct? scancel does not
> >> work on it.
> >>
> >> -Paul Edmon-
>



-- 
--
Carles Fenoy

Reply via email to