Look into this

https://arc.liv.ac.uk/pipermail/gridengine-users/2008-October/020911.html

We had a similar problem and that fixed it for us.


On 5/1/18, 11:25 PM, "users-boun...@gridengine.org on behalf of Simon Matthews" 
<users-boun...@gridengine.org on behalf of simon.d.matth...@gmail.com> wrote:

    After deleting all the jobs, it still won't schedule any new jobs. The
    qmaster "messages" file has this in it:

    05/01/2018 20:19:51|worker|sgemasterU5|E|scheduler tries to schedule
    job 4409436.1 twice
    05/01/2018 20:19:51|worker|sgemasterU5|W|Skipping remaining 156 orders

    But I can't delete 4409436:
    qdel -u build -j 4409436
    The job -j of user(s) build does not exist

    Simon

    On Tue, May 1, 2018 at 2:53 PM, Simon Matthews
    <simon.d.matth...@gmail.com> wrote:
    > A little more info.
    >
    > After moving the spool directory and re-starting the qmaster, I
    > deleted all the jobs with qdel and the qmaster showed no jobs.
    > However, after a further re-start it now shows over 2396 active jobs,
    > with "-2251" available slots. I assume that somehow the history of
    > jobs finishing was lost so the qmaster thinks the jobs are still
    > active. I am trying another delete!
    >
    > Simon
    >
    > On Tue, May 1, 2018 at 2:36 PM, Simon Matthews
    > <simon.d.matth...@gmail.com> wrote:
    >> I am running SoGE 8.1.8, using BDB spooling.
    >>
    >> Last night the spool directory ran out of disk space (I think),
    >> causing a freeze of all jobs.  I moved the spool directory (~4GB at
    >> that time) to another partition, with more space.
    >>
    >> However, jobs are still not running. The qmaster appears to be running
    >> and I think reading from the spool directory.
    >>
    >> I would like to clean out all the jobs (old and current) and start again.
    >>
    >> Is there a safe way to clean out the spool directory when using BDB
    >> spooling? I was not able to backup the configuration because SoGE
    >> doesn't provide a copy of db_dump and versions of this program from
    >> other distributions fail with an error relating to the version of the
    >> database.
    >>
    >> Any suggestions?
    >>
    >> Simon
    _______________________________________________
    users mailing list
    users@gridengine.org
    https://gridengine.org/mailman/listinfo/users



________________________________
 This electronic message is intended for the use of the named recipient only, 
and may contain information that is confidential, privileged or protected from 
disclosure under applicable law. If you are not the intended recipient, or an 
employee or agent responsible for delivering this message to the intended 
recipient, you are hereby notified that any reading, disclosure, dissemination, 
distribution, copying or use of the contents of this message including any of 
its attachments is strictly prohibited. If you have received this message in 
error or are not the named recipient, please notify us immediately by 
contacting the sender at the electronic mail address noted above, and destroy 
all copies of this message. Please note, the recipient should check this email 
and any attachments for the presence of viruses. The organization accepts no 
liability for any damage caused by any virus transmitted by this email.

_______________________________________________
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users

Reply via email to