That solution requires working "db_dump" and "db_restore" executables,
which don't appear to be available for the SoGE version.

My eventual solution was to remove the "sge_job" file and re-start the qmaster.

Simon


On Wed, May 2, 2018 at 6:40 AM, Luis Huang <lhu...@nygenome.org> wrote:
> Look into this
>
> https://arc.liv.ac.uk/pipermail/gridengine-users/2008-October/020911.html
>
> We had a similar problem and that fixed it for us.
>
>
> On 5/1/18, 11:25 PM, "users-boun...@gridengine.org on behalf of Simon 
> Matthews" <users-boun...@gridengine.org on behalf of 
> simon.d.matth...@gmail.com> wrote:
>
>     After deleting all the jobs, it still won't schedule any new jobs. The
>     qmaster "messages" file has this in it:
>
>     05/01/2018 20:19:51|worker|sgemasterU5|E|scheduler tries to schedule
>     job 4409436.1 twice
>     05/01/2018 20:19:51|worker|sgemasterU5|W|Skipping remaining 156 orders
>
>     But I can't delete 4409436:
>     qdel -u build -j 4409436
>     The job -j of user(s) build does not exist
>
>     Simon
>
>     On Tue, May 1, 2018 at 2:53 PM, Simon Matthews
>     <simon.d.matth...@gmail.com> wrote:
>     > A little more info.
>     >
>     > After moving the spool directory and re-starting the qmaster, I
>     > deleted all the jobs with qdel and the qmaster showed no jobs.
>     > However, after a further re-start it now shows over 2396 active jobs,
>     > with "-2251" available slots. I assume that somehow the history of
>     > jobs finishing was lost so the qmaster thinks the jobs are still
>     > active. I am trying another delete!
>     >
>     > Simon
>     >
>     > On Tue, May 1, 2018 at 2:36 PM, Simon Matthews
>     > <simon.d.matth...@gmail.com> wrote:
>     >> I am running SoGE 8.1.8, using BDB spooling.
>     >>
>     >> Last night the spool directory ran out of disk space (I think),
>     >> causing a freeze of all jobs.  I moved the spool directory (~4GB at
>     >> that time) to another partition, with more space.
>     >>
>     >> However, jobs are still not running. The qmaster appears to be running
>     >> and I think reading from the spool directory.
>     >>
>     >> I would like to clean out all the jobs (old and current) and start 
> again.
>     >>
>     >> Is there a safe way to clean out the spool directory when using BDB
>     >> spooling? I was not able to backup the configuration because SoGE
>     >> doesn't provide a copy of db_dump and versions of this program from
>     >> other distributions fail with an error relating to the version of the
>     >> database.
>     >>
>     >> Any suggestions?
>     >>
>     >> Simon
>     _______________________________________________
>     users mailing list
>     users@gridengine.org
>     https://gridengine.org/mailman/listinfo/users
>
>
>
> ________________________________
>  This electronic message is intended for the use of the named recipient only, 
> and may contain information that is confidential, privileged or protected 
> from disclosure under applicable law. If you are not the intended recipient, 
> or an employee or agent responsible for delivering this message to the 
> intended recipient, you are hereby notified that any reading, disclosure, 
> dissemination, distribution, copying or use of the contents of this message 
> including any of its attachments is strictly prohibited. If you have received 
> this message in error or are not the named recipient, please notify us 
> immediately by contacting the sender at the electronic mail address noted 
> above, and destroy all copies of this message. Please note, the recipient 
> should check this email and any attachments for the presence of viruses. The 
> organization accepts no liability for any damage caused by any virus 
> transmitted by this email.

_______________________________________________
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users

Reply via email to