> From: Reuti <[email protected]>
> Subject: Re: [gridengine users] Configure gridengine on CentOS 6.3
> Date: Tue, 30 Oct 2012 11:27:49 +0100
>
>> Just use the version you have already in the shared /usr/sge or your
>> particular mountpoint.
> 
> I should probably try this first, at least to verify that it's
> working. But later I would like to migrate to the CentOS on all my
> exechosts and leave the installation to somebody else.

I did this and it worked out fine on the first machine I migrated.
However, on the next set of machines I run into the problem where the
submitted job will cause the queue to go into the error state.

I observe that:

1) It will not be submitted
2) The queue will be marked with the 'E' state
3) I get an e-mail saying
    Shepherd pe_hostfile:
    node 1 queue@node UNDEFINED
4) The node will log the following in the spool/node/messages file:
    11/07/2012 15:33:07|  main|node|E|shepherd of job 48548.1 exited with exit 
status = 11
5) qstat -j jobnumber returns

    error reason    1:          11/07/2012 15:33:06 [555:29681]: unable to find 
job file "/work/gridengine/spool/node/job_scr
    scheduling info:            (Collecting of scheduler job information is 
turned off)

How can I debug this further?

Best regards
//Petter
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to