2 obvious problems

(1) Your queue instances are in error state E which means no jobs will run ever. State E is a peristent error that must be manually cleared by an SGE admin

(2) You have 0 set in the "slots" value for your mat-lab queue. This means that unless slots are inherited by the exec hosts itself your queue is going to apply a value of 0 slots

So quick advice ..

- Clear the error states from the matlab queue instances
- Set some manual # of slots in your matlab queue just to see if it makes a difference

-Chris


Eric Kaufmann wrote:
Here is some of the requested information.

qstat -f -q matlab
queuename                      qtype resv/used/tot. load_avg arch
    states
---------------------------------------------------------------------------------
[email protected]      BIP   0/0/0          0.00     lx26-amd64    E
---------------------------------------------------------------------------------
[email protected]      BIP   0/0/0          0.00     lx26-amd64    E
---------------------------------------------------------------------------------
[email protected]      BIP   0/0/0          0.00     lx26-amd64    E

qconf -sql
Dcradle
all.q
cdt
check
clong
goodson
linda
long
matlab
sapt
schrod
std

qconf -sq matlab
qname                 matlab
hostlist              @matlab
seq_no                0
load_thresholds       NONE
suspend_thresholds    NONE
nsuspend              1
suspend_interval      00:05:00
priority              0
min_cpu_interval      00:05:00
processors            UNDEFINED
qtype                 BATCH INTERACTIVE
ckpt_list             NONE
pe_list               matlabPE
rerun                 FALSE
slots                 0
tmpdir                /tmp
shell                 /bin/bash
prolog                NONE
epilog                NONE
shell_start_mode      posix_compliant
starter_method        NONE
suspend_method        NONE
resume_method         NONE
terminate_method      NONE
notify                00:00:60
owner_list            NONE
user_lists            NONE
xuser_lists           NONE
subordinate_list      NONE
complex_values        NONE
projects              NONE
xprojects             NONE
calendar              NONE
initial_state         default
s_rt                  INFINITY
h_rt                  INFINITY
s_cpu                 INFINITY
h_cpu                 INFINITY
s_fsize               INFINITY
h_fsize               INFINITY
s_data                INFINITY
h_data                INFINITY
s_stack               INFINITY
h_stack               INFINITY
s_core                INFINITY
h_core                INFINITY
s_rss                 INFINITY
h_rss                 INFINITY
s_vmem                INFINITY
h_vmem                INFINITY



Here is the output of

On Mon, Aug 8, 2011 at 4:05 PM, Chris Dagdigian <[email protected]
<mailto:[email protected]>> wrote:


    If you post the output of 'qconf -sq <matlab queue name>' we can
    provide more targeted advice.

    qstat -f -q <queuename> output might be useful as well just so we
    can be sure your nodes are actually up and not in error state

    It sounds as though you have a cluster queue set up without any
    available hosts configured within it? Your hosts entry for the queue
    should say "@allhosts" or "@matlabhosts" however you set it up. It
    should not be "@/" - it has to name or reference an existing and
    real SGE hostgroup. It may be possible you have a hostgroup created
    without any actual hosts defined within it.

    Showing the 'qconf -s ...' output for the PE, queue and hostgroups
    would help


    -Chris






    Eric Kaufmann wrote:

        I am running SGE 6.2 on a Rocks 5.2 cluster. I am trying to add
        a new
        parallel environment and queue for Matlab. I was able to add
        both. The
        queue for Matlab shows zero slots available. I did create a
        matlab host
        group. This shows up in the Hostgroup list. In the Attributes for
        Host/Host Group @/ is listed but that is all.

        I have other queues where machines are also listed in the
        Attributes for
        Host/Host Group.

        What am I missing here? Is there also a way to set up a parallel
        environment so only one que can use it?

        Thanks,

        Eric


_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to