hello people...
i have set up a 3 node cluster (1 head node+2 client nodes). i m having problem set up a batch system for running multiple jobs.
here's the exact problem with all the o/p as shown...
1) when i run the script using qsub, only 1 job runs at a time ( at this point maui scheduler is running pbs_sched is stopped)
but pbsnodes -a command shows the same job 2 b running on both the client nodes.
[EMAIL PROTECTED] PBS]$ qsub ./test
372.cluster
[EMAIL PROTECTED] PBS]$ qsub ./test
373.cluster
[EMAIL PROTECTED] PBS]$ qsub ./test
374.cluster
[EMAIL PROTECTED] PBS]$ qstat -a
cluster:
Req'd Req'd Elap
Job ID Username Queue Jobname SessID NDS TSK Memory Time S Time
--------------- -------- -------- ---------- ------ --- --- ------ ----- - -----
372.cluster user1 workq test 721 1 1 -- 10000 R --
373.cluster user1 workq test -- 1 1 -- 10000 Q --
374.cluster user1 workq test -- 1 1 -- 10000 Q --
[EMAIL PROTECTED] PBS]$ pbsnodes -a
oscarnode1.djscoe
state = free
np = 2
properties = all
ntype = cluster
jobs = 0/372.cluster
oscarnode2.djscoe
state = free
np = 2
properties = all
ntype = cluster
jobs = 0/372.cluster
2) i tried stopping the maui scheduler and starting the pbs_scheduler. then i submitted the scripts again.this time qstat shows 2 jobs 2 b running.
however "pbsnodes -a" shows both of them 2 b running on same node and status is shown 2 b free.
here's the o/p..................
[EMAIL PROTECTED] PBS]$ qsub ./test
375.cluster
[EMAIL PROTECTED] PBS]$ qsub ./test
376.cluster
[EMAIL PROTECTED] PBS]$ qsub ./test
377.cluster
[EMAIL PROTECTED] PBS]$ qstat -a
cluster:
Req'd Req'd Elap
Job ID Username Queue Jobname SessID NDS TSK Memory Time S Time
--------------- -------- -------- ---------- ------ --- --- ------ ----- - -----
375.cluster user1 workq test 780 1 1 -- 10000 R --
376.cluster user1 workq test 837 1 1 -- 10000 R --
377.cluster user1 workq test -- 1 1 -- 10000 Q --
[EMAIL PROTECTED] PBS]$ pbsnodes -a
oscarnode1.djscoe
state = free
np = 2
properties = all
ntype = cluster
jobs = 0/376.cluster, 0/375.cluster
oscarnode2.djscoe
state = free
np = 2
properties = all
ntype = cluster
here's the file /var/spool/pbs/server_priv/nodes
oscarnode1.djscoe all
oscarnode2.djscoe all
this means that no matter whichever scheduler i use, the job runs on one node only?? (oscarnode2 for maui and oscranode1 for pbs)
I checked the file maui.cfg. As jeremy said, it already had the entries
NODEACCESSPOLICY DEDICATED
JOBNODEMATCHPOLICY EXACTNODE
also i would like to know which is the nodes file that maui refers to. coz the node that pbs executes the job on and the one that maui execute the job on are different.
--thanks
Apply to 50,000 jobs now. Post your CV on naukri.com today. ------------------------------------------------------- This SF.Net email is sponsored by: IBM Linux Tutorials Free Linux tutorial presented by Daniel Robbins, President and CEO of GenToo technologies. Learn everything from fundamentals to system administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click _______________________________________________ Oscar-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/oscar-users
