hi
i have setup a 3 node cluster(1 head node+2 client nodes) using oscar version 2.2.1 on a system using RH 7.3
i m facing problems in batch system configuration i.e. i cannot run multiple jobs at the same time (only 1 is run, rest r queued)
the exact problem is as follows:
-i m using only the pbs scheduler. so at first i shut down the maui scheduler(which comes with oscar).
-without submitting any job, if i run the command 'pbsnodes -a' the state of both the nodes are "free"
-when i submit a small shellscript (which just sends the cpu into sleep state for 20 seconds) multiple times using the qsub command, the first job starts running immediately and the rest are queued.
-now the 'pbsnodes -a' command shows state of node1=job-sharing and state of node2=free. however, none of the nodes are configured as time-shared or temporarily-shared nodes. also, after the 1st job is over, the 2nd job starts running but that too on the 1st node only. somehow or the other, the jobs do not get executed on node2.
-i checked the file sched_logs and found the error to be:
"cannot find enough right type of nodes to run jobs"
-however, i (as a root) can run jobs on node2 forcibly by using
"qrun -H node2 432.cluster"
PS:i m using RH 7.3 and i am not planning to upgrade (neither oscar nor Red Hat Linux)
--thanks
-maulin
Contact brides & grooms FREE! Only on www.shaadi.com. Register now! ------------------------------------------------------- This SF.Net email is sponsored by: IBM Linux Tutorials Free Linux tutorial presented by Daniel Robbins, President and CEO of GenToo technologies. Learn everything from fundamentals to system administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click _______________________________________________ Oscar-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/oscar-users
