Hi, My problem concern maui-3.3, torque-2.5.3 and openmpi-1.4 comunication. Exactly, I have problem with running multicore jobs on multi nodes. I read all topics which are connection with my problem and I couldn't find solution. I think it's a problem with maui-3.3 scheduler because if I disable it and use pbs scheduler everything is fine. I read a lot about ENABLEMULTIREQJOBS and JOBNODEMATCHPOLICY and I know that these variables are necessary to run MPI jobs on clustrer. I set these variables in maui config file, but when I run checkconfig command these variables are not set. Below are my system settings, maui config file and output from showconfig command.
---------- System Settings ---------- [r...@ori1 ~]# uname -a Linux ori1 2.6.18-194.17.4.el5xen #1 SMP Tue Oct 26 12:37:47 CEST 2010 x86_64 x86_64 x86_64 GNU/Linux [r...@ori1 ~]# maui --version Maui version 3.3 Copyright 2000-2010 Cluster Resources, Inc, All Rights Reserved for the latest release, see http://clusterresources.com/maui This software includes the Maui Server Module, Copyright 1996 MHPCC, All Rights Reserved This software utilizes the Moab Scheduling Library, version 3.3 Copyright 2000-2010 Cluster Resources, Inc, All Rights Reserved [r...@ori1 ~]# pbs_server --version version: 2.5.3 [r...@ori1 ~]# /usr/lib64/openmpi/1.4-gcc/bin/mpiexec --version mpiexec (OpenRTE) 1.4 Report bugs to http://www.open-mpi.org/community/help/ [r...@ori1 x86_64]# /usr/lib64/openmpi/1.4-gcc/bin/ompi_info | grep tm MCA memory: ptmalloc2 (MCA v2.0, API v2.0, Component v1.4) MCA ras: tm (MCA v2.0, API v2.0, Component v1.4) MCA plm: tm (MCA v2.0, API v2.0, Component v1.4) ------------------------------------- ---------- maui.cfg ---------- # maui.cfg 3.3 SERVERHOST ori1 ADMIN1 root RMCFG[ori1] TYPE=PBS RMPOLLINTERVAL 00:00:10 SERVERPORT 40559 SERVERMODE NORMAL LOGFILE /var/spool/maui/logs/maui.log LOGFILEMAXSIZE 100000000 LOGLEVEL 7 QUEUETIMEWEIGHT 1 BACKFILLPOLICY FIRSTFIT RESERVATIONPOLICY CURRENTHIGHEST NODEALLOCATIONPOLICY MINRESOURCE ENABLEMULTIREQJOBS TRUE ENABLEMULTINODEJOBS TRUE JOBNODEMATCHPOLICY EXACTNODE NODEACCESSPOLICY SHARED ---------------------------------- ---------- showconfig ---------- [r...@ori1 ~]# showconfig NODELOADPOLICY ADJUSTSTATE JOBNODEMATCHPOLICY[1] JOBMAXSTARTTIME[1] INFINITY METAMAXTASKS[1] 0 NODESETPOLICY[1] [NONE] NODESETATTRIBUTE[1] [NONE] NODESETLIST[1] NODESETDELAY[1] 00:00:00 NODESETPRIORITYTYPE[1] MINLOSS NODESETTOLERANCE[1] 0.00 # Priority Weights XFMINWCLIMIT[1] 00:00:00 RMAUTHTYPE[0] CHECKSUM CLASSCFG[simple] DEFAULT.FEATURES=[NONE] QOSPRIORITY[0] 0 QOSQTWEIGHT[0] 0 QOSXFWEIGHT[0] 0 QOSTARGETXF[0] 0.00 QOSTARGETQT[0] 00:00:00 QOSFLAGS[0] QOSPRIORITY[1] 0 QOSQTWEIGHT[1] 0 QOSXFWEIGHT[1] 0 QOSTARGETXF[1] 0.00 QOSTARGETQT[1] 00:00:00 QOSFLAGS[1] RESDEPTH 24 SCHEDCFG[] MODE=NORMAL SERVER=ori1:40559 # RM MODULES: PBS SSS WIKI NATIVE TYPE=PBS SIMEXITITERATION -1 -------------------------------------- Please help me resolve my problem. Thanks in advance. Best Regards Piotr Brona _______________________________________________ mauiusers mailing list [email protected] http://www.supercluster.org/mailman/listinfo/mauiusers
