Eygene, I tried maui p21 and configured maui.cfg with AUTHTYPE CHECKSUM. However, I got error of checksum does not match. Can you tell what to fix the checksum key OR what necesary step I should do ?
> SERVERHOST cluster-ib-1 > # primary admin must be first in list > ADMIN1 root > > # Resource Manager Definition > > RMCFG[cluster-ib-1] TYPE=WIKI > RMPORT 7321 > RMHOST cluster-ib-1 > RMAUTHTYPE[cluster-ib-1] CHECKSUM ALERT: checksum does not match (7771069aaa6d32a4:37a0c962c3f39618) request 'TS=1225919856 AUTH=slurm DT=SC=0 ARG=4#cluster-ib-1:STATE=Idle;ARCH=ppc64;OS=Linux;CMEMORY=1;CDISK=0;CPROC=1;#cluster-ib' 11/05 15:17:36 ERROR: cannot receive data from server cluster-ib-1:7321 ========== More info from maui.log: 11/05 15:17:36 MRMClusterQuery() 11/05 15:17:36 MWikiClusterLoadInfo(cluster-ib-1,RCount,EMsg,SC) 11/05 15:17:36 MWikiDoCommand(cluster-ib-1,7321,9000000,CHECKSUM,CMD=GETNODES ARG=0:ALL,Data,DataSize,SC) 11/05 15:17:36 MSUSendData(S,9000000,TRUE,FALSE) 11/05 15:17:36 INFO: packet sent (78 bytes of 78) 11/05 15:17:36 INFO: command sent to server 11/05 15:17:36 INFO: message sent: 'CMD=GETNODES ARG=0:ALL' 11/05 15:17:36 MSURecvData(S,9000000,TRUE,SC,EMsg) 11/05 15:17:36 MSURecvPacket(7,BufP,9,NULL,9000000,SC) 11/05 15:17:36 MSURecvPacket(7,BufP,394,NULL,9000000,SC) 11/05 15:17:36 ALERT: checksum does not match (7771069aaa6d32a4:37a0c962c3f39618) request 'TS=1225919856 AUTH=slurm DT=SC=0 ARG=4#cluster-ib-1:STATE=Idle;ARCH=ppc64;OS=Linux;CMEMORY=1;CDISK=0;CPROC=1;#cluster-ib' 11/05 15:17:36 ERROR: cannot receive data from server cluster-ib-1:7321 11/05 15:17:36 MSUDisconnect(S) 11/05 15:17:36 ALERT: cannot get node list from WIKI RM 11/05 15:17:36 ALERT: cannot load cluster resources on RM (RM 'cluster-ib-1' failed in function 'clusterquery') 11/05 15:17:36 WARNING: no resources detected 11/05 15:17:36 MRMWorkloadQuery() 11/05 15:17:36 MWikiWorkloadQuery(cluster-ib-1,JCount,SC) 11/05 15:17:36 MWikiDoCommand(cluster-ib-1,7321,9000000,CHECKSUM,CMD=GETJOBS ARG=0:ALL,Data,DataSize,SC) 11/05 15:17:36 MSUSendData(S,9000000,TRUE,FALSE) 11/05 15:17:36 INFO: packet sent (77 bytes of 77) 11/05 15:17:36 INFO: command sent to server 11/05 15:17:36 INFO: message sent: 'CMD=GETJOBS ARG=0:ALL' 11/05 15:17:36 MSURecvData(S,9000000,TRUE,SC,EMsg) 11/05 15:17:36 MSURecvPacket(7,BufP,9,NULL,9000000,SC) 11/05 15:17:36 MSURecvPacket(7,BufP,301,NULL,9000000,SC) 11/05 15:17:36 ALERT: checksum does not match (47611b22590d32a0:bbe6843806e4b6ae) request 'TS=1225919856 AUTH=slurm DT=SC=0 ARG=1#52888:STATE=Removed;UPDATETIME=1225919502;WCLIMIT=31536000;TASKS=0;DPROCS=1;QUEUE' 11/05 15:17:36 ERROR: cannot receive data from server cluster-ib-1:7321 11/05 15:17:36 MSUDisconnect(S) 11/05 15:17:36 ALERT: cannot get job list from WIKI RM 11/05 15:17:36 ALERT: cannot load cluster workload on RM (RM 'cluster-ib-1' failed in function 'workloadquery') 11/05 15:17:36 WARNING: no workload detected 11/05 15:17:36 MStatClearUsage(node,Active) 11/05 15:17:36 MClusterUpdateNodeState() 11/05 15:17:36 MQueueSelectAllJobs(Q,HARD,ALL,JIList,DP,Msg) Regards, Hien Nguyen Linux Technology Center (Austin) Phone: (512) 838-4140 Tie Line: 678-4140 e-mail: [EMAIL PROTECTED] Eygene Ryabinkin <[EMAIL PROTECTED]> 11/04/2008 11:01 AM To Hien Nguyen/Austin/[EMAIL PROTECTED] cc [email protected] Subject Re: [Mauiusers] question on maui 3.2.6p20: can not get job list from WIKI RM Hien, good day. Tue, Nov 04, 2008 at 08:19:52AM -0600, Hien Nguyen wrote: > I run maui and slurm 1.3.6 . I found that in maui log there are errors and > alerts: > 11/03 23:56:40 ERROR: command 'CMD=GETNODES ARG=0:ALL' SC: -300 > response: 'NONE' > 11/03 23:56:40 ALERT: cannot get node list from WIKI RM > 11/03 23:56:40 ALERT: cannot load cluster resources on RM (RM > 'p6ihopenhpc-ib-3' failed in function 'clusterquery') > 11/03 23:56:40 WARNING: no resources detected > > Can someone tell what's wrong with the config of maui and slurm? > > file maui.cfg: > ------------------------------------- > # maui.cfg 3.2.6p20 > > SERVERHOST p6ihopenhpc-ib-3 > # primary admin must be first in list > ADMIN1 root > > # Resource Manager Definition > > RMCFG[p6ihopenhpc-ib-3] TYPE=WIKI > RMPORT 7321 > RMHOST p6ihopenhpc-ib-3 > RMAUTHTYPE[p6ihopenhpc-ib-3] MUNGE ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ I very much doubt that Maui support Munge authentication. You will want to use 'RMCFG[<host>] TYPE=WIKI PORT=7321 HOST=<host> AUTHTYPE=CHECKSUM' along with the Slurm's wiki.conf carrying the appropriate 'AuthKey' directive. The key itself should contain only digits, it shouldn't be bigger than 2^32 and the key should be the same as one was used during Maui compilation (parameter '--with-key' to the configure script). And you will need the patch mentioned in the list message http://www.clusterresources.com/pipermail/mauiusers/2008-October/003564.html or to use maui-3.2.6p21-snap.1224706197 that was already patched by Brian Christiansen. -- Eygene Ryabinkin, Russian Research Centre "Kurchatov Institute"
attpimgc.dat
Description: Binary data
_______________________________________________ mauiusers mailing list [email protected] http://www.supercluster.org/mailman/listinfo/mauiusers
