Eygene,

I tried maui p21 and configured maui.cfg with AUTHTYPE CHECKSUM. However, 
I got error of checksum does not match.
Can you tell what to fix the checksum key OR what necesary step I should 
do ?

> SERVERHOST            cluster-ib-1
> # primary admin must be first in list
> ADMIN1                root
> 
> # Resource Manager Definition
> 
> RMCFG[cluster-ib-1] TYPE=WIKI
> RMPORT          7321
> RMHOST          cluster-ib-1
> RMAUTHTYPE[cluster-ib-1] CHECKSUM

ALERT:    checksum does not match (7771069aaa6d32a4:37a0c962c3f39618) 
request 'TS=1225919856 AUTH=slurm DT=SC=0 
ARG=4#cluster-ib-1:STATE=Idle;ARCH=ppc64;OS=Linux;CMEMORY=1;CDISK=0;CPROC=1;#cluster-ib'
11/05 15:17:36 ERROR:    cannot receive data from server cluster-ib-1:7321

==========
More info from maui.log:

11/05 15:17:36 MRMClusterQuery()
11/05 15:17:36 MWikiClusterLoadInfo(cluster-ib-1,RCount,EMsg,SC)
11/05 15:17:36 
MWikiDoCommand(cluster-ib-1,7321,9000000,CHECKSUM,CMD=GETNODES 
ARG=0:ALL,Data,DataSize,SC)
11/05 15:17:36 MSUSendData(S,9000000,TRUE,FALSE)
11/05 15:17:36 INFO:     packet sent (78 bytes of 78)
11/05 15:17:36 INFO:     command sent to server
11/05 15:17:36 INFO:     message sent: 'CMD=GETNODES ARG=0:ALL'
11/05 15:17:36 MSURecvData(S,9000000,TRUE,SC,EMsg)
11/05 15:17:36 MSURecvPacket(7,BufP,9,NULL,9000000,SC)
11/05 15:17:36 MSURecvPacket(7,BufP,394,NULL,9000000,SC)
11/05 15:17:36 ALERT:    checksum does not match 
(7771069aaa6d32a4:37a0c962c3f39618)  request 'TS=1225919856 AUTH=slurm 
DT=SC=0 
ARG=4#cluster-ib-1:STATE=Idle;ARCH=ppc64;OS=Linux;CMEMORY=1;CDISK=0;CPROC=1;#cluster-ib'
11/05 15:17:36 ERROR:    cannot receive data from server cluster-ib-1:7321
11/05 15:17:36 MSUDisconnect(S)
11/05 15:17:36 ALERT:    cannot get node list from WIKI RM
11/05 15:17:36 ALERT:    cannot load cluster resources on RM (RM 
'cluster-ib-1'
failed in function 'clusterquery')
11/05 15:17:36 WARNING:  no resources detected
11/05 15:17:36 MRMWorkloadQuery()
11/05 15:17:36 MWikiWorkloadQuery(cluster-ib-1,JCount,SC)
11/05 15:17:36 
MWikiDoCommand(cluster-ib-1,7321,9000000,CHECKSUM,CMD=GETJOBS 
ARG=0:ALL,Data,DataSize,SC)
11/05 15:17:36 MSUSendData(S,9000000,TRUE,FALSE)
11/05 15:17:36 INFO:     packet sent (77 bytes of 77)
11/05 15:17:36 INFO:     command sent to server
11/05 15:17:36 INFO:     message sent: 'CMD=GETJOBS ARG=0:ALL'
11/05 15:17:36 MSURecvData(S,9000000,TRUE,SC,EMsg)
11/05 15:17:36 MSURecvPacket(7,BufP,9,NULL,9000000,SC)
11/05 15:17:36 MSURecvPacket(7,BufP,301,NULL,9000000,SC)
11/05 15:17:36 ALERT:    checksum does not match 
(47611b22590d32a0:bbe6843806e4b6ae)  request 'TS=1225919856 AUTH=slurm 
DT=SC=0 
ARG=1#52888:STATE=Removed;UPDATETIME=1225919502;WCLIMIT=31536000;TASKS=0;DPROCS=1;QUEUE'
11/05 15:17:36 ERROR:    cannot receive data from server cluster-ib-1:7321
11/05 15:17:36 MSUDisconnect(S)
11/05 15:17:36 ALERT:    cannot get job list from WIKI RM
11/05 15:17:36 ALERT:    cannot load cluster workload on RM (RM 
'cluster-ib-1' failed in function 'workloadquery')
11/05 15:17:36 WARNING:  no workload detected
11/05 15:17:36 MStatClearUsage(node,Active)
11/05 15:17:36 MClusterUpdateNodeState()
11/05 15:17:36 MQueueSelectAllJobs(Q,HARD,ALL,JIList,DP,Msg)



Regards,

 Hien Nguyen
Linux Technology Center (Austin)
 Phone: (512) 838-4140            Tie Line: 678-4140
 e-mail: [EMAIL PROTECTED]




Eygene Ryabinkin <[EMAIL PROTECTED]> 
11/04/2008 11:01 AM

To
Hien Nguyen/Austin/[EMAIL PROTECTED]
cc
[email protected]
Subject
Re: [Mauiusers] question on maui 3.2.6p20: can not get job list from WIKI 
RM






Hien, good day.

Tue, Nov 04, 2008 at 08:19:52AM -0600, Hien Nguyen wrote:
> I run maui and slurm 1.3.6 . I found that in maui log there are errors 
and 
> alerts:
> 11/03 23:56:40 ERROR:    command 'CMD=GETNODES ARG=0:ALL'  SC: -300 
> response: 'NONE'
> 11/03 23:56:40 ALERT:    cannot get node list from WIKI RM
> 11/03 23:56:40 ALERT:    cannot load cluster resources on RM (RM 
> 'p6ihopenhpc-ib-3' failed in function 'clusterquery')
> 11/03 23:56:40 WARNING:  no resources detected
> 
> Can someone tell what's wrong with the config of maui and slurm?
> 
> file maui.cfg:
> -------------------------------------
> # maui.cfg 3.2.6p20
> 
> SERVERHOST            p6ihopenhpc-ib-3
> # primary admin must be first in list
> ADMIN1                root
> 
> # Resource Manager Definition
> 
> RMCFG[p6ihopenhpc-ib-3] TYPE=WIKI
> RMPORT          7321
> RMHOST          p6ihopenhpc-ib-3
> RMAUTHTYPE[p6ihopenhpc-ib-3] MUNGE
  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

I very much doubt that Maui support Munge authentication.  You will want
to use 'RMCFG[<host>] TYPE=WIKI PORT=7321 HOST=<host> AUTHTYPE=CHECKSUM'
along with the Slurm's wiki.conf carrying the appropriate 'AuthKey'
directive.  The key itself should contain only digits, it shouldn't
be bigger than 2^32 and the key should be the same as one was used
during Maui compilation (parameter '--with-key' to the configure script).

And you will need the patch mentioned in the list message
  
http://www.clusterresources.com/pipermail/mauiusers/2008-October/003564.html

or to use maui-3.2.6p21-snap.1224706197 that was already patched by
Brian Christiansen.
-- 
Eygene Ryabinkin, Russian Research Centre "Kurchatov Institute"

Attachment: attpimgc.dat
Description: Binary data

_______________________________________________
mauiusers mailing list
[email protected]
http://www.supercluster.org/mailman/listinfo/mauiusers

Reply via email to