Title: Re: [Oscar-users] RE: 4.2.1b54423 : Post-install : PBS_SERVER and MAUI trouble
I don't see any glaring problems in your log file, is TORQUE still reporting connection refused?
 
Cheers,
 
Bernard


From: Lefevre Jerome [mailto:[EMAIL PROTECTED]
Sent: Tue 28/03/2006 20:08
To: Bernard Li; [email protected]
Subject: Re: [Oscar-users] RE: 4.2.1b54423 : Post-install : PBS_SERVER and MAUI trouble


Note : I have the same issue as James Wigdahl (Maui Problem, oscar-users
14-nov-2005)

Please, find my maui.log


[EMAIL PROTECTED] oscar]# service maui start
Starting MAUI Scheduler:                                   [  OK  ]
[EMAIL PROTECTED] oscar]# service maui status
maui est arrêté
[EMAIL PROTECTED] oscar]# tail -n 100 /opt/maui/log/maui.log


03/30 01:19:35 MAMInitialize(NULL)
03/30 01:19:35 MStatInitializeActiveSysUsage()
03/30 01:19:35 MStatClearUsage(NONE,Active)
03/30 01:19:35 ServerUpdate()
03/30 01:19:35 MSysUpdateTime()
03/30 01:19:35 INFO:     starting new day: Thu Mar 30 01:19:35
03/30 01:19:35 MStatOpenFile(1143641975)
03/30 01:19:35 INFO:     starting iteration 0
03/30 01:19:35 MRMGetInfo()
03/30 01:19:35 MClusterClearUsage()
03/30 01:19:35 MRMClusterQuery()
03/30 01:19:35 PBSClusterQuery(base,RCount,SC)
03/30 01:19:35 PBSGetNodeState(Name,Status,PNode)
03/30 01:19:35 INFO:     PBS node editr.cluster.ird.nc set to state Idle (free)
03/30 01:19:35 PBSNodeLoad(editr.cluster.ird.nc,editr.cluster.ird.nc,Idle,0)
03/30 01:19:35 MUGetIndex(STATACTIVETIME,ValList,0)
03/30 01:19:35 MUGetIndex(STATTOTALTIME,ValList,0)
03/30 01:19:35 MUGetIndex(STATUPTIME,ValList,0)
03/30 01:19:35 MNodeUpdateResExpression(editr.cluster.ird.nc)
03/30 01:19:35 INFO:     cannot determine node/frame of host
'editr.cluster.ird.nc'
[000] editr.cluster.ird.nc: (P:2,S:10,M:1,D:1)
[Idle][DEFAULT][linux]<0.000000> C:[NONE][DEFAULT] [all] [NONE]
03/30 01:19:35 PBSGetNodeState(Name,Status,PNode)
03/30 01:19:35 INFO:     PBS node node1.cluster.ird.nc set to state Idle (free)
03/30 01:19:35 PBSNodeLoad(node1.cluster.ird.nc,node1.cluster.ird.nc,Idle,0)

HERE => Service Maui start


03/30 02:06:57 INFO:     starting Maui Scheduler version 3.2.5p2
##################
03/30 02:06:57 INFO:     new LOGLEVEL value (3)
03/30 02:06:57 OConfigProcessLine(NODEACCESSPOLICY,,DEDICATED)
03/30 02:06:57 MUGetIndex(DEDICATED,ValList,1)
03/30 02:06:57 OConfigProcessLine(NODEALLOCATIONPOLICY,,MINRESOURCE)
03/30 02:06:57 MUGetIndex(MINRESOURCE,ValList,2)
03/30 02:06:57 OConfigProcessLine(QUEUETIMEWEIGHT,,1 )
03/30 02:06:57 OConfigProcessLine(RESERVATIONPOLICY,,CURRENTHIGHEST)
03/30 02:06:57 MUGetIndex(CURRENTHIGHEST,ValList,0)
03/30 02:06:57 OConfigProcessLine(RMPOLLINTERVAL,,00:00:10)
03/30 02:06:57 MUTimeFromString(00:00:10)
03/30 02:06:57 OConfigProcessLine(SERVERHOST,,editr.cluster.ird.nc)
03/30 02:06:57 INFO:     starting scheduler on 'editr.cluster.ird.nc'
03/30 02:06:57 OConfigProcessLine(SERVERMODE,,NORMAL)
03/30 02:06:57 MUGetIndex(NORMAL,ValList,1)
03/30 02:06:57 OConfigProcessLine(SERVERPORT,,42559)
03/30 02:06:57 MUGetIndex(TYPE,ValList,0)
03/30 02:06:57 MUGetIndex(PBS,ValList,0)
03/30 02:06:57 ServerProcessArgs(1,ArgV)
03/30 02:06:57
MUGetOpt(1,ArgV,a:b:B:c:C:dD:f:hH:i:j:l:m:n:N:p:P:r:s:v?-:,OptArg)
03/30 02:06:57 ServerDemonize()
03/30 02:06:57 INFO:     child process in background
03/30 02:06:57 ServerAuthenticate()
03/30 02:06:57 MUFileLock(/opt/maui/,/opt/maui/maui.pid)
03/30 02:06:57 INFO:     executing scheduler from '/opt/maui/' under UID 0
GID 0
03/30 02:06:57 starting 3.2.5p2 version Maui Scheduler (PID: 4365) on Thu
Mar 30 02:06:57
03/30 02:06:57 MSysMemCheck()
03/30 02:06:57 MNode[5120]               0.04
03/30 02:06:57 MJob[4096]                0.03
03/30 02:06:57 MJobTraceBuffer[4096]      0.00
03/30 02:06:57 MUser[1792]               0.01
03/30 02:06:57 MGroup[1792]              2.35
03/30 02:06:57 MAcct[1792]               2.34
03/30 02:06:57 MRes[1024]                0.01
03/30 02:06:57 SRes[128]                2.45
03/30 02:06:57 MStatInitialize(P)
03/30 02:06:57 MStatProfInitialize(P)
03/30 02:06:57 MStatOpenFile(1143644817)
03/30 02:06:57 MSUListen(S)
03/30 02:06:57 INFO:     opened service socket on port 42559
03/30 02:06:57 MSUListen(S)
03/30 02:06:57 INFO:     opened service socket on port 42560
03/30 02:06:57 SDRGetSystemConfig()
03/30 02:06:57 MFSInitialize()
03/30 02:06:57 MCPLoad(/opt/maui/maui.ck,ResOnly)
03/30 02:06:57 MRMInitialize()
03/30 02:06:57 PBSInitialize(base,SC)
03/30 02:06:57 INFO:     parent is exiting
03/30 02:06:57 MSUListen(S)
03/30 02:06:57 INFO:     opened service socket on port 15004
03/30 02:06:57 __MPBSSystemQuery(base,RCount,SC)
03/30 02:06:57 INFO:     connected to PBS server :0 on sd 1
03/30 02:06:57 MAMInitialize(NULL)
03/30 02:06:57 MStatInitializeActiveSysUsage()
03/30 02:06:57 MStatClearUsage(NONE,Active)
03/30 02:06:57 ServerUpdate()
03/30 02:06:57 MSysUpdateTime()
03/30 02:06:57 INFO:     starting new day: Thu Mar 30 02:06:57
03/30 02:06:57 MStatOpenFile(1143644817)
03/30 02:06:57 INFO:     starting iteration 0
03/30 02:06:57 MRMGetInfo()
03/30 02:06:57 MClusterClearUsage()
03/30 02:06:57 MRMClusterQuery()
03/30 02:06:57 PBSClusterQuery(base,RCount,SC)
03/30 02:06:57 PBSGetNodeState(Name,Status,PNode)
03/30 02:06:57 INFO:     PBS node editr.cluster.ird.nc set to state Idle (free)
03/30 02:06:57 PBSNodeLoad(editr.cluster.ird.nc,editr.cluster.ird.nc,Idle,0)
03/30 02:06:57 MUGetIndex(STATACTIVETIME,ValList,0)
03/30 02:06:57 MUGetIndex(STATTOTALTIME,ValList,0)
03/30 02:06:57 MUGetIndex(STATUPTIME,ValList,0)
03/30 02:06:57 MNodeUpdateResExpression(editr.cluster.ird.nc)
03/30 02:06:57 INFO:     cannot determine node/frame of host
'editr.cluster.ird.nc'
[000] editr.cluster.ird.nc: (P:2,S:10,M:1,D:1)
[Idle][DEFAULT][linux]<0.000000> C:[NONE][DEFAULT] [all] [NONE]
03/30 02:06:57 PBSGetNodeState(Name,Status,PNode)
03/30 02:06:57 INFO:     PBS node node1.cluster.ird.nc set to state Idle (free)
03/30 02:06:57 PBSNodeLoad(node1.cluster.ird.nc,node1.cluster.ird.nc,Idle,0)




A 19:56 28/03/2006 -0800, Bernard Li a écrit :
>Was MAUI running when that happened (/etc/init.d/maui status) - if it
>wasn't running, can you restart it and see if that helps?
>
>Cheers,
>
>Bernard
>
>
>----------
>From: Lefevre Jerome [mailto:[EMAIL PROTECTED]]
>Sent: Tue 28/03/2006 19:37
>To: Bernard Li; Lefevre Jerome; [email protected]
>Subject: 4.2.1b54423 : Post-install : PBS_SERVER and MAUI trouble
>
>Cluster 5 Dual-Opteron tyan S2885 3Ware Sata raid
>Switch 3Com gigabit
>Fedora Core 3 x86_64 fresh install
>Oscar 4.2.1b54423
>
>
>Hi,
>
>Some trouble with Maui and PBS.
>
>Just after Oscar Installation Maui is running and PBS is up. Ganglia is
>fine too.
>
>But after a reboot, Maui don't run and pbs_server.log print "Connection
>Refused (111)".
>I have the same issue as before with Oscar 4.0 and Fedora Core 2 ...
>
>Any Idea ?
>
>Cheers,
>
>Jerome


Reply via email to