From: Lefevre Jerome [mailto:[EMAIL PROTECTED]
Sent: Tue 28/03/2006 20:08
To: Bernard Li; [email protected]
Subject: Re: [Oscar-users] RE: 4.2.1b54423 : Post-install : PBS_SERVER and MAUI trouble
Note : I have the same issue as James Wigdahl (Maui Problem,
oscar-users
14-nov-2005)
Please, find my
maui.log
[EMAIL PROTECTED] oscar]# service maui start
Starting MAUI
Scheduler:
[ OK ]
[EMAIL PROTECTED] oscar]# service maui status
maui est
arrêté
[EMAIL PROTECTED] oscar]# tail -n 100
/opt/maui/log/maui.log
03/30 01:19:35 MAMInitialize(NULL)
03/30
01:19:35 MStatInitializeActiveSysUsage()
03/30 01:19:35
MStatClearUsage(NONE,Active)
03/30 01:19:35 ServerUpdate()
03/30 01:19:35
MSysUpdateTime()
03/30 01:19:35 INFO: starting new
day: Thu Mar 30 01:19:35
03/30 01:19:35 MStatOpenFile(1143641975)
03/30
01:19:35 INFO: starting iteration 0
03/30 01:19:35
MRMGetInfo()
03/30 01:19:35 MClusterClearUsage()
03/30 01:19:35
MRMClusterQuery()
03/30 01:19:35 PBSClusterQuery(base,RCount,SC)
03/30
01:19:35 PBSGetNodeState(Name,Status,PNode)
03/30 01:19:35
INFO: PBS node editr.cluster.ird.nc set to state Idle
(free)
03/30 01:19:35
PBSNodeLoad(editr.cluster.ird.nc,editr.cluster.ird.nc,Idle,0)
03/30 01:19:35
MUGetIndex(STATACTIVETIME,ValList,0)
03/30 01:19:35
MUGetIndex(STATTOTALTIME,ValList,0)
03/30 01:19:35
MUGetIndex(STATUPTIME,ValList,0)
03/30 01:19:35
MNodeUpdateResExpression(editr.cluster.ird.nc)
03/30 01:19:35
INFO: cannot determine node/frame of
host
'editr.cluster.ird.nc'
[000] editr.cluster.ird.nc:
(P:2,S:10,M:1,D:1)
[Idle][DEFAULT][linux]<0.000000> C:[NONE][DEFAULT]
[all] [NONE]
03/30 01:19:35 PBSGetNodeState(Name,Status,PNode)
03/30
01:19:35 INFO: PBS node node1.cluster.ird.nc set to
state Idle (free)
03/30 01:19:35
PBSNodeLoad(node1.cluster.ird.nc,node1.cluster.ird.nc,Idle,0)
HERE =>
Service Maui start
03/30 02:06:57 INFO:
starting Maui Scheduler version 3.2.5p2
##################
03/30 02:06:57
INFO: new LOGLEVEL value (3)
03/30 02:06:57
OConfigProcessLine(NODEACCESSPOLICY,,DEDICATED)
03/30 02:06:57
MUGetIndex(DEDICATED,ValList,1)
03/30 02:06:57
OConfigProcessLine(NODEALLOCATIONPOLICY,,MINRESOURCE)
03/30 02:06:57
MUGetIndex(MINRESOURCE,ValList,2)
03/30 02:06:57
OConfigProcessLine(QUEUETIMEWEIGHT,,1 )
03/30 02:06:57
OConfigProcessLine(RESERVATIONPOLICY,,CURRENTHIGHEST)
03/30 02:06:57
MUGetIndex(CURRENTHIGHEST,ValList,0)
03/30 02:06:57
OConfigProcessLine(RMPOLLINTERVAL,,00:00:10)
03/30 02:06:57
MUTimeFromString(00:00:10)
03/30 02:06:57
OConfigProcessLine(SERVERHOST,,editr.cluster.ird.nc)
03/30 02:06:57
INFO: starting scheduler on
'editr.cluster.ird.nc'
03/30 02:06:57
OConfigProcessLine(SERVERMODE,,NORMAL)
03/30 02:06:57
MUGetIndex(NORMAL,ValList,1)
03/30 02:06:57
OConfigProcessLine(SERVERPORT,,42559)
03/30 02:06:57
MUGetIndex(TYPE,ValList,0)
03/30 02:06:57 MUGetIndex(PBS,ValList,0)
03/30
02:06:57 ServerProcessArgs(1,ArgV)
03/30
02:06:57
MUGetOpt(1,ArgV,a:b:B:c:C:dD:f:hH:i:j:l:m:n:N:p:P:r:s:v?-:,OptArg)
03/30
02:06:57 ServerDemonize()
03/30 02:06:57 INFO: child
process in background
03/30 02:06:57 ServerAuthenticate()
03/30 02:06:57
MUFileLock(/opt/maui/,/opt/maui/maui.pid)
03/30 02:06:57
INFO: executing scheduler from '/opt/maui/' under UID
0
GID 0
03/30 02:06:57 starting 3.2.5p2 version Maui Scheduler (PID: 4365)
on Thu
Mar 30 02:06:57
03/30 02:06:57 MSysMemCheck()
03/30 02:06:57
MNode[5120]
0.04
03/30 02:06:57
MJob[4096]
0.03
03/30 02:06:57 MJobTraceBuffer[4096]
0.00
03/30 02:06:57
MUser[1792]
0.01
03/30 02:06:57
MGroup[1792]
2.35
03/30 02:06:57
MAcct[1792]
2.34
03/30 02:06:57
MRes[1024]
0.01
03/30 02:06:57
SRes[128]
2.45
03/30 02:06:57 MStatInitialize(P)
03/30 02:06:57
MStatProfInitialize(P)
03/30 02:06:57 MStatOpenFile(1143644817)
03/30
02:06:57 MSUListen(S)
03/30 02:06:57 INFO: opened
service socket on port 42559
03/30 02:06:57 MSUListen(S)
03/30 02:06:57
INFO: opened service socket on port 42560
03/30
02:06:57 SDRGetSystemConfig()
03/30 02:06:57 MFSInitialize()
03/30
02:06:57 MCPLoad(/opt/maui/maui.ck,ResOnly)
03/30 02:06:57
MRMInitialize()
03/30 02:06:57 PBSInitialize(base,SC)
03/30 02:06:57
INFO: parent is exiting
03/30 02:06:57
MSUListen(S)
03/30 02:06:57 INFO: opened service
socket on port 15004
03/30 02:06:57
__MPBSSystemQuery(base,RCount,SC)
03/30 02:06:57
INFO: connected to PBS server :0 on sd 1
03/30
02:06:57 MAMInitialize(NULL)
03/30 02:06:57
MStatInitializeActiveSysUsage()
03/30 02:06:57
MStatClearUsage(NONE,Active)
03/30 02:06:57 ServerUpdate()
03/30 02:06:57
MSysUpdateTime()
03/30 02:06:57 INFO: starting new
day: Thu Mar 30 02:06:57
03/30 02:06:57 MStatOpenFile(1143644817)
03/30
02:06:57 INFO: starting iteration 0
03/30 02:06:57
MRMGetInfo()
03/30 02:06:57 MClusterClearUsage()
03/30 02:06:57
MRMClusterQuery()
03/30 02:06:57 PBSClusterQuery(base,RCount,SC)
03/30
02:06:57 PBSGetNodeState(Name,Status,PNode)
03/30 02:06:57
INFO: PBS node editr.cluster.ird.nc set to state Idle
(free)
03/30 02:06:57
PBSNodeLoad(editr.cluster.ird.nc,editr.cluster.ird.nc,Idle,0)
03/30 02:06:57
MUGetIndex(STATACTIVETIME,ValList,0)
03/30 02:06:57
MUGetIndex(STATTOTALTIME,ValList,0)
03/30 02:06:57
MUGetIndex(STATUPTIME,ValList,0)
03/30 02:06:57
MNodeUpdateResExpression(editr.cluster.ird.nc)
03/30 02:06:57
INFO: cannot determine node/frame of
host
'editr.cluster.ird.nc'
[000] editr.cluster.ird.nc:
(P:2,S:10,M:1,D:1)
[Idle][DEFAULT][linux]<0.000000> C:[NONE][DEFAULT]
[all] [NONE]
03/30 02:06:57 PBSGetNodeState(Name,Status,PNode)
03/30
02:06:57 INFO: PBS node node1.cluster.ird.nc set to
state Idle (free)
03/30 02:06:57
PBSNodeLoad(node1.cluster.ird.nc,node1.cluster.ird.nc,Idle,0)
A
19:56 28/03/2006 -0800, Bernard Li a écrit :
>Was MAUI running when that
happened (/etc/init.d/maui status) - if it
>wasn't running, can you
restart it and see if that
helps?
>
>Cheers,
>
>Bernard
>
>
>----------
>From:
Lefevre Jerome [mailto:[EMAIL PROTECTED]]
>Sent:
Tue 28/03/2006 19:37
>To: Bernard Li; Lefevre Jerome;
[email protected]
>Subject: 4.2.1b54423 : Post-install :
PBS_SERVER and MAUI trouble
>
>Cluster 5 Dual-Opteron tyan S2885
3Ware Sata raid
>Switch 3Com gigabit
>Fedora Core 3 x86_64 fresh
install
>Oscar 4.2.1b54423
>
>
>Hi,
>
>Some
trouble with Maui and PBS.
>
>Just after Oscar Installation Maui is
running and PBS is up. Ganglia is
>fine too.
>
>But after a
reboot, Maui don't run and pbs_server.log print "Connection
>Refused
(111)".
>I have the same issue as before with Oscar 4.0 and Fedora Core 2
...
>
>Any Idea
?
>
>Cheers,
>
>Jerome
