Hi, Berit
 I had the same issue.As you can see when maui updates previous jobs status
using MStatUpdateActiveJobUsage() function segmentation fault
occurs.Toresolve this try restartting the pbs server with option
'pbs_server -type
cold' to remove previous jobs.Then start maui.Remember to start maui after
you have started the pbs server.If the previous jobs does'nt get deleted
with above option, try using pbs_sched and remove all the jobs, then restart
the server and then start maui.

Hope this will
Regards--
Rishi Pathak

On 12/13/06, Berit Hinnemann <[EMAIL PROTECTED]> wrote:

Hi all,

I am new to installing Torque PBS and Maui. My system is a one
dual-processor dual-core server for testing purposes, where I try things out
before getting the actual cluster. I have installed both Torque PBS and this
seems to work fine. Then I installed Maui and used the file maui.cfg as
below, aside from telling that the queue system is PBS I did not change
anything.

Now the behavior is that I can start the 'maui' demon, issue 'showq' and
see the queue, but when I submit a job, the maui demon seems to stop by
itself. Then, when I issue "showq" I get

[EMAIL PROTECTED] 1proc]$ showq
ERROR:    cannot send request to server localhost.localdomain:42559
(server may not be running)
ERROR:    cannot request service (status)

I have appended the lines generated in maui.log below.
The job runs fine and I can also submit several jobs, which are just done
in the order submitted. I can also restart maui and repeat this procedure.

Does anybody have an idea where I should be looking to figure out what is
wrong? I would be grateful on any hints on how to get started.
Best, Berit

--------------------------------------
Berit Hinnemann
Research Scientist
Haldor Topsøe A/S
---------------------------------------

-------------------------------------------------------------------------------------------------------------------------------------
output from maui.log upon submitting a job
12/13 16:23:35 INFO:     scheduling complete.  sleeping 30 seconds
12/13 16:24:06 ServerProcessRequests()
12/13 16:24:06 INFO:     not rolling logs (585245 < 10000000)
12/13 16:24:06 MResAdjust(NULL,0,0)
12/13 16:24:06 MStatInitializeActiveSysUsage()
12/13 16:24:06 MStatClearUsage([NONE],Active)
12/13 16:24:06 ServerUpdate()
12/13 16:24:06 MSysUpdateTime()
12/13 16:24:06 INFO:     starting iteration 7
12/13 16:24:06 MRMGetInfo()
12/13 16:24:06 MClusterClearUsage()
12/13 16:24:06 MRMClusterQuery()
12/13 16:24:06 MPBSClusterQuery(localhost.localdomain,RCount,SC)
12/13 16:24:06 __MPBSGetNodeState(Name,State,PNode)
12/13 16:24:06 INFO:     PBS node localhost.localdomain set to state Busy
(job-exclusive)
12/13 16:24:06 INFO:     node 'localhost.localdomain' changed states from
Idle to Busy
12/13 16:24:06 ALERT:    unexpected node transition on node '
localhost.localdomain'  Idle -> Busy
12/13 16:24:06 MPBSNodeUpdate(localhost.localdomain,localhost.localdomain
,Busy,localhost.localdomain)
12/13 16:24:06 INFO:     node localhost.localdomain has joblist
'0/10.localhost.localdomain, 1/10.localhost.localdomain,
2/10.localhost.localdomain, 3/10.localhost.localdomain'
12/13 16:24:06 ALERT:    cannot locate PBS job '10.localhost.localdomain'
(running on node localhost.localdomain)
12/13 16:24:06 ALERT:    cannot locate PBS job '10.localhost.localdomain'
(running on node localhost.localdomain)
12/13 16:24:06 ALERT:    cannot locate PBS job '10.localhost.localdomain'
(running on node localhost.localdomain)
12/13 16:24:06 ALERT:    cannot locate PBS job '10.localhost.localdomain'
(running on node localhost.localdomain)
12/13 16:24:06 MPBSLoadQueueInfo(localhost.localdomain,
localhost.localdomain,SC)
12/13 16:24:06 INFO:     queue 'batch' started state set to True
12/13 16:24:06 INFO:     class to node not mapping enabled for queue
'batch' adding class to all nodes
12/13 16:24:06 INFO:     1 PBS resources detected on RM
localhost.localdomain
12/13 16:24:06 INFO:     resources detected: 1
12/13 16:24:06 MRMWorkloadQuery()
12/13 16:24:06 MPBSWorkloadQuery(localhost.localdomain,JCount,SC)
12/13 16:24:06 MPBSJobLoad(10,10.localhost.localdomain,J,TaskList,0)
12/13 16:24:06 MReqCreate(10,SrcRQ,DstRQ,DoCreate)
12/13 16:24:06 INFO:     processing node request line '1:ppn=4'
12/13 16:24:06 MJobSetCreds(10,behi,behi,)
12/13 16:24:06 INFO:     default QOS for job 10 set to DEFAULT(0)
(P:DEFAULT,U:[NONE],G:[NONE],A:[NONE],C:[NONE])
12/13 16:24:06 INFO:     default QOS for job 10 set to DEFAULT(0)
(P:DEFAULT,U:[NONE],G:[NONE],A:[NONE],C:[NONE])
12/13 16:24:06 INFO:     default QOS for job 10 set to DEFAULT(0)
(P:DEFAULT,U:[NONE],G:[NONE],A:[NONE],C:[NONE])
12/13 16:24:06 MResJCreate(10,MNodeList,-00:00:10,ActiveJob,Res)
12/13 16:24:06 MStatUpdateActiveJobUsage(10)

---------------------------------------------------------------------------------------------------------------------------------------
maui.cfg
# maui.cfg 3.2.6p18

SERVERHOST            localhost.localdomain
# primary admin must be first in list
ADMIN1                root

# Resource Manager Definition

RMCFG[localhost.localdomain] TYPE=PBS

# Allocation Manager Definition

AMCFG[bank]  TYPE=NONE

# full parameter docs at
http://supercluster.org/mauidocs/a.fparameters.html
# use the 'schedctl -l' command to display current configuration

RMPOLLINTERVAL        00:00:30

SERVERPORT            42559
SERVERMODE            NORMAL

# Admin: http://supercluster.org/mauidocs/a.esecurity.html


LOGFILE               maui.log
LOGFILEMAXSIZE        10000000
LOGLEVEL              3

# Job Priority: http://supercluster.org/mauidocs/5.1jobprioritization.html

QUEUETIMEWEIGHT       1

# FairShare: http://supercluster.org/mauidocs/6.3fairshare.html

#FSPOLICY              PSDEDICATED
#FSDEPTH               7
#FSINTERVAL            86400
#FSDECAY               0.80

# Throttling Policies:
http://supercluster.org/mauidocs/6.2throttlingpolicies.html

# NONE SPECIFIED

# Backfill: http://supercluster.org/mauidocs/8.2backfill.html

BACKFILLPOLICY        FIRSTFIT
RESERVATIONPOLICY     CURRENTHIGHEST

# Node Allocation: http://supercluster.org/mauidocs/5.2nodeallocation.html

NODEALLOCATIONPOLICY  MINRESOURCE

# QOS: http://supercluster.org/mauidocs/7.3qos.html

# QOSCFG[hi]  PRIORITY=100 XFTARGET=100 FLAGS=PREEMPTOR:IGNMAXJOB
# QOSCFG[low] PRIORITY=-1000 FLAGS=PREEMPTEE

# Standing Reservations:
http://supercluster.org/mauidocs/7.1.3standingreservations.html

# SRSTARTTIME[test] 8:00:00
# SRENDTIME[test]   17:00:00
# SRDAYS[test]      MON TUE WED THU FRI
# SRTASKCOUNT[test] 20
# SRMAXTIME[test]   0:30:00

# Creds: http://supercluster.org/mauidocs/6.1fairnessoverview.html

# USERCFG[DEFAULT]      FSTARGET=25.0
# USERCFG[john]         PRIORITY=100  FSTARGET=10.0-
# GROUPCFG[staff]       PRIORITY=1000 QLIST=hi:low QDEF=hi
# CLASSCFG[batch]       FLAGS=PREEMPTEE
# CLASSCFG[interactive] FLAGS=PREEMPTOR


_______________________________________________
mauiusers mailing list
[email protected]
http://www.supercluster.org/mailman/listinfo/mauiusers



_______________________________________________
mauiusers mailing list
[email protected]
http://www.supercluster.org/mailman/listinfo/mauiusers

Reply via email to