Hi
I submitted this job on the cluster and the job is deferred. Using
tracejob I get:
03/14/2006 05:06:17 S unable to run job, MOM rejected/rc=1
_
Using checkjob $PBS_ID_
StartDate: -00:06:36 Tue Mar 14 05:06:18
Total Tasks: 1
Req[0] TaskCount: 1 Partition: ALL
Network: [NONE] Memory >= 0 Disk >= 0 Swap >= 0
Opsys: [NONE] Arch: [NONE] Features: [NONE]
IWD: [NONE] Executable: [NONE]
Bypass: 0 StartCount: 2
PartitionMask: [ALL]
Flags: RESTARTABLE
job is deferred. Reason: RMFailure (cannot start job - RM failure,
rc: 15041, msg: 'Execution server rejected request MSG=send failed,
STARTING')
Holds: Defer (hold reason: RMFailure)
PE: 1.00 StartPriority: 1
cannot select job 99950 for partition DEFAULT (job hold active)
Please advice
Gaurav
_______________________________________________
mauiusers mailing list
[email protected]
http://www.supercluster.org/mailman/listinfo/mauiusers