On Mar 10, 2010, at 12:59 AM, Vladimir Mencl wrote:
> Hi,
> 
> I'm installing GRAM5 (GT 5.0.0) on a CentOS 5 x86_64 system with Torque.
> 
> I got gatekeeper going and I can submit simple jobs all fine.
> 
> I've tried to switch to using the Scheduler event generator, but got
> stuck on that:
> 
> I was trying to follow the instructions on
> http://www.globus.org/toolkit/docs/5.0/5.0.0/execution/gram5/admin/#id2545820
> 
> * I've run
>  /opt/globus/setup/globus/setup-seg-pbs.pl --path
> /usr/spool/PBS/server_logs
> * I've edited  $GLOBUS_LOCATION/etc/grid-services/jobmanager-pbs and
> added "-seg-module pbs" to the list of arguments:
> 
> * I've tried running
> $GLOBUS_LOCATION/sbin/globus-job-manager-event-generator -scheduler pbs
> -background -pidfile /opt/globus/var/job-manager-seg-pbs.pid
> but it failed with
>   Error: pbs not configured
> 
> * After looking into the event-generator script, I've added the
> following line to $GLOBUS_LOCATION/etc/globus-job-manager-seg.conf
> 
> pbs_log_path=/usr/spool/PBS/server_logs
> 
> Now, I could run the event-generator without any error.

Do events for jobs started outside of globus show up in the SEG log file when 
you run the SEG?

> However, when I then try running the job with globusrun:
> * the job state does not progress
> * the globus-job-manager process (running under the local mapped user
> account) starts running wild, consuming more and more memory until
> taking the machine down (or being killed)
> 
> In PS output, the process shows as:
>> globus-job-manager -conf /opt/globus/etc/globus-job-manager.conf -type pbs 
>> -seg-module pbs
> 
> There's not much output in /opt/globus/var/globus-gatekeeper.log even
> though I'm running gatekeeper with -debug.

If there any information in the Job Manager Log? That file is going to be in 
$HOME/gram_YYYMMDD.log by default.

The only time I've seen the huge memory use is when there are many job state 
files in $GLOBUS_LOCATION/tmp/gram_job_state for the user due to a bug in how 
the job manager restart code works. Is that the case for you?

Joe

Reply via email to