Hello Martin,

I made it one step further in the meantime.
globusrun-ws -submit -F
https://ubuntu:8443/wsrf/services/ManagedJobFactoryService -Ft PBS -c
/bin/sleep 120
Submitting job...Done.
Job ID: uuid:caab5920-a41a-11dc-b5d9-000c29583dae
Termination time: 12/07/2007 16:46 GMT
Current job state: Failed
Destroying job...Done.
globusrun-ws: Job failed: Error code: 201
Script stderr:
[sudo] password for globus:

The globus container is running under user globus. Now I have an
authentication failure. It looks like that sudo is asking for a password in
a batch script and directly fails, it doesn't even wait for me to type it
in. I have edited sudoers, I even allowed the user globus ALL=NOPASSWD: ALL,
but it didn't help.
The container has this in the log:
2007-12-06 16:46:35,515 INFO
exec.StateMachine[RunQueueThread_3,logJobAccepted:3513] Job
cdf3e390-a41a-11dc-ae70-b55e8731fc0e accepted for local user 'root'
2007-12-06 16:46:37,852 WARN
exec.StateMachine[RunQueueThread_4,createFaultFromErrorCode:3270]
Unhandled fault code 201
2007-12-06 16:46:41,325 INFO
exec.StateMachine[RunQueueThread_4,logJobFailed:3543] Job
cdf3e390-a41a-11dc-ae70-b55e8731fc0e failed
2007-12-06 16:47:31,284 ERROR
monitoring.SchedulerEventGenerator[Thread-9,run:198] SEG Terminated
with Fault: globus_xio: Operation was
canceled
Somehow it looks like it is trying to execute the job as root (huh?) even
though it is running under my globus user.

Petr

On 12/6/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
>
> Petr,
>
> We don't officially support SGE and i don't have
> much experience with SGE integration in GT, but i
> know that there are some people on the list who
> know about it.
>
> About Torque:
> After you installed Torque: did you update your
> GT installation to support PBS (GT does not make
> a difference between PBS and Torque) ?
>
> What's the output of
>    ps -ef | grep globus-scheduler-event-generator
> after you started the GT container
> on the server-side?
>
> Martin
>
> > Hello,
> >
> > I am trying to set up GT4.0.5 with Sun Grid Engine. Installation was
> > successful without any errors or warnings.
> > When I try to submit a job, nothing happens at first and roughly after
> two
> > minutes and error comes:
> > 2007-12-06 10:38:18,604 ERROR
> > monitoring.SchedulerEventGenerator[Thread-9,run:198] SEG Terminated
> > with Fault: globus_xio: Operation was
> > canceled
> > 2007-12-06 10:38:18,961 ERROR
> > monitoring.SchedulerEventGenerator[Thread-13,run:198] SEG Terminated
> > with Fault: globus_xio: Operation was
> > canceled
> > I was searching on google, and this thread came up:
> > http://www.globus.org/mail_archive/gt-user/2006/09/msg00120.html
> > However, it didn't help me at all, as it says this error was fixed in
> > 4.0.3and I have
> > 4.0.5.
> > So I tried to use another scheduling system. I Installed Torque and I am
> > getting the same error.
> > Is there any way to debug what is going wrong?
> >
> > Petr
> >
>
>
>

Reply via email to