Mike,
Can you send us the server-side container logfile?
(typically $GLOBUS_LOCATION/var/container.log)
Thanks, Martin
mike packard wrote:
Good Morning,
I am using globus 4.0.8 flavor gcc32 with on Suse enterprise 9 32-bit, kernel
2.6.5-7.308-bigsmp.
I installed Globus using the quickstart guide here:
http://www.globus.org/toolkit/docs/4.0/admin/docbook/quickstart.html
But when I try to run globusrun-ws it hangs at the CleanUp step. I hit ctrl-c
and
it hangs again at the "Destroying job..." step. I hit ctrl-c again and it
quits. Same thing when I use the SGE jobmanager.
I know that credentials are ok because I can file transfer via both
globus-url-copy & rft. Also, I can submit jobs to another host (which had
globus setup already) just fine. Some examples are below.
Any help would be greatly appreciated.
Kind Regards,
Mike Packard
UZH
---
Failed job:
globusrun-ws -submit -F https://host01.unizh.ch -Ft Fork -streaming -c
/bin/hostname
Delegating user credentials...Done.
Submitting job...Done.
Job ID: uuid:c23483cc-7fd2-11dd-96d7-00142272b1a0
Termination time: 09/12/2008 07:25 GMT
Current job state: Active
Current job state: CleanUp-Hold
host01.unizh.ch
Current job state: CleanUp
"ctrl-c"
Canceling...Canceled.
Destroying job...
"ctrl-c"
Canceling...Failed.
globusrun-ws: Operation was canceled
globusrun-ws: Unable to destroy job: Error destroying job
globus_soap_message_module: Failed receiving response
ManagedJobPortType_Destroy.
globus_soap_message_module: SOAP Message transport failed: Error in HTTP
response
globus_xio: Operation was canceled
---
Successful job to other host:
globusrun-ws -submit -F https://host02.unizh.ch -Ft Fork -streaming -c
/bin/hostname
Delegating user credentials...Done.
Submitting job...Done.
Job ID: uuid:ce733836-7fd2-11dd-a805-00142272b1a0
Termination time: 09/12/2008 07:25 GMT
Current job state: Active
Current job state: CleanUp-Hold
host02.unizh.ch
Current job state: CleanUp
Current job state: Done
Destroying job...Done.
Cleaning up any delegated credentials...Done.
---
Here is the container log for the failed job:
2008-09-11 09:29:56,467 INFO exec.StateMachine
[RunQueueThread_0,logJobAccepted:3424] Job 6ec0f7b0-7fd3-11dd-9f10-bf807922599c
accepted for local user 'mpackard' for DN '/O=University of
Zurich/OU=OCI/OU=Baldridge group/OU=unizh.ch/CN=Mike Packard'
2008-09-11 09:29:56,708 INFO exec.StateMachine
[RunQueueThread_6,logJobSubmitted:3437] Job
6ec0f7b0-7fd3-11dd-9f10-bf807922599c submitted with local job ID
'6eedf9d6-7fd3-11dd-845b-00142272b1a0:536'