Hi Prakashan,

When you run your test with the SEG_SGE_DEBUG level set, what corresponding entries do you see in the reporting file? either 'tail -f' the file and or grep on "job_log" and the job id.

BTW: ARCO's dbwriter does delete the reporting file as it's checkpoint mechanism so that's still an incompatibility with gt4.

thanks, Jeff

Korambath, Prakashan wrote:

Hi,

I am trying to sort out some issues with Integrating Globus ToolKit 4.2 and SGE 6.2 SEG. Some of the issues have already been answered in the mailing list and I have followed those answers and they work correctly, but I am having at least couple of issues.

For example command below

1. globusrun-ws -debug -batch -submit -o job_epr -factory "globushostname" -Ft SGE -f sleep.xml
submits and runs the job ok, but command below


2. globusrun-ws -debug -status -job-epr-file job_epr

This command always return status unsubmitted even when job is long gone.

Current job state: Unsubmitted

I checked the $SGE_ROOT/$SGE_CELL/common/reporting file and the file. I found this file disappearing when SGE's ARCO dbwriter is also running. For testing purpose I stopped the postgresql and stopped ARCO from doing anything to that file. So now that file is there, but still SEG is not getting updates like pending, finished etc. Everything is fine with Fork, so there is some problem with SGE-SEG.

I also set

export SEG_SGE_DEBUG=3 and ran
/home/globus/gt4.2.1/libexec/globus-scheduler-event-generator -s sge -t 1225815907


globus_l_sge_split_into_fields()
globus_l_sge_split_into_fields(): exit success
New event: job 28 now pending
freeing fields
globus_l_sge_parse_events() exits
globus_l_sge_clean_buffer() called
globus_l_sge_split_into_fields()
globus_l_sge_split_into_fields(): exit success
New event: job 28 now completed
freeing fields
globus_l_sge_split_into_fields()
globus_l_sge_split_into_fields(): exit success


So the scheduler event generator seems to get the status. My suspicion is that something is missing in the file seg_sge_module.c. I already have changes mentioned here
http://www.globus.org/toolkit/docs/4.2/4.2.0/execution/gram4/developer/scheduler-tutorial-seg.html

I wonder what else is missing.


Prakashan



Reply via email to