I had properly configured GRAM for GT4.0.5. Now, trying to integrate with a PBS/MAUI cluster, it looks like work, but it doesn't.
The master node of my cluster is the machine pdclab-05. I run the next command: [EMAIL PROTECTED] ~]$ globusrun-ws -Ft PBS -submit -S -f a.rsl Delegating user credentials...Done. Submitting job...Done. Job ID: uuid:adc621f8-4481-11dc-8fe6-00163e6ad4e0 Termination time: 08/08/2007 01:01 GMT Current job state: StageIn Current job state: Pending Current job state: Active Current job state: CleanUp Current job state: Done Destroying job...Done. Cleaning up any delegated credentials...Done. hmmm!!! no, no... [EMAIL PROTECTED] ~]$ cat stderr Host key verification failed. /bin/touch: cannot touch `/home/josanabr/.globus/ae1e6610-4481-11dc-9e98-ba518469aa4c/exit.0': No such file or directory /var/spool/torque/mom_priv/jobs/25.pdclab-0.SC: line 54: /home/josanabr/.globus/ae1e6610-4481-11dc-9e98-ba518469aa4c/exit.0: No such file or directory /var/spool/torque/mom_priv/jobs/25.pdclab-0.SC: line 55: [: too many arguments Now, from pdclab-01 machine, I execute the next command: [EMAIL PROTECTED] ~]$ globusrun-ws -submit -S -Ft PBS -F https://pdclab-05.ece.uprm.edu:8443/wsrf/services/ManagedJobFactoryService -f a.rsl Delegating user credentials...Done. Submitting job...Done. Job ID: uuid:f949514a-4481-11dc-bef7-00163e2fd1ce Termination time: 08/08/2007 01:03 GMT Current job state: StageIn Current job state: Pending Current job state: Active Current job state: CleanUp Current job state: Done Destroying job...Done. Cleaning up any delegated credentials...Done. I don't have stderr file in pdclab-01, but it is in pdclab-05. [EMAIL PROTECTED] ~]$ cat stderr Host key verification failed. /bin/touch: cannot touch `/home/josanabr/.globus/e9c90e90-4481-11dc-9e98-ba518469aa4c/exit.0': No such file or directory /var/spool/torque/mom_priv/jobs/26.pdclab-0.SC: line 54: /home/josanabr/.globus/e9c90e90-4481-11dc-9e98-ba518469aa4c/exit.0: No such file or directory /var/spool/torque/mom_priv/jobs/26.pdclab-0.SC: line 55: [: too many arguments I look to the "my_pbs_submit_script" and "my_pbs_job", files created as is suggested in "http://www.mail-archive.com/[email protected]/msg00553.html", in its content is append to the end of this e-mail. I think the problem is that the directory is not created by the perl's script ${GLOBUS_LOCATION}/lib/perl/Globus/GRAM/JobManager/pbs.pm. I look for a line to contains "mkdir" command to create a directory corresponding to ${GLOBUS_USER}/.globus/hash_key, and I can't found. Two questions: Are there something missing in this script related to directory creation? If there is, someone has provide a solution for that issue? thanks a lot. regards. -------------------- my_pbs_submit_script ---------------------------- #! /bin/sh # PBS batch job script built by Globus job manager # #PBS -S /bin/sh #PBS -m n #PBS -o /home/josanabr/stdout #PBS -e /home/josanabr/stderr #PBS -l nodes=1 GLOBUS_LOCATION="/opt/gt"; export GLOBUS_LOCATION; X509_CERT_DIR="/etc/grid-security/certificates"; export X509_CERT_DIR; X509_USER_PROXY=""; export X509_USER_PROXY; X509_USER_CERT=""; export X509_USER_CERT; X509_USER_KEY=""; export X509_USER_KEY; HOME="/home/josanabr"; export HOME; LOGNAME="josanabr"; export LOGNAME; SCRATCH_DIRECTORY="/home/josanabr/.globus/scratch"; export SCRATCH_DIRECTORY; JAVA_HOME="/opt/jdk/jre"; export JAVA_HOME; GLOBUS_GRAM_JOB_HANDLE="https://136.145.116.82:8443/wsrf/services/ManagedExecuta bleJobService?e9c90e90-4481-11dc-9e98-ba518469aa4c"; export GLOBUS_GRAM_JOB_HANDLE; LD_LIBRARY_PATH=; export LD_LIBRARY_PATH; #Change to directory requested by user cd /home/josanabr hosts=`cat $PBS_NODEFILE`; counter=0 while test $counter -lt 1; do for host in $hosts; do if test $counter -lt 1; then /usr/bin/ssh $host "/bin/sh /home/josanabr/.globus/e9c90e90-4481-11d c-9e98-ba518469aa4c/scheduler_pbs_cmd_script; echo \$? > /home/josanabr/.globus/ e9c90e90-4481-11dc-9e98-ba518469aa4c/exit.$counter" < /dev/null & counter=`expr $counter + 1` else break fi done done wait counter=0 exit_code=0 while test $counter -lt 1; do /bin/touch /home/josanabr/.globus/e9c90e90-4481-11dc-9e98-ba518469aa4c/exit. $counter; read tmp_exit_code < /home/josanabr/.globus/e9c90e90-4481-11dc-9e98-ba518469 aa4c/exit.$counter if [ $exit_code = 0 -a $tmp_exit_code != 0 ]; then exit_code=$tmp_exit_code fi counter=`expr $counter + 1` done exit $exit_code --------------------------------------------------------------- ------------------- my_pbs_job ------------------------- #!/bin/sh cd /home/josanabr GLOBUS_LOCATION="/opt/gt"; export GLOBUS_LOCATION; X509_CERT_DIR="/etc/grid-security/certificates"; export X509_CERT_DIR; X509_USER_PROXY=""; export X509_USER_PROXY; X509_USER_CERT=""; export X509_USER_CERT; X509_USER_KEY=""; export X509_USER_KEY; HOME="/home/josanabr"; export HOME; LOGNAME="josanabr"; export LOGNAME; SCRATCH_DIRECTORY="/home/josanabr/.globus/scratch"; export SCRATCH_DIRECTORY; JAVA_HOME="/opt/jdk/jre"; export JAVA_HOME; GLOBUS_GRAM_JOB_HANDLE="https://136.145.116.82:8443/wsrf/services/ManagedExecuta bleJobService?e9c90e90-4481-11dc-9e98-ba518469aa4c"; export GLOBUS_GRAM_JOB_HANDLE; LD_LIBRARY_PATH=; export LD_LIBRARY_PATH; ./my_echo "Hello" "World!" -------------------------------------------------------------- http://ece.uprm.edu/~s047267 http://del.icio.us/josanabr http://blog-grid.blogspot.com
