I'm still trying to get through step 7 of the OSCAR installation
process.  Some of the problems are cleared up, but some remain.
I'm not sure if there are 1 or 2 problems.  An excerpt from the
log is shown below.

1.  I have been digging everywhere I can think to find gexec_cluster(),
but it has evaded me.  The XML_ParseBuffer error is mysterious.
It happens 6 times, although there are only 4 nodes.

2.  The file /var/spool/pbs/server_priv/nodes does not exist.
I haven't been able to figure out where it is supposed to be created.
I tried to make it by hand, but it didn't help.  Could this be a
result of error #1?

I have deleted the many irritating copies of
  sh: module: line 1: syntax error: unexpected end of file
  sh: error importing function definition for `module'

-------------  Begin log excerpt  -------------------------------

--> About to run /var/lib/oscar/packages/opium/api-post-deploy for opium
image:
$VAR1 = 'galpropimage';
---------------
gexec_cluster() XML_ParseBuffer() error at line 1:
no element found

cpush returned -1 on subcluster galpropimage
image:
$VAR1 = 'galpropimage';
---------------
gexec_cluster() XML_ParseBuffer() error at line 1:
no element found

cpush returned -1 on subcluster galpropimage
image:
$VAR1 = 'galpropimage';
---------------
gexec_cluster() XML_ParseBuffer() error at line 1:
no element found

cpush returned -1 on subcluster galpropimage
image:
$VAR1 = 'galpropimage';
---------------
gexec_cluster() XML_ParseBuffer() error at line 1:
no element found

cpush returned -1 on subcluster galpropimage
image:
$VAR1 = 'galpropimage';
---------------
gexec_cluster() XML_ParseBuffer() error at line 1:
no element found

cpush returned -1 on subcluster galpropimage
image:
$VAR1 = 'galpropimage';
---------------
gexec_cluster() XML_ParseBuffer() error at line 1:
no element found

cpush returned -1 on subcluster galpropimage


--> About to run /var/lib/oscar/packages/switcher/api-post-deploy for 
switcher
Checking if the OPKG has to be excluded...
OPKG switcher: Analysing default values
--> About to run /var/lib/oscar/packages/torque/api-post-deploy for torque
[torque] Updating pbs_server nodes
/opt/pbs/bin/pbsnodes: Server has no node list MSG=node list is empty
qmgr obj=compute1.stanford.edu svr=default: Unauthorized Request
create node compute1.stanford.edu np = 48 , properties = all
qmgr obj=compute2.stanford.edu svr=default: Unauthorized Request
create node compute2.stanford.edu np = 48 , properties = all
qmgr obj=compute3.stanford.edu svr=default: Unauthorized Request
create node compute3.stanford.edu np = 48 , properties = all
qmgr obj=compute4.stanford.edu svr=default: Unauthorized Request
create node compute4.stanford.edu np = 48 , properties = all
Shutting down TORQUE Server: [  OK  ]
Starting TORQUE Server: [  OK  ]
[torque] Creating TORQUE workq queue...
qmgr obj=workq svr=default: Unauthorized Request
Max open servers: 4
create queue workq
Configuration of TORQUE queues failed, check the logs at /var/spool/pbs 
at /var/lib/oscar/packages/torque/api-post-deploy line 316
Script /var/lib/oscar/packages/torque/api-post-deploy exitted badly with 
exit code '2' at ./post_install line 49
Couldn't run 'post_install' script for torque at ./post_install line 50
Some of the post install scripts failed, please check your logs for more 
info at ./post_install line 55
--> Step 7: Failed to properly complete the cluster install; please 
check the logs


 From the pbs log:
08/09/2010 15:07:26;0004;PBS_Server;Svr;galprop-test2.stanford.edu;
cannot open node description file '/var/spool/pbs/server_priv/nodes'
in setup_nodes()

------------------------------------------------------------------------------
This SF.net email is sponsored by 

Make an app they can't live without
Enter the BlackBerry Developer Challenge
http://p.sf.net/sfu/RIM-dev2dev 
_______________________________________________
Oscar-users mailing list
Oscar-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/oscar-users

Reply via email to