I'm still trying to get through step 7 of the OSCAR installation process. Some of the problems are cleared up, but some remain. I'm not sure if there are 1 or 2 problems. An excerpt from the log is shown below.
1. I have been digging everywhere I can think to find gexec_cluster(), but it has evaded me. The XML_ParseBuffer error is mysterious. It happens 6 times, although there are only 4 nodes. 2. The file /var/spool/pbs/server_priv/nodes does not exist. I haven't been able to figure out where it is supposed to be created. I tried to make it by hand, but it didn't help. Could this be a result of error #1? I have deleted the many irritating copies of sh: module: line 1: syntax error: unexpected end of file sh: error importing function definition for `module' ------------- Begin log excerpt ------------------------------- --> About to run /var/lib/oscar/packages/opium/api-post-deploy for opium image: $VAR1 = 'galpropimage'; --------------- gexec_cluster() XML_ParseBuffer() error at line 1: no element found cpush returned -1 on subcluster galpropimage image: $VAR1 = 'galpropimage'; --------------- gexec_cluster() XML_ParseBuffer() error at line 1: no element found cpush returned -1 on subcluster galpropimage image: $VAR1 = 'galpropimage'; --------------- gexec_cluster() XML_ParseBuffer() error at line 1: no element found cpush returned -1 on subcluster galpropimage image: $VAR1 = 'galpropimage'; --------------- gexec_cluster() XML_ParseBuffer() error at line 1: no element found cpush returned -1 on subcluster galpropimage image: $VAR1 = 'galpropimage'; --------------- gexec_cluster() XML_ParseBuffer() error at line 1: no element found cpush returned -1 on subcluster galpropimage image: $VAR1 = 'galpropimage'; --------------- gexec_cluster() XML_ParseBuffer() error at line 1: no element found cpush returned -1 on subcluster galpropimage --> About to run /var/lib/oscar/packages/switcher/api-post-deploy for switcher Checking if the OPKG has to be excluded... OPKG switcher: Analysing default values --> About to run /var/lib/oscar/packages/torque/api-post-deploy for torque [torque] Updating pbs_server nodes /opt/pbs/bin/pbsnodes: Server has no node list MSG=node list is empty qmgr obj=compute1.stanford.edu svr=default: Unauthorized Request create node compute1.stanford.edu np = 48 , properties = all qmgr obj=compute2.stanford.edu svr=default: Unauthorized Request create node compute2.stanford.edu np = 48 , properties = all qmgr obj=compute3.stanford.edu svr=default: Unauthorized Request create node compute3.stanford.edu np = 48 , properties = all qmgr obj=compute4.stanford.edu svr=default: Unauthorized Request create node compute4.stanford.edu np = 48 , properties = all Shutting down TORQUE Server: [ OK ] Starting TORQUE Server: [ OK ] [torque] Creating TORQUE workq queue... qmgr obj=workq svr=default: Unauthorized Request Max open servers: 4 create queue workq Configuration of TORQUE queues failed, check the logs at /var/spool/pbs at /var/lib/oscar/packages/torque/api-post-deploy line 316 Script /var/lib/oscar/packages/torque/api-post-deploy exitted badly with exit code '2' at ./post_install line 49 Couldn't run 'post_install' script for torque at ./post_install line 50 Some of the post install scripts failed, please check your logs for more info at ./post_install line 55 --> Step 7: Failed to properly complete the cluster install; please check the logs From the pbs log: 08/09/2010 15:07:26;0004;PBS_Server;Svr;galprop-test2.stanford.edu; cannot open node description file '/var/spool/pbs/server_priv/nodes' in setup_nodes() ------------------------------------------------------------------------------ This SF.net email is sponsored by Make an app they can't live without Enter the BlackBerry Developer Challenge http://p.sf.net/sfu/RIM-dev2dev _______________________________________________ Oscar-users mailing list Oscar-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/oscar-users