Thanks
Jeremy for your second reply.
I have
tried your suggestion, stop the pbs_server,
start
with create and run step 7 again.
Still
got /opt/pbs/bin/pbsnodes: Server has no node list
qmr
obj=node2.gome.xxx.yyy.zzz svr=default; Unauthorzed Request
Creating pbs workq queue...
qmgr
obj=workq svr=default: Unauthorized Request
Attached is the log.
Any further ideas?
-----Original Message-----Chiu-
From: Jeremy Enos [mailto:[EMAIL PROTECTED]
Sent: 15 October 2003 18:32
To: Chiu, PCM (Peter) ; '[EMAIL PROTECTED]'
Subject: RE: [Oscar-users] Queries on Oscar
Try starting w/ a clean PBS db by doing the following:
service pbs_server stop/opt/pbs/sbin/pbs_server -t createThen try step 7 again.Jeremy
At 03:37 AM 10/15/2003, Chiu, PCM (Peter) wrote:
Hi Jeremy,
On the server node, I have this in /etc/hosts
127.0.0.1 localhost.localdomain localhost
192.168.0.1 master.aaa.bbb.ccc master oscar_server nfs_oscar pbs_oscar
130.246.nnn.mmm ecp.ddd.bbb.ccc
#These entries are managed by SIS, please don't modify them.
192.168.0.2 node2.aaa.bbb.ccc node2
"ecp.ddd.bbb.ccc" is the normal name for this server configured on interface eth0
for external access. This name will come up in response to the "hostname" command.
On interface eth1, the machine is configured as 192.168.0.1, with the name "master.aaa.bbb.ccc".
This is the interface used to communicate with the compute node "node2".
Am I missing something?
Peter
- -----Original Message-----
- From: Jeremy Enos [mailto:[EMAIL PROTECTED]]
- Sent: 14 October 2003 22:39
- To: Chiu, PCM (Peter) ; Chiu, PCM (Peter) ; '[EMAIL PROTECTED]'
- Subject: Re: [Oscar-users] Queries on Oscar
- Peter-
- PBS setup is sensitive to having a proper hostname on the server. Have you set one?
- thx-
Jeremy
- At 02:06 PM 10/14/2003, Chiu, PCM (Peter) wrote:
Dear all, I am trying to set up a test cluster of two RH 9.0 nodes using oscar 2.3.1. The compute node appears to start up okay with its own local system disk. Some questions: a. at the last phase of completing the cluster setup, the log reveals **** oscar_cluster **** ----- node2.xxx.yyy.zzz ---- Setting loghost to 192.168.0.1 Starting system logginer: [ OK] Staring kernel logger: [ok] Some of the post install scripts failed, please check your logs for more info at ./post_install line 48 ---> Step 7: Failed to properly complete the cluster install; please check the logs Where is the detailed log ? b. During the test cluster setup, it complains pbs server has no node list. I then found and ran this script ./packagespbs/scripts/post_install --default it complains: create node node2.xxx.yyy.zzz = 1; properties = all qmgr obj=node2.xxx.yyy.zzz svr=default; Unauthorized Request It appears that the batch system is not quite setup properly.
Any ideas?
PBS_Server ecp.ag.rl.ac.uk: Create mode and server database exists, do you wish to continue y/(n)?y
=============================================================================
== Running step 7 of the OSCAR wizard: Complete cluster setup
=============================================================================
--> Step 7: Running: ./post_install
Gathering processor count from node2.gome.rl.ac.uk.
Warning: No xauth data; using fake authentication data for X11 forwarding.
Updating database for machine node2.gome.rl.ac.uk.
Warning: No xauth data; using fake authentication data for X11 forwarding.
building file list ... done
hosts
wrote 78 bytes read 42 bytes 240.00 bytes/sec
total size is 362 speedup is 3.02
--> About to run /opt/oscar/packages/switcher/scripts/post_install for switcher
Setting default for tag mpi ("lam-7.0")
Attribute successfully set; new attribute setting will be effective for
future shells
Warning: No xauth data; using fake authentication data for X11 forwarding.
building file list ... done
switcher.ini
wrote 239 bytes read 42 bytes 562.00 bytes/sec
total size is 189 speedup is 0.67
--> About to run /opt/oscar/packages/pfilter/scripts/post_install for pfilter
(re)starting the pfilter firewall service on this server
/etc/init.d/pfilter restart
Restarting pfilter: [ OK ]
pushing out the clients pfilter firewall configuration file
. /etc/profile.d/c3.sh && cpush /etc/pfilter.conf.clients /etc/pfilter.conf
Warning: No xauth data; using fake authentication data for X11 forwarding.
building file list ... done
wrote 59 bytes read 20 bytes 52.67 bytes/sec
total size is 856 speedup is 10.84
(re)starting the pfilter firewall service on the clients
. /etc/profile.d/c3.sh && cexec /etc/init.d/pfilter restart
************************* oscar_cluster *************************
--------- node2.gome.rl.ac.uk---------
Warning: No xauth data; using fake authentication data for X11 forwarding.
Restarting pfilter:[ OK ]
--> About to run /opt/oscar/packages/pbs/scripts/post_install for pbs
Warning: No xauth data; using fake authentication data for X11 forwarding.
PBS mom config file updated with clienthost: ecp.ag.rl.ac.uk
Pushing config file to clients...
building file list ... done
config
wrote 79 bytes read 42 bytes 242.00 bytes/sec
total size is 105 speedup is 0.87
Sending SIGHUP to all moms...
************************* oscar_cluster *************************
--------- node2.gome.rl.ac.uk---------
Warning: No xauth data; using fake authentication data for X11 forwarding.
Shutting down PBS Server: [ OK ]
Starting PBS Server: [ OK ]
Updating pbs_server nodes
/opt/pbs/bin/pbsnodes: Server has no node list
qmgr obj=node2.gome.rl.ac.uk svr=default: Unauthorized Request
create node node2.gome.rl.ac.uk np = 1 , properties = all
Creating pbs workq queue...
qmgr obj=workq svr=default: Unauthorized Request
Max open servers: 4
create queue workq
Configuration of PBS queues failed at /opt/oscar/packages/pbs/scripts/post_install
line 260
Script /opt/oscar/packages/pbs/scripts/post_install exitted badly with exit code '2'
at ./post_install line 42
Couldn't run 'post_install' script for pbs at ./post_install line 43
--> About to run /opt/oscar/packages/opium/scripts/post_install for opium
Warning: No xauth data; using fake authentication data for X11 forwarding.
building file list ... done
wrote 46 bytes read 20 bytes 132.00 bytes/sec
total size is 543 speedup is 8.23
Warning: No xauth data; using fake authentication data for X11 forwarding.
building file list ... done
wrote 51 bytes read 20 bytes 47.33 bytes/sec
total size is 189 speedup is 2.66
Warning: No xauth data; using fake authentication data for X11 forwarding.
building file list ... done
passwd
wrote 81 bytes read 54 bytes 90.00 bytes/sec
total size is 1869 speedup is 13.84
Warning: No xauth data; using fake authentication data for X
building file list ... done
group
wrote 78 bytes read 42 bytes 240.00 bytes/sec
total size is 659 speedup is 5.49
Warning: No xauth data; using fake authentication data for X11 forwarding.
building file list ... done
shadow
wrote 81 bytes read 48 bytes 86.00 bytes/sec
total size is 1190 speedup is 9.22
--> About to run /opt/oscar/packages/ntpconfig/scripts/post_install for ntpconfig
************************* oscar_cluster *************************
--------- node2.gome.rl.ac.uk---------
Warning: No xauth data; using fake authentication data for X11 forwarding.
Shutting down ntpd: [FAILED]
ntpd: Synchronizing with time server: [ OK ]
Starting ntpd: [ OK ]
--> About to run /opt/oscar/packages/loghost/scripts/post_install for loghost
************************* oscar_cluster *************************
--------- node2.gome.rl.ac.uk---------
Warning: No xauth data; using fake authentication data for X11 forwarding.
oscar_loghost already set
Some of the post install scripts failed, please check your logs for more info at
./post_install line 48
--> Step 7: Failed to properly complete the cluster install; please check the logs
