Try starting w/ a clean PBS db by doing the following:
service pbs_server stop
/opt/pbs/sbin/pbs_server -t createThen try step 7 again.
At 03:37 AM 10/15/2003, Chiu, PCM (Peter) wrote:
Hi Jeremy,
On the server node, I have this in /etc/hosts
127.0.0.1 localhost.localdomain localhost
192.168.0.1 master.aaa.bbb.ccc master oscar_server nfs_oscar pbs_oscar
130.246.nnn.mmm ecp.ddd.bbb.ccc
#These entries are managed by SIS, please don't modify them.
192.168.0.2 node2.aaa.bbb.ccc node2
"ecp.ddd.bbb.ccc" is the normal name for this server configured on interface eth0
for external access. This name will come up in response to the "hostname" command.
On interface eth1, the machine is configured as 192.168.0.1, with the name "master.aaa.bbb.ccc".
This is the interface used to communicate with the compute node "node2".
Am I missing something?
Peter
- -----Original Message-----
- From: Jeremy Enos [mailto:[EMAIL PROTECTED]]
- Sent: 14 October 2003 22:39
- To: Chiu, PCM (Peter) ; Chiu, PCM (Peter) ; '[EMAIL PROTECTED]'
- Subject: Re: [Oscar-users] Queries on Oscar
- Peter-
- PBS setup is sensitive to having a proper hostname on the server. Have you set one?
- thx-
Jeremy
- At 02:06 PM 10/14/2003, Chiu, PCM (Peter) wrote:
Dear all, I am trying to set up a test cluster of two RH 9.0 nodes using oscar 2.3.1. The compute node appears to start up okay with its own local system disk. Some questions: a. at the last phase of completing the cluster setup, the log reveals **** oscar_cluster **** ----- node2.xxx.yyy.zzz ---- Setting loghost to 192.168.0.1 Starting system logginer: [ OK] Staring kernel logger: [ok] Some of the post install scripts failed, please check your logs for more info at ./post_install line 48 ---> Step 7: Failed to properly complete the cluster install; please check the logs Where is the detailed log ? b. During the test cluster setup, it complains pbs server has no node list. I then found and ran this script ./packagespbs/scripts/post_install --default it complains: create node node2.xxx.yyy.zzz = 1; properties = all qmgr obj=node2.xxx.yyy.zzz svr=default; Unauthorized Request It appears that the batch system is not quite setup properly.
Any ideas?
