I'm stuck here... my only guess is that qmgr has some problem with that long of an address. You can try just creating the nodes file manually... it's just an ascii file of the form:
NODENAME ppn=NUM_PROCS arbitrary_propertyA arbitrary_propertyB etc
/var/spool/pbs/server_priv/nodes is the filename. Restart the pbs_server for settings to take effect. Not sure that will work because queues may not have been defined... you can find that out by entering:
qmgr -c "print server"
While the server is running of course.
good luck-
At 11:00 AM 10/16/2003, Chiu, PCM (Peter) wrote:
Thanks Jeremy for your second reply.
I have tried your suggestion, stop the pbs_server,
start with create and run step 7 again.
Still got /opt/pbs/bin/pbsnodes: Server has no node list
qmr obj=node2.gome.xxx.yyy.zzz svr=default; Unauthorzed Request
Creating pbs workq queue...
qmgr obj=workq svr=default: Unauthorized Request
Attached is the log.
Any further ideas?
- -----Original Message-----
- From: Jeremy Enos [mailto:[EMAIL PROTECTED]]
- Sent: 15 October 2003 18:32
- To: Chiu, PCM (Peter) ; '[EMAIL PROTECTED]'
- Subject: RE: [Oscar-users] Queries on Oscar
- Chiu-
- Try starting w/ a clean PBS db by doing the following:
- service pbs_server stop
/opt/pbs/sbin/pbs_server -t create- Then try step 7 again.
Jeremy - At 03:37 AM 10/15/2003, Chiu, PCM (Peter) wrote:
- Hi Jeremy,
- On the server node, I have this in /etc/hosts
- 127.0.0.1 localhost.localdomain localhost
- 192.168.0.1 master.aaa.bbb.ccc master oscar_server nfs_oscar pbs_oscar
- 130.246.nnn.mmm ecp.ddd.bbb.ccc
- #These entries are managed by SIS, please don't modify them.
- 192.168.0.2 node2.aaa.bbb.ccc node2
- "ecp.ddd.bbb.ccc" is the normal name for this server configured on interface eth0
- for external access. This name will come up in response to the "hostname" command.
- On interface eth1, the machine is configured as 192.168.0.1, with the name "master.aaa.bbb.ccc".
- This is the interface used to communicate with the compute node "node2".
- Am I missing something?
- Peter
- -----Original Message-----
- From: Jeremy Enos [mailto:[EMAIL PROTECTED]]
- Sent: 14 October 2003 22:39
- To: Chiu, PCM (Peter) ; Chiu, PCM (Peter) ; '[EMAIL PROTECTED]'
- Subject: Re: [Oscar-users] Queries on Oscar
- Peter-
- PBS setup is sensitive to having a proper hostname on the server. Have you set one?
- thx-
Jeremy - At 02:06 PM 10/14/2003, Chiu, PCM (Peter) wrote:
Dear all, I am trying to set up a test cluster of two RH 9.0 nodes using oscar 2.3.1. The compute node appears to start up okay with its own local system disk. Some questions: a. at the last phase of completing the cluster setup, the log reveals **** oscar_cluster **** ----- node2.xxx.yyy.zzz ---- Setting loghost to 192.168.0.1 Starting system logginer: [ OK] Staring kernel logger: [ok] Some of the post install scripts failed, please check your logs for more info at ./post_install line 48 ---> Step 7: Failed to properly complete the cluster install; please check the logs Where is the detailed log ? b. During the test cluster setup, it complains pbs server has no node list. I then found and ran this script ./packagespbs/scripts/post_install --default it complains: create node node2.xxx.yyy.zzz = 1; properties = all qmgr obj=node2.xxx.yyy.zzz svr=default; Unauthorized Request It appears that the batch system is not quite setup properly.
Any ideas?
