On Thu, 16 Oct 2003, Chiu, PCM (Peter) wrote:
> Still got /opt/pbs/bin/pbsnodes: Server has no node list > qmr obj=node2.gome.xxx.yyy.zzz svr=default; Unauthorzed Request > Creating pbs workq queue... > qmgr obj=workq svr=default: Unauthorized Request > > Attached is the log. > > Any further ideas?
I've seen the above errors several times before. The symptoms that seem to be present when it occurs is the lack of a valid dns entry for the entry in /var/spool/pbs/server_name, which is pbs_oscar by default (for example, type 'host pbs_oscar' and getting a host not found).
Often times (but not 100%), I've been able to correct this by changing the entry in /var/spool/pbs/server_name to match the dns entry for the head node. If the nodes can resolve the IP for the head node, you're in luck.
I've always (100% of the time) been able to correct this by writing my own DNS authority zones for the cluster, and resolving all of the oscar-specific hosts in /etc/hosts manually through bind (eg. pbs_oscar*). Afterwards, rerun the "complete cluster setup" and the test-cluster should pull through without a complete reinstall.
I'd provide some easy zone examples here if you're interested, but hopefully someone will interject here about why this isn't the "Right Thing to Do(tm)".
Anecdotal observations: Most of the time I've seen this issue when the cluster is being installed without an external network connection or when there is no DNS entry on the local LAN's dns for the IP of the head node.
I'd appreciate it if anyone could chime in here with steps that I could take so that this step isn't necessary.
ADVthanksANCE
(*) Keep in mind though, that underscores are not valid characters for hostnames (ref: RFC 952 and 1123), so that alternate DNS servers (pdnsd, for example, which is more light weight and secure than bind) will refuse to honor these hosts.
Paul Gray -o) 323 Wright Hall /\\ University of Northern Iowa _\_V Message void if penguin violated ... Don't mess with the penguin No one ever says "Hey, I can't read that ASCII e-mail ya sent me."
Let me know if this corresponds to your scenario or not...
Jeremy
At 08:52 AM 5/7/2004, Chad Dupuis wrote:
Hello,
I'm hoping that someone can offer advice on this error we are getting during Step 7. It seems that the requests going to pbs do not have the correct permissions. Below is the log:
Shutting down PBS Server: [ OK ]
Starting PBS Server: [ OK ]
Updating pbs_server nodes
/opt/pbs/bin/pbsnodes: Server has no node list
qmgr obj=mitacis3.oscardomain svr=default: Unauthorized Request
create node mitacis3.oscardomain np = 2 , properties = all
qmgr obj=mitacis4.oscardomain svr=default: Unauthorized Request
create node mitacis4.oscardomain np = 2 , properties = all
qmgr obj=mitacis5.oscardomain svr=default: Unauthorized Request
create node mitacis5.oscardomain np = 2 , properties = all
qmgr obj=mitacis6.oscardomain svr=default: Unauthorized Request
create node mitacis6.oscardomain np = 2 , properties = all
qmgr obj=mitacis7.oscardomain svr=default: Unauthorized Request
create node mitacis7.oscardomain np = 2 , properties = all
qmgr obj=mitacis8.oscardomain svr=default: Unauthorized Request
create node mitacis8.oscardomain np = 2 , properties = all
qmgr obj=mitacis9.oscardomain svr=default: Unauthorized Request
create node mitacis9.oscardomain np = 2 , properties = all
Creating pbs workq queue...
Max open servers: 4
qmgr obj=workq svr=default: Unauthorized Request
create queue workq
Configuration of PBS queues failed at /var/lib/oscar/packages/torque/scripts/post_install line 297
Script /var/lib/oscar/packages/torque/scripts/post_install exitted badly with exit code '2' at ./post_install line 42
Couldn't run 'post_install' script for torque at ./post_install line 43
Some of the post install scripts failed, please check your logs for more info at ./post_install
line 48
--> Step 7: Failed to properly complete the cluster install; please check the logs
-- Cheers,
Chad J. Dupuis MIT Academic Computing
------------------------------------------------------- This SF.Net email is sponsored by Sleepycat Software Learn developer strategies Cisco, Motorola, Ericsson & Lucent use to deliver higher performing products faster, at low TCO. http://www.sleepycat.com/telcomwpreg.php?From=osdnemail3 _______________________________________________ Oscar-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/oscar-users
------------------------------------------------------- This SF.Net email is sponsored by Sleepycat Software Learn developer strategies Cisco, Motorola, Ericsson & Lucent use to deliver higher performing products faster, at low TCO. http://www.sleepycat.com/telcomwpreg.php?From=osdnemail3 _______________________________________________ Oscar-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/oscar-users