|
Hi Bernard, The the address 10.0.1.10 was for my intranet network card (eth1) and I used ./install_cluster eth1 for the installation. BTW, I'd like to make it a bit more clear for the meaning of " hostname for the network should be the same in the PBSserver file". My understanding is that the hostname (one can get from %echo $HOSTNAME) should be on the same line as the name of the PBSserver (pbs_oscar) in the file /etc/hosts. In the example below my $HOSTNAME was abc.ntu.edu.tw. Thanks to you for your suggestions as well. Shiang-Tai Bernard Li wrote: Hi Shiang-Tai: Good work! This may be a potential bug that we might have to look into.One question though, assuming that 10.0.1.10 is the address of your eth1 interface, have you always been running: ./install_cluster eth1 ? Have you ever run install_cluster on eth0? Thanks, Bernard-----Original Message----- From: Shiang-Tai Lin [mailto:[EMAIL PROTECTED]] Sent: Tuesday, January 11, 2005 20:53 To: Yu Chen Cc: Bernard Li; [email protected] Subject: Re: [Oscar-users] PBS configuration failure during post_install (OSCAR4+FC2) Hi, I finally figure out the problem with the Torque post_install failure. The problem (at least for me) was that the hostname for the network is not the same in the PBSserver file. That is, the name of the PBSserver, pbs_oscar (defined in /var/spool/pbs/server_name), is not defined in the file /etc/hosts. This should always be true if one has only one network card installed but may be setup incorrectly if there are additional network cards on the server node. To be more clear, I list my /etc/hosts file here (the xx are numbers I do not wish to disclose for security reasons) # Do not remove the following line, or various programs # that require network functionality will fail. 10.0.1.10 abc.ntu.edu.tw abc oscar_server nfs_oscar pbs_oscar 140.112.xx.xx def.ntu.edu.tw def # These entries are managed by SIS, please don't modify them. 10.0.1.1 node1.abc.ntu.edu.tw node1 10.0.1.2 node2.abc.ntu.edu.tw node2 Originally I had "10.0.1.10 def.ntu.edu.tw def oscar_server nfs_oscar pbs_oscar" in the /etc/hosts so the post_install always failed. I also want to note that it is not necessary to edit the nodes file /var/spool/pbs/server_priv/nodes. The post_install script can still find the nodes without this file. Finally, I'd like to point out that I figured this out after reading the following paragraph I found in http://www.mail-archive.com/[email protected]/msg01387.html |
- Re: [Oscar-users] PBS configuration failure during pos... Shiang-Tai Lin
- RE: [Oscar-users] PBS configuration failure during post_ins... Bernard Li
- RE: [Oscar-users] PBS configuration failure during post_ins... Bernard Li
- Re: [Oscar-users] PBS configuration failure during pos... Shiang-Tai Lin
- RE: [Oscar-users] PBS configuration failure during post_ins... Bernard Li
- FW: [Oscar-users] PBS configuration failure during post_ins... Bernard Li
- RE: [Oscar-users] PBS configuration failure during post_ins... Bernard Li
- RE: [Oscar-users] PBS configuration failure during post_ins... Bernard Li
