Title: Message
Thanks Jeremy for your second reply.
 
I have tried your suggestion, stop the pbs_server,
start with create and run step 7 again.
 
Still got /opt/pbs/bin/pbsnodes: Server has no node list
qmr obj=node2.gome.xxx.yyy.zzz svr=default; Unauthorzed Request
Creating pbs workq queue...
qmgr obj=workq svr=default: Unauthorized Request
 
Attached is the log.
 
Any further ideas?
 
-----Original Message-----
From: Jeremy Enos [mailto:[EMAIL PROTECTED]
Sent: 15 October 2003 18:32
To: Chiu, PCM (Peter) ; '[EMAIL PROTECTED]'
Subject: RE: [Oscar-users] Queries on Oscar

Chiu-
Try starting w/ a clean PBS db by doing the following:
service pbs_server stop
/opt/pbs/sbin/pbs_server -t create

Then try step 7 again.
        Jeremy

At 03:37 AM 10/15/2003, Chiu, PCM (Peter) wrote:
Hi Jeremy,
 
On the server node, I have this in  /etc/hosts
127.0.0.1               localhost.localdomain localhost
192.168.0.1            master.aaa.bbb.ccc  master oscar_server nfs_oscar pbs_oscar
130.246.nnn.mmm  ecp.ddd.bbb.ccc
#These entries are managed by SIS, please don't modify them.
192.168.0.2  node2.aaa.bbb.ccc node2
 
"ecp.ddd.bbb.ccc" is the normal name for this server configured on interface eth0
for external access. This name will come up in response to the "hostname" command.
 
On interface eth1, the machine is configured as 192.168.0.1, with the name "master.aaa.bbb.ccc". 
This is the interface used to communicate with the compute node "node2".
 
Am I missing something?
 
Peter
-----Original Message-----
From: Jeremy Enos [mailto:[EMAIL PROTECTED]]
Sent: 14 October 2003 22:39
To: Chiu, PCM (Peter) ; Chiu, PCM (Peter) ; '[EMAIL PROTECTED]'
Subject: Re: [Oscar-users] Queries on Oscar

Peter-
PBS setup is sensitive to having a proper hostname on the server.  Have you set one?
thx-

        Jeremy

At 02:06 PM 10/14/2003, Chiu, PCM (Peter) wrote:

    Dear all, I am trying to set up a test cluster of two RH 9.0 nodes using oscar 2.3.1. The compute node appears to start up okay with its own local system disk. Some questions: a. at the last phase of completing the cluster setup, the log reveals **** oscar_cluster **** ----- node2.xxx.yyy.zzz ---- Setting loghost to 192.168.0.1 Starting system logginer: [ OK] Staring kernel logger: [ok] Some of the post install scripts failed, please check your logs for more info at ./post_install line 48 ---> Step 7: Failed to properly complete the cluster install; please check the logs Where is the detailed log ? b. During the test cluster setup, it complains pbs server has no node list. I then found and ran this script ./packagespbs/scripts/post_install --default it complains: create node node2.xxx.yyy.zzz = 1; properties = all qmgr obj=node2.xxx.yyy.zzz svr=default; Unauthorized Request It appears that the batch system is not quite setup properly.

    Any ideas?
    PBS_Server ecp.ag.rl.ac.uk: Create mode and server database exists, 
    do you wish to continue y/(n)?y
    
    =============================================================================
    == Running step 7 of the OSCAR wizard: Complete cluster setup
    =============================================================================
    
    --> Step 7: Running: ./post_install
    Gathering processor count from node2.gome.rl.ac.uk.
    Warning: No xauth data; using fake authentication data for X11 forwarding.
    Updating database for machine node2.gome.rl.ac.uk.
    Warning: No xauth data; using fake authentication data for X11 forwarding.
    building file list ... done
    hosts
    wrote 78 bytes  read 42 bytes  240.00 bytes/sec
    total size is 362  speedup is 3.02
    --> About to run /opt/oscar/packages/switcher/scripts/post_install for switcher
    Setting default for tag mpi ("lam-7.0")
    Attribute successfully set; new attribute setting will be effective for
    future shells
    Warning: No xauth data; using fake authentication data for X11 forwarding.
    building file list ... done
    switcher.ini
    wrote 239 bytes  read 42 bytes  562.00 bytes/sec
    total size is 189  speedup is 0.67
    --> About to run /opt/oscar/packages/pfilter/scripts/post_install for pfilter
    (re)starting the pfilter firewall service on this server
    /etc/init.d/pfilter restart
    Restarting pfilter:                                        [  OK  ]
    pushing out the clients pfilter firewall configuration file
    . /etc/profile.d/c3.sh && cpush /etc/pfilter.conf.clients /etc/pfilter.conf
    Warning: No xauth data; using fake authentication data for X11 forwarding.
    building file list ... done
    wrote 59 bytes  read 20 bytes  52.67 bytes/sec
    total size is 856  speedup is 10.84
    (re)starting the pfilter firewall service on the clients
    . /etc/profile.d/c3.sh && cexec /etc/init.d/pfilter restart
    ************************* oscar_cluster *************************
    --------- node2.gome.rl.ac.uk---------
    Warning: No xauth data; using fake authentication data for X11 forwarding.
    Restarting pfilter:[  OK  ]
    --> About to run /opt/oscar/packages/pbs/scripts/post_install for pbs
    Warning: No xauth data; using fake authentication data for X11 forwarding.
    PBS mom config file updated with clienthost: ecp.ag.rl.ac.uk
    Pushing config file to clients...
    building file list ... done
    config
    wrote 79 bytes  read 42 bytes  242.00 bytes/sec
    total size is 105  speedup is 0.87
    Sending SIGHUP to all moms...
    ************************* oscar_cluster *************************
    --------- node2.gome.rl.ac.uk---------
    Warning: No xauth data; using fake authentication data for X11 forwarding.
    Shutting down PBS Server:                                  [  OK  ]
    Starting PBS Server:                                       [  OK  ]
    Updating pbs_server nodes
    /opt/pbs/bin/pbsnodes: Server has no node list
    qmgr obj=node2.gome.rl.ac.uk svr=default: Unauthorized Request 
    create node node2.gome.rl.ac.uk np = 1 , properties = all
    Creating pbs workq queue...
    qmgr obj=workq svr=default: Unauthorized Request 
    Max open servers: 4
    create queue workq
    Configuration of PBS queues failed at /opt/oscar/packages/pbs/scripts/post_install 
    line 260
    Script /opt/oscar/packages/pbs/scripts/post_install exitted badly with exit code '2' 
    at ./post_install line 42
    Couldn't run 'post_install' script for pbs at ./post_install line 43
    --> About to run /opt/oscar/packages/opium/scripts/post_install for opium
    Warning: No xauth data; using fake authentication data for X11 forwarding.
    building file list ... done
    wrote 46 bytes  read 20 bytes  132.00 bytes/sec
    total size is 543  speedup is 8.23
    Warning: No xauth data; using fake authentication data for X11 forwarding.
    building file list ... done
    wrote 51 bytes  read 20 bytes  47.33 bytes/sec
    total size is 189  speedup is 2.66
    Warning: No xauth data; using fake authentication data for X11 forwarding.
    building file list ... done
    passwd
    wrote 81 bytes  read 54 bytes  90.00 bytes/sec
    total size is 1869  speedup is 13.84
    Warning: No xauth data; using fake authentication data for X
    building file list ... done
    group
    wrote 78 bytes  read 42 bytes  240.00 bytes/sec
    total size is 659  speedup is 5.49
    Warning: No xauth data; using fake authentication data for X11 forwarding.
    building file list ... done
    shadow
    wrote 81 bytes  read 48 bytes  86.00 bytes/sec
    total size is 1190  speedup is 9.22
    --> About to run /opt/oscar/packages/ntpconfig/scripts/post_install for ntpconfig
    ************************* oscar_cluster *************************
    --------- node2.gome.rl.ac.uk---------
    Warning: No xauth data; using fake authentication data for X11 forwarding.
    Shutting down ntpd: [FAILED]
    ntpd: Synchronizing with time server: [  OK  ]
    Starting ntpd: [  OK  ]
    --> About to run /opt/oscar/packages/loghost/scripts/post_install for loghost
    ************************* oscar_cluster *************************
    --------- node2.gome.rl.ac.uk---------
    Warning: No xauth data; using fake authentication data for X11 forwarding.
    oscar_loghost already set
    Some of the post install scripts failed, please check your logs for more info at 
    ./post_install line 48
    --> Step 7: Failed to properly complete the cluster install; please check the logs
    
    

    Reply via email to