From: Bernard Li
Sent: Tue 22/02/2005 8:23 PM
To: ullas nil; [EMAIL PROTECTED]
Subject: RE: [Oscar-users] errors during installation(erroneous parts of log file included)
Sent: Tue 22/02/2005 8:11 PM
To: Bernard Li
Subject: RE: [Oscar-users] errors during installation(erroneous parts of log file included)
hi
thx for the response
ya we booted client
nodes(network boot),we observed client nodes installing image.after successful
completion when we booted the system from its local disk, they got hanged
showing some error like "VP_IDE:unknown VIA SouthBridge,disanling
DMA".
----- Original Message -----
From: "Bernard
Li" <[EMAIL PROTECTED]>
To: "ullas nil" <[EMAIL PROTECTED]>,
[email protected]
Subject: RE: [Oscar-users] errors during
installation(erroneous parts of log file included)
Date: Tue, 22 Feb 2005
12:22:45 -0800
>
> Have your nodes been imaged and booted before
you run the 'Complete
> Cluster Install' step?
>
>
Cheers,
>
> Bernard
>
> > -----Original
Message-----
> > From: [EMAIL PROTECTED]
>
> [mailto:[EMAIL PROTECTED]]
On Behalf Of
> > ullas nil
> > Sent: Tuesday, February 22,
2005 2:04
> > To: [email protected]
> >
Subject: [Oscar-users] errors during installation(erroneous parts
> >
of log file included)
> >
> > hi
> > thx for previous
response
> >
> > At all steps we are getting sucess popup
window. members pls do clarify.
> >
> >
> >
>
> Errors and warnings......................................
>
>
> >
> > Running OSCAR wizard_prep script
> >
==============================================================
> >
===============
> >
> >
> > --> Running OSCAR
wizard prerequisites
> > Running prereq setup script packman (in order)
Checking for
> > packman/depman RPMs...
> > Running prereq
setup script update-rpms (in order) Installing
> >
/opt/oscar/share/prereqs/update-rpms/RPMS/update-rpms-1.1.14-2
> >
0.noarch.rpm
> >
Preparing...
>
> ##################################################
>
> package
update-rpms-1.1.14-20 is already installed
> > WARNING: Appears
oscar-httpd was not installed. Defaulting to
> > local
filesystem.
> >
> >
> >
> > -->
Successfully ran wizard_prep
> >
> >
> >
> >
==============================================================
> >
===============
> > == Prerequisites installed
> >
==============================================================
> >
===============
> >
> >
> > --> OSCAR version:
4.0
> > --> Command line invocation: ./install_cluster eth0
>
>
> >
> > --> Hostname:
oscarserver ****(domain name set
using
> > domainname command and also using
> >
**** setdomain system call,still
domain name not displayed).
> > --> Domainname:
> > -->
Network interface: eth0
> >
> >
> > --> Linux
distribution: redhat 9
> > --> Kernel version: 2.4.20-8
> >
--> Architecture: i686
> > --> Running in directory:
/opt/oscar
> > --> PATH: -->
> >
/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin:/usr/X1
>
> -->
1R6/bin:/root/bin:/bin:/usr/bin:/sbin:/usr/sbin:/usr/local/bin:/usr/
>
> --> local/sbin
> > --> Running: "./oscar_wizard"
>
>
> >
> >
> >
> >
==============================================================
> >
===============
> > == Running step 2 of the OSCAR wizard: Configure
selected OSCAR
> > packages
> >
==============================================================
> >
===============
> >
> > no errors and warnings
>
>
> >
> >
==============================================================
> >
===============
> > == Running step 3 of the OSCAR wizard: Install
OSCAR server
> > packages
> >
==============================================================
> >
===============
> >
> > WARNING: OSCAR does not know how to
configure sendmail yet.
> > WARNING: Please bug the OSCAR developers to
finish the
> > disable-servics package!
> > WARNING: There
will be no mail service running on the client nodes!
> >
>
>
> > --> Finished server_prep script
> > --> Step 3:
Successfully installed OSCAR server
> >
> >
==============================================================
> >
===============
> > == Running step 4 of the OSCAR wizard: Build OSCAR
client image
> >
==============================================================
> >
===============
> >
> >
> > warning:
/tftpboot/rpm/redhat-release-9-3.i386.rpm: V3 DSA
> > signature: NOKEY,
key ID db42a60e
> > --> Step 4: Identified distro of clients: redhat
9
> >
> >
> >
> >
> >
> >
17: 2005-1-22 9:33:30 [SystemInstaller::Package :: Line 216]
> >
Installing with module SystemInstaller::Package::Rpm
> > 18: 2005-1-22
9:33:30 [SystemInstaller::Package::Rpm :: Line 223]
> > Performing RPM
stage 1 install, command is:
> > 19: 2005-1-22 9:33:30
[SystemInstaller::Package::Rpm :: Line 224]
> > cd /tftpboot/rpm;rpm
-ir /var/lib/systemimager/images/oscarimage
> > -v --percent
filesystem-2.2.1-3.i386.rpm
> > setup-2.5.25-1.noarch.rpm
basesystem-8.0-2.noarch.rpm
> > glibc-common-2.3.2-11.9.i386.rpm
glibc-2.3.2-11.9.i686.rpm
> > libtermcap-2.0.8-35.i386.rpm
termcap-11.0.1-16.noarch.rpm
> > warning: filesystem-2.2.1-3.i386.rpm:
V3 DSA signature: NOKEY,
> > key ID db42a60e
> > warning:
raidtools-1.00.3-2.i386.rpm: V3 DSA signature: NOKEY,
> > key ID
db42a60e
> > 20: Preparing packages for installation...
> >
21: setup-2.5.25-1
> >
> >
> >
> >
>
> awk: cmd. line:2: fatal: cannot open file `/etc/fstab' for
> >
reading (No such file or directory)
> > ls: readlink:: No such file or
directory
> > ls: file: No such file or directory
> > ls:
expected: No such file or directory
> > awk: cmd. line:2: fatal: cannot
open file `/etc/fstab' for
> > reading (No such file or
directory)
> > awk: cmd. line:2: fatal: cannot open file `/etc/fstab'
for
> > reading (No such file or directory)
> > awk: cmd.
line:2: fatal: cannot open file `/etc/fstab' for
> > reading (No such
file or directory)
> > 261: telnet-server-0.17-25
> > 262:
cyrus-sasl-plain-2.1.10-4
> > 263: postfix-1.1.11-11
> > 264:
vconfig-1.6-2
> > 265: pc
> >
> >
> > -->
Set package: c3
> > --> Set package: base
> > --> Done
marking installed bits in ODA
> > --> Step 4: Completed
successfully
> >
> >
> >
> >
>
>
> >
==============================================================
> >
===============
> > == Running step 5 of the OSCAR wizard: Define OSCAR
clients
> >
==============================================================
> >
===============
> >
> >
> > --> Step 5: Running:
./post_clients
> > No ipchains on this computer
> > No
modification to do for ipchains
> > --> About to run
> >
/opt/oscar/packages/switcher/scripts/post_clients for -->
> >
switcher About to run
> > /opt/oscar/packages/sis/scripts/post_clients
--> for sis
> > using ODA to read the OSCAR database for node and
adapters information ...
> > re
> >
> >
>
>
> >
> > --> About to run
/opt/oscar/packages/maui/scripts/post_clients for maui
> > Shutting
down MAUI Scheduler: ^[[60G[^[[0;31mFAILED^[[0;39m]^M
> > Starting MAUI
Scheduler: ^[[60G[ ^[[0;32mOK^[[0;39m ]^M
> > --> About
to run
> > /opt/oscar/packages/kernel_picker/scripts/post_clients
--> for
> > kernel_picker About to run -->
> >
/opt/oscar/packages/c3/scripts/post_clients for c3 Step 5: -->
> >
Successfully ran: ./post_clients Step 5: Completed successfully
>
>
> >
> >
==============================================================
> >
===============
> > == Running step 6 of the OSCAR wizard: Setup
networking
> >
==============================================================
> >
===============
> >
> >
> > --> Step 6: Assigned
00:11:2F:51:32:5A to oscarnode1
> >
>
>
1134,17 86%
> >
>
>
> > --> Step 6: Successfully setup network boot
> >
tcpdump: no process killed
> > --> Step 6: Stopped listening to
network Step 6: Completed successfully
> >
> >
> >
==============================================================
> >
===============
> > == Running step 7 of the OSCAR wizard: Complete
cluster setup
> >
==============================================================
> >
===============
> >
> >
> > --> Step 7: Running:
./post_install
> > Gathering processor count from
oscarnode1.oscardomain.
> > ssh: connect to host oscarnode1.oscardomain
port 22: No route to
> > host^M Improper count (0) returned from
machine
> > oscarnode1.oscardomain at ./post_install line 76
>
> main::get_numproc() called
at ./post_install line 33
> > ssh: connect to host oscarnode1 port 22:
No route to host^M
> > rsync: connection unexpectedly closed (0 bytes
read so far) rsync
> > error: error in rsync protocol data stream (code
12) at io.c(150)
> > --> About to run
/opt/oscar/packages/oda/scripts/post_install for oda
> > generating the
/etc/odaserver file on all oscar clients .
> > /etc/profile.d/c3.sh
&& cexec 'echo oscar_server > /etc/odaserver'
> >
************************* oscar_cluster *************************
> >
--------- oscarnode1---------
> > ssh: connect to host oscarnode1 port
22: No route to host^M
> > --> About to run
/opt/oscar/packages/torque/scripts/post_install
> > for -->
torque
> > ssh: connect to host oscarnode1 port 22: No route to
host^M
> > rsync: connection unexpectedly closed (0 bytes read so far)
rsync
> > error: error in rsync protocol data stream (code 12) at
io.c(150)
> > PBS mom config file updated with clienthost: oscarserver
Pushing
> > config file to clients...
> > Sending SIGHUP to
all moms...
> > ************************* oscar_cluster
*************************
> > --------- oscarnode1---------
>
> ssh: connect to host oscarnode1 port 22: No route to host^M
> >
Shutting down PBS Server: ^[[60G[^[[0;31mFAILED^[[0;39m]^M
> > Starting
PBS Server: ^[[60G[ ^[[0;32mOK^[[0;39m ]^M Updating
> >
pbs_server nodes
> >
> > /opt/pbs/bin/pbsnodes: Server has no
node list Shutting down PBS
> > Server: ^[[60G[
^[[0;32mOK^[[0;39m ]^M Starting PBS Server:
> > ^[[60G[
^[[0;32mOK^[[0;39m ]^M Creating pbs workq queue...
> > Unknown
Host.
> > qmgr: cannot connect to server localhost Max open servers:
4
> > create queue workq set queue workq queue_type = Execution
set
> > queue workq resources_max.cput = 10000:00:00 set queue
workq
> > resources_max.walltime = 10000:00:00 set queue workq
>
> resources_min.cput = 00:00:01 set queue workq resources_min.ncpus
>
> = 1 set queue workq resources_min.nodect = 1 set queue workq
> >
resources_min.walltime = 00:00:01 set queue workq
> >
resources_default.cput = 10000:00:00 set queue workq
> >
resources_default.ncpus = 1 set queue workq
> >
resources_default.nodect = 1 set queue workq
> >
resources_default.walltime = 10000:00:00 set queue workq enabled
> > =
True set queue workq started = True set server scheduling =
> > True
set server default_queue = workq set server mail_from = adm
> > set
server query_other_jobs = True set queue workq
> > resources_max.ncpus
= 0 set queue workq resources_max.nodect = 0
> >
> >
>
> set queue workq resources_available.nodect = 0 set server
> >
resources_available.ncpus = 0 set server
> > resources_available.nodect
= 0 set server
> > resources_available.nodes = 0 set server
resources_max.ncpus = 0
> > set server resources_max.nodes = 0 set
server scheduler_iteration
> > = 60 set server log_events = 64 Shutting
down MAUI Scheduler:
> > ^[[60G[ ^[[0;32mOK^[[0;39m ]^M
Starting MAUI Scheduler: ^[[60G[
> > ^[[0;32mOK^[[0;39m
]^M
> > --> About to run
> >
/opt/oscar/packages/switcher/scripts/post_install for --> switcher
>
> Setting default for tag mpi ("lam-7.0.6") Attribute successfully
>
> set; new attribute setting will be effective for future shells
> >
ssh: connect to host oscarnode1 port 22: No route to host^M
> > rsync:
connection unexpectedly closed (0 bytes read so far) rsync
> > error:
error in rsync protocol data stream (code 12) at io.c(150)
> > -->
About to run /opt/oscar/packages/pfilter/scripts/post_install
> > for
--> pfilter
> > (re)starting the pfilter firewall service on this
server
> > /etc/init.d/pfilter restart Restarting
pfilter:^[[60G[
> > ^[[0;32mOK^[[0;39m ]^M pushing out the
clients pfilter firewall
> > configuration file . /etc/profile.d/c3.sh
&& cpush
> > /etc/pfilter.conf.clients
/etc/pfilter.conf
> > ssh: connect to host oscarnode1 port 22: No route
to host^M
> > rsync: connection unexpectedly closed (0 bytes read so
far) rsync
> > error: error in rsync protocol data stream (code 12) at
io.c(150)
> > (re)starting the pfilter firewall service on the clients
.
> > /etc/profile.d/c3.sh && cexec /etc/init.d/pfilter
restart
> > ************************* oscar_cluster
*************************
> > --------- oscarnode1---------
>
> ssh: connect to host oscarnode1 port 22: No route to host^M
> >
--> About to run /opt/oscar/packages/opium/scripts/post_install for -->
opium
> > ssh: connect to host oscarnode1 port 22: No route to
host^M
> > rsync: connection unexpectedly closed (0 bytes read so far)
rsync
> > error: error in rsync protocol data stream (code 12) at
io.c(150)
> > ssh: connect to host oscarnode1 port 22: No route to
host^M
> > rsync: connection unexpectedly closed (0 bytes read so far)
rsync
> > error: error in rsync protocol data stream (code 12) at
io.c(150)
> > ssh: connect to host oscarnode1 port 22: No route to
host^M
> > rsync: connection unexpectedly closed (0 bytes read so far)
rsync
> > error: error in rsync protocol data stream (code 12) at
io.c(150)
> > ssh: connect to host oscarnode1 port 22: No route to
host^M
> > rsync: connection unexpectedly closed (0 bytes read so far)
rsync
> > error: error in rsync protocol data stream (code 12) at
io.c(150)
> > ssh: connect to host oscarnode1 port 22: No route to
host^M
> > rsync: connection unexpectedly closed (0 bytes read so far)
rsync
> > error: error in rsync protocol data stream (code 12) at
io.c(150)
> > --> About to run
> >
/opt/oscar/packages/ntpconfig/scripts/post_install for -->
> >
ntpconfig
> > ************************* oscar_cluster
*************************
> > --------- oscarnode1---------
>
> ssh: connect to host oscarnode1 port 22: No route to host^M
> >
--> About to run /opt/oscar/packages/loghost/scripts/post_install
>
> for --> loghost
> > ************************* oscar_cluster
*************************
> > --------- oscarnode1---------
>
> ssh: connect to host oscarnode1 port 22: No route to host^M
> >
--> About to run /opt/oscar/packages/ganglia/scripts/post_install
>
> for --> ganglia
> > ssh: connect to host oscarnode1 port 22: No
route to host^M
> > rsync: connection unexpectedly closed (0 bytes read
so far) rsync
> > error: error in rsync protocol data stream (code 12)
at io.c(150)
> > Shutting down GANGLIA gmond: ^[[60G[
^[[0;32mOK^[[0;39m ]^M
> > Shutting down GANGLIA gmetad:
^[[60G[ ^[[0;32mOK^[[0;39m ]^M
> >
************************* oscar_cluster *************************
> >
--------- oscarnode1---------
> > ssh: connect to host oscarnode1 port
22: No route to host^M
> > Starting GANGLIA gmond: ^[[60G[
^[[0;32mOK^[[0;39m ]^M
> > *************************
oscar_cluster *************************
> >
> > ssh: connect
to host oscarnode1 port 22: No route to host^M
> > Starting GANGLIA
gmond: ^[[60G[ ^[[0;32mOK^[[0;39m ]^M
> >
************************* oscar_cluster *************************
> >
--------- oscarnode1---------
> > ssh: connect to host oscarnode1 port
22: No route to host^M
> > Starting GANGLIA gmetad: ^[[60G[
^[[0;32mOK^[[0;39m ]^M
> > --> About to run -->
>
> /opt/oscar/packages/disable-services/scripts/post_install for
-->
> > disable-services
> >
************************************ WARNING
> >
************************************
> > OSCAR could not set up the
configuration for any mailing service
> > on the server.
> >
The current version of the disable-services packages in OSCAR
> > only
supports the Postfix mail transfer agent (MTA).
> > It looks like you
have another MTA installed (e.g, sendmail or
> > exim); as such, please
be aware that OSCAR will not automatically
> > configure it.
>
> ************************************ WARNING
> >
************************************
> > Cluster setup
complete!
> > --> Step 7: Successfully completed the cluster
install
> >
> >
> >
==============================================================
> >
===============
> > == Running step 8 of the OSCAR wizard: Test cluster
setup
> >
==============================================================
> >
===============
> >
> >
> > --> Step 8: Running
tests: cd /opt/oscar/testing && xterm -sl 500
> > -e
