Stephen, Please stop to send the report over and over again, it just creates pollution on the mailing list.
Regards, ----- Mail Original ----- De: "Stephen Shaw" <s.j.s...@swansea.ac.uk> À: oscar-users@lists.sourceforge.net Envoyé: Vendredi 27 Mars 2009 07h34:11 GMT -05:00 USA/Canada - États de l'Est Objet: [Oscar-users] 6.0.1 and Centos 5.2 problems I've had several unsuccessful attempts at installing 6.0.1 from http://bear.csm.ornl.gov/repos/rhel-5-i386 on a 10 node cluster running a fresh installation of Centos 5.2 The main issues are:- 1. configurator package is not installed automatically 2. At step 6 I get: ERROR: Impossible to get the network interface id we have to use on compute nodes (oscar_server, eth0) at /usr/bin/setup_pxe line 181. 3. After nodes are imaged and I try to run step 7 I get prompted to add host key to known_hosts file and for root password which is not set. I think that the second and third problems are connected to the fact that the oscar mysql database is not being populated with all the information required. I'll try and summarise the problems I've had and include log extracts where appropriate. PROBLEM 1 Firstly, running oscar-config --setup-distro centos-5-i386 sets the file /tftpboot/oscar/rhel-5-i386.url to point to the unstable repository http://bear.csm.ornl.gov/repos/unstable/rhel-5-i386 not the stable one. I need to modify this by hand. Then, when I run oscar-config --bootstrap command it fails to install the configurator package: Prereqs status (/usr/share/oscar/prereqs/Configurator): configurator: needs to be installed [ http://bear.csm.ornl.gov/repos/rhel-5-i386 http://isoredirect.centos.org/centos/5/os/i386/ ] Array: 2 element(s) smartly installing: configurator ERROR during execution 512 package configurator is not installed !!! Some packages were NOT installed !!! ERROR: impossible to install Configurator ( /usr/bin/install_prereq --smart /usr/share/oscar/prereqs/Configurator). Checking the repository explains this as the package simply isn't there so I grabbed it from the unstable repository and installled it manually with yum localinstall ~/configurator-1.0-1.noarch.rpm Bootstap then completes (after manually restarting mysql). Next I run oscar_wizard install and am able to create an image with the following errors ============================================================================= == Running step 4 of the OSCAR wizard: Build OSCAR client image ============================================================================= 0: centos-5-i386 Selection: centos-5-i386 [add_image] Starting... Executing command: mksiimage -A --name oscarimage --distro centos-5-i386 --filename /usr/share/oscar/oscarsamples/rhel-5-i386.rpmlist --arch i386 --path /var/lib/systemimager/images/oscarimage --force ERROR during execution 512 ERROR during execution 512 ERROR during execution 512 ERROR during execution 512 WARNING: Impossible to install glibc-kernheaders (1, ) ERROR during execution 512 WARNING: Impossible to install slocate (1, ) ERROR during execution 512 WARNING: Impossible to install utempter (1, ) Image build finished. Added Disk Table for oscarimage based on /usr/share/oscar/oscarsamples/scsi.gg.disk Running: si_mkautoinstallscript -quiet --autodetect-disks -image oscarimage -force -ip-assignment static -post-install reboot ... done PROBLEM 2 At step 6 in oscar_wizard there seems to be an issue with the database not knowing the compute nodes' network interface. ERROR: Impossible to get the network interface id we have to use on compute nodes (oscar_server, eth0) at /usr/bin/setup_pxe line 181 ERROR: at /usr/lib/perl5/vendor_perl/5.8.8/OSCAR/GUI_MAC.pm line 784 Possibly related to this is the the warning received when I run system-sanity ------------------------------------------------------ WARNING: A valid NIC must be specified for the cluster private network. Valid NICs: eth0, eth1 You tried to use: <None>. This may be normal if this is the first time you execute OSCAR. ------------------------------------------------------ Also at step 5 I always get the error error opening file: /etc/c3.conf The /tftpboot files for network boot were not created at step 6 until about the seventh attempt at installing oscar. When they were finally created it was after I changed the file /etc/oscar/oscar.conf to include ODA_TYPE = file rather than ODA_TYPE = db. This is a bad idea I guess and may or may not have been the reason for the files to be finally created. PROBLEM 3 After finally persuading the wizard to report success at every step up to 6, I network booted the nodes and imaged them with out obvious problems, however when they boot there are messages about SSH RSA keys being created (which shouldn't happen right?). When I try to run step 7 I get prompted to add host key to known_hosts file and for the root password (which isn't set). Again, this shouldn't happen? Inspecting the mysql oscar database seems to reveal the problem i.e missing information about the nodes. mysql> use oscar Reading table information for completion of table and column names You can turn off this feature to get a quicker startup with -A Database changed mysql> select * from Nodes; +------------+---------+-----------+----------------+------+--------------+---------------------+----+----------+-----------+--------------+------+------+-------+---------+ | cluster_id | cpu_num | cpu_speed | dns_domain | fqdn | group_name | hostname | id | image_id | installer | name | ram | swap | units | virtual | +------------+---------+-----------+----------------+------+--------------+---------------------+----+----------+-----------+--------------+------+------+-------+---------+ | 1 | NULL | NULL | oscardomain.gg | NULL | oscar_server | head.oscardomain.gg | 1 | 0 | NULL | oscar_server | NULL | NULL | NULL | NULL | +------------+---------+-----------+----------------+------+--------------+---------------------+----+----------+-----------+--------------+------+------+-------+---------+ 1 row in set (0.00 sec) mysql> select * from Nics; Empty set (0.00 sec) I can set the root passwords on the nodes by booting them in single user mode and running pwconv then passwd. Step 7 will then report success (after typing in the password 9 times, one for each node) . I can not run test cluster though and I get the error: Use of uninitialized value in concatenation (.) or string at /usr/bin/oscar_wizard line 879. This would appear to be because $OSCAR_HOME is not set which it should be? Thanks for looking, I hope someone has some suggestions. ------------------------------------------------------------------------------ _______________________________________________ Oscar-users mailing list Oscar-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/oscar-users ------------------------------------------------------------------------------ _______________________________________________ Oscar-users mailing list Oscar-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/oscar-users