Ok I booted into a node via rescue mode from a Red Hat 7.2 installation
CD. I noticed that there are certain files missing.

There is no
/etc/lilo.conf
/boot/map

And no ramdisk

Where does OSCAR keep these files?  They are also not in the
/var/lib/systemimager/images/oscarimage/ tree on the master node.

This seems odd because I get the LIL- error message and that seems to me
that it is loading a lilo.conf file but I cannot find that file anywhere
on the nodes.

        |-----Original Message-----
        |From: Keller, Gregory W xNON-EMPLOYEEx 
        |[mailto:[EMAIL PROTECTED] 
        |Sent: Wednesday, March 19, 2003 12:50 PM
        |To: '[EMAIL PROTECTED]'
        |Cc: '[EMAIL PROTECTED]'
        |Subject: Re: OSCAR cluster dead after simple changes
        |
        |
        |Chris,
        |The change to the switch is irrelevent.  Until LILO 
        |bootstraps the kernel your network card isn't a factor 
        |- so unless your diskless or using PXE to boot over 
        |the network, this is an install problem on the local disks.
        |
        |Check out this link: 
        |http://user.fundy.net/cyclist/linux/troubleshoot-LILO.html
        |
        |Here is an excerpt that will point us in the right direction:
        |  ()  No part of LILO has been loaded. LILO either 
        |isn't installed 
        |    or the partition on which its boot sector is 
        |located isn't active. 
        |   L  ...   The first stage boot loader has been 
        |loaded and started, 
        |    but it can't load the second stage boot loader. 
        |The two-digit error 
        |    codes indicate the type of problem. (See also 
        |section "Disk error 
        |    codes".) This condition usually indicates a media 
        |failure or a geometry 
        |    mismatch (e.g. bad disk parameters, see section 
        |"Disk geometry"). 
        |   LI   The first stage boot loader was able to load 
        |the second stage boot 
        |    loader, but has failed to execute it. This can 
        |either be caused by a 
        |    geometry mismatch or by moving /boot/boot.b 
        |without running the map 
        |    installer. 
        |   LIL   The second stage boot loader has been 
        |started, but it can't load 
        |    the descriptor table from the map file. This is 
        |typically caused by a 
        |    media failure or by a geometry mismatch. 
        |   LIL?   The second stage boot loader has been loaded 
        |at an incorrect 
        |    address. This is typically caused by a subtle 
        |geometry mismatch or by 
        |    moving /boot/boot.b without running the map installer. 
        |   LIL-   The descriptor table is corrupt. This can 
        |either be caused by a 
        |    geometry mismatch or by moving /boot/map without 
        |running the map 
        |    installer. 
        |   LILO   All parts of LILO have been successfully loaded. 
        |
        |Let us know what you find - perhaps the installer is 
        |having trouble with the map installer.
        |
        |Keep smiling,
        |Greg
        |
        |
        |       Message: 2
        |       From: "Chris Oubre" <[EMAIL PROTECTED]>
        |       To: <[EMAIL PROTECTED]>
        |       Date: Tue, 18 Mar 2003 17:24:18 -0600
        |       Subject: [Oscar-users] OSCAR cluster dead after 
        |simple changes
        |
        |       We are currently working on a nasty problem 
        |with our OSCAR cluster.
        |
        |       I am running OSCAR 2.1 on RedHat 7.2 with a 
        |modified kernel to
        |       accommodate for my e1000 gigabit cards and 
        |mylex hardware RAID.
        |
        |       First the problem.
        |       We recently received some new nodes for our 
        |Beowulf cluster.  We 
        |       brought the cluster down to install them.  We 
        |installed an additional 
        |       network card to the switch, and manually moved 
        |the position of the 
        |       old nodes in the rack.  When we rebooted up the 
        |cluster (master and 
        |       old nodes), ALL of the old nodes do not boot 
        |up.  They stop at a 
        |       screen that says 
        |
        |
        |       LIL-
        |
        |       I have removed the additional card from the 
        |switch and tried rebooting
        |       everything.  No Joy
        |       I am able to ping the master node from the 
        |switch and vice-versa.
        |
        |       If you could give some ideas of where to look?  
        |Or do I need to
        |       completely reinstall?  I'd rather avoid the 
        |reinstall because I do not
        |       know what cause the crash, and thus would be 
        |susceptible to a
        |       reoccurrence.
        |
        |
        |
        |****************************************************
        |Christopher D. Oubre                               *
        |email: [EMAIL PROTECTED]                     *
        |research: http://cmt.rice.edu/~coubre              *
        |Web: http://www.angelfire.com/la2/oubre            *
        |Hangout: http://pub44.ezboard.com/bsouthterrebonne *
        |Phone:(713)348-3541  Fax:   (713)348-4150          *
        |Rice University                                    *
        |Department of Physics, M.S. 61                     *
        |6100 Main St.                       ^-^            *
        |Houston, Tx  77251-1892, USA       (O O)           *
        |-= Phlax=-                         ( v )           *
        |************************************m*m*************
        |
        |
        |


-------------------------------------------------------
This SF.net email is sponsored by: Does your code think in ink? 
You could win a Tablet PC. Get a free Tablet PC hat just for playing. 
What are you waiting for?
http://ads.sourceforge.net/cgi-bin/redirect.pl?micr5043en
_______________________________________________
Oscar-users mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/oscar-users

Reply via email to