These files are generated automatically. Did the clients install completely and successfully? If no lilo.conf was created, there must have been some error message near the end of the install. Since that message would be long gone, try reinstalling one. Did the machines ever boot as clients after being installed the first time? If not, reinstalling may be the only answer. If they did, then there may be some other problem at hand.

Jason

At 04:31 PM 3/19/2003 -0600, Chris Oubre wrote:
Ok I booted into a node via rescue mode from a Red Hat 7.2 installation
CD. I noticed that there are certain files missing.

There is no
/etc/lilo.conf
/boot/map

And no ramdisk

Where does OSCAR keep these files?  They are also not in the
/var/lib/systemimager/images/oscarimage/ tree on the master node.

This seems odd because I get the LIL- error message and that seems to me
that it is loading a lilo.conf file but I cannot find that file anywhere
on the nodes.

        |-----Original Message-----
        |From: Keller, Gregory W xNON-EMPLOYEEx
        |[mailto:[EMAIL PROTECTED]
        |Sent: Wednesday, March 19, 2003 12:50 PM
        |To: '[EMAIL PROTECTED]'
        |Cc: '[EMAIL PROTECTED]'
        |Subject: Re: OSCAR cluster dead after simple changes
        |
        |
        |Chris,
        |The change to the switch is irrelevent.  Until LILO
        |bootstraps the kernel your network card isn't a factor
        |- so unless your diskless or using PXE to boot over
        |the network, this is an install problem on the local disks.
        |
        |Check out this link:
        |http://user.fundy.net/cyclist/linux/troubleshoot-LILO.html
        |
        |Here is an excerpt that will point us in the right direction:
        |  ()  No part of LILO has been loaded. LILO either
        |isn't installed
        |    or the partition on which its boot sector is
        |located isn't active.
        |   L  ...   The first stage boot loader has been
        |loaded and started,
        |    but it can't load the second stage boot loader.
        |The two-digit error
        |    codes indicate the type of problem. (See also
        |section "Disk error
        |    codes".) This condition usually indicates a media
        |failure or a geometry
        |    mismatch (e.g. bad disk parameters, see section
        |"Disk geometry").
        |   LI   The first stage boot loader was able to load
        |the second stage boot
        |    loader, but has failed to execute it. This can
        |either be caused by a
        |    geometry mismatch or by moving /boot/boot.b
        |without running the map
        |    installer.
        |   LIL   The second stage boot loader has been
        |started, but it can't load
        |    the descriptor table from the map file. This is
        |typically caused by a
        |    media failure or by a geometry mismatch.
        |   LIL?   The second stage boot loader has been loaded
        |at an incorrect
        |    address. This is typically caused by a subtle
        |geometry mismatch or by
        |    moving /boot/boot.b without running the map installer.
        |   LIL-   The descriptor table is corrupt. This can
        |either be caused by a
        |    geometry mismatch or by moving /boot/map without
        |running the map
        |    installer.
        |   LILO   All parts of LILO have been successfully loaded.
        |
        |Let us know what you find - perhaps the installer is
        |having trouble with the map installer.
        |
        |Keep smiling,
        |Greg
        |
        |
        |       Message: 2
        |       From: "Chris Oubre" <[EMAIL PROTECTED]>
        |       To: <[EMAIL PROTECTED]>
        |       Date: Tue, 18 Mar 2003 17:24:18 -0600
        |       Subject: [Oscar-users] OSCAR cluster dead after
        |simple changes
        |
        |       We are currently working on a nasty problem
        |with our OSCAR cluster.
        |
        |       I am running OSCAR 2.1 on RedHat 7.2 with a
        |modified kernel to
        |       accommodate for my e1000 gigabit cards and
        |mylex hardware RAID.
        |
        |       First the problem.
        |       We recently received some new nodes for our
        |Beowulf cluster.  We
        |       brought the cluster down to install them.  We
        |installed an additional
        |       network card to the switch, and manually moved
        |the position of the
        |       old nodes in the rack.  When we rebooted up the
        |cluster (master and
        |       old nodes), ALL of the old nodes do not boot
        |up.  They stop at a
        |       screen that says
        |
        |
        |       LIL-
        |
        |       I have removed the additional card from the
        |switch and tried rebooting
        |       everything.  No Joy
        |       I am able to ping the master node from the
        |switch and vice-versa.
        |
        |       If you could give some ideas of where to look?
        |Or do I need to
        |       completely reinstall?  I'd rather avoid the
        |reinstall because I do not
        |       know what cause the crash, and thus would be
        |susceptible to a
        |       reoccurrence.
        |
        |
        |
        |****************************************************
        |Christopher D. Oubre                               *
        |email: [EMAIL PROTECTED]                     *
        |research: http://cmt.rice.edu/~coubre              *
        |Web: http://www.angelfire.com/la2/oubre            *
        |Hangout: http://pub44.ezboard.com/bsouthterrebonne *
        |Phone:(713)348-3541  Fax:   (713)348-4150          *
        |Rice University                                    *
        |Department of Physics, M.S. 61                     *
        |6100 Main St.                       ^-^            *
        |Houston, Tx  77251-1892, USA       (O O)           *
        |-= Phlax=-                         ( v )           *
        |************************************m*m*************
        |
        |
        |


------------------------------------------------------- This SF.net email is sponsored by: Does your code think in ink? You could win a Tablet PC. Get a free Tablet PC hat just for playing. What are you waiting for? http://ads.sourceforge.net/cgi-bin/redirect.pl?micr5043en _______________________________________________ Oscar-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/oscar-users



-------------------------------------------------------
This SF.net email is sponsored by: Does your code think in ink? You could win a Tablet PC. Get a free Tablet PC hat just for playing. What are you waiting for?
http://ads.sourceforge.net/cgi-bin/redirect.pl?micr5043en
_______________________________________________
Oscar-users mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/oscar-users

Reply via email to