RE: Lilo RAID-1 woes

Martin Munt Wed, 10 May 2000 03:05:12 -0700
        Rich/Chance,

        I had EXACTLY the same problem. This post prompted me to fix it.
        Thanks a lot.

        My setup is basic Redhat 6.0 but with clean 2.2.14 kernel sources,
        mingo's 2.2.14-B1 patch & raidtools-19990824-0_90_tar.gz. After an
        unsuccessful attempt to boot my kernel from a raid-1 array using
        Michael Robinton's excellent root-raid howto and RH6.0's lilo, I
built
        the RH6.2 lilo source RPM (LILO-0_21-15_SRC.RPM).
        The RPM has the following patches:

        Patch0: lilo-0.21-include.patch
        Patch1: lilo-0.21-1.1.patch
        Patch2: http://www.insync.net/~frantzc/lilo.patch
        Patch3: lilo.raid1
        Patch4: lilo-0.21-loopdev.patch
        Patch5: lilo-0.21-second.patch

        I (mindlessly) applied all the Redhat patches and now lilo works OK
- with
        the exception of the behaviour that Chance describes below.

        Does anyone have any ideas why lilo-0.21.15 messes up writing both
        partitions when given "boot=/dev/mdN"??

        I have not attempted to apply Richard Bollinger's 'lilo.raid1.also'
patch yet.
        But I will now!

        Martin Munt
        [EMAIL PROTECTED] or [EMAIL PROTECTED]

> -----Original Message-----
> From: Richard Bollinger [SMTP:[EMAIL PROTECTED]]
> Sent: Monday, May 08, 2000 9:59 PM
> To:   Chance Reschke; [EMAIL PROTECTED]
> Subject:      Re: Lilo RAID-1 woes
> 
> I'm not sure how to fix it, but I'm suspicious that some portion of the
> data
> which LILO should have written to both mirrors is only being written to
> the
> "first" drive in the mirror; which in your case is your second physical
> drive.
        [MM] I'm sure you are right.
        I have the same thing with a pair of IDE disks. 
        In /etc/raidtab, /dev/md5 is made up of /dev/hdc1and /dev/hda1 - in
that 
        order - and, sure enough, I have the same problem on boot. 
        If I physically remove hda, lilo boots up OK on (what is now) the
"first" 
        disk. This supports your idea that the second physical disk has the
boot
        info. I did not test this by physically removing hda - I wish I had
now!
>   Note that LILO says its working on /dev/sdb1 first, then /dev/sda1.
> On my test system, the boot disk is listed first in the array and it works
> fine as advertised. [MM]  My lilo lists in hdc hda order too. 
> 
> Its a bit scary, but you can change the order of the drives in
> /etc/raidtab
> and rerun the mkraid command to change the order to what you'd like it to
> be; with /dev/sda1 as raid_disk 1 and /dev/sdb1 as raid_disk 2, without
> affecting any of the data actually stored on the drives. [MM]  This fixed
> it
        for me (Thanks, Rich!). I did: raidstop the array (in my case
/dev/md5);
        change the order in raidtab to hda1, hdc1 and marked hdc1 as a
        "failed-disk"; mkraid /dev/md5 - it runs degraded on /dev/hda1 only;

        raidhotadd /dev/hdc1- it reconstructs; run lilo again; reboot. If
you need more
> explicit instructions we can take it off of the list. [MM]  Please include
> me
        in any off-list discussion. I would like to follow this one up. 

> Rich B
> ----- Original Message -----
> From: "Chance Reschke" <[EMAIL PROTECTED]>
> To: <[EMAIL PROTECTED]>
> Cc: <[EMAIL PROTECTED]>
> Sent: Monday, May 08, 2000 4:25 PM
> Subject: Lilo RAID-1 woes
> 
> 
> > Hi,
> >
> > I've been trying to get Lilo to work with Linux software RAID-1.  I'm
> > using the patches from Richard Bollinger's post from 5/4/2000, and
> > things almost work.  I can use lilo to update the boot sector and
> > maps, but lilo will not boot unless I enter a valid kernel label at
> > the boot time lilo prompt.
> >
> > When I run lilo it appears to update the boot sector and maps;
> > however, after rebooting the lilo prompt appears, followed after the
> > timeout period by the message "Error 0x80".  The lilo prompt then
> > reappears, and the pattern repeats.  At any point while the lilo
> > prompt is being displayed I can enter a valid lilo label and the
> > corresponding kernel will boot.  Pressing <TAB> at the lilo prompt
> > does not list the configured labels.  Running 'lilo -R labelname'
> > before rebooting doesn't help.
> >
> > While I'm aware that the 0x80 error is supposed to refer to "Disk
> > timeout" 
> 
        [MM]  I assumed this meant "an error occurred reading disk 0x80"
        (i.e. first bios disk) - silly me!

> I'm confident both disks in my array are up and running.
> >
> > My lilo.conf:
> >
> >    boot=/dev/md0
> >    root=/dev/md0
> >    install=/boot/boot.b
> >    prompt
> >    timeout=50
> >    image=/boot/vmlinuz-2.2.14smp
> >         label=2.2.14
> >         read-only
> >
> > I'm using the raid-2.2.14-B1 patches to a clean 2.2.14 source tree,
> > and lilo version 21 with patches from:
> >
> >    http://www.elliott-turbo.com/lilo/lilo.raid1
> >    http://www.elliott-turbo.com/lilo/lilo.raid1.also
> >
> > Running 'lilo -v -v' I get:
> >
> >    LILO version 21, Copyright 1992-1998 Werner Almesberger
> >
> >    boot = /dev/sdb, map = /boot/map.0811
> >    Reading boot sector from /dev/sdb
> >    Merging with /boot/boot.b
> >    Secondary loader: 8 sectors.
> >    Boot image: /boot/vmlinuz-2.2.14smp
> >    Setup length is 4 sectors.
> >    Mapped 1298 sectors.
> >    Added 2.2.14 *
> >    /boot/boot.0810 exists - no backup copy made.
> >    Map file size: 10240 bytes.
> >    Writing boot sector.
> >    boot = /dev/sda, map = /boot/map.0801
> >    Reading boot sector from /dev/sda
> >    Merging with /boot/boot.b
> >    Secondary loader: 8 sectors.
> >    Boot image: /boot/vmlinuz-2.2.14smp
> >    Setup length is 4 sectors.
> >    Mapped 1298 sectors.
> >    Added 2.2.14
> >    /boot/boot.0800 exists - no backup copy made.
> >    Map file size: 10240 bytes.
> >    Writing boot sector.
> >
> > Any help fixing this would be great.
> >
> > Thanks,
> > Chance
> >
> > --
> > Chance Reschke
> > UW Astronomy
> >
> >
RE: Lilo RAID-1 woes

Reply via email to