Re: bootcode capable of booting both UFS and ZFS? (Amazon/ec2)

Julian Elischer Sat, 06 May 2017 21:05:02 -0700

On 6/5/17 4:01 am, Toomas Soome wrote:

On 5. mai 2017, at 22:07, Julian Elischer <jul...@freebsd.org<mailto:jul...@freebsd.org>> wrote:
Subject says it all really, is this an option at this time?
we'd like to try boot the main zfs root partition and then fallback to a small UFS based recovery partition.. is that possible?
I know we could use grub but I'd prefer keep it in the family.
it is, sure. but there is an compromise to be made for it.
Lets start with what I have done in illumos port, as the idea thereis exactly about having as “universal” binaries as possible (justthe binaries are listed below to get the size):
-r-xr-xr-x   1 root     sys       171008 apr 30 19:55 bootia32.efi
-r-xr-xr-x   1 root     sys       148992 apr 30 19:55 bootx64.efi
-r--r--r--   1 root     sys         1255 okt 25  2015 cdboot
-r--r--r--   1 root     sys       154112 apr 30 19:55 gptzfsboot
-r-xr-xr-x   1 root     sys       482293 mai  2 21:10 loader32.efi
-r-xr-xr-x   1 root     sys       499218 mai  2 21:10 loader64.efi
-r--r--r--   1 root     sys          512 okt 15  2015 pmbr
-r--r--r--   1 root     sys       377344 mai  2 21:10 pxeboot
-r--r--r--   1 root     sys       376832 mai  2 21:10 zfsloader
the loader (bios/efi) is built with full complement - zfs, ufs,dosfs, cd9660, nfs, tftp + gzipfs. The cdboot is starting zfsloader(thats trivial string change).
The gptzfsboot in illumos case is only built with zfs, dosfs and ufs- as it has to support only disk based media to read out the loader.Also I am building gptzfsboot with libstand and libi386 to get asmuch shared code as possible - which has both good and bad sides, asusual;)
The gptzfsboot size means that with ufs the dedicated boot partitionis needed (freebsd-boot), with zfs the illumos port is always usingthe 3.5MB boot area after first 2 labels (as there is no geli, theillumos does not need dedicated boot partition with zfs).
As the freebsd-boot is currently created 512k, the size is not anissue. Also using common code does allow the generic partition codeto be used, so GPT/MBR/BSD (VTOC in illumos case) labels are notproblem.
So, even just with cd boot (iso), starting zfsloader (which in fbsdhas built in ufs, zfs etc), you already can get rescue capability.
Now, even with just adding ufs reader to gptzfsboot, we can use gpt+ freebsd-boot and ufs root but loading zfsloader on usb image, soit can be used for both live/install and rescue, because zfsloaderitself has support for all file systems + partition types.
I have kept myself a bit off from freebsd gptzfsboot because ofsimple reason - the older setups have smaller size for freebsd boot,and not everyone is necessarily happy about size changes:D also infreebsd case there is another factor called geli - it most certainlydoes contribute some bits, but also needs to be properly addressedon IO call stack (as we have seen with zfsbootcfg bits). But thenagain, here also the shared code can help to reduce the complexity.
Yea, the zfsloader/loader*.efi in that listing above is actuallybuilt with framebuffer code and compiled in 8x16 default font (lz4compressed ascii+boxdrawing basically - because zfs has lz4, thedecompressor is always there), and ficl 4.1, so thats a bit ofdifference from fbsd loader.
Also note that we can still build the smaller dedicated blocks likeboot2, just that we can not use those blocks for more universalcases and eventually those special cases will diminish.


thanks for that..

 so, here's my exact problem I need to solve.
FreeBSD 10 (or newer) on Amazon EC2.

We need to have a plan for recovering the scenario where somethigngoes wrong (e.g. during an upgrade) and we are left with a systemwhere the default zpool rootfs points to a dataset that doesn't boot.It is possible that mabe the entire pool is unbootable intomulti-user.. Maybe somehow it filled up? who knows. It's hard topredict future problems.There is no console access at all so there is no possibility of humanintervention. So all recovery paths that start "enter single user modeand...." are unusable.

The customers who own the amazon account are not crazy about giving usthe keys to the kingdom as far as all their EC2 instances, so taking aroot drive off a 'sick' VM and grafting it onto a freebsd instance to'repair' it becomes a task we don't want to really have to ask them todo. They may not have the in-house expertise to do it. confidently.

This leaves us with automatic recovery, or at least automatic methodsof getting access to that drive from the network.Since the regular root is zfs, my gut feeling is that to deduce thechances of confusion during recovery, I'd like the (recovery) systemitself to be running off a UFS partition, and potentially, with amemory root filesystem. As long as it can be reached over the networkwe can then take over.


we'd also like to have the boot environment support in the bootcode.
so, what would be the minimum set we'd need?

Ufs support, zfs support, BE support, and support for selecting acompletely different boot procedure after some number of boot attemptswithout getting all the way to multi-user.

How does that come out size-wise? And what do I need to configure toget that?

The current EC2 Instances have a 64kB boot partition , but I have awindow to convince management to expand that if I have a good enoughargument.(since we a re doing a repartition on the next upgrade, which is"special" (it's out upgrade to 10.3 from 8.0).Being able to self heal or at least 'get at' a sick instance might bea good enough argument and would make the EC2 instances the same asall the other versions of the product..

/me has thought.. I wonder if the ec2 instance bios has enoughnetwork support to allow PXE-like behaviour? or at least being able toreceive packets..?


rgds,
toomas


_______________________________________________
freebsd-current@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscr...@freebsd.org"

Re: bootcode capable of booting both UFS and ZFS? (Amazon/ec2)

Reply via email to