Re: [PATCH 00/62] initrd: remove classic initrd support

Rob Landley Mon, 15 Sep 2025 09:48:06 -0700

On 9/12/25 17:38, Askar Safin wrote:

Intro
====
This patchset removes classic initrd (initial RAM disk) support,
which was deprecated in 2020.

Still useful for embedded systems that can memory map flash, but it'sgetting harder to find embedded developers who consider new kernels animprovement over older ones...

Initramfs still stays, and RAM disk itself (brd) still stays, too.

While you're at it, could you fix static/builtin initramfs so PID 1 hasa valid stdin/stdout/stderr?

A static initramfs won't create /dev/console if the embedded initramfsimage doesn't contain it, which a non-root build can't mknod, so thekernel plumbing won't see it dev in the directory we point it at unlesswe build with root access. This means the open("/dev/console") fails, soinit starts with no error reporting and we have to get far enough tomount our own devtmpfs or similar and open our own stdout/stderr beforewe can see any error output from init, which is kinda brittle.

I posted various patches to make CONFIG_DEVTMPFS_MOUNT work for initmpfsrepeatedly since 2017, which also addressed it, but the kernelcommunity's been hermetically sealed against outside intrusion for awhile now...


https://lkml.iu.edu/hypermail/linux/kernel/2005.1/09399.html

https://lkml.iu.edu/2302.2/05597.html

init/do_mounts* and init/*initramfs* are listed in VFS entry in
MAINTAINERS, so I think this patchset should go through VFS tree.
This patchset touchs every subdirectory in arch/, so I tested it
on 8 (!!!) archs in Qemu (see details below).


Oh hey, somebody using mkroot. Cool. :)

My current "passes basic automated smoketests" list for 6.16 is:

aarch64 armv4l armv5l armv7l i486 i686 m68k mips64 mipsel mips powerpcpowerpc64le powerpc64 riscv32 riscv64 s390x sh4 x86_64

I'm assuming that's your 8: arm, x86, m68k, mips, ppc, riscv, s390x,superh. (The variants are mostly 32/64 bit and bit/little endian, couplearchitecture generations in there. The old ones go out of patent first,you can always tell patents are about to expire and get generic cloneswhen corporate shills start insisting that support for something REALLYNEEDS TO GO AWAY RIGHT NOW...)

The or1k, microblaze, and sh4eb targets mostly work: sh4eb has brokeneithernet (never tracked down whether it's kernel or qemu that's wrong Ijust know they disagree), or1k doesn't know how to exit alahttps://lists.gnu.org/archive/html/qemu-devel/2024-11/msg04522.html andmicroblaze never wired up -hda to their hard drive emulationhttps://lists.nongnu.org/archive/html/qemu-devel/2025-01/msg01149.htmlbut I haven't had the spoons to argue with IBM Hat developers aboutprocedure compliance auditing.

I need to track down a decent qemu emulation for armv7m, last time Itried with vanilla was https://landley.net/notes-2023.html#23-02-2023which was not promising, I downloaded a pic32 qemu fork last week, buthaven't had the spoons to follow up on that either. Or to ship a newtoybox/mkroot release: I've had 6.16 kernel patches since the week itcame out, unbreaking powerpc and adding fdpic support to sh4-mmu, buthobbyist friendly this community ain't. Sigh, I should get back on the(beating a dead) horse...

I had hexagon userspace working for a while ("qemu-hexagon ls -l") butno kernel for it: Taylor Simpson said he was going to post aqemu-system-hexagon patchset with a comet board emulation, but thatarchitecture has no gcc support (there was a gcc fork on code aurora butthey abandoned it when the FSF went gplv3) so it needs an llvm-onlytoolchain build with a non-vanilla musl libc fork... Honestly theproblem is compiler-rt sucks rocks: I should cycle back around tohttps://landley.net/notes-2021.html#28-07-2021 but just haven't.

(Although part of the "Just haven't" is that I posted a patch to lkmlmaking generic $CROSS_COMPILE prefixes automatically work whether yourtoolchain was gcc or llvm, and the response was literally "we decided tomanually specify LLVM= on the command line so you must always do thatand we're refusing your two line fix to NOT need to do that". No really:https://lkml.iu.edu/2302.2/08170.html

Warning: this patchset renames CONFIG_BLK_DEV_INITRD (!!!) to CONFIG_INITRAMFS
and CONFIG_RD_* to CONFIG_INITRAMFS_DECOMPRESS_* (for example,
CONFIG_RD_GZIP to CONFIG_INITRAMFS_DECOMPRESS_GZIP).
If you still use initrd, see below for workaround.


Which will break existing configs for what benefit?

I'm not convinced the churn improves matters. Presumably the kernelcommand line paremeter is still rdinit= and grub still uses the "initrd"command to load an external cpio.gz.

But I bisect to find breakage like that every release so I assume theother embedded linux developers... are mostly shipping 10+ year oldkernels that use half the memory of today's.

Details
====
I not only removed initrd, I also removed a lot of code, which
became dead, including a lot of code in arch/.

Still I think the only two architectures I touched in non-trivial
way are sh and 32-bit arm.

Also I renamed some files, functions and variables (which became misnomers) to 
proper names,
moved some code around, removed a lot of mentions of initrd
in code and comments. Also I cleaned up some docs.

Now that lkml.iu.edu is back up (yay!) all the links inramfs-rootfs-initramfs.txt can theoretically be fixed just by switchingthe domain name.

For example, I renamed the following global variables:

__initramfs_start
__initramfs_size


That already said initramfs, and you renamed it.

phys_initrd_start
phys_initrd_size
initrd_start
initrd_end

Which is data delivered through grub's "initrd" command. Here's how I'vebeen explaining it to people for years:


1) initrd is the external blob from the bootloader's initrd= option.

2) initramfs is the extractor plumbing, _init code that gets discarded.

3) rootfs is (for some reason) the name of the mounted filesystem in/proc/mounts (because letting it say "ramfs" or "tmpfs" like normal in/proc/mounts would be consistent and immediately understandable, so theycouldn't have that).

(No I don't know why it's called rootfs. Having things like df not showovermounted filesystems isn't special case logic, why...? The argumentto special case this because you can't unmount it is like saying PID 1shouldn't have a number because it can't exit. I would happily call thewhole thing initramfs... but it's already not.)

to:

__builtin_initramfs_start
__builtin_initramfs_size
phys_external_initramfs_start
phys_external_initramfs_size
virt_external_initramfs_start
virt_external_initramfs_end

Do you believe people will understand what the slightly longer names arewithout looking them up?

I'm all for removing obsolete code, but a partial cleanup that stillleaves various sharp edges around isn't necessarily a net improvement.Did you remove the NFS mount code from init/do_mounts.c? Part of theinitramfs justification back in 2005 was "you can have a tiny initramfsset up our root filesystem so most of the init special casing can go"...and then they added CONFIG_DEVTMPFS_MOUNT but made it ONLY apply to thefallback root after the system has decided NOT to stay on rootfs, andignored my patches to at least make it consistent.

The one config symbol that really seems to bite people in this area isBLK_DEV_INITRD because a common thing people running from initramfs wantto do is yank the block layer entirely (CONFIG_BLOCK=n) and useinitramfs instead, and needing to enable CONFIG_BLK_DEV_INITRD while

And the INSANE part is they generally want a static initrd to do it sothey're not using the external loader, but Kconfig has INITRAMFS_SOURCEunder CONFIG_BLK_DEV_INITRD and it's a mess. Renaming THAT symbol wouldbe good.

But then, CONFIG_BLOCK is hidden under CONFIG_EXPERT which selectsDEBUG_KERNEL (INCREASING KERNEL SIZE!!!) and thus everybody who doesthis patches the kconfig plumbing to be less stupid anyway. So theproblem isn't JUST renaming the symbol...

(Oh CONFIG_EXPERT is SO STUPID. It's got a menu under it, butCONFIG_BLOCK isn't in that menu, it's at the top of menuconfig betweenloadable module support and executable file formats, just invisibleunless you go down into a menu and switch on a setting and then back outto go find it. WHY WOULD YOU DO THAT?)

New names precisely capture meaning of these variables.

To you. I'm not entirely sure what virt_external means. (Yes I could goread the code. No I don't want to. I EXPECT to need context andrefreshing stuff, but having it change out from under me since the LASTtime I did that is annoying when it's "same thing, new name, because".)

It makes more sense to YOU because you changed it to smell like you.Meanwhile 35 years of installed base expertise in other people's headshas been discarded and developed version skew for anyone maintaining anexisting system. (That's not a "never do this", that's a "be awarehumans consistently have the wrong weightings in our heads for this".)

Personally I usually have to look it up either way. And am spending moreand more of my time poking at older kernels rather because newer stuffhas either removed support for things I need or grown dependencies. (Andbecause there's 20 years of installed base still in various stages ofuse, I'm personally likely to spend more time looking at the old namesthan the new ones.)

This will break all configs out there (update your configs!).
Still I think this is okay,


Because you don't have to clean up after it.

because config names never were part of stable API.

I can forward everyone who asks me questions to you, or just agree whenthey tell me it's yet another reason not to upgrade.

Other user-visible changes:

- Removed kernel command line parameters "load_ramdisk" and
"prompt_ramdisk", which did nothing and were deprecated


Sure.

- Removed kernel command line parameter "ramdisk_start",
which was used for initrd only (not for initramfs)

Some bootloaders appended that to the kernel command line to specifywhere in memory they've loaded the initrd image, which could be acpio.gz once upon a time. No idea what regressions happened since though.

(Last new bootloader I was involved with that had to make it work usedsome horrible hack editing a dtb at a fixed offset, like the old "rdev"trick but more brittle. Because "device tree better" than human readabletextual mechanism. Fixing ramdisk_start to work right sounded like amore sane approach to me, but...)

I tested my patchset on many architectures in Qemu using my Rust
program, heavily based on mkroot [1].


You rewrote a 400 line bash script in rust.

Yeah, that's a rust developer. (And it smells like you now...)

I used the following cross-compilers:

aarch64-linux-musleabi
armv4l-linux-musleabihf
armv5l-linux-musleabihf
armv7l-linux-musleabihf
i486-linux-musl
i686-linux-musl
mips-linux-musl
mips64-linux-musl
mipsel-linux-musl
powerpc-linux-musl
powerpc64-linux-musl
powerpc64le-linux-musl
riscv32-linux-musl
riscv64-linux-musl
s390x-linux-musl
sh4-linux-musl
sh4eb-linux-musl
x86_64-linux-musl

or1k and microblaze work, they just don't pass the full smoketest forreasons that shouldn't affect initramfs testing.

I'm still waiting for Rich to ship the next musl release to do newtoolchains...


https://www.openwall.com/lists/musl/2025/08/04/1

Workaround
====
If "retain_initrd" is passed to kernel, then initramfs/initrd,
passed by bootloader, is retained and becomes available after boot
as read-only magic file /sys/firmware/initrd [3].

Common use case for eg romfs is memory mapped flash or rom, so theaddress range in question isn't actually ram anyway. Mostly on mmusystems you just don't want the mapping to go away, so the kernel canstill reach out and read it.

This is even better than classic initrd, because:
- You can use fs not supported by classic initrd, for example erofs

Network block device was the most recent one I saw used, but it had atiny initramfs to set up and switch_root into it...

(Network block device != network filesystem. I have a todo item tointegrate nbd-server into mkroot/testroot.sh but "-hda works" is one ofthe things it's testing...)

- One copy is involved (from /sys/firmware/initrd to some file in /)
as opposed to two when using classic initrd

Embedded developers have always been reaching out and using mappableflash directly. Vitaly Wool's ELC talk in 2015 (about running Linux in256k of sram, yes one quarter of one megabyte) described the process:


https://elinux.org/images/9/90/Linux_for_Microcontrollers-_From_Marginal_to_Mainstream.pdf

Rob

Re: [PATCH 00/62] initrd: remove classic initrd support

Reply via email to