On 12/29/23 10:39, Stefan Berger wrote:> On 12/21/23 17:58, Askar Safin wrote:
Hi, Rob. And Stefan.
First of all, this patch got to linux-next ( https://git.kernel.org/pub/scm/linux/kernel/git/next/linux-next.git/log/?qt=... ), so it seems it soon will be in mainline.
On Thu, Dec 21, 2023 at 12:24 PM Rob Landley rob@landley.net wrote:
Can you build tmpfs on a nommu system? Last I checked the plumbing expects swap, but it's been a while...
Okay, I agree, let's not remove ramfs.
Still, I don't like this (already applied) patch. init= and rdinit= are two different options,
Because they control two different things which are often used at the same time. (Debian has an initramfs that hands off to the final root filesystem, for example. Hence the initramfs-tools package that runs every time apt-get updates the kernel.)
So being able to specify rdinit= to intercept the ramfs layer or init= to intercept the root= layer made sense, because they did different things.
But the only reason to specify anything nontrivial for the initramfs _filesystem_ mount properties is because you intend to stay there. They don't get used together.
and this is good.
Eh, not really. Strange legacy decision we're now stuck with. The kernel only ever runs one init task per boot. If init= was _also_ checked to see which file to run out of initramfs (and the plumbing still justs silently fails and moves on if it's not found) then the debian script would have been forced to do INIT= or similar to override the overmounted root's init task separately from initrd's init task, making it clear a script (not the kernel) is making that decision.
But that would have been a user-visible change, and when initramfs was going in they were trying to avoid user-visible changes that would force sysadmins to learn new stuff because the plumbing changed out from under them. (Like the change you're proposing now would.)
So, I think we should have two different options. Analogously they should be rootfstype= and rdrootfstype=.
You can't have a root= type of initramfs or tmpfs. The specified values can't overlap. The plumbing I wrote responds to specific values but otherwise leaves it for later users.
https://salsa.debian.org/kernel-team/initramfs-tools/-/blob/cf964bfb4362019f...
As you can see, this shell script parses /proc/cmdline and assumes that rootfstype= always applies to real root.
The script is running _in_ the initramfs, which is already loaded and running at that point. Meaning the _kernel_ will not parse root= at that point, userspace has to do it.
So, if someone sets rootfstype= to tmpfs or ramfs, this will likely break this script.
Which was the same 10 years ago?
The script is running in a context where initramfs is not persistent, so overriding it to be a tmpfs has no benefit. (I mean you _can_... Nobody does, because we're gonna switch_root off of it.)
And once code _is_ running in initramfs, the kernel's internal root= automounter will never run. The initramfs code can parse /proc/cmdline to use the same arguments as the kernel, or it could much more easily use the "any unrecognized arguments get set as environment variables in PID 1" and use ROOT= or similar, like many scripts do.
Modifying kernel code that NEVER RUNS in the case you're pointing out seems silly to me.
That said, the code I wrote is doing a strstr to see if the argument's there, but doesn't care what ELSE is there, so it could easily be "rootfstype=tmpfs,ext4" and have the userspace script also filter the argument for just what it's interested in, since at that point it's NOT THE KERNEL DOING IT.
Setting the kernel boot command line option rootfstype= to tmpfs or ramfs was possible so far and that's what the documentation and code supported so far as well. The bug surfaced when root= was provided, in which case it was ignored.
No, as I explained when I wrote the initmpfs code in 2013 when you say root= you are explicitly requesting the kernel mount a second file system over rootfs (that's what root= MEANS), and thus don't bother making it a (more expensive) tmpfs because it's not sticking around.
I did that to NOT invent new arguments, and work with existing systems. The new feature I added triggered when you _requested_ it.
Yes, this will slightly break compatibility. I. e. this will make Linux always choose tmpfs if rdrootfstype= is not present. But I think
You may find someone who doesn't like this change, either, ...
Yeah, me.
there is nothing wrong with it. If a user cares, he will set rdrootfstype= . And early boot code will become a lot more clean and logical.
Rob, do you agree? Stefan, do you agree? Then I will write a patch,
I think it's a bad idea.
If a documentation change is needed, maybe clarify what root= means? That argument explicitly requests the kernel mount a second filesystem over rootfs after failing to launch init out of initramfs, so that rootfs will NOT be the / visible to PID 1.
This is what it has ALWAYS meant. Back before initramfs was invented in the old initrd days, you didn't need to say root= if you provided an initial ramdisk with an init executable in it (except the old mechanism didn't want /init it wanted /linuxrc for no apparent reason), and if you DID say root= when the kernel launched PID 1 out of initrd the superfluous argument was ignored (never parsed) because the kernel setup code never made it far enough to check it.
I also note that I never wired up "rootflags=" (see init/do_mounts.c root_data_setup) so it gets passed through to initmpfs, meaning you can't specify size= and thus it defaults to 50% of available memory as the enforced limit. That would be actually useful. (And no I don't really care if it's the original argument or a different one because it can't have any existing users to break yet, although once again the kernel CAN'T do both at once so having two is SILLY. The reason they're NOT wired up yet is ramfs takes no arguments, so only passing rootflags to tmpfs when you didn't specify a root= or when you tell it rootfstype=tmpfs makes sense, because then what you're passing better not be ext4 arguments because you're staying on initramfs and the kernel's root= plumbing doesn't run.)
That's the actual missing feature here.
Rob