this post was submitted on 13 Dec 2023
995 points (99.1% liked)
linuxmemes
21172 readers
853 users here now
Hint: :q!
Sister communities:
Community rules (click to expand)
1. Follow the site-wide rules
- Instance-wide TOS: https://legal.lemmy.world/tos/
- Lemmy code of conduct: https://join-lemmy.org/docs/code_of_conduct.html
2. Be civil
- Understand the difference between a joke and an insult.
- Do not harrass or attack members of the community for any reason.
- Leave remarks of "peasantry" to the PCMR community. If you dislike an OS/service/application, attack the thing you dislike, not the individuals who use it. Some people may not have a choice.
- Bigotry will not be tolerated.
- These rules are somewhat loosened when the subject is a public figure. Still, do not attack their person or incite harrassment.
3. Post Linux-related content
- Including Unix and BSD.
- Non-Linux content is acceptable as long as it makes a reference to Linux. For example, the poorly made mockery of
sudo
in Windows.
- No porn. Even if you watch it on a Linux machine.
4. No recent reposts
- Everybody uses Arch btw, can't quit Vim, and wants to interject for a moment. You can stop now.
Please report posts and comments that break these rules!
founded 1 year ago
MODERATORS
Can a linux/systemd nerd explain what the error is? I know it's a shutdown sequence, but I'm curious on the fault
It is actually a boot failure. Normally the kernel reads some config from the initrd (the bootloader loads initrd and passes it to the kernel - thanks dan) and then does a bunch of setup stuff, and then it mounts the actual root filesystem, and then switches to using that. In this case, the root filesystem has failed to mount.
Hardware failure is most likely the cause, but misconfiguration can also make this happen. Probably hardware though.
If its misconfiguration, an admin can reattempt to mount the root drive on /new_root, and then ctrl-d to get the init system to try again
ELI5: couldnt open C:/ drive
Edit: clarified what loads the initrd - as per dans comment.
The bootloader (GRUB) loads the initrd, not the kernel. The kernel accesses stuff from the initrd, but it's already loaded by that point.
You are correct. Ill add an edit. Thanks!
Thanks for that!
Switching to Linux and actually being able to see real time logs made me actually curious how it works, so that's one gear out of the machine demistified
The root filesystem mounted fine. That's why the init is starting with all the services on the root disk.
Not necessarily. I've seen failures like this if the boot partition works, but fails to mount the root partition. systemd then fails to proceed, and shuts down the running services.
systemd daemons are configured via /etc/systemd, and systemd itself lives in /usr/lib/systemd/systemd. How can systemd run or start the configured services without the root disk mounted? The initrd (from the boot partition) only contains enough of an environment to call the entrypoint for the init system, not contain the entirety of systemd (or the configured services).
Initrd contains the systemd binary and enough libraries, services, and kernel modules to get booted this far. The system failed at switch root which is where the real root disk is mounted. Initrd can contain as much or as little as needed to get a working system which can be a lot of you are using a network filesystem as a root for instance.
Those are all hardware management services (as far as I can tell), and are configured before the root is mounted.
I have hit this exact error before, that is what failing to mount the root disk looks like. A bunch of services will start, and then you get dropped into a shell (with a login).
If you want to see it for yourself, change /etc/fstab such that /root is now pointing to the wrong device, and then rebuild your initrd. When you reboot you'll see exactly that output. To fix it, login to the shell and mount your root on /new_root, and ctrl-d to continue the boot (from memory it has a message telling you to do that anyway). When your system boots you can fix fstab and rebuild initrd. Its reversable, but maybe test on a machine you dont care about to be safe :)
Oh interesting! I suppose I have just been very careful with /etc/fstab and I haven't seen systemd fail this way. TIL! Thanks for letting me know!
These kinds of public errors are almost always a hard drive failure.
Using an actual hard drive for an embedded system like this would be a failure in and of itself.
Unless it literally has to store several hours' worth of HD video content, no reason the entire system couldn't fit on an SD card.
It's been my experience that SD cards are almost always what causes a failure on a SBC. Given the cost of the screens, i'd probably choose something that could boot off nvme storage. Or at least tape a new, configured SD card to the case of the SBC for when this inevitably happens.
An SD card is MUCH less reliable than a good hdd unless it's read only.
As someone who works on embedded devices: HDDs are used for media storage and can be easily replaced. Any NAND as a limited life span and good embedded software will try very hard to minimise writes. Though in my particular area, there's additional security constraints on the OS, which preclude any removable flash storage from being used.
They probably expect the signage to change a lot and don't want a hardware failure when they do it too much, or didn't use an external drive in this case and the SD card failed because they wrote to it too much (which would happen eventually anyway).
Even better: Three SD cards with a ZFS mirror and failure notifications
Bah humbug, just hook it up to the cloud, WCGW?
You don't need an internet connection for failure notifications
You may be surprised to learn that these stores use machines that are occasionally more than a year old and also use inexpensive tech like enterprise spinny disk.
A spinny disk will work in this space, and you know they'll be deciding based on cost.
Systemd has a feature to shorten lines too long for the display, which is a pretty stupid idea, as you can see here.
The service failing here would be initrd-switch-root.service.
So the weird block character in the "see... for details" line is replacing "nitrd-switch-roo" just to shorten the line? That's what I was trying to figure out.
Yeah, that'd be the Unicode ellipsis character (…) rendered on a system without a Unicode font on the terminal.