this post was submitted on 30 Jan 2026

50 points (98.1% liked)

Selfhosted

55494 readers

1026 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.
No spam posting.
Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.
Don't duplicate the full text of your blog or github here. Just post the link for folks to click.
Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).
No trolling.
No low-effort posts. This is subjective and will largely be determined by the community member reports.

Resources:

selfh.st Newsletter and index of selfhosted software and apps
awesome-selfhosted software
awesome-sysadmin resources
Self-Hosted Podcast from Jupiter Broadcasting

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 2 years ago

MODERATORS

HybridSarcasm@lemmy.world

HybridSarcasm@lemmy.hybridsarcasm.xyz

Do you backup your docker images? (lemmy.world)

submitted 3 days ago by thelocalhostinger@lemmy.world to c/selfhosted@lemmy.world

35 comments fedilink hide all child comments

Not containers and data, but the images. The point would be reproducability in case a remote registry does not contain a certain image anymore. Do you do that and how?

top 35 comments

sorted by: hot top controversial new old

[–] ada@lemmy.blahaj.zone 39 points 3 days ago (2 children)

Nope. Data directories and compose files only.

[–] Bakkoda@lemmy.world 3 points 3 days ago

This is how i do it. I can delete everything, repull and redeploy like nothing ever happened.

I have one exception and thats my calibre-web container. I have all my books backed up separately and I'll be nuking that whole set up eventually.

[–] Onomatopoeia@lemmy.cafe 4 points 3 days ago (3 children)

Curious, have you tested rebuilding using this approach?

It makes sense to me, if the compose files are up-to-date, it should just work.

I'd only be concerned about ensuring I capture changes made to the container that happened after initial build.

[–] teawrecks@sopuli.xyz 3 points 2 days ago

I feel like if that's something you're doing, you're using containers wrong. At least docker ones. I expect a container to have no state from run to run except what is written to mounted volumes. I should always be able to blow away my containers and images, and rebuild them from scratch. Afaik docker compose always implicitly runs with --rm for this reason.

[–] surewhynotlem@lemmy.world 7 points 3 days ago

That's the secret. I never change the container.

If you absolutely must do some config on the container, then have a container creation script that creates a new container with the right settings and use that.

#pipelines

[–] enchantedgoldapple@sopuli.xyz 4 points 3 days ago

I can attest to it. Lately my server needed to be repaired and got its entire disk wiped. Fortunately I managed to back up my compose files and bind mounts. After reinstalling the OS and transferring the backed up contents, I ran 'docker compose up' and eventually everything came back up exactly how they were when being backed up. Even the Postgres backup of Joplin was recovered purely from its bind mount.

[–] ptz@dubvee.org 27 points 3 days ago (2 children)

Yep. I've got a bunch of apps that work offline, so I back up the currently deployed version of the image in case of hardware or other failure that requires me to re-deploy it. I also have quite a few custom-built images that take a while to build, so having a backup of the built image is convenient.

I structure my Docker-based apps into dedicated folders with all of their config and data directories inside a main container directory so everything is kept together. I also make an images directory which holds backup dumps of the images for the stack.

Backup: docker save {image}:{tag} | gzip -9 > ./images/{image}-{tag}-{arch}.tar.gz
Restore docker load < ./images/{image}-{tag}-{arch}.tar.gz

It will backup/restore with the image and tag used during the save step. The load step will accept a gzipped tar so you don't even need to decompress it first. My older stuff doesn't have the architecture in the filename but I've started adding that lately now that I have a mix of amd64 and arm64.

[–] 4am@lemmy.zip 3 points 3 days ago (1 children)

Ah, cool. Seems like way less work than hosting a local registry. But, would I have to docker load every single one when needed? If I had to rebuild the docker host for example? Is it faster to just front load that work and build a local registry to pull through?

[–] ptz@dubvee.org 3 points 3 days ago* (last edited 3 days ago)

I also run (well, ran) a local registry. It ended up being more trouble than it was worth.

Would you have to docker load them all when rebuilding a host?

Only if you want to ensure you bring the replacement stack back up with the exact same version of everything or need to bring it up while you're offline. I'm bad about using the :latest tag so this is my way of version-controlling. I've had things break (cough Authelia cough) when I moved it to another server and it pulled a newer image that had breaking config changes.

For me, it's about having everything I need on hand in order to quickly move a service or restore it from a backup. It also depends on what your needs are and the challenges you are trying to overcome. i.e. When I started doing this style of deployment, I had slow, unreliable, ad heavily data-capped internet. Even if my connection was up, pulling a bunch of images was time consuming and ate away at my measly satellite internet data cap. Having the ability to rebuild stuff offline was a hard requirement when I started doing things this way. That's now no longer a limitation, but I like the way this works so have stuck with it.

Everything a service (or stack of services) needs is all in my deploy directory which looks like this:

/apps/{app_name}/
    docker-compose.yml
    .env
    build/
        Dockerfile
        {build assets}
    data/
        {app_name}
        {app2_name}  # If there are multiple applications in the stack
        ...
    conf/                   # If separate from the app data
        {app_name}
        {app2_name}
        ...
    images/
        {app_name}-{tag}-{arch}.tar.gz
        {app2_name}-{tag}-{arch}.tar.gz

When I run backups, I tar.gz the whole base {app_name} folder which includes the deploy file, data, config, and dumps of its services images and pipe that over SSH to my backup server (rsync also works for this). The only ones I do differently are ones with in-stack databases that need a consistent snapshot.

When I pull new images to update the stack, I move the old images and docker save the now current ones. The old images get deleted after the update is considered successful (so usually within 3-5 days).

A local registry would work, but you would have to re-tag all of the pre-made images to your registry (e.g. docker tag library/nginx docker.example.com/nginx) in order to push them to it. That makes updates more involved and was a frequent cause of me running 2+ year old versions of some images.

Plus, you'd need the registry server and any infrastructure it needs such as DNS, file server, reverse proxy, etc before you could bootstrap anything else. Or if you're deploying your stack to a different environment outside your own, then your registry server might not be available.

Bottom line is I am a big fan of using Docker to make my complex stacks easy to port around, backup, and restore. There's many ways to do that, but this is what works best for me.

[–] bonenode@piefed.social 4 points 3 days ago

Thanks for sharing this!

[–] HelloRoot@lemy.lol 14 points 3 days ago (2 children)

i selfhost a pullthrough docker repository, so every container I use is stored in there and can be pulled offline.

[–] marv99@feddit.org 1 points 2 days ago (1 children)

Can you please tell, which self-hosted solution you are using?

[–] HelloRoot@lemy.lol 2 points 2 days ago* (last edited 2 days ago) (1 children)

afaik I'm on an older version of https://github.com/distribution/distribution/pkgs/container/distribution

my compose somehow says nothing ... i should have made more notes

[–] marv99@feddit.org 2 points 2 days ago

Thank you, I will give it a try :)

[–] 4am@lemmy.zip 3 points 3 days ago

This is the way

[–] wersooth@lemmy.ml 14 points 3 days ago (2 children)

if there are some custom image, e.g. additional stuff I added to an existing image, then I backup the dockerfile. you can rebuild the image anytime and the size is smaller than a binary.

[–] bravesilvernest@lemmy.ml 6 points 3 days ago

Came here to say that: the most economic solution is to backup any dockerfiles themselves, though that also has the caveat that any installs within the build steps might also depend on external resources that could also be dropped.

There are methods of adding a local caching layer between your system and external, which might be what OP is after, but that involves investing in the additional space needed to back them up

[–] cactus@sh.itjust.works 3 points 3 days ago

good point, I might do that over the weekend :D

[–] doeknius_gloek@discuss.tchncs.de 11 points 3 days ago

Yes. I run Harbor and pull all images through its proxy cache.

[–] kumi@feddit.online 2 points 2 days ago* (last edited 2 days ago)

Just a small number of base images (ubuntu:, alpine:, debian:) are routinely synced, and anything else is built in CI from Containerfiles. Those are backed up. So as long as backups are intact can recover from loss of the image store even without internet.

I also have a two-tier container image storage anyway which gives redundancy but thats more of a side-effect of workarounds.. Anyway, the "source of truth" docker-registry which is pushed to is only exposed internally to the one who needs to do authenticated push, and to the second layer of pull-through caches which the internal servers actually pull from. So backups aside, images that are in active use already at least three copies (push-registry, pull-registry, and whoevers running it). The mirrored public images are a separate chain alltogether.

[–] HumanPerson@sh.itjust.works 4 points 3 days ago

I used to but then I switched out the server I was doing backups to and have been thinking "I'll get to it later" for many months. If anything goes wrong I'm screwed. I'll get to it later ¯_(ツ)_/¯

[–] just_another_person@lemmy.world 4 points 3 days ago (1 children)

I mean...you have the container right there on your machine. If you're concerned, just run your own registry and push copies there when needed. This of course is all unnecessary, as you only need the Dockerfile to build a clean image from scratch, and it will obviously work if it's already been published.

[–] 4am@lemmy.zip 3 points 3 days ago (1 children)

As long as the internet works and the image is still available.

Which is kind of the whole point, homie.

[–] just_another_person@lemmy.world 1 points 3 days ago* (last edited 3 days ago)

Again, it's nearly impossible to scrub the entire internet of the code to just run a single command and build a docker image of whatever you were running yourself. If you have the image locally, you can just push it anywhere you want.

[–] realitaetsverlust@piefed.zip 2 points 2 days ago (1 children)

I'm kinda confused by all of the people here doing that tbh.

The entire point of dockerfiles is to have them produce the same image over and over again. Meaning, I can take the dockerfile, spin it up on any machine on gods green earth and have it run there in the exact same state as anywhere else, minus eventual configs or files that need to be mounted.

Now, if I'm worried about an image disappearing from a remote registry, I just download the dockerfile and have it stored locally somewhere. But backuping the entire image seems seriously weird to me and kinda goes against of the spirit of docker.

[–] crater2150@feddit.org 2 points 2 days ago

A lot of Dockerfiles start with installing dependencies via the base image's package manager, without specifying exact versions (which isn't always possible, as most distros don't keep all history of all packages in their repos). So all your dependencies may have different versions, when you build again.

[–] irmadlad@lemmy.world 3 points 3 days ago

I back up everything.

[–] eskuero@lemmy.fromshado.ws 3 points 3 days ago* (last edited 3 days ago)

Yes I do. I cooked a small python script that runs at the end of every daily backup

import subprocess
import json
import os

# Output directory
OUTPUT_DIR = "/data/dockerimages"
try:
        os.mkdir(OUTPUT_DIR)
except:
        pass

# Grab all the docker images. Each line a json string defining the image
imagenes = subprocess.Popen(["docker", "images", "--format", "json"], stdout = subprocess.PIPE, stderr = subprocess.DEVNULL).communicate()[0].decode().split("\n")

for imagen in imagenes[:-1]:
        datos = json.loads(imagen)
        # ID of the image to save
        imageid = datos["ID"]
        # Compose the output name like this
        # ghcr.io-immich-app-immich-machine-learning:release:2026-01-28:3c42f025fb7c.tar
        outputname = f"{datos["Repository"]}:{datos["Tag"]}:{datos["CreatedAt"].split(" ")[0]}:{imageid}.tar".replace("/", "-")
        # If the file already exists just skip it
        if not os.path.isfile(f"{OUTPUT_DIR}/{outputname}"):
                print(f"Saving {outputname}...")
                subprocess.run(["docker", "save", imageid, "-o", f"{OUTPUT_DIR}/{outputname}"])
        else:
                print(f"Already exists {outputname}")

[–] tofu@lemmy.nocturnal.garden 3 points 3 days ago

No. They are cached on the hosts, thats enough for me.

[–] r0ertel@lemmy.world 1 points 2 days ago

I've been looking to do this, but haven't found a good, easy to use pull thru proxy for docker, ghcr.io and some other registries. Most support docker only.

This one looks promising but overly complicated to set up.

A few times now, I've gone to restart a container and the repo's been moved, archived or paywalled. Other times, I'm running a few versions behind and the maintainer decided to not support it, but upgrading would mean a complete overhaul of my Helm values file. Ugh!

I was considering a docker registry on separate ports for each upstream registry I'd like to proxy/cache.

[–] panda_abyss@lemmy.ca 1 points 3 days ago

I do run my own forgejo container repo, and I mirror containers I need there.

But I don’t backup my containers, just the data directories I mount to them.

[–] Decronym@lemmy.decronym.xyz -4 points 3 days ago* (last edited 2 days ago) (1 children)

Acronyms, initialisms, abbreviations, contractions, and other phrases which expand to something larger, that I've seen in this thread:

Fewer Letters	More Letters
DNS	Domain Name Service/System
Git	Popular version control system, primarily for code
HTTP	Hypertext Transfer Protocol, the Web
IP	Internet Protocol
SSH	Secure Shell for remote terminal access
SSL	Secure Sockets Layer, for transparent encryption
VPN	Virtual Private Network
nginx	Popular HTTP server

[Thread #44 for this comm, first seen 30th Jan 2026, 12:50] [FAQ] [Full list] [Contact] [Source code]

[–] officermike@lemmy.world 5 points 3 days ago (2 children)

While I don't necessarily have an issue with the intent of this bot, literally none of the initialisms listed were in the OP or any of the comments, as far as I can see.

[–] bravesilvernest@lemmy.ml 4 points 3 days ago

subprocess.PIPE Found "IP" 😂

[–] ea6927d8@lemmy.ml -1 points 3 days ago

The bot is a vibe-coded interface to an "AI".