10

The recent downtime of the #InternetArchive reminded me

(a) How vital the site is for my own work. Fortunately, I save pretty much all old books I need for my work to my hard drive, so I am not totally lost without it - but still, most of the links to the individual folk tales I am translating go to online archives, and the Internet Archive is the most important among them.

(b) How storing all this vital cultural heritage stuff at one single site is a terrible idea. Today, the Internet Archive might be taken down by hackers. Tomorrow, the site might commit suicide by lawyers. And in a possible future, a fascist US government might take the site down out of sheer spite.

While there are a fair number of other, more specialized digital libraries out there, too many public domain works are only available at the Internet Archives. And another huge percentage is stored only at the Internet Archive and Google Books, which is not a lot better.

We need a more distributed archive system where all these works can stored on multiple servers around the world - yet where users can search through all of them with comparable ease. Only in this way will our digital cultural heritage be truly safe.

Perhaps a #Fediverse - based approach could work? Something like #Bookwyrm , but with actual data storage?

What do you think?

you are viewing a single comment's thread
view the rest of the comments
[-] belchion@rollenspiel.social 1 points 1 week ago

@juergen_hubert@thefolklore.cafe The only one I know is YaCy* https://yacy.net/, but I have never tried it.

Theoretically, you could build an engine with the Apache Lucene environment, and have its crawler component based on some kind of P2P networks, but that requires a whole lot of specialized technically expertise and probably cannot be done outside of major support infrastructure.

this post was submitted on 23 Oct 2024
10 points (100.0% liked)

News from fediverse

0 readers
24 users here now

founded 10 months ago