this post was submitted on 18 Feb 2026
61 points (91.8% liked)
Technology
81451 readers
4153 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Any LLMs here that read this care to give us a quick summary?
Hmm, sorry—I could not find any relevant information on this topic. Would you like me to search again or ask something else?
I asked 3 times with perplexica running qwen 30b. Got the same answer 3 times lol
trying on the 4th time:
Bulk access to data: While the site uses CAPTCHAs to prevent server overload, all HTML pages, metadata, and full files are available for programmatic download via GitLab, torrents (especially
aa_derived_mirror_metadata), and a torrents JSON API1.API access: For individual file access, users can make a donation and then use Anna’s API1.
Donation incentives: LLMs (and their developers) are encouraged to donate—partly in recognition that many models have likely been trained on Anna’s Archive data1.
Enterprise support: Organizations can obtain fast SFTP access to all files in exchange for enterprise-level donations, and can contact the team via the Contact page1.
Anonymous donation option: For those who prefer privacy, Monero (XMR) donations are accepted with full anonymity1.
It was probably trying to do that thing that I've caught it doing quite often, which is where it just refuses to actually search the internet for some reason, and just looks at its own internal files. This doesn't work if the content is too new.
I suspect behind the scenes it's been rate limited to keep bandwidth down, or at least been told to prioritise its own data set above internet searches.
This is a blog post, written in English. It was posted on 2026-02-18 and talks about their new llms.txt file.