this post was submitted on 18 Aug 2024
74 points (76.8% liked)

Privacy

40070 readers
454 users here now

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

Related communities

much thanks to @gary_host_laptop for the logo design :)

founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] CynicusRex@lemmy.ml 8 points 11 months ago (1 children)

#TL;DR:

User-agent: GPTBot
Disallow: /
User-agent: ChatGPT-User
Disallow: /
User-agent: Google-Extended
Disallow: /
User-agent: PerplexityBot
Disallow: /
User-agent: Amazonbot
Disallow: /
User-agent: ClaudeBot
Disallow: /
User-agent: Omgilibot
Disallow: /
User-Agent: FacebookBot
Disallow: /
User-Agent: Applebot
Disallow: /
User-agent: anthropic-ai
Disallow: /
User-agent: Bytespider
Disallow: /
User-agent: Claude-Web
Disallow: /
User-agent: Diffbot
Disallow: /
User-agent: ImagesiftBot
Disallow: /
User-agent: Omgilibot
Disallow: /
User-agent: Omgili
Disallow: /
User-agent: YouBot
Disallow: /
[–] mox@lemmy.sdf.org 6 points 11 months ago (1 children)

Of course, nothing stops a bot from picking a user agent field that exactly matches a web browser.

[–] JackbyDev@programming.dev 3 points 11 months ago (1 children)

Nothing stops a bot from choosing to not read robots.txt

[–] mox@lemmy.sdf.org 2 points 11 months ago* (last edited 11 months ago)

Indeed, as has already been said repeatedly in other comments.