this post was submitted on 17 Aug 2025
705 points (99.7% liked)

Technology

468 readers
279 users here now

Share interesting Technology news and links.

Rules:

  1. No paywalled sites at all.
  2. News articles has to be recent, not older than 2 weeks (14 days).
  3. No videos.
  4. Post only direct links.

To encourage more original sources and keep this space commercial free as much as I could, the following websites are Blacklisted:

More sites will be added to the blacklist as needed.

Encouraged:

Misc:

Relevant Communities:

founded 3 months ago
MODERATORS
 

Comments

Source.

you are viewing a single comment's thread
view the rest of the comments
[–] MonkderVierte@lemmy.zip 18 points 1 week ago* (last edited 1 week ago) (4 children)

I just thought that having a client side proof-of-work (or even only a delay) bound to the IP might deter the AI companies to choose to behave instead (because single-visit-per-IP crawlers get too expensive/slow and you can just block normal abusive crawlers). But they already have mind-blowing computing and money ressources and only want your data.

But if there was a simple-to-use integrated solution and every single webpage used this approach?

[–] witten@lemmy.world 12 points 1 week ago

Believe me, these AI corporations have way too many IPs to make this feasible. I've tried per-IP rate limiting. It doesn't work on these crawlers.

[–] explodicle@sh.itjust.works 2 points 1 week ago (1 children)

What if we had some protocol by which the proof-of-work is transferable? Then not only would there be a cost to using the website, but also the operator would receive that cost as payment.

[–] Taldan@lemmy.world 4 points 1 week ago* (last edited 1 week ago)

It's theoretically viable, but every time that has been tried has failed

There are a lot of practical issues, mainly that it's functionally identical to a crypto miner malware

[–] Taldan@lemmy.world 2 points 1 week ago (1 children)

Are you planning to just outright ban IPv6 (and thus half the world)?

Any IP based restriction is useless with IPv6

[–] strict0768@lemmy.world 3 points 1 week ago (1 children)

Not really true, you can block ranges.

[–] Taldan@lemmy.world 1 points 1 week ago

Okay, but how does that help? Or are you suggesting just wholesale banning entire ISPs?