this post was submitted on 25 Aug 2023
134 points (95.3% liked)

Technology

73801 readers
4479 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

Major websites like Amazon and the New York Times are increasingly blocking OpenAI's web crawler GPTBot::Companies like Amazon and The New York Times are rushing to prevent ChatGPT from collecting their data.

top 8 comments
sorted by: hot top controversial new old
[–] Fiivemacs@lemmy.ca 82 points 2 years ago (3 children)

Ohhh these companies don't like their data collected but have zero issue collecting ours and selling it to whoever?

Get fucked, I hope more bots spring up and start collecting their shit

[–] lobut@lemmy.ca 18 points 2 years ago (3 children)

I mean, I'm no fan of these companies and their data collection either which also likely led to Trump being in power for 4 years.

At the same time, I'm not a fan of the AI companies taking all this data either. They're not doing good for good either. They're money hungry monsters too.

[–] BURN@lemmy.world 3 points 2 years ago

Also the inability to opt your content out of being used for AI training by is a major issue for individuals.

[–] uriel238@lemmy.blahaj.zone 2 points 2 years ago

So the problem here isn't hostile AI but capitalism.

To be fair, the AI community notes that capitalist interests will drive earlier releases of dangerous AI products.

[–] Jaded@lemmy.dbzer0.com -4 points 2 years ago

Many AI companies open source their models, their isn't only openai.

[–] alienanimals@lemmy.world 14 points 2 years ago

"HEY WE STOLE THAT DATA FAIR AND SQUARE. YOU CAN'T TAKE IT" -Amazon

[–] uriel238@lemmy.blahaj.zone 4 points 2 years ago

Oh they'll multiply like spiders.

The internet behaves like an organic ecosystem with countless arms races between crypsis and detection, pursuit and evasion, encryption and crack.

And the beautiful thing is we get the technology as it leaks so shortly after Amazon protects its data from the scrapers, we protect our data from Amazon.

[–] Aopen@discuss.tchncs.de 5 points 2 years ago

Banning GPTBot doesnt make any harm to them. What with Bard though? Does he use Google Bot or his own web crawler?