this post was submitted on 23 May 2024
5 points (100.0% liked)

TechTakes

1751 readers
52 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 2 years ago
MODERATORS
 

Source

I see Google's deal with Reddit is going just great...

you are viewing a single comment's thread
view the rest of the comments
[–] dumbass@leminal.space 0 points 10 months ago (3 children)

Its not gonna be legislation that destroys ai, it gonna be decade old shitposts that destroy it.

[–] match@pawb.social 1 points 10 months ago

Well now I'm glad I didn't delete my old shitposts

[–] MalachaiConstant@lemmy.world 1 points 10 months ago (1 children)

Everyone who neglected to add the "/s" has become an unwitting data poisoner

[–] anton@lemmy.blahaj.zone 3 points 9 months ago

Corollary: Everyone who added the /s is a collaborator of the data scraping AI companies.

[–] jonhendry@iosdev.space 0 points 10 months ago (1 children)

@dumbass @db0

I suppose we should be glad that they aren’t training on old 4chan/8chan posts.

[–] harrys_balzac@lemmy.dbzer0.com 0 points 10 months ago (1 children)
[–] jonhendry@iosdev.space 0 points 10 months ago (1 children)

@harrys_balzac

Posts there are expired and deleted over time, so unless someone's made an effort to archive them, they're gone.

Of course, the AI people could hoover up new horrible posts.

[–] nickwitha_k@lemmy.sdf.org 0 points 10 months ago (1 children)

I would be surprised if someone hasn't been scraping it for years.

[–] Irelephant@lemm.ee 2 points 2 weeks ago

There is dozens of 4chan data archives.