this post was submitted on 08 Aug 2025
229 points (99.6% liked)

Privacy

2358 readers
156 users here now

Icon base by Lorc under CC BY 3.0 with modifications to add a gradient

founded 2 years ago
MODERATORS
 

Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther

Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.

Full article here.

Link to the full leaked list download: Meta leaked list pdf

you are viewing a single comment's thread
view the rest of the comments
[–] FaceDeer@fedia.io 1 points 3 weeks ago (1 children)

There was someone else in this thread responding to me that didn't understand how ActivityPub or the law worked, my explanations certainly were not "pointless" for them. They could have learned some things from what I said. Whether they did or not, who knows, that's up to them.

You're not bringing anything to the conversation because you can't even tell what the conversation is about, apparently.

You don't get to decide what the conversation is about, it's a collaborative thing. All that OP opened with is "look, Facebook is training AIs off of Fediverse content" and I responded to that with my own take on what this meant. My comments have been on-topic and haven't broken any instance or community rules that I can see.

Feel free to not respond to my comments, or even to block me if you really prefer not to see what I have to say. User blocks are better implemented on the Fediverse than back on Reddit, they don't wreck the flow of conversation for everyone else so they're a better option here.

[–] Feyd@programming.dev 1 points 3 weeks ago (1 children)

You opened by saying that somehow, using a federated social media site naturally means someone also supports using that site to train AI. My whole point in this entire thread is that you are drawing a false conclusion, clearly, because there are plenty of people that clearly don't agree.

You just spew the same unrelated junk over and over because you can't back up your ridiculous assertion.

I don't see any point in continuing because you're clearly tripling down, but you really should actually respond to what is actually being said to you if you're going to respond at all.

[–] FaceDeer@fedia.io 1 points 3 weeks ago

I was being sarcastic. It just boggles me that people are surprised by this, it should be obvious that the Fediverse is an even better source of training material (in practical terms if not in volume) than Reddit and such because there are no API restrictions or big corporations willing to throw lawsuits around.

If you don't want your posts and comments to be used to train AI then posting on the Fediverse is the very last thing you should be doing.