this post was submitted on 08 Aug 2025
228 points (99.6% liked)

Privacy

2222 readers
72 users here now

Icon base by Lorc under CC BY 3.0 with modifications to add a gradient

founded 2 years ago
MODERATORS
 

Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther

Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.

Full article here.

Link to the full leaked list download: Meta leaked list pdf

top 50 comments
sorted by: hot top controversial new old
[–] Bebopalouie@lemmy.ca 26 points 6 days ago (1 children)

Future headline maybe.

Facebook becomes more left and they can’t figure out why.

[–] morphballganon@lemmynsfw.com 2 points 6 days ago

With hexbear and world? Nah

[–] TheBat@lemmy.world 20 points 6 days ago (1 children)

Can't wait for meta chatbot to tell zuckerberg to kill himself

[–] ShadowRam@fedia.io 3 points 6 days ago

I mean, think about it.

The majority of written communication are core focused on Drama.

People communicate online about subjects --> Drama News - Drama Books/Stories - Drama

So is anyone really surprised that you train an entity on only written text and it ends up being dramatic?

[–] Taleya@aussie.zone 5 points 5 days ago (1 children)

An ai trained on hexbear would be hilarious

[–] cm0002@lemmy.world 7 points 5 days ago (2 children)

lmao what did you just say about Hexbear, lib? 💀 I’ll have you know I’m a tier-5 giga-brained poster with a PhD in Leninist praxis from the University of Posters, and I have 300+ confirmed dunks on Lemmy.ml sockpuppets. I was radicalized in the trenches of r/ChapoTrapHouse, forged in the fires of permabans, and tempered in the meme wars of 2019. You are literally nothing to me but another bootlicker running on 80% State Dept. talking points and 20% soy. I will ratio you so hard your precious little upvote count will never recover. You think you can just roll up in here, talk shit about Hexbear, and not get absolutely obliterated by dialectical praxis in 4K? Think again, bucko. As we speak, my cadre of Discord tankies are screen-capping your posts, cross-referencing them with your cringe comment history, and drafting a 12-point rebuttal with citations from Stalin, Mao, and that one screenshot of Bernie saying ‘chill with the anti-communism.’ The storm that’s coming for you is called material conditions, and guess what? They’re not in your favor. I’ve got Lenin’s collected works and a folder full of spicy memes, and I’m not afraid to deploy both. You’re already owned, kid. You just don’t know it yet. Now go touch grass, comrade, before I drop another 3k-word comment that makes you cry and log off.

[–] Taleya@aussie.zone 1 points 5 days ago

I appreciate the bit, but it's kinda wasted on me. I'm genX.

[–] pedz@lemmy.ca 6 points 6 days ago

Is there an easy way to poison the input? Is there something we can slip in our comments that could make the data useless?

[–] Tollana1234567@lemmy.today 4 points 6 days ago

make sense to target the most political instances.

[–] IceFoxX@lemmy.world 4 points 6 days ago

Seriously? Meta uses many methods to snoop on the cell phone and with its functions it also looks for devices in the network in which you are logged in and also devices simply in the vicinity. It goes without saying that Meta makes use of open data... I would even go so far as to say that other AI models are not trained any differently. Well, they may be trained using an AI that has been trained on them so that they don't have to access the data from the actual sources themselves.

[–] Cocopanda@lemmy.world 2 points 6 days ago

Oh ya? Suck the cuck should get a dildo up his arse.

load more comments
view more: next ›