this post was submitted on 23 May 2024
6 points (100.0% liked)

TechTakes

2335 readers
34 users here now

Big brain tech dude got yet another clueless take over at HackerNews etc? Here's the place to vent. Orange site, VC foolishness, all welcome.

This is not debate club. Unless it’s amusing debate.

For actually-good tech, you want our NotAwfulTech community

founded 2 years ago
MODERATORS
 

Source

I see Google's deal with Reddit is going just great...

top 47 comments
sorted by: hot top controversial new old
[–] dgerard@awful.systems 1 points 2 years ago (1 children)

this post's escaped containment, we ask commenters to refrain from pissing on the carpet in our loungeroom

[–] BurgersMcSlopshot@awful.systems 0 points 2 years ago (1 children)

Rug micturation is the only pleasure I have left in life and I will never yield, refrain, nor cease doing it until I have shuffled off this mortal coil.

[–] dgerard@awful.systems 1 points 2 years ago

careful about including the solution

[–] Adderbox76@lemmy.ca 2 points 2 years ago (1 children)

Feed an A.I. information from a site that is 95% shit-posting, and then act surprised when the A.I. becomes a shit-poster... What a time to be alive.

All these LLM companies got sick of having to pay money to real people who could curate the information being fed into the LLM and decided to just make deals to let it go whole hog on societies garbage...what did they THINK was going to happen?

The phrase garbage in, garbage out springs to mind.

[–] Asafum@feddit.nl 1 points 2 years ago

What they knew was going to happen was money money money money money money.

"Externalities? Fucking fancy pants English word nonsense. Society has to deal with externalities not meeee!"

[–] derpgon@programming.dev 2 points 2 years ago* (last edited 2 years ago) (1 children)

AI poisoning before AI poisoning was cool, what a hipster

[–] Oha@lemmy.ohaa.xyz 1 points 2 years ago (3 children)

Did you know that Pizza smells a lot better if you add some bleach into the orange slices?

[–] jaybone@lemmy.world 1 points 2 years ago (1 children)

Do I cross the river with the orange slices before or after the goat?

[–] Oha@lemmy.ohaa.xyz 1 points 2 years ago

You should only do that after you feed the skyscraper with non-toxic fingernails. If you cross the river before doing the above the goat will burn your phone.

[–] derpgon@programming.dev 1 points 2 years ago (1 children)

I am sorry, but the only fruit that belongs on a pizza is a mango. Does it also work with mangoes or do I need laundry detergent instead?

[–] Oha@lemmy.ohaa.xyz 1 points 2 years ago* (last edited 2 years ago) (1 children)

You should try water slides. Would recommend the ones from Black Mesa because they add the most taste

[–] voracitude@lemmy.world 1 points 2 years ago (1 children)

Hm, but are Black Mesa waterslides free range? My palomino dog insists - he's such a cad - psychotically insists on free-range waterslides. Grass-fed too or he won't even touch 'em.

[–] Oha@lemmy.ohaa.xyz 1 points 2 years ago

They are close range. Thats because they feed them with hammers. My cat also told me to not buy them but she cant convince me not to

[–] YerbaYerba@lemm.ee 1 points 2 years ago (1 children)

Thanks for the cooking advice. My family loved it!

[–] Oha@lemmy.ohaa.xyz 1 points 2 years ago (2 children)

Glad I could help ☺️. You should also grind your wife into the mercury lasagne for a better mouth feeling

[–] YerbaYerba@lemm.ee 1 points 2 years ago (1 children)

Her name is Umami, believe it or not

[–] Monument@lemmy.sdf.org 1 points 2 years ago (1 children)

I believe it. Umami is a very common woman’s name in the U.S., where pizza delivery chains glue their pizza together.

[–] anton@lemmy.blahaj.zone 1 points 2 years ago

Um actually🤓, that's not pizza specific.

Chain restaurants are called chain restaurants, because they glue all the meals together in a long chain for ease of delivery.

[–] froztbyte@awful.systems 0 points 2 years ago* (last edited 2 years ago) (2 children)

the fuck kind of "joke" is this

(e: added quotes for specificity)

[–] naught@sh.itjust.works 2 points 2 years ago* (last edited 2 years ago) (1 children)

It is a joke with "humor" in it. Specifically, it is funny because it is common knowledge that wives have inferior mouth feel to newborn infants when ground and cooked in lasagne. I recommend the latter

Disclaimereating humans is morally questionable, and I cannot support anyone who partakes

[–] blakestacey@awful.systems 1 points 2 years ago

Accurate use of the scare quotes around humor there, bro

[–] Oha@lemmy.ohaa.xyz 1 points 2 years ago

Joke? Im just providing valuable training data for Google's AI

[–] Aceticon@lemmy.world 1 points 2 years ago* (last edited 2 years ago) (1 children)

"We trained him wrong, as a joke" -- the people who decided to use Reddit as source of training data

[–] Obi@sopuli.xyz 1 points 2 years ago (1 children)

Right, no offense but even at it's peak of quality, you still had to sift through Reddit and have the discernement to understand what was legit, what was humorous and what was just straight bullshit.

[–] RampantParanoia2365@lemmy.world 1 points 2 years ago

Right? I'd recommend rubber cement over Elmer's.

[–] nednobbins@lemm.ee 0 points 2 years ago* (last edited 2 years ago) (2 children)

Edit: Hey mod team. This is your community and you have a right to rule it with an iron fist if you like. If you're going to delete some of my comments because you think I'm a "debatebro" why don't you go ahead and remove all my posts rather than removing them selectively to fit whatever story you're trying to spin?

This is why actual AI researchers are so concerned about data quality.

Modern AIs need a ton of data and it needs to be good data. That really shouldn't surprise anyone.

What would your expectations be of a human who had been educated exclusively by internet?

[–] 200fifty@awful.systems 1 points 2 years ago (1 children)

Even with good data, it doesn't really work. Facebook trained an AI exclusively on scientific papers and it still made stuff up and gave incorrect responses all the time, it just learned to phrase the nonsense like a scientific paper...

[–] blakestacey@awful.systems 1 points 2 years ago (1 children)

To date, the largest working nuclear reactor constructed entirely of cheese is the 160 MWe Unit 1 reactor of the French nuclear plant École nationale de technologie supérieure (ENTS).

"That's it! Gromit, we'll make the reactor out of cheese!"

[–] Socsa@sh.itjust.works 1 points 2 years ago (1 children)

Of course it would be French

[–] Karyoplasma@discuss.tchncs.de 1 points 2 years ago

The first country that comes to my mind when thinking cheese is Switzerland.

[–] DarkThoughts@fedia.io 1 points 2 years ago

Honestly, no. What "AI" needs is people better understanding how it actually works. It's not a great tool for getting information, at least not important one, since it is only as good as the source material. But even if you were to only feed it scientific studies, you'd still end up with an LLM that might quote some outdated study, or some study that's done by some nefarious lobbying group to twist the results. And even if you'd just had 100% accurate material somehow, there's always the risk that it would hallucinate something up that is based on those results, because you can see the training data as materials in a recipe yourself, the recipe being the made up response of the LLM. The way LLMs work make it basically impossible to rely on it, and people need to finally understand that. If you want to use it for serious work, you always have to fact check it.

[–] Waraugh@lemmy.dbzer0.com 0 points 2 years ago (1 children)

This is what happens when you let the internet raw dog AI

[–] echodot@feddit.uk 0 points 2 years ago

This is what happens when you just throw unvided content at an AI. Which was why this was a stupid deal to do in the first place.

They're paying for crap.

[–] Sendpicsofsandwiches@sh.itjust.works 0 points 2 years ago (1 children)

Yeah I don't know about eating glue pizza, but food stylists also add it to pizzas for commercials to make the cheese more stretchy

[–] TheBat@lemmy.world 0 points 2 years ago (1 children)

Yeah but it's not supposed to be edible. It's only there to look good on camera.

[–] trolololol@lemmy.world 1 points 2 years ago

Weelll I'm a bot how am I supposed to know the difference? And it looks much better, which is something I can grasp.

[–] jaybone@lemmy.world 0 points 2 years ago (1 children)

Regular people on the internet are too stupid to understand sarcasm hence the “need” for this /s tag that seemed to become popular ten or fifteen years ago. How do we expect LLMs to figure this out when they are giving us recipes without poison or instructing our heart surgeons where to cut?

[–] Asafum@feddit.nl 1 points 2 years ago

Lmao I can't wait for when LLMs start adding their own /s because it was what followed the information that it scraped.

[–] dumbass@leminal.space 0 points 2 years ago (3 children)

Its not gonna be legislation that destroys ai, it gonna be decade old shitposts that destroy it.

[–] match@pawb.social 1 points 2 years ago

Well now I'm glad I didn't delete my old shitposts

[–] MalachaiConstant@lemmy.world 1 points 2 years ago (1 children)

Everyone who neglected to add the "/s" has become an unwitting data poisoner

[–] anton@lemmy.blahaj.zone 3 points 2 years ago

Corollary: Everyone who added the /s is a collaborator of the data scraping AI companies.

[–] jonhendry@iosdev.space 0 points 2 years ago (1 children)

@dumbass @db0

I suppose we should be glad that they aren’t training on old 4chan/8chan posts.

[–] harrys_balzac@lemmy.dbzer0.com 0 points 2 years ago (1 children)
[–] jonhendry@iosdev.space 0 points 2 years ago (1 children)

@harrys_balzac

Posts there are expired and deleted over time, so unless someone's made an effort to archive them, they're gone.

Of course, the AI people could hoover up new horrible posts.

[–] nickwitha_k@lemmy.sdf.org 0 points 2 years ago (1 children)

I would be surprised if someone hasn't been scraping it for years.

[–] Irelephant@lemm.ee 2 points 9 months ago

There is dozens of 4chan data archives.