Technology

85964 readers

3364 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 3 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

1100

CEO of Google Says It Has No Solution for Its AI Providing Wildly Incorrect Information (futurism.com)

submitted 2 years ago by Stopthatgirl7@lemmy.world to c/technology@lemmy.world

430 comments fedilink hide all child comments

You know how Google's new feature called AI Overviews is prone to spitting out wildly incorrect answers to search queries? In one instance, AI Overviews told a user to use glue on pizza to make sure the cheese won't slide off (pssst...please don't do this.)

Well, according to an interview at The Vergewith Google CEO Sundar Pichai published earlier this week, just before criticism of the outputs really took off, these "hallucinations" are an "inherent feature" of AI large language models (LLM), which is what drives AI Overviews, and this feature "is still an unsolved problem."

you are viewing a single comment's thread
view the rest of the comments

[–] vrighter@discuss.tchncs.de 18 points 2 years ago (1 children)

no, the truth is it's impossible even then. If the result involves randomness at its most fundamental level, then it's not reliable whatever you do.

[–] MacNCheezus@lemmy.today -3 points 2 years ago* (last edited 2 years ago) (2 children)

Sure, the AI is never going to understand what it's doing or why, but training it on better datasets certain WILL improve the results.

Garbage in, garbage out.

[–] joneskind@lemmy.world 10 points 2 years ago (1 children)

You can train an LLM on the best possible set of data without a single false statement and it will still hallucinate. And there’s nothing to be done against that.

Without understanding of the context everything can be true or false.

“The acceleration due to gravity is equal to 9.81m/s2” True or False?

LLM basically works like this: given the previous words written and their order, the most probable next word of the sentence is this one.

[–] MacNCheezus@lemmy.today -3 points 2 years ago (1 children)

Well yes, I've seen those examples of ChatGPT citing scientific research papers that turned out to be completely made up, but at least it seems to be a step up from straight up shitposting, which is what you get when you train it on a dataset full of shitposts.

[–] joneskind@lemmy.world 3 points 2 years ago

Well it’s definitely true that you will have hard times getting true things from garbage. But funny enough, the model might hallucinate true things:)

[–] Aceticon@lemmy.world 4 points 2 years ago (1 children)

The problem is that given the way they combine things is determine by probability, even training it with the greatest bestest of data, the LLM is still going to halucinate because it's combining multiple sources word by word (roughly) guided only by probabilities derived from language, not logic.

[–] MacNCheezus@lemmy.today 1 points 2 years ago (1 children)

Yes, I understand that. But I'm fairly certain the quality of the data will still have a massive influence over how much and how egregiously that happens.

Basically, what I'm saying is, training your AI on a corpus on shitposts instead of factual information seems like a good way to increase the frequency and magnitude of such hallucinations.

[–] Aceticon@lemmy.world 2 points 2 years ago (1 children)

Yeah, true.

If you train you LLM on exclusivelly Nazi literature (to pick a wild example) don't expect it to by chance end up making points similar to Marx's Das Kapital.

(Personally I think what might be really funny - in the sense of laughter inducing - would be to purposefull train an LLM exclusivelly on a specific kind of weird material).

[–] MacNCheezus@lemmy.today 3 points 2 years ago (2 children)

Yeah, I mean that’s basically what GPT4Chan did, which someone else already mentioned ITT.

Basically, this guy took a dataset of several gigabytes worth of archived posts from /pol/ and trained a model on that, then hooked it up to a chatbot and let it loose on the board. You can see the results in this video.

[–] Aceticon@lemmy.world 2 points 2 years ago

That was hilarious!

Thanks for the link.

[–] PipedLinkBot@feddit.rocks 1 points 2 years ago

Here is an alternative Piped link(s):

in this video

Piped is a privacy-respecting open-source alternative frontend to YouTube.

I'm open-source; check me out at GitHub.