Technology

77589 readers

2532 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

1099

CEO of Google Says It Has No Solution for Its AI Providing Wildly Incorrect Information (futurism.com)

submitted 2 years ago by Stopthatgirl7@lemmy.world to c/technology@lemmy.world

438 comments fedilink hide all child comments

You know how Google's new feature called AI Overviews is prone to spitting out wildly incorrect answers to search queries? In one instance, AI Overviews told a user to use glue on pizza to make sure the cheese won't slide off (pssst...please don't do this.)

Well, according to an interview at The Vergewith Google CEO Sundar Pichai published earlier this week, just before criticism of the outputs really took off, these "hallucinations" are an "inherent feature" of AI large language models (LLM), which is what drives AI Overviews, and this feature "is still an unsolved problem."

you are viewing a single comment's thread
view the rest of the comments

[–] space@lemmy.dbzer0.com 10 points 2 years ago (2 children)

It's quite simple. Garbage in, garbage out. Data they use for training needs to be curated. How to curate the entire internet, I have no clue.

[–] dQw4w9WgXcQ@lemm.ee 9 points 2 years ago (3 children)

The real answer would be "don't". Have a decent whitelist dor training data with reliable data. Don't just add every orifice of the internet (like reddit) to the training data. Limitations would be good in this case.

[–] CheeseNoodle@lemmy.world 7 points 2 years ago (1 children)

Its worse than reddit, they've been pulling data from the onion.

[–] olympicyes@lemmy.world 3 points 2 years ago (1 children)

Is that for real?

[–] CheeseNoodle@lemmy.world 3 points 2 years ago

Its been quoting some onion articles verbatim, so either they pulled from the onion directly or from somewhere that re-posts onion articles.

[–] Agent641@lemmy.world 3 points 2 years ago

Just train it on linux help forum replies, because everyone there is always 100% right.

[–] space@lemmy.dbzer0.com 2 points 2 years ago

Having a curated whitelist would definitely be a good idea, but if it only shows information from a limited list of websites, that would make it a terrible search engine incapable of searching most of the web.

[–] woelkchen@lemmy.world 4 points 2 years ago

They already have a curated data set. It's called Google Scholar.