this post was submitted on 09 Jun 2025
-6 points (37.5% liked)

Technology

38378 readers
113 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 6 years ago
MODERATORS
top 8 comments
sorted by: hot top controversial new old
[–] technocrit@lemmy.dbzer0.com 3 points 14 hours ago* (last edited 14 hours ago)

“I came up with a problem which experts in my field would recognize as an open question in number theory—a good Ph.D.-level problem,” he says. He asked o4-mini to solve the question. Over the next 10 minutes, Ono watched in stunned silence as the bot unfurled a solution in real time, showing its reasoning process along the way.

Ok cool story, hypeman. Well what was this problem then? Has it shook up the world of number theory? lol these grifters. jfc.

[–] samc@feddit.uk 9 points 1 day ago

Most of their quotes come from this Ono guy...

Ono, who is also a freelance mathematical consultant for Epoch AI.

Ahhh, there it is.

[–] lps2@lemmy.ml 10 points 1 day ago (1 children)

Shit, I used AI to suggest a summer cocktail recipe and it thought bourbon was something you sliced and used as a garnish - I think we're safe for now

[–] NigelFrobisher@aussie.zone 2 points 1 day ago

They’re great biscuits.

[–] Vendetta9076@sh.itjust.works 6 points 1 day ago

This shits so stupid. LLMs aren't good at math. They aren't meant to be good at math. Stop trying to trick us into thinking theyre good at math. Use them for what theyre meant to do. Good Christ.

[–] Finch9678@europe.pub 6 points 1 day ago

So a company that is known for cheating on benchmarks to look better organises a benchmark and surprisingly passes it flawlessly.... I am shocked!

[–] psycho_driver@lemmy.world 7 points 1 day ago

Earlier today I read an article about ChatGPT getting trounced by Chess on the Atari 2600 at the beginner's setting.

[–] drspod@lemmy.ml 4 points 1 day ago

And not a single research paper was linked.