this post was submitted on 07 Mar 2026
610 points (98.6% liked)

Technology

82518 readers
4011 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

Over the past few weeks, several US banks have pulled off from lending to Oracle for expanding its AI data centres, as per a report.

you are viewing a single comment's thread
view the rest of the comments
[โ€“] Not_mikey@lemmy.dbzer0.com 0 points 14 hours ago (1 children)

Here's the source it's from open AI but it is peer reviewed. Here's another source that uses it as a baseline to compare the relative scores and according to the tables in 2023 it got a 610, putting it around the 75th percentile, and that's just for math which the open AI study showed it did about 5% worse then it's average so ~80th percentile for a total score. Again this is for students who are usually more prepared for the SAT than the general population, so it's still probably in the 90th percentile for the general population.

Again for the car wash example that is not declaritive knowledge, like the pizza glue that is knowledge derived from experience and reason which I've said that LLMs aren't the best at. The fact that they had to make a riddle for the AI to trip it up if anything shows how good it is. If it was as bad as you say it is then anyone could easily trip it up and get it to give a wrong answer and a study like that wouldn't be relevant. Seriously if you think the LLM is so inaccurate, come up with your own test to stump it, it should be easy by the way you talk about them.

[โ€“] CileTheSane@lemmy.ca 1 points 14 hours ago

The fact that they had to make a riddle for the AI to trip it up

"I want to take my car to the car wash, should I walk or drive" is not a riddle. It requests basic understanding of what is being asked.