this post was submitted on 22 Feb 2026
986 points (97.1% liked)

Microblog Memes

10944 readers
1828 users here now

A place to share screenshots of Microblog posts, whether from Mastodon, tumblr, ~~Twitter~~ X, KBin, Threads or elsewhere.

Created as an evolution of White People Twitter and other tweet-capture subreddits.

RULES:

  1. Your post must be a screen capture of a microblog-type post that includes the UI of the site it came from, preferably also including the avatar and username of the original poster. Including relevant comments made to the original post is encouraged.
  2. Your post, included comments, or your title/comment should include some kind of commentary or remark on the subject of the screen capture. Your title must include at least one word relevant to your post.
  3. You are encouraged to provide a link back to the source of your screen capture in the body of your post.
  4. Current politics and news are allowed, but discouraged. There MUST be some kind of human commentary/reaction included (either by the original poster or you). Just news articles or headlines will be deleted.
  5. Doctored posts/images and AI are allowed, but discouraged. You MUST indicate this in your post (even if you didn't originally know). If an image is found to be fabricated or edited in any way and it is not properly labeled, it will be deleted.
  6. Absolutely no NSFL content.
  7. Be nice. Don't take anything personally. Take political debates to the appropriate communities. Take personal disagreements & arguments to private messages.
  8. No advertising, brand promotion, or guerrilla marketing.

RELATED COMMUNITIES:

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] stabby_cicada@lemmy.blahaj.zone 3 points 1 day ago (1 children)

What you've given is an example of a problem where an LLM is, by design, the wrong tool.

See, variation is built into LLMs. They're programmed to evaluate probable responses and select from them on the basis of probability - to simplify ridiculously, if a particular word follows another 90% of a time, in 90% of the content it generates the LLM will have that word follow the other, and in the other 10% it won't.

If you give an LLM the exact same prompt multiple times, you will get multiple different responses. They'll all be similar responses, but they won't be exactly the same, because how LLMs generate language is probabilistic and contains randomness.

(And that is why hallucination is an inherent feature of LLMs and can't be trained out.)

But math isn't language. Math problems have correct answers. When you use a software tool to answer a math problem, you don't want variation. You want the correct answer every time.

To solve a math problem, you need to find the appropriate formula, which will be the same every time. Then you use a calculator, which always gives the correct result. You plug the numbers into the formula and calculate the result.

What I'm getting at is, if you use a calculator to do the math problem yourself, and you put in the correct formula, you'll always get the correct result. If you use a LLM to generate the answer to a math problem, there is always a non-zero chance it will give you the wrong answer.

If you haven't looked up the formula yourself, you have no way to check the LLM's work, which means you can't trust the answer for anything important - so you have to look up the formula and check it with a calculator.

And if you have looked up the formula yourself, it's just as easy to use a calculator the first time and skip the LLM.

Right? This is what I'm getting at. An LLM can do some of the same things a human does, but it's always going to be worse at it than a human, because it's not conscious, it's not reasoning its way to a correct answer, it's just generating a string of linguistic tokens based on probabilities. And math problems might be the clearest possible example of this.

[–] maplesaga@lemmy.world 2 points 1 day ago* (last edited 1 day ago) (1 children)

Thats well put, I'm under no naive assumption that LLMs are AI. Though I do think youre discounting the usefulness, as it did give the right answer, which is a fine use for average people doing basic math or whatever project theyre working on. I'm under no delusion that its replacing workers, unless someones job is writing fancy emails or building spreadsheets, and I do still think its a massive bubble.

[–] stabby_cicada@lemmy.blahaj.zone 5 points 1 day ago* (last edited 1 day ago)

Yeah, I get that it seems like a fine use for average people doing basic math. The nonzero chance of error could end up not mattering. But it could matter very much, depending on the use case. If you're asking an LLM the volume of a bucket, it's not a big deal. If you're asking an LLM "how many milligrams of this drug is the correct dose for a 80 kg man", that's a big fucking deal.

If people don't know LLMs can't be trusted to give the corect answer, they're not going to realize they need to do the math themselves in important use cases. And that is certainly not something Microsoft and Google are encouraging people to learn.

Then there's the efficiency issue - Big Tech spent trillions of dollars to develop and train machine learning processors, which perform quadrillions of energy-intensive processes per second, and they're being marketed to do a job that a 99 cent solar powered calculator from the 1980s can do better.

God, I just realized tax season is coming up. And after all the layoffs and political firings and general dogebaggery at the American IRS, they're going to have to deal with people using AI to do their taxes 😆