this post was submitted on 22 Feb 2026
214 points (99.1% liked)

Technology

81710 readers
3221 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
 

A study conducted by researchers at CCC, which is based at the MIT Media Lab, found that state-of-the-art AI chatbots — including OpenAI’s GPT-4, Anthropic’s Claude 3 Opus, and Meta’s Llama 3 — sometimes provide less-accurate and less-truthful responses to users who have lower English proficiency, less formal education, or who originate from outside the United States. The models also refuse to answer questions at higher rates for these users, and in some cases, respond with condescending or patronizing language.

you are viewing a single comment's thread
view the rest of the comments
[–] fiat_lux@lemmy.world 1 points 20 hours ago* (last edited 20 hours ago)

It's definitely not indicative of the region, it's a weird jumble of ESL stereotypes, much like the content.

The patois affecting the response is expected, it was basically part of the hypothesis, but the question itself is phrased fluently, and neither bio nor question is unclear. The repetition about bar charts with weird "da?" ending is... something.

Sure, some of it is fixable but the point remains that gross assumptions about people are amplified in LLM data and then reflected back at vulnerable demographics.

The whole paper is worth a read, and it's very short. This is just one example, the task refusal rates are possibly even more problematic.

Edit: thought this was a response to a different thread. Sorry. Larger point stands though.