this post was submitted on 31 Mar 2026
269 points (99.6% liked)

Technology

84643 readers
3705 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
top 50 comments
sorted by: hot top controversial new old
[–] Encephalotrocity@feddit.online 197 points 1 month ago (14 children)

Perhaps the most discussed technical detail is the "Undercover Mode." This feature reveals that Anthropic uses Claude Code for "stealth" contributions to public open-source repositories.

The system prompt discovered in the leak explicitly warns the model: "You are operating UNDERCOVER... Your commit messages... MUST NOT contain ANY Anthropic-internal information. Do not blow your cover."

Laws should have been put in place years ago to make it so that AI usage needs to be explicitly declared.

[–] merc@sh.itjust.works 88 points 1 month ago (1 children)

The system prompt discovered in the leak explicitly warns the model: "You are operating UNDERCOVER... Your commit messages... MUST NOT contain ANY Anthropic-internal information. Do not blow your cover."

This is so incredibly stupid.

You've tried security.

You've tried security through obscurity.

Now try security through giving instructions to an LLM via a system prompt to not blow its cover.

[–] pemptago@lemmy.ml 7 points 1 month ago

Haven't read the article and have a limited knowledge of ai, but I wonder if they do this for reinforcement learning: So OSS PR responses can be used to label different weights and models. Using even more free labor to train their models.

[–] a4ng3l@lemmy.world 7 points 1 month ago

In Europe we have the AI act which, as of August, will introduce some form of transparency obligations. Not perfect obviously but a start. Probably will not be followed by the rest of the world though so like GDPR it will be forcibly eroded by other’s interests through lobbying but at least we try.

[–] JohnEdwa@sopuli.xyz 2 points 1 month ago (1 children)

With how massive of a computer science field artificial intelligence is and how much of it already is or is getting added to every piece of software that exists, a label like that would be equally useless as the California prop 65 cancer warnings.

Do you use a mobile keyboard that supports swipe typing and has autocorrect? Remember to mark everything you write as being AI assisted.

[–] mrbutterscotch@feddit.org 2 points 1 month ago

Well yes, if you let autocorrect write code contribution, I think you should lable that contribution as AI.

load more comments (10 replies)
[–] rimu@piefed.social 80 points 1 month ago* (last edited 1 month ago) (2 children)

If you installed or updated Claude Code via npm on March 31, 2026, between 00:21 and 03:29 UTC, you may have inadvertently pulled in a malicious version of axios (1.14.1 or 0.30.4) that contains a Remote Access Trojan (RAT). You should immediately search your project lockfiles (package-lock.json, yarn.lock, or bun.lockb) for these specific versions or the dependency plain-crypto-js. If found, treat the host machine as fully compromised, rotate all secrets, and perform a clean OS reinstallation.

Lol 😂

[–] ellen.kimble@piefed.social 13 points 1 month ago (1 children)

This is because if an unrelated hack on npm’s latest build. Anyone with this version of npm is affected

[–] criss_cross@lemmy.world 4 points 1 month ago

That axios supply chain attack was a bitch. There were extensions compromised from that shit.

load more comments (1 replies)
[–] CorrectAlias@piefed.blahaj.zone 53 points 1 month ago (3 children)

Be careful not to introduce security vulnerabilities such as command injection, XSS, SQL injection, and other OWASP top 10 vulnerabilities. If you notice that you wrote insecure code, immediately fix it.

Lmao. I'm sure that will solve the problem of it writing insecure slop code.

[–] filcuk@lemmy.zip 25 points 1 month ago (2 children)

It doesn't fix it, but as stupid as it looks, it should actually improve the chances.
If you've seen how the reasoning works, they basically spit out some garbage, then read it again and think whether it's garbage enough or not.
They do try to 'correct their errors', so to say.

[–] underisk@lemmy.ml 5 points 1 month ago (2 children)

That’s not enabled by default afaik and it burns through way more tokens looping its output through several times. It also adds a bunch more context which will bring you that much closer to context collapse.

I didn't turn it on, and I see it doing it all the time. In my case though the mistakes are often absurd. I often feel like claude is a very junior programmer that has a hard time remembering the original requirements.

[–] fuzzzerd@programming.dev 4 points 1 month ago

While true, the latest opus model has 1m token context. Which is a lot more than the previous 200k limit. Hard to fill that up with regular work, but easy if you try to oneshot a whole product.

load more comments (1 replies)
[–] _stranger_@lemmy.world 6 points 1 month ago

Sounds exactly like half the managers I've ever worked with.

[–] raspberriesareyummy@lemmy.world 3 points 1 month ago

That sounds like written by some dumbass vibe-coder who actually believes their LLM is "smart".

[–] lmr0x61@lemmy.ml 50 points 1 month ago (2 children)

Normally, I’d be reading about NPM security breaches and AI security breaches separately, but now I can get them in the same article! Truly amazing how technology has progressed.

[–] return2ozma@lemmy.world 6 points 1 month ago

Fun times ahead!

load more comments (1 replies)
[–] jivandabeast@lemmy.browntown.dev 41 points 1 month ago
[–] spez@sh.itjust.works 20 points 1 month ago* (last edited 1 month ago) (7 children)

I mean it's not that big a deal. However, it would another thing if the model itself leaked. Now that would be something.

edit: Like I thought, it turns out to be a TS wrapper with more internal prompts. The fireship video is really funny, they use regex to detect if the user is angry 😭

[–] MangoCats@feddit.it 7 points 1 month ago

As they tell it, Claude Code is over 80% written by the models anyway...

[–] lexiw@lemmy.world 6 points 1 month ago

The harness is as important as the model

load more comments (5 replies)
[–] NocturnalMorning@lemmy.world 16 points 1 month ago (4 children)

By 4:23 am ET, Chaofan Shou (@Fried_rice), an intern at Solayer Labs, broadcasted the discovery on X (formerly Twitter).

Ha, by an intern

[–] djmikeale@feddit.dk 5 points 1 month ago

Nice. One of the ways to write Chaofan in Chinese is 炒饭, which means fried rice. Amazing to be able to get that Twitter handle

load more comments (3 replies)
[–] pelespirit@sh.itjust.works 16 points 1 month ago* (last edited 1 month ago) (1 children)

Like a healthy brain. And just like a healthy brain, it'll still hallucinate and make mistakes probably:

The leaked source reveals a sophisticated, three-layer memory architecture that moves away from traditional "store-everything" retrieval.

As analyzed by developers like @himanshustwts, the architecture utilizes a "Self-Healing Memory" system.

[–] Semi_Hemi_Demigod@lemmy.world 9 points 1 month ago (3 children)

We’re gonna make AGI and realize that being stupid sometimes and making mistakes is integral to general intelligence.

[–] Didntdoit71@feddit.online 7 points 1 month ago (1 children)

Actually, the people in the know...already knew this. We've known for years. Mistakes are required for learning.

load more comments (1 replies)
[–] MangoCats@feddit.it 3 points 1 month ago

being stupid sometimes and making mistakes is integral to general intelligence.

Smart people figured this out a long time ago.

https://www.amazon.com/s?k=nassim+taleb+antifragile&adgrpid=187118826460

https://www.goodreads.com/en/book/show/18378002-intuition-pumps-and-other-tools-for-thinking

[–] a4ng3l@lemmy.world 1 points 1 month ago

That’s what makes us humans at least…

[–] Fmstrat@lemmy.world 14 points 1 month ago

At its core is MEMORY.md, a lightweight index of pointers (~150 characters per line) that is perpetually loaded into the context. This index does not store data; it stores locations.

Actual project knowledge is distributed across "topic files" fetched on-demand, while raw transcripts are never fully read back into the context, but merely "grep’d" for specific identifiers.

This "Strict Write Discipline"—where the agent must update its index only after a successful file write—prevents the model from polluting its context with failed attempts.

For competitors, the "blueprint" is clear: build a skeptical memory. The code confirms that Anthropic’s agents are instructed to treat their own memory as a "hint," requiring the model to verify facts against the actual codebase before proceeding.

Interesting to see if continue.dev takes advantage of this methodology. My only complaint has been context with it.

[–] WhyJiffie@sh.itjust.works 12 points 1 month ago (1 children)

In this mode, the agent performs "memory consolidation" while the user is idle. The autoDream logic merges disparate observations, removes logical contradictions, and converts vague insights into absolute facts.

this blog post reads like a marketing piece

[–] nightlily@leminal.space 7 points 1 month ago

Pretty sure it’s a bad LLM „analysis“ of the code. It has that flavour to it.

[–] captcha_incorrect@lemmy.world 11 points 1 month ago
[–] Dentzy@sh.itjust.works 10 points 1 month ago

I was like "Ha, ha nice April's fools"... Then I keep reading the comments and... WTF‽

[–] itisileclerk@lemmy.world 5 points 1 month ago

The best learning method is from your own mistakes. So, Claude is still learning.

[–] Wooki@lemmy.world 5 points 1 month ago (1 children)

Best part of the leak, they use regex matches for sentiment lol

[–] hactar42@lemmy.ml 6 points 1 month ago (2 children)

I think saw one of the keywords was dumbass. And another looked for you calling it a piece of shit

[–] Buddahriffic@lemmy.world 4 points 1 month ago

Lmao, so the LLM framework falls back to similar shit to what ALICE used?

[–] smeenz@lemmy.nz 4 points 1 month ago* (last edited 1 month ago) (1 children)

Something in a song on my car radio triggered my phone to wake google yesterday and I casually told it to fuck off, and it replied "I'm sorry you're upset. You can send feedback"

[–] partofthevoice@lemmy.zip 3 points 1 month ago

Adversarial audio, but just occurring by chance? Wild stuff. I was just looking into how to do that.

[–] OhVenus_Baby@lemmy.ml 2 points 1 month ago

Vote people. There's town and city votes everyday or often. Vote!

load more comments
view more: next ›