Memes

54806 readers

987 users here now

Rules:

Be civil and nice.
Try not to excessively repost, as a rule of thumb, wait at least 2 months to do it if you have to.

founded 6 years ago

MODERATORS

gary_host_laptop@lemmy.ml

cyclohexane@lemmy.ml

cypherpunks@lemmy.ml

Based if true (lemmy.ml)

submitted 3 days ago by yogthos@lemmy.ml to c/memes@lemmy.ml

10 comments fedilink hide all child comments

all 11 comments

sorted by: hot top controversial new old

[–] HiddenLayer555@lemmy.ml 2 points 1 day ago* (last edited 1 day ago)

Even if true, just block them wtf? Vibe coders don't understand server side access controls.

[–] Meron35@lemmy.world 18 points 3 days ago (2 children)

Anthropic making a lot of noise of being the victim of large scale distillation attacks (ie other AI firms, usually Chinese copying/scraping their model), but people have pointed out the hypocrisy that Anthropic themselves seems to have copied DeepSeek.

If you bypass the system prompt and ask Claude what model it is (e.g. via Open router), it'll reply that it's DeepSeek.

(Also I know, eww Reddit and X)

Claude sonnet 4.6 says it’s DeepSeek when system prompt is empty : r/DeepSeek - https://www.reddit.com/r/DeepSeek/comments/1rd5jw7/claude_sonnet_46_says_its_deepseek_when_system/

Claude Sonnet 4.6 distilled DeepSeek? : r/DeepSeek - https://www.reddit.com/r/DeepSeek/comments/1r9se7p/claude_sonnet_46_distilled_deepseek/

https://x.com/i/status/2026130112685416881

[–] HiddenLayer555@lemmy.ml 1 points 1 day ago* (last edited 1 day ago)

Interestingly I've gotten the 32B local Deepseek R1 model to say it's Claude as well (in English).

IDK if it's indicative of distillation or "model theft" though (how do you distill a closed model like Claude?) And why would them training on prompt responses from Claude also carry the name over unless Claude says its name every time? From my uneducated guess it's probably more of an indication that LLMs don't actually know anything and the information they're trained on mentions every AI model so they just randomly "pick" the most commonly mentioned ones for what they think they are if you don't tell them. Deepseek is a common Chinese model so it's probably associated with asking names of models in Chinese, same with Claude and English.

[–] yogthos@lemmy.ml 12 points 3 days ago

I think the reason they're making noise is cause they want to make a case to ban Chinese models entirely. Right now they have a problem that Chinese models are open and anybody can download and run their own version. That directly undermines the whole business model of providing them as a service. I bet they're going to try and argue that since DeepSeek and other Chinese companies stole their IP, these models are now illegal and can't be used in the US.

[–] funkajunk@lemmy.world 17 points 3 days ago

Oh no, they used the thing I built using stolen data

[–] ExotiqueMatter@lemmygrad.ml 4 points 2 days ago (1 children)

Almost surely false. I seriously doubt it's possible to train a modern multy-billions parameters LLM with less than 2 dozen million prompts, even if it's 16M each, let alone if it's 16M combined.

[–] yogthos@lemmy.ml 2 points 2 days ago

yeah I'm very skeptical here as well

[–] pineapple@lemmy.ml 8 points 3 days ago

Robin hood moment.

[–] ShinkanTrain@lemmy.ml 6 points 3 days ago (1 children)

Please tell me these 16 million prompts cost Anthropic a lot of money

[–] yogthos@lemmy.ml 8 points 3 days ago

It would've been through the API access, so they'd get paid.