Memes

54806 readers

1459 users here now

Rules:

Be civil and nice.
Try not to excessively repost, as a rule of thumb, wait at least 2 months to do it if you have to.

founded 6 years ago

MODERATORS

gary_host_laptop@lemmy.ml

cyclohexane@lemmy.ml

cypherpunks@lemmy.ml

Based if true (lemmy.ml)

submitted 3 days ago by yogthos@lemmy.ml to c/memes@lemmy.ml

10 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] Meron35@lemmy.world 18 points 3 days ago (2 children)

Anthropic making a lot of noise of being the victim of large scale distillation attacks (ie other AI firms, usually Chinese copying/scraping their model), but people have pointed out the hypocrisy that Anthropic themselves seems to have copied DeepSeek.

If you bypass the system prompt and ask Claude what model it is (e.g. via Open router), it'll reply that it's DeepSeek.

(Also I know, eww Reddit and X)

Claude sonnet 4.6 says it’s DeepSeek when system prompt is empty : r/DeepSeek - https://www.reddit.com/r/DeepSeek/comments/1rd5jw7/claude_sonnet_46_says_its_deepseek_when_system/

Claude Sonnet 4.6 distilled DeepSeek? : r/DeepSeek - https://www.reddit.com/r/DeepSeek/comments/1r9se7p/claude_sonnet_46_distilled_deepseek/

https://x.com/i/status/2026130112685416881

[–] HiddenLayer555@lemmy.ml 1 points 1 day ago* (last edited 1 day ago)

Interestingly I've gotten the 32B local Deepseek R1 model to say it's Claude as well (in English).

IDK if it's indicative of distillation or "model theft" though (how do you distill a closed model like Claude?) And why would them training on prompt responses from Claude also carry the name over unless Claude says its name every time? From my uneducated guess it's probably more of an indication that LLMs don't actually know anything and the information they're trained on mentions every AI model so they just randomly "pick" the most commonly mentioned ones for what they think they are if you don't tell them. Deepseek is a common Chinese model so it's probably associated with asking names of models in Chinese, same with Claude and English.

[–] yogthos@lemmy.ml 12 points 3 days ago

I think the reason they're making noise is cause they want to make a case to ban Chinese models entirely. Right now they have a problem that Chinese models are open and anybody can download and run their own version. That directly undermines the whole business model of providing them as a service. I bet they're going to try and argue that since DeepSeek and other Chinese companies stole their IP, these models are now illegal and can't be used in the US.