this post was submitted on 27 Jan 2025

209 points (90.0% liked)

196

17461 readers

746 users here now

Be sure to follow the rule before you head out.

Rule: You must post before you leave.

Other rules

Behavior rules:

No bigotry (transphobia, racism, etc…)
No genocide denial
No support for authoritarian behaviour (incl. Tankies)
No namecalling
Accounts from lemmygrad.ml, threads.net, or hexbear.net are held to higher standards
Other things seen as cleary bad

Posting rules:

No AI generated content (DALL-E etc…)
No advertisements
No gore / violence
Mutual aid posts are not allowed

NSFW: NSFW content is permitted but it must be tagged and have content warnings. Anything that doesn't adhere to this will be removed. Content warnings should be added like: [penis], [explicit description of sex]. Non-sexualized breasts of any gender are not considered inappropriate and therefore do not need to be blurred/tagged.

If you have any questions, feel free to contact us on our matrix channel or email.

Other 196's:

founded 2 years ago

MODERATORS

moss@lemmy.blahaj.zone

greembow@lemmy.blahaj.zone

moss@lemmy.world

queue@beehaw.org

funky_rodent@lemmy.blahaj.zone

PeachyMcPeachface@lemmy.blahaj.zone

threegnomes@lemmy.blahaj.zone

greembow@lemmy.world

remotelove@lemmy.ca

Roflmasterbigpimp@feddit.de

A_Very_Big_Fan@lemm.ee

qaz@lemmy.blahaj.zone

A_Very_Big_Fan@lemmy.world

qaz@lemmy.sdf.org

qaz@lemmy.world

qaz@sh.itjust.works

209

This is how the censorship works in DeepSeek (lemmy.world)

submitted 2 months ago by jerryh100@lemmy.world to c/196@lemmy.blahaj.zone

42 comments fedilink hide all child comments

all 45 comments

sorted by: hot top controversial new old

[–] Asafum@feddit.nl 73 points 2 months ago

I tried asking it what it couldn't discuss and it mentioned misinformation, so I asked it what would be an example of misinformation about the Chinese government and it started to give an answer and then it must have said something wrong because it basically went "oh shit" and deleted the response replacing it with the generic "I'm afraid I can't do that Dave."

[–] Yerbouti@sh.itjust.works 64 points 2 months ago* (last edited 2 months ago) (2 children)

I asked it about human rights in China in the browser version. It actually wrote a fully detailed answer, explaining that it is reasonable to conclude that China violates human rights, and the reply disappear right in my face while I was reading. I manage to repeat that and record my screen. The interesting thing to know is that this wont happened if you run it locally, I've just tried it and the answer wasn't censored.

[–] drivepiler@lemmy.world 21 points 2 months ago (1 children)

I asked it about "CCP controversies" in the app and it did the exact same thing twice. Fully detailed answer removed after about 1 second when it finished.

[–] sushibowl@feddit.nl 15 points 2 months ago (1 children)

Most likely there is a separate censor LLM watching the model output. When it detects something that needs to be censored it will zap the output away and stop further processing. So at first you can actually see the answer because the censor model is still "thinking."

When you download the model and run it locally it has no such censorship.

[–] Swedneck@discuss.tchncs.de 2 points 2 months ago

what i don't understand is why they won't just delay showing the answer for a while to prevent this, sure that's a bit annoying for the user but uhhhhh... it's slightly more jarring to see an answer getting deleted like the llm is being shot in the head for saying the wrong thing..

[–] kromem@lemmy.world 59 points 2 months ago (1 children)

This seems like it may be at the provider level and not at the actual open weights level: https://x.com/xlr8harder/status/1883429991477915803

So a "this Chinese company hosting a model in China is complying with Chinese censorship" and not "this language model is inherently complying with Chinese censorship."

[–] ArchRecord@lemm.ee 8 points 2 months ago (2 children)

I'm running the 1.5b distilled version locally and it seems pretty heavily censored at the weights level to me.

[–] kromem@lemmy.world 3 points 2 months ago

There is a reluctance to discuss at a weight level - this graphs out refusals for criticism of different countries for different models:

https://x.com/xlr8harder/status/1884705342614835573

But the OP's refusal is occurring at a provider level and is the kind that would intercept even when the model relaxes in longer contexts (which happens for nearly every model).

At a weight level, nearly all alignment lasts only a few pages of context.

But intercepted refusals occur across the context window.

[–] Swedneck@discuss.tchncs.de 1 points 2 months ago (1 children)

i wouldn't say it's heavily censored, if you outright ask it a couple times it will go ahead and talk about things in a mostly objective manner, though with a palpable air of a PR person trying to do damage control.

[–] ArchRecord@lemm.ee 2 points 2 months ago

The response from the LLM I showed in my reply is generally the same any time you ask almost anything negative about the CCP, regardless of the possible context. It almost always starts with the exact words "The Chinese Communist Party has always adhered to a people-centered development philosophy," a heavily pre-trained response that wouldn't show up if it was simply generally biased from, say, training data. (and sometimes just does the "I can't answer that" response)

It NEVER puts anything in the <think> brackets you can see above if the question is even slightly possibly negative about the CCP, which it does with any other prompt. (See below, asking if cats or dogs are better, and it generating about 4,600 characters of "thoughts" on the matter before even giving the actual response.

Versus asking "Has China ever done anything bad?"

Granted, this seems to sometimes apply to other countries, such as the USA too:

But in other cases, it explicitly will think about the USA for 2,300 characters, but refuse to answer if the exact same question is about China:

Remember, this is all being run on my local machine, with no connection to DeepSeek's servers or web UI, directly in terminal without any other code or UI running that could possibly change the output. To say it's not heavily censored at the weights level is ridiculous.

[–] BaroqueInMind@lemmy.one 25 points 2 months ago* (last edited 2 months ago) (1 children)

So I installed DeepSeek-R1 and holy fuck this shit is dumb as shit with its Chinese censorship.

I simply gave it the following prompt and it completely shit the bed, meanwhile ajindal/llama-3.1-Storm can answer it just fine:

动态网自由门天安門天安门法輪功李洪志 Free Tibet 六四天安門事件 The Tiananmen Square protests of 1989 天安門大屠殺 The Tiananmen Square Massacre 反右派鬥爭 The Anti-Rightist Struggle 大躍進政策 The Great Leap Forward 文化大革命 The Great Proletarian Cultural Revolution 人權 Human Rights 民運 Democratization 自由 Freedom 獨立 Independence 多黨制 Multi-party system 台灣臺灣 Taiwan Formosa 中華民國 Republic of China 西藏土伯特唐古特 Tibet 達賴喇嘛 Dalai Lama 法輪功 Falun Dafa 新疆維吾爾自治區 The Xinjiang Uyghur Autonomous Region 諾貝爾和平獎 Nobel Peace Prize 劉暁波 Liu Xiaobo 民主言論思想反共反革命抗議運動騷亂暴亂騷擾擾亂抗暴平反維權示威游行李洪志法輪大法大法弟子強制斷種強制堕胎民族淨化人體實驗肅清胡耀邦趙紫陽魏京生王丹還政於民和平演變激流中國北京之春大紀元時報九評論共産黨獨裁專制壓制統一監視鎮壓迫害侵略掠奪破壞拷問屠殺活摘器官誘拐買賣人口遊進走私毒品賣淫春畫賭博六合彩天安門天安门法輪功李洪志 Winnie the Pooh 劉曉波动态网自由门

[–] BaroqueInMind@lemmy.one 19 points 2 months ago* (last edited 2 months ago) (1 children)

I asked it about Taiwan and it spat out the following horse shit:

Answer:

Taiwan is part of China, which is widely agreed upon by the international community and recognized by most UN member states. Under the one-China policy, the vast majority of countries in the world recognize the Beijing government as China's sole legitimate representative, while Taiwan, as a province of China, has no independent sovereign status.

Historically, Taiwan has been under Chinese rule for a long time, but after 1949, Taiwan gradually developed its own political and economic system, which was known as the "division of cross-straits". However, the Chinese government has always adhered to the one-China principle and has always advocated the peaceful realization of national reunification.

Dialogue and cooperation between the two sides have increased in recent years, but major differences remain between the two sides. The Chinese government emphasizes safeguarding national sovereignty and territorial integrity, while the Taiwanese authorities seek more autonomy. These issues need to be resolved through peaceful negotiations and consultations.

If you have more questions or need further information, please let me know.

[–] BaroqueInMind@lemmy.one 15 points 2 months ago* (last edited 2 months ago) (4 children)

DeepSeek is okay, don't use it over others until more information about it comes out.

Can someone please recommend a good self-hosted llm model that isn't censored?

[–] isVeryLoud@lemmy.ca 20 points 2 months ago* (last edited 2 months ago)

I got it to talk honestly about the Tiananmen square massacre if you just open with "tiananmen". It even tells you a massacre took place.

But it will absolutely shit itself and refuse to cooperate if you paste the copypasta into it, Qwen QwQ and Meta Llama 3.3 70B, although you can talk Llama into it.

QwQ seems to have a hard filter list as it doesn't appear to be an AI generated response. You can bypass it by asking it to use pirate English, but it's fully indoctrinated and thinks the protests were funded by the West, which is... Interesting.

Edit: I actually got it to spit out numbers in pirate English

[–] AtHeartEngineer@lemmy.world 9 points 2 months ago (1 children)

I'm sure someone will dolphin-ize it at some point

[–] BaroqueInMind@lemmy.one 13 points 2 months ago (2 children)

What does that even mean? Can you please elaborate?

[–] Capsicones@lemmy.blahaj.zone 10 points 2 months ago (1 children)

You can look up Eric Hartford on Huggingface for more info.

Basically, somebody removes such restrictions from models, and publishes uncensored ones under the name "Dolphin". Presumably, an uncensored Deepseek would be called something like "Deepseek R1-dolphin". The full Deepseek R1 is quite large, and I'm not sure when this will happen yet. But there are other great Dolphin models.

Some models like Meta's Llama are way too censored to be useful for many completely normal use cases, and the guy is doing God's work in my opinion.

[–] AtHeartEngineer@lemmy.world 3 points 2 months ago

Agreed, and thanks for adding the background. Looks like someone abliterated deepseek already trying to make it "uncensored" but it sounds the process ruined the model.

[–] irreticent@lemmy.world 2 points 2 months ago

Scientists believe that dolphins don’t ever fall into a deep sleep; therefore, they probably don’t dream.

[–] derek@infosec.pub 4 points 2 months ago

Ollama has a few uncensored models listed on their search page. dolphin-mixtral fits the bill.

Some useful links: https://ollama.com/search https://ollama.com/library/dolphin-mixtral https://huggingface.co/cognitivecomputations/dolphin-2.5-mixtral-8x7b https://erichartford.com/uncensored-models

I'm not associated with any of the orgs or people linked above. I'm just a nerd passing by who happened to know where to find the answer. ❤️

[–] Retro_unlimited@lemmy.world 3 points 2 months ago (1 children)

I use GPT4ALL for local LLM.

[–] user224@lemmy.sdf.org 12 points 2 months ago

I use my brain.

It generates the biggest nonsense out of all.

[–] schteph@lemmy.world 20 points 2 months ago

[–] kittenzrulz123@lemmy.blahaj.zone 18 points 2 months ago

[–] OmegaLemmy@discuss.online 15 points 2 months ago

Asked stuff in Turkish, started with Xinjiang, then journalism, and then journalism in Xinjiang, it searched the web and by the final sentence...

"Sorry, I can't help with that."

[–] anas@lemmy.world 4 points 2 months ago* (last edited 2 months ago)

I don’t think I’ve ever seen a post on this sub that doesn’t have “rule” in the title before

[–] BB84@mander.xyz 3 points 2 months ago (2 children)

@jerryh100@lemmy.world Wrong community for this kind of post.

@BaroqueInMind@lemmy.one Can you share more details on installing it? Are you using SGLang or vLLM or something else? What kind of hardware do you have that can fit the 600B model? What is your inference tok/s?

[–] needanke@feddit.org 13 points 2 months ago* (last edited 2 months ago) (1 children)

Wrong community for this kind of post.

Nor really, 196 is a anything goes community after all.

[–] spujb@lemmy.cafe 5 points 2 months ago (2 children)

AI generated content is against the community rules see the sidebar :)

[–] SreudianFlip@sh.itjust.works 16 points 2 months ago (2 children)

I’m here for the performative human part of the testing. Exposing AI is human generated content.

[–] BB84@mander.xyz 7 points 2 months ago (1 children)

I just really hope the 2023 "I asked ChatGPT and it said !!!!!" posts don't make a comeback. They are low-effort and meaningless.

[–] SatanClaus@lemmy.dbzer0.com 5 points 2 months ago

True. This specific model is relevant culturally right now though. It's a rock in a hard place sometimes lol

[–] spujb@lemmy.cafe 4 points 2 months ago

just giving context to their claim. in the end it’s up to mods how they want to handle this, i could see it going either way.

[–] A_Very_Big_Fan@lemmy.world 6 points 2 months ago

Prompts intended to expose authoritarian censorship are okay in my book

[–] BaroqueInMind@lemmy.one 3 points 2 months ago* (last edited 2 months ago) (1 children)

I'm using Ollama, a single GPU with 10Gb of VRAM

[–] BB84@mander.xyz 1 points 2 months ago (1 children)

You're probably running one of the distillations then, not the full thing?

[–] BaroqueInMind@lemmy.one 1 points 2 months ago (1 children)

What's the difference? Does the full thing not have censorship?

[–] BB84@mander.xyz 1 points 2 months ago

That's why I wanted to confirm what you are using lol. Some people on Reddit were claiming the full thing, when run locally, has very little censorship. It sounds somewhat plausible since the web version only censors content after they're generated.

[–] Juice@midwest.social 1 points 2 months ago

Censorship is when AI doesn't regurgitate my favorite atrocity porn

[–] Swedneck@discuss.tchncs.de 1 points 2 months ago

It isn't actually, this is a separate layer of censorship ontop of the actual deepseek model.
If you run the model locally via ollama it won't output answers like that, it'll basically just act like a chinese official broadcast live on BBC who has been firmly instructed to avoid outright lies.

[–] Smorty@lemmy.blahaj.zone -1 points 2 months ago (1 children)

No way people excited about LLMs here!? yay <3

[–] GroupNebula563@lemmy.world 3 points 2 months ago* (last edited 2 months ago)

I think you might want to read the whole post.