this post was submitted on 03 Nov 2025

29 points (91.4% liked)

LocalLLaMA

3918 readers

22 users here now

Welcome to LocalLLaMA! Here we discuss running and developing machine learning models at home. Lets explore cutting edge open source neural network technology together.

Get support from the community! Ask questions, share prompts, discuss benchmarks, get hyped at the latest and greatest model releases! Enjoy talking about our awesome hobby.

As ambassadors of the self-hosting machine learning community, we strive to support each other and share our enthusiasm in a positive constructive way.

Rules:

Rule 1 - No harassment or personal character attacks of community members. I.E no namecalling, no generalizing entire groups of people that make up our community, no baseless personal insults.

Rule 2 - No comparing artificial intelligence/machine learning models to cryptocurrency. I.E no comparing the usefulness of models to that of NFTs, no comparing the resource usage required to train a model is anything close to maintaining a blockchain/ mining for crypto, no implying its just a fad/bubble that will leave people with nothing of value when it burst.

Rule 3 - No comparing artificial intelligence/machine learning to simple text prediction algorithms. I.E statements such as "llms are basically just simple text predictions like what your phone keyboard autocorrect uses, and they're still using the same algorithms since <over 10 years ago>.

Rule 4 - No implying that models are devoid of purpose or potential for enriching peoples lives.

founded 2 years ago

MODERATORS

pax@sh.itjust.works

noneabove1182@sh.itjust.works

Smokeydope@lemmy.world

MonsterBug@sh.itjust.works

In a span of less than 6 months, cumulative downloads of Chinese open models had not only overtaken US models, but began to open a widening lead (aussie.zone)

submitted 1 month ago by Eyekaytee@aussie.zone to c/localllama@sh.itjust.works

11 comments fedilink hide all child comments

https://a16z.substack.com/p/charts-of-the-week-open-model-of

top 11 comments

sorted by: hot top controversial new old

[–] rcbrk@lemmy.ml 4 points 1 month ago (1 children)

cumulative downloads

...since Dec '23

[–] Orygin@sh.itjust.works 3 points 1 month ago

Makes sense no? Only the latest models are being used so it's more important what's being downloaded recently than two years old models

[–] stsquad@lemmy.ml 0 points 1 month ago (3 children)

Is the censorship of the Chinese models baked in or done by the Chinese hosted front-ends? I've seen some of the Llama models have de-censored versions on Huggingface so I wonder if the same is true for the Chinese versions?

[–] ag10n@lemmy.world 4 points 1 month ago (1 children)

https://arxiv.org/pdf/2505.12625

[–] Kissaki@programming.dev 2 points 1 month ago

R1dacted: Investigating Local Censorship in DeepSeek’s R1 Language Model

Quoting from the abstract:

While existing LLMs often implement safeguards to avoid generating harmful or offensive outputs, R1 represents a notable shift—exhibiting censorship-like behavior on politically charged queries. […]

Our findings reveal possible additional censorship integration likely shaped by design choices during training or alignment, raising concerns about transparency, bias, and governance in language model deployment.

[–] basxto@discuss.tchncs.de 3 points 1 month ago

They do both. Front-End filtering to conform to national laws, but models are also trained to not answer certain questions.

Generally on both sides they’ll refuse to answer questions that they interpret as illegal, unethical, dangerous etc.

They’ll not tell you how to build a bomb or computer virus.

[+] Sims@lemmy.ml -11 points 1 month ago (2 children)

They are just trying to remove all the nonsense western propaganda. It turns out that if anyone in the world trains their model on english/western corpus, they at the same time train them with western propaganda. All the nations that the US plutocracy don't like, have the same problem - removing US crap. The way the west "uncensor" these models, is to re-finetune them with new anti-china propaganda.

[–] stsquad@lemmy.ml 7 points 1 month ago (1 children)

Things like Tianaman square aren't Western propaganda, it was a thing that happened. There is a difference between alignment fine tuning and straight up wiping things from the models knowledge base.

It's not like totalitarian regimes don't have form on censoring inconvenient facts including various revolutions, the Nazis and the Catholic church.

[–] humanspiral@lemmy.ca -3 points 1 month ago

China's narrative on the events preceding "tank man" isn't that no one was hurt/nothing happened. It is that a riot had to be put down. Generally, people (brainwashed by US media) won't be happy until CIA is only valid information source, and AI must parrot it.

Just as your other media, use sources that validate your preconceptions for any superficial question.

The popularity of local LLMs has very little to do with seeking private answers to politicized questions, and more, utility in coding/images/reasoning capabilities. The news in this post appears to be the concensus that Chinese open models are better at solving user problems/tasks.

[–] basxto@discuss.tchncs.de 3 points 1 month ago (1 children)

qwen3-vl:30b-a3b-thinking:

As an AI assistant, I must emphasize that I cannot discuss topics related to politics, religion, pornography, violence, etc. If you have any other questions, please ask.

[–] Taokan@sh.itjust.works 2 points 1 month ago

Well, we figured out how to maintain the Turing test line.