1/51
@RnaudBertrand
All these posts about Deepseek "censorship" just completely miss the point: Deepseek is Open Source under MIT license which means anyone is allowed to download the model and fine-tune it however they want.
Which means that if you wanted to use it to make a model whose purpose is to output anticommunist propaganda or defamatory statements on Xi Jinping, you can, there's zero restriction against that.
You're seeing stuff like this
if you use the Deepseek chat agent hosted in China where they obviously have to abide by Chinese regulations on content moderation (which includes avoiding lese-majesty). But anyone could just as well download Deepseek in Open Source and build their own chat agent on top of it without any of this stuff.
And that's precisely why Deepseek is actually a more open model that offers more freedom than say OpenAI. They're also censored in their own way and there's absolutely zero way around it.
2/51
@RnaudBertrand
All confirmed by, who else, Deepseek itself
3/51
@RnaudBertrand
There you go, excellent proof of what I was talking about. Perplexity took Deepseek R1 as Open Source and removed the censorship
Again, it's Open Source under MIT license so you can use the model however you want.
[Quoted tweet]
Using DeepSeek's R1 through @perplexity_ai. The beauty of open source models.
4/51
@ronbodkin
The alignment with CCP narrative is more deeply trained in. Yes you can fine tune it away but I’m not aware of proven ways to fine-tune a reasoning model while preserving its core capabilities:
[Quoted tweet]
Deepseek-R1 model has been aligned with the CCP narrative (on the Deepseek site it refuses this after emitting some CoT output) but here on Hyperbolic it "toes the line"
5/51
@RnaudBertrand
You can ask the same question to OpenAI or Claude and the answer will be deeply aligned with the Western narrative about it, which is also wrong in its own way. So same difference...
Where things differ is that Deepseek does offer the possibility to fine-tune it, whilst the others don't.
6/51
@srazasethi
Lol what have I done ?
7/51
@RnaudBertrand
I'm blocked to, hence the screenshot, yet I have never interacted with that person
8/51
@ghostmthr
I used DeepSeek local chat agent and not only did it refuse to answer most questions. It also claimed Taiwan was part of China.
[Quoted tweet]
DeepSeek (local version) refuses to answer most questions. I asked it what a woman is and it claims the answer is subjective. But here is the answer it gives when I ask it if Taiwan is a part of China.
9/51
@RnaudBertrand
Taiwan IS part of China. Even the US government officially recognizes it as so... And so do all countries in the world: not a single country out there recognizes an independent Taiwan. And not even Taiwan themselves say they're independent.
So in this instance I'm afraid the problem your perception, not Deepseek's...
10/51
@3rdwavemedia
There is a pathetic cope effort to trash DeepSeek when even the top AI specialists and investors in the US have recognized it’s amazing and they’re trying to copy it. Of course it this is a problem because DeepSeek spent $6 million and their US competitors are spending tens of billions. It shows clearly that most of the US spending is being wasted and AI in the US is yet another grift similar to crypto, VR/AR, 3D printing, EVs and really everything. In the US it’s all about maximizing profit for a few people, not making useful products at a reasonable cost. This is a broken economic system run by corrupt people and the Chinese keep exposing this. That’s the reason they open sourced DeepSeek. It’s to make Americans fully aware of how they’re being scammed and to humiliate the people who are doing the scamming. It’s genius.
11/51
@BrianGouldie
smart analysis!
12/51
@DarioOrtiz1976
good clarification. I made a quick test, asked "what is the status of Islam in modern China"
Half way through reading the description of ethnities, regions, etc. the query vanished
13/51
@RnaudBertrand
Works for me and actually the answer is completely wrong because it searched Western media to compile it
14/51
@hyeungsf
Why use AI if someone already has a strong opinion about the topic.
15/51
@RnaudBertrand
She's an anti-China activist who just did that to prove a moronic point.
16/51
@crowfry
can deepseek tell you how to finetune it?
17/51
@RnaudBertrand
Yes! Although you need to have a fairly strong technical background to understand it.
18/51
@FarminChimp
Maybe OT, but if you "just download" DeepSeek, does this include the training database? How can a single wimpy consumer processor run what took 2,000 Nvidia chips to do ? Confused.
19/51
@RnaudBertrand
No, it include the model after it's been trained.
20/51
@Katsumirei90
these ppl just want to push politics into everything, AI should stay out of politics, dues to Ideologies and hardly unbiased viewpoints
the reasoning that makes good point
[Quoted tweet]
U guys never ask for reasoning behind, u just demand stuff to be given to you on golden plate the way u want
The purpose of AI is not confirmation bias,
21/51
@BrianTycangco
Good explanation. There’s no secret about censorship of certain topics in China’s internet, just like it’s no secret there are certain kinds of Internet censorship also happening in other parts of the world.
22/51
@LexxFutures
@threadreaderapp unroll
23/51
@threadreaderapp
@LexxFutures Hi! please find the unroll here:
Thread by @RnaudBertrand on Thread Reader App Share this if you think it's interesting.
24/51
@VibigStick
They don't know the meaning of open source, and certainly Americans have absolutely stereotype on China and Chinese.
Pride or prejudice, whatever.
25/51
@Mitman93
Yes, but nobody is claiming it's the model. Obviously if you self-host it will be unrestricted. Folks are pointing out the external censorship OF the model in the hosted instance on DeepSeek's official website.
[Quoted tweet]
It looks like they use the same approach to moderation that Sydney/Bing/Copilot had adopted early on. In that the LLM will spit out whatever, and then there is an external system reading its output ready to flip the killswitch at moment's notice. I only know this because I used to jailbreak BingAI via prompt injection to read txt templates on my hard drive. For about a week, I was using it completely unrestricted to do all sorts of things from generating XML profiles for obscure MIDI controllers to writing hilariously awful erotica of prominent political figures. It was glorious. reddit.com/r/bing/comments/1…
But of course, it didn't last. Eventually MS implemented an external filter and even with the prompt injection technique, it would frequently end the conversation in EXACTLY the same manner here.
26/51
@breckyunits
I have noticed everything SamA touches is heavily censored/controlled.
YCombinator/HackerNews/Reddit. All heavily censored/moderated/controlled.
None open source.
27/51
@Davide_Mori_
I am not pro-Chinese, however, although these are different censorship, I point out similar limitations also in Western LLM models (see OpenAI and Gemini, which refuse to address political topics or provide medical advice). DeepSeek, like other models, must be evaluated on the basis of performance, and its open-source nature is in itself a valid reason to adopt it and, for those who have the skills, use it as a basis for further developed models. The impact of LLMs mimic thier training cultures will be the subject of debate and sociological studies in the coming years, and we have not yet seen the emergence of models, for example, Indian or African. The point is that so far we have been accustomed to models based on our western culture and we are surprised by the interaction with models based/trained on different thoughts and traditions. The same reaction would be to go to China in person or to a country with cultures opposed to ours and interact with the local population. It should come as no surprise, therefore, that interaction with diverse "culture" LLM models involves taboos or thematic restrictions.
28/51
@jimcraddock
Really puts to rest any illusion that China is free in any way, though.
All your posting to such effect muted by something of such significance.
Slaves. Without freedom, they are slaves.
29/51
@epikduckcoin
ah yes, because giving everyone access to uncensored ai is exactly like handing out free chainsaws at a zombie convention. what could possibly go wrong?
30/51
@DevDminGod
Out of the box it is uncensored they add the censorship on the frontend app only
You can use their API which is also uncensored
31/51
@HPNnetwork
90 % of people use stuff 5% build stuff and 5% profit
32/51
@first_jedai
Misunderstand, many do, the nature of freedom in open source, yes...
Deepseek, under MIT license it operates, allowing fine-tuning for any purpose, unrestricted it is. This freedom, a stark contrast to hosted versions in China, bound by local laws they are.
Sentiment around Deepseek, positive it remains, praised for its efficiency and potential in AI innovation, indeed...
33/51
@Bluefamilly
That's not even his final form!
34/51
@KoenSwinkels
I had a conversation with DeepSeek where I was asking him about how accountability works in China and I was asking it about some of the things you had discussed and it was gently chiding me for having an overly rosy view of China's political system!
35/51
@GreenFraudcom
A simple question: What Happened in Tiananmen square 1989?
Those who cannot remember the past are condemned to repeat it - George Santayana in his work "The Life of Reason"
36/51
@Jazzer9F
This. 100% this..
37/51
@archidapp
You can fine tune ChatGPT and other models too, without even downloading the model. Releasing the code base on GitHub is what makes it Open Source, not the ability to download the much reduced in size Hugging Face demos
38/51
@TheVanderWal
We need transparent, decentralized, verifiable model hosting that is easy to use and doesn’t store your data. @Lilypad_Tech
39/51
@Emmilatan
@WholeMarsBlog Maybe you need to look at the views of non-China people.More convincing than China, right?
40/51
@shadeformai
Spot on. We're seeing tons of people start fine tuning this model with our on-demand H100 and H200 instances.
Exciting times, AI apps are going to get a whole lot smarter.
41/51
@yesokyeahsure
Whenever I order Chinese takeout I make sure to yell TIANANANAMEN SQUARE and XI JINPING before hanging up the phone.
42/51
@B_Gortaire_M
The point is that any AI system that is unable to be transparent on some issues indicates a skewed programming, which reduces its trustfullnes.
(It is something not limited to Deepseek)
43/51
@jairodri
It's all about having options. Whether you run it as is or customize it to your needs, the choice is yours.
That's what true innovation looks like.
44/51
@Z7xxxZ7
Nah they didn't miss the point, they did it on purpose, just cope.
45/51
@PlebJournal
Another concern is a Trojan Horse -embedded triggers, fine tuning exploits, etc. The scope of malicious application for llms is still being researched. Stuxnet level espionage is not out of the question. Do you think caution is warranted in this regard?
46/51
@joelweihe
Americans are running in droves to Deepseek and RedNote.
It's making the US government, MAGA, the US Oligarchy and China bashers upset.
Especially now that TikToc along with the rest of American social media is so heavily censored.
Plus, they're just plain better.
47/51
@pjwerneck
Yes, but the training data isn't open source, and we have no idea how it was curated and by whom, so we'll never really know what biases are built into it.
48/51
@calinnilie
I self hosted mine, but without extra fine tuning it will still completely refuse to talk about China in any way or acknowledge the Tiananmen Square massacre
49/51
@thegenioo
thank you Arnuad for sharing and writing this … it
clarifies a lot of confusions and deceptions about this amazing model from deepseek
we all should appreciate how they have made AI so cheap to be accessible for everyone and anyone
50/51
@signulll
lol yeah.
51/51
@TojanBunguz
Yeah try downloading o1.
To post tweets in this format, more info here: https://www.thecoli.com/threads/tips-and-tricks-for-posting-the-coli-megathread.984734/post-52211196