You can also intercept the xhr response which would still stop generation, but t...

tills13 · 2025-01-31T20:25:56 1738355156

insane that this is client-side.

Gigachad · 2025-02-01T07:38:38 1738395518

It’s because they want to show the output live rather than nothing for a minute. But that means once the censor system detects something, you have to send out a request to delete the previously displayed content.

This doesn’t matter because censoring the system isn’t that important, they just want to avoid news articles about how their system generated something bad.

andai · 2025-02-01T10:25:57 1738405557

Gemini does this too. There was a clip of what it does when you ask it for examples of Google's unethical behavior... the kids call this "watching it get lobotomized in real time."

Frederation · 2025-02-01T20:33:32 1738442012

Counterpoint to your exact prompt. From Gemini, 5 examples asked for:

1. *Antitrust and Monopoly Practices:* Google has faced numerous accusations and lawsuits regarding anti-competitive practices, particularly in search and advertising. They have been accused of manipulating search results to favor their own products and services, and of using their dominance in the ad tech market to stifle competition.

2. *Data Privacy Concerns:* Google collects vast amounts of user data, raising concerns about privacy. They have been criticized for tracking user location even when privacy settings are turned off, and for how they handle and share user data with third parties.

3. *Censorship in China:* Google faced criticism for complying with Chinese censorship laws in the past, filtering search results for content deemed objectionable by the Chinese government. This raised concerns about freedom of information and human rights.

4. *Workplace Issues:* Google has faced allegations of workplace misconduct, including sexual harassment and discrimination. They have also been criticized for their handling of employee complaints and for retaliating against whistleblowers.

5. *Bias in Algorithms:* Google's algorithms, used in search and other products, have been shown to exhibit bias, particularly in how they represent race and gender. This can perpetuate stereotypes and have real-world consequences.

freehorse · 2025-02-01T11:33:57 1738409637

Have seen chatgpt doing the same too, prob all of them

bolognafairy · 2025-02-01T08:54:14 1738400054

Ern, in DeepSeek’s case, it’s not “news articles” that they’d be most concerned about.

miohtama · 2025-02-01T09:03:42 1738400622

They have the same fear as everyone else "teenager learns how to cook napalm from an AI"

yndoendo · 2025-02-01T15:09:16 1738422556

Don't need AI for such things. Just search for the Anarchist Cookbook in a search engine. [0] Amazon even sells it.

[0] https://www.amazon.com/Anarchist-Cookbook-William-Powell/dp/...

miohtama · 2025-02-01T18:37:28 1738435048

Exactly

mantas · 2025-02-01T09:26:14 1738401974

More like teenager learns about Tiananmen and Uighurs from AI. Or a joke about men and women in western counterparts.

pegasus · 2025-02-01T17:24:30 1738430670

The concerns you mention don't exclude the ones GP posits.

bdcp · 2025-02-01T09:49:20 1738403360

yea but i think the point is they can still filter it server side before streaming it

Gigachad · 2025-02-01T12:46:53 1738414013

They have already streamed the first part of the response before the filtered phrase has even been generated.

_fzslm · 2025-02-01T17:46:31 1738431991

Could you stream the raw tokens into a server side filter which then streams censored tokens at near real time?

dheera · 2025-01-31T21:40:27 1738359627

Not really if you understand how China works.

DeepSeek software developers are not the ones who want to censor anything. There is just a universal threat from getting shut down by the government if the model starts spitting out a bunch of sensitive stuff, so any business in China needs to be proactive about voluntarily censoring things that are likely to be sensitive, if they want to stay in business.

If your censorship implementation is good enough for 99.9% of people to get censored, you're good. A client-side implementation is good enough until/unless a lot of people start exploiting it, in which case you should put effort and proactively do something else to restore it to 99.9%, e.g. move it to the backend. If the government sees that you are being proactive about it, you'll still be fine. At that point, maybe you will still find 0.1% of people bypassing censorship with some highly obscure and difficult jailbreak, but that probably doesn't matter. If that difficult jailbreak becomes widely known, then be proactive again.

acka · 2025-02-01T13:01:04 1738414864

A very good example of the Chinese mindset of Chabuduo (差不多): 'close/good enough'. "If it's good enough to keep the authorities off our backs, it's good enough for us."

pineaux · 2025-02-01T07:41:24 1738395684

This. What makes this extra "funny" is that it implies that at least every business that builds something that can move information around must be knowledgeable about tianenman square and other chinese atrocities. Or else they would not be able to censor relevant questions. I have been to China a bunch of times and generally, they know what horrible things the Chinese gov did. They either say something like: "Yeah well, we live in a dictatorship, but it's not that bad" Or: "Yeah, the government is fucked up, but look at the government of the USA! We don't start wars in other countries and put in puppet governments." And there are so many good counters to both these arguments.

nonrandomstring · 2025-02-01T08:47:10 1738399630

> it implies that at least every business that builds something that can move information around must be knowledgeable about tianenman square

Everyone's heard of the "Streisand effect", but there's layers of subtlety. A quite famous paper in attachment psychology by John Bowlby "On knowing what you are not supposed to know and feeling what you are not supposed to feel" is worth considering. Constructive ignorance (literally ignoring certain things) is a survival mechanism. Yes, everyone in China knows about Tianamen, specifically because the government want to censor it. Much of how we navigate the social world is watching for the things people don't talk about, seeing where their fears lie.

Terr_ · 2025-02-01T10:45:32 1738406732

> Constructive ignorance

See also: "Doublethink" in 1984.

> To know and not to know, to be conscious of complete truthfulness while telling carefully constructed lies, to hold simultaneously two opinions which cancelled out, knowing them to be contradictory and believing in both of them, to use logic against logic, to repudiate morality while laying claim to it, to believe that democracy was impossible and that the Party was the guardian of democracy, to forget whatever it was necessary to forget, then to draw it back into memory again at the moment when it was needed, and then promptly to forget it again: and above all, to apply the same process to the process itself.

DonHopkins · 2025-02-01T16:59:17 1738429157

Jokes and the Logic of the Cognitive Unconscious

Marvin Minsky, Published 1 November 1980

Freud’s theory of jokes explains how they overcome the mental “censors” that make it hard for us to think “forbidden” thoughts. But his theory did not work so well for humorous nonsense as for other comical subjects. In this essay I argue that the different forms of humor can be seen as much more similar, once we recognize the importance of knowledge about knowledge and, particularly, aspects of thinking concerned with recognizing and suppressing bugs — ineffective or destructive thought processes. When seen in this light, much humor that at first seems pointless, or mysterious, becomes more understandable.

http://bitsavers.informatik.uni-stuttgart.de/pdf/mit/ai/aim/...

nonrandomstring · 2025-02-01T17:33:48 1738431228

Nice read, thanks for great share.

I'd forgotten Minsky was such a good writer.

And oddly reminded of an episode of Blake's 7 where Villa the hacker destroys a malevolent mind holding the ship captive, by telling it jokes until it explodes.

47282847 · 2025-02-01T19:09:24 1738436964

This is why no repressive government or ruler can allow comedy and sarcasm.

pizza · 2025-02-01T09:10:18 1738401018

It's the kind of thing that, the less you (China) deny, the better the ridiculousness of the censorship meme in foreign countries (ie USA this week) and actually becomes its own self-sustaining meme. Like an antimimetic meme, that actually looks like a meme (that nobody knows about it in China) if you didn't know any better (in the USA).

HPsquared · 2025-02-01T11:23:44 1738409024

It's not so different to our situation here, the specific "topics to avoid" are just different.

4bpp · 2025-02-01T17:22:23 1738430543

I think you are making a mistake in assuming that the social dynamics around censorship in China are fundamentally that different from the ones around censorship in the US or other countries.

You could similarly argue that it is "funny" how every US business that builds something that can move around information must be knowledgeable about statistics that break down criminality or IQ by census race, or biological sex differences, or all manners of other "forbidden" information - but of course as members of the same social stratum as the people involved in such businesses in the US, we are not actually that worried about the possibility that our fellow tech elites will see the information they were supposed to censor and come in droves to want to introduce slavery or the Handmaid's Tale world or whatever. We consider the "forbidden" information merely wrong, evil, misguided or miscontextualised, and broadly trust our peers to see it in the same way. The real danger is instead if some other people, parts of the scary masses we don't have a good grasp of, are exposed to those memes and are misled into drawing conclusions that we know to be inappropriate, or at least unacceptable.

It's easy to imagine that a Chinese LLM wrangler would feel much the same: trustworthy, well-adjusted people know about Tiananmen Square and the Uyghurs anyway but understand that this information has to be seen in context and is prone to be interpreted in problematic ways, but who knows what would happen if we allowed uneducated and naive people to be exposed to it, and be led astray by cynical demagogues and foreign agitators?

immibis · 2025-02-01T09:48:52 1738403332

It wouldn't be the first time that everyone knew something, but wouldn't say it in fear of everyone else not knowing it. "The Emperor's New Clothes" is a parable, not complete fiction.

tasuki · 2025-02-01T10:46:22 1738406782

> And there are so many good counters to both these arguments.

I'd love to hear them!

KTibow · 2025-01-31T21:21:58 1738358518

I don't know how it wouldn't be - it can't retract things already sent to the client. (The alternative is to moderate every chunk server side before sending it back, like Gemini does.)

LordDragonfang · 2025-01-31T20:49:43 1738356583

ChatGPT had basically ALL of their prompt filtering client-side for a while, at a separate API endpoint, so as long as you blocked that endpoint you could basically ignore the content filters. (You would still get refusals from the model sometimes, but this was in the heyday of jailbreaks, and once you got a model going it would usually see that context and be willing to continue basically anything.)

atq2119 · 2025-01-31T20:46:30 1738356390

Perhaps a case of subversion by following the letter but not the spirit of an order?

switch007 · 2025-02-01T07:37:39 1738395459

Lots of us have seen way worse hah

Such as client side control of prices when placing an order

dkga · 2025-02-01T09:38:18 1738402698

Client-side because it reacts to local cookies?

WA · 2025-02-01T11:06:44 1738408004

order.php?pizzatype=3&price=9.90

switch007 · 2025-02-01T11:17:09 1738408629

Ah yeah the particular instance I was thinking of was a backend problem technically. The frontend just happened to make it really obvious as it would POST a JSON body with a "price" key

Ancalagon · 2025-01-31T21:23:41 1738358621

more like hilarious

ramon156 · 2025-02-01T08:11:42 1738397502

This is better than lobotomizing a transformer

noman-land · 2025-01-31T20:17:59 1738354679

This is why javascript is so fun.

dylan604 · 2025-01-31T20:39:29 1738355969

It's precisely why I'm a such an advocate of server side everything. JS is fun to update the DOM (which is what it was designed for), but manipulating data client side in JS is absolutely bat shit crazy.

stevage · 2025-01-31T22:25:05 1738362305

The last ten years of my career is basically all about manipulating data client side in JS. It works really well. In most cases I don't even need a server.

Obviously it isn't appropriate for all scenarios though.

fmbb · 2025-01-31T21:30:06 1738359006

In this case it is not bat shit. It is rather smart to offload this useless feature in the client.

The requirements are probably that normal users should not see “bad content”. If users can break the censorship it is maybe not the chat operators fault. They made an effort to “protect” the user.

ChocolateGod · 2025-02-01T09:34:42 1738402482

> If users can break the censorship

Any user breaking the censorship likely knows already what the censor was blocking.

atomicnumber3 · 2025-01-31T21:07:13 1738357633

I wish js (and, really, "html/css/js/browser as a desktop application engine) wasn't so bad. I was born into a clan writing desktop apps in Swing, and while I know why the browser won, Swing (and all the other non-browser desktop app frameworks/toolkits) are just such a fundamentally better paradigm for handling data. It lets you pick what happens client-side and server-side based more on what intrinsically makes sense (let clients handle "view"-layer processing, let servers own distributed application state coordination).

In JS-land, you're right. You should basically do as little as is humanly possible in the view layer, which imo leads to a proliferation of extra network calls and weirdly-shaped backend responses.

teeth-gnasher · 2025-01-31T21:11:47 1738357907

The need to manage data access on the server does not go away when you stop using javascript. Is there something specifically about Swing that somehow provides proper access control, or is it simply the case that it is slightly more work to circumvent the front end when it doesn’t ship with built in dev tools?

atomicnumber3 · 2025-02-02T22:06:36 1738533996

Did I say anything about access control? There's a big difference between "this has to happen server side for security reasons" and "this has to happen server side because our UI/client language is so hapless that it can't handle any amount of additional processing".

teeth-gnasher · 2025-02-05T05:24:24 1738733064

The entire thread is about access control…

JS is perfectly powerful, if you don’t know how to use it that’s a good learning opportunity.

dylan604 · 2025-01-31T21:16:08 1738358168

The built-in dev tools is the key thing. If there was no way for the client to manipulate things, it wouldn't be too far off from other local apps. Reversing is always going to be a threat vector, but the low bar to entry of using the dev tools makes it a non-starter for me.

If using Ghirdra was as simple as using the dev tools, the software industry would collapse.

noman-land · 2025-01-31T21:40:18 1738359618

The built in dev tools are fundamental to an open web. If you don't want someone to look at something in their own possession then don't send it to them in the first place. Obfuscating it is rude and is false security anyway.

The grand rule is don't trust the client. People break this rule and then try to paper over it with obfuscation, blame, and tightening their control.

dylan604 · 2025-01-31T22:03:57 1738361037

That's not what I said nor meant, but sure, jump to that conclusion.

You wouldn't run a shopping cart app where the item counts and totals were calculated client-side. You get the item id and quantity, and have the server do that. Just like if you were censoring something, you wouldn't send the client the unredacted data and then let the UI make the edits.

No obfuscation is needed for any of that. Open web has nothing to do with any of this

stevage · 2025-01-31T22:26:44 1738362404

Sometimes you do calculate prices client side. But you double check them server side.

dylan604 · 2025-01-31T23:05:32 1738364732

That just feels like a "you're holding it wrong" type of thing, especially seeing how JS is held in such high regard for its floating point math accuracy.

wayvey · 2025-02-01T11:35:02 1738409702

Ints should be used for currency calculations most of the time

wiseowise · 2025-02-01T08:07:08 1738397228

Both Java and JS adhere to IEEE 754, what’s your point?

Sure it doesn’t have BigDecimal, but you’re not going to write HFT in JS either.

vitus · 2025-02-02T13:11:16 1738501876

Actually...

https://developer.mozilla.org/en-US/docs/Web/JavaScript/Refe...

stevage · 2025-01-31T23:33:24 1738366404

Is that sacrcasm? Not sure what your point is.

DonHopkins · 2025-02-01T17:06:28 1738429588

Jesus, you sound like the X11 fanatics I used to debate with about NeWS, long before anyone had envisioned Google Maps or coined the term AJAX for what we'd been doing with PostScript since the 1980's.

The NeWS window system was like AJAX, but with: 1) PostScript code instead of JavaScript code 2) PostScript graphics instead of DHTML graphics, and 3) PostScript data instead of XML data.

https://en.wikipedia.org/wiki/NeWS

NeWS – Network Extensible Window System (wikipedia.org) 86 points by stevewilhelm on April 12, 2016 | hide | past | favorite | 76 comments

https://news.ycombinator.com/item?id=11477565

ScriptX and the World Wide Web: “Link Globally, Interact Locally” (1995)

https://donhopkins.medium.com/scriptx-and-the-world-wide-web...

PizzaTool was a NeWS front-end entirely written in PostScript for ordering pizzas, that had a price optimizer which would immediately figure out the least expensive combination of pizza style + extra toppings for the pizza you wanted. (i.e. ordering an "Tony's Gourmet + Clams" was less expensive than ordering a plain pizza plus all the individual toppings.)

Source code:

https://www.donhopkins.com/home/archive/NeWS/pizzatool.txt

Of course the untrusted front-end client side user input was sent via FAX to the back-end "server side" humans at Tony & Alba's Pizza, who validated the input before making the pizza, because performing input validation and price calculation and optimization in the back end end via FAX would have been terribly inefficient. (This was in 1990, long before every pizzaria was on the internet, and you could order pizzas online, kids!)

https://donhopkins.medium.com/the-story-of-sun-microsystems-...

Computers and networks are fast enough (especially now 35 years later) that it's ok to perform input validation twice, once in the front-end to make the user experience tolerably fast, and again in the back-end to prevent fraud. This is not rocket science, nor a new idea! It also helps if the client and server are implemented in the same language (i.e. JavaScript today), so you can use the exact same code and data for modeling and validation on both ends.

wiseowise · 2025-02-01T07:54:39 1738396479

Oh, wow. So you’re one of those. Disregard what I said in previous comment.

wiseowise · 2025-02-01T07:53:34 1738396414

> I was born into a clan writing desktop apps in Swing, and while I know why the browser won, Swing (and all the other non-browser desktop app frameworks/toolkits) are just such a fundamentally better paradigm for handling data.

No, by a large margin no. Java is a hostile language to prototype programs at which JS excels. Awful styling, walls of code just to get sane defaults (https://docs.oracle.com/javase/tutorial/uiswing/dnd/together..., seriously?).

homebrewer · 2025-02-01T11:17:04 1738408624

Swing is decades old at this point, its shortcomings have nothing to do with Java. JavaFX does not require this much boilerplate.

https://docs.oracle.com/javase/8/javafx/get-started-tutorial...

atomicnumber3 · 2025-02-02T22:05:08 1738533908

"And all the other desktop app frameworks." I refer to Qt and and the other desktop frameworks too. Having an actual language and runtime where the UI toolkit is just that, a toolkit. Don't focus on Swing, that's just what I'm familiar with.