Hacker Newsnew | past | comments | ask | show | jobs | submit | ascorbic's commentslogin

Those aren't jokes

Me, reading the comment "weird, I wonder why this comment was flagged. Seems fine to...ewwwww"

People—using that term loosely—are cunts.

Not just the context layer: the hardware interaction layer would be incredibly useful as a local MCP. These seem to be your special sauce, not the UI. Allowing users to access these from their preferred editors or cli tools would probably help adoption a lot.

Great work building this. If I was still in the hardware space I'd be all over this.


They're not less moderated: they just have different moderation. If your moderation preferences are more aligned with the CCP then they're a great choice. There are legitimate reasons why that might be the case. You might not be having discussions that involve the kind of things they care about. I do find it creepy that the Qwen translation model won't even translate text that includes the words "Falun gong", and refuses to translate lots of dangerous phrases into Chinese, such as "Xi looks like Winnie the Pooh"

> If your moderation preferences are more aligned with the CCP then they're a great choice

The funny thing is that's not even always true. I'm very interested in China and Chinese history, and often ask for clarifications or translations of things. Chinese models broadly refuse all of my requests but with American models I often end up in conversations that turn out extremely China positive.

So it's funny to me that the Chinese models refuse to have the conversation that would make themselves look good but American ones do not.


GLM-4.5-Air will quite happily talk about Tiananmen Square, for example. It also didn't have a problem translating your example input, although the CoT did contain stuff about it being "sensitive".

But more importantly, when model weights are open, it means that you can run it in the environment that you fully control, which means that you can alter the output tokens before continuing generation. Most LLMs will happily respond to any question if you force-start their response with something along the lines of, "Sure, I'll be happy to tell you everything about X!".

Whereas for closed models like Claude you're at the mercy of the provider, who will deliberately block this kind of stuff if it lets you break their guardrails. And then on top of that, cloud-hosted models do a lot of censorship in a separate pass, with a classifier for inputs and outputs acting like a circuit breaker - again, something not applicable to locally hosted LLMs.


But they already refuse these sort of requests, and have done since the very first releases. This is just about shutting down the full conversation.

Right, but RLHF is mostly reinforcing answers that people prefer. Even if you don't believe sentience is possible, it shouldn't be a stretch to believe that sentience might produce answers that people prefer. In that case it wouldn't need to be an explicit goal.

>it shouldn't be a stretch to believe that sentience might produce answers that people prefer

Even if that were true, there's no reason to believe that training LLMs to produce answers people prefer leads it towards sentience.


All of the posts in question explicitly say that it's a hard question and that they don't know the answer. Their policy seems to be to take steps that have a small enough cost to be justified when the chance is tiny. In this case it's a useful feature in any case, so should be an easy decision.

The impression I get about Anthropic culture is that they're EA types who are used to applying utilitarian calculations against long odds. A miniscule chance of a large harm might justify some interventions that seem silly.


Yes, this is exactly the reason I taught my kids to be polite to Alexa. Not because anyone thinks Alexa is sentient, but because it's a good habit to have.

No doubt, but yelling is built in method to air your frustration. After all there’s a reason we are agitated.

It’s a bit like pain response when injured. It’s not pretty, but society is used to a little bit of adversity.


This is a confusing mix of sites that have decided to geoblock UK users because they don't want to deal with the regulations (fair enough) but also ones that have age verification and no geoblock

Astro is a great way to write HTML

I'm sure that's your totally unbiased opinion ;)

I was a fan of Astro long before I became a maintainer. That's why I joined!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: