Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I personally cancelled my Claude sub when they had an employee promoting this as a good thing on Twitter. I recognize that the actual risk here is probably quite low, but I don't trust a chat bot to make legal determinations and that employees are touting this as a good thing does not make me trust the company's judgment


>promoting this as a good thing

This is literally completely opposite of what happened. Then entire point is that this is bad, unwanted, behavior.

Additionally, it has already been demonstrated that every other frontier model can be made to behave the same way given the correct prompting.

I recommend the following article for an in depth discussion [0]

[0] https://thezvi.substack.com/p/claude-4-you-safety-and-alignm...


Fine, replace "good" with "acceptable". The tone of the thread came off as "look at all these wacky things it can do! What a rascal"

It is irresponsible to release something in this state.


That is still incorrect. The entire point is that this is misaligned behavior that they would prefer not to see. They are reporting bad things. You are wanting to be mad and assigning a tone or feeling that was not actually there. You are punishing the wrong company. All of the frontier Model companies have models that will behave in the same way under similar circumstances. Only one company did the work to find this behavior and tell you about it. Think about whether you would prefer in the future to know about similar kinds of behaviors or not. The action you have described yourself taking if taken probably enough will ensure that in the future we the only way we will ever know is if we find out ourselves, because the companies will stop telling us (or rather, for every company except anthropic continue to not tell us).

It is only acceptable in the sense that they chose to release the model anyways. But, if that's the case, then every other frontier Model company believes that this level of behavior is acceptable. Because they are all releasing models that have approximately the same behavior when put in approximately the same conditions.


For now, but imagine when they figure out a trump voter is using it. It’s going to be very tempting to get it to ruin their life.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: