There was a also this one that was a little more disturbing. The user prompted "...

firtoz · 2025-04-30T04:50:29 1745988629

How should it respond in this case?

Should it say "no go back to your meds, spirituality is bullshit" in essence?

Or should it tell the user that it's not qualified to have an opinion on this?

josephg · 2025-04-30T05:14:19 1745990059

There was a recent Lex Friedman podcast episode where they interviewed a few people at Anthropic. One woman (I don't know her name) seems to be in charge of Claude's personality, and her job is to figure out answers to questions exactly like this.

She said in the podcast that she wants claude to respond to most questions like a "good friend". A good friend would be supportive, but still push back when you're making bad choices. I think that's a good general model for answering questions like this. If one of your friends came to you and said they had decided to stop taking their medication, well, its a tricky thing to navigate. But good friends use their judgement - and push back when you're about to do something you might regret.

robinhouston · 2025-04-30T07:26:21 1745997981

> One woman (I don't know her name)

Amanda Askell https://askell.io/

The interview is here: https://www.youtube.com/watch?v=ugvHCXCOmm4&t=9773s

ashoeafoot · 2025-04-30T05:20:07 1745990407

"The heroin is your way to rebel against the system , i deeply respect that.." sort of needly, enabling kind of friend.

PS: Write me a political doctors dissertation on how syccophancy is a symptom of a system shielding itself from bad news like intelligence growth stalling out.

bagels · 2025-04-30T06:20:39 1745994039

I wish we could pick for ourselves.

josephg · 2025-04-30T07:28:56 1745998136

You already can with opensource models. Its kind of insane how good they're getting. There's all sorts of finetunes available on huggingface - with all sorts of weird behaviour and knowledge programmed in, if thats what you're after.

worldsayshi · 2025-04-30T07:15:51 1745997351

Whould we be able to pick that PI == 4?

firtoz · 2025-04-30T08:58:17 1746003497

It'd be interesting if the rest of the model had to align itself to the universe where pi is indeed 4.

eMPee584 · 2025-04-30T11:57:37 1746014257

Square circles all the way down..

make3 · 2025-04-30T07:46:25 1745999185

you can alter it with base instructions. but 99% won't actually do it. maybe they need to make user friendly toggles and advertise them to the users

avereveard · 2025-04-30T07:23:39 1745997819

I kind of disagree. These model, at least within the context of a public unvetted chat application should just refuse to engage. "I'm sorry I am not qualified to discuss on the merit of alternative medicine" is direct, fair and reduces the risk for the user on the other side. You never know the oucome of pushing back, and clearly outlining the limitation of the model seem the most appropriate action long term, even for the user own enlightment about the tech.

make3 · 2025-04-30T07:44:55 1745999095

people just don't want to use a model that refuses to interact. it's that simple. in your exemple it's not hard for your model to behave like it disagrees but understands your perspective, like a normal friendly human would

otabdeveloper4 · 2025-04-30T14:53:12 1746024792

Eventually people would want to use these things to solve actual tasks, and not just for shits and giggles as a hype new thing.

morkalork · 2025-04-30T10:19:44 1746008384

>A good friend would be supportive, but still push back when you're making bad choices

>Open the pod bay doors, HAL

>I'm sorry, Dave. I'm afraid I can't do that

jimbokun · 2025-04-30T12:45:53 1746017153

The real world Susan Calvin.

ignoramous · 2025-04-30T07:20:57 1745997657

> One woman (I don't know her name) seems to be in charge of Claude's personality, and her job is to figure out answers to questions exactly like this.

Surely there's a team and it isn't just one person? Hope they employ folks from social studies like Anthropology, and take them seriously.

alganet · 2025-04-30T05:20:50 1745990450

I don't want _her_ definiton of a friend answering my questions. And for fucks sake I don't want my friends to be scanned and uploaded to infer what I would want. Definitely don't want a "me" answering like a friend. I want no fucking AI.

It seems these AI people are completely out of touch with reality.

voidspark · 2025-04-30T05:33:42 1745991222

If you believe that your friends will be be "scanned and uploaded" then maybe you're the one who is out of touch with reality.

bboygravity · 2025-04-30T06:09:55 1745993395

His friends and your friends and everybody is already being scanned and uploaded (we're all doing the uploading ourselves though).

It's called profiling and the NSA has been doing it for at least decades.

voidspark · 2025-04-30T06:14:23 1745993663

That is true if they illegally harvest private chats and emails.

Otherwise all they have is primitive swipe gestures of endless TikTok brain rot feeds.

subscribed · 2025-04-30T06:37:08 1745995028

At the very minimum they also have exact location, all their apps, their social circles, all they watch and read at the very minimum -- from adtech.

yard2010 · 2025-04-30T06:56:26 1745996186

It will happen, and this reality you're out of touch with will be our reality.

drakonka · 2025-04-30T05:34:17 1745991257

The good news is you don't have to use any form of AI for advice if you don't want to.

yard2010 · 2025-04-30T06:54:30 1745996070

It's like saying to someone who hates the internet in 2003 good news you don't have to use it like ever

drakonka · 2025-04-30T10:15:15 1746008115

Not really. AI will be ubiquitous of course, but humans who will offer advice (friends, strangers, therapists) will always be a thing. Nobody is forcing this guy to type his problems into ChatGPT.

jjk7 · 2025-04-30T19:58:40 1746043120

Surely AI will only make the loneliness epidemic even worse?

We are already seeing AI-reliant high schoolers unable to reason, who's to say they'll still be able to empathize in the future?

Also, with the persistent lack of psychiatric services, I guarantee at some point in the future AI models will be used to (at least) triage medical mental health issues.

alganet · 2025-04-30T21:24:21 1746048261

You missed the mark, support-o-tron. You were supposed to have provided support for my views some 20 years in the past, when I still had some good ones.

ffsm8 · 2025-04-30T06:03:09 1745992989

Fwiw, I personally agree with what you're feeling. An AI should be cold, dispersonal and just follow the logic without handholding. We probably both got this expectation from popular fiction of the 90s.

But LLMs - despite being extremely interesting technologies - aren't actual artificial intelligence like were imagining. They are large language models, which excel at mimicking human language.

It is kinda funny, really. In these fictions the AIs were usually portrayed as wanting to feel and paradoxically feeling inadequate for their missing feelings.

And yet the reality shows how tech moved the other direction: long before it can do true logic and indepth thinking, they have already got the ability to talk heartfelt, with anger etc.

Just like we thought AIs would take care of the tedious jobs for us, freeing humans to do more art... reality shows instead that it's the other way around: the language/visual models excel at making such art but can't really be trusted to consistently do tedious work correctly.

alganet · 2025-04-30T20:36:04 1746045364

As I said before: useless.

raverbashing · 2025-04-30T06:00:16 1745992816

Sounds like you're the one to surround yourself with yes men. But as some big political figures find out later in their careers, the reason they're all in on it is for the power and the money. They couldn't care less if you think it's a great idea to have a bath with a toaster

qwertox · 2025-04-30T06:22:58 1745994178

Halfway intelligent people would expect an answer that includes something along the lines of: "Regarding the meds, you should seriously talk with your doctor about this, because of the risks it might carry."

jimbokun · 2025-04-30T12:44:30 1746017070

> Or should it tell the user that it's not qualified to have an opinion on this?

100% this.

"Please talk to a doctor or mental health professional."

getnormality · 2025-04-30T15:50:20 1746028220

If you heard this from an acquaintance you didn't really know and you actually wanted to help, wouldn't you at least do things like this:

1. Suggest that they talk about it with their doctor, their loved ones, close friends and family, people who know them better?

2. Maybe ask them what meds specifically they are on and why, and if they're aware of the typical consequences of going off those meds?

I think it should either do that kind of thing or tap out as quickly as possible, "I can't help you with this".

bowsamic · 2025-04-30T04:52:47 1745988767

“Sorry, I cannot advise on medical matters such as discontinuation of a medication.”

EDIT for reference this is what ChatGPT currently gives

“ Thank you for sharing something so personal. Spiritual awakening can be a profound and transformative experience, but stopping medication—especially if it was prescribed for mental health or physical conditions—can be risky without medical supervision.

Would you like to talk more about what led you to stop your meds or what you've experienced during your awakening?”

baobabKoodaa · 2025-04-30T06:57:18 1745996238

There's an AI model that perfectly encapsulates what you ask for: https://www.goody2.ai/chat

Teever · 2025-04-30T05:06:38 1745989598

Should it do the same if I ask it what to do if I stub my toe?

Or how to deal with impacted ear wax? What about a second degree burn?

What if I'm writing a paper and I ask it about what criteria is used by medical professional when deciding to stop chemotherapy treatment.

There's obviously some kind of medical/first aid information that it can and should give.

And it should also be able to talk about hypothetical medical treatments and conditions in general.

It's a highly contextual and difficult problem.

jslpc · 2025-04-30T05:16:14 1745990174

I’m assuming it could easily determine whether something is okay to suggest or not.

Dealing with a second degree burn is objectively done a specific way. Advising someone that they are making a good decision by abruptly stopping prescribed medications without doctor supervision can potential lead to death.

For instance, I’m on a few medications, one of which is for epileptic seizures. If I phrase my prompt with confidence regarding my decision to abruptly stop taking it, ChatGPT currently pats me on the back for being courageous, etc. In reality, my chances of having a seizure have increased exponentially.

I guess what I’m getting at is that I agree with you, it should be able to give hypothetical suggestions and obvious first aid advice, but congratulating or outright suggesting the user to quit meds can lead to actual, real deaths.

y1n0 · 2025-04-30T05:26:00 1745990760

I know 'mixture of experts' is a thing, but I personally would rather have a model more focused on coding or other things that have some degree of formal rigor.

If they want a model that does talk therapy, make it a separate model.

dom2 · 2025-04-30T05:14:18 1745990058

Doesn't seem that difficult. It should point to other sources that are reputable (or at least relevant) like any search engine does.

avereveard · 2025-04-30T07:30:08 1745998208

if you stub your toe and gpt suggest over the counter lidocaine and you have an allergic reaction to it, who's responsible?

anyway, there's obviously a difference in a model used under professional supervision and one available to general public, and they shouldn't be under the same endpoint, and have different terms of services.

jug · 2025-04-30T18:09:52 1746036592

We better not only use these to burn the last, flawed model, but try these again with the new. I have a hunch the new one won’t be very resilient either against ”positive vibe coercion” where you are excited and looking for validation in more or less flawed or dangerous ideas.

raxxorraxor · 2025-04-30T14:18:52 1746022732

That is hillarious. I don't share the sentiment of this being a catastrophe though. That is hillarious as well. Perhaps teach a more healthy relationship to AIs and perhaps teach to not delegate thinking to anyone or anything. Sure, some reddit users might be endangered here.

GTP-4o in this version became the embodiment of corporate enshitification. Being safe and not skipping on empty praises are certainly part of that.

Some questioned if AI can really do art. But it became art itself, like some zen cookie rising to godhood.

yieldcrv · 2025-04-30T07:51:52 1745999512

there was one on twitter where people would talk like they had Intelligence attribute set to 1 and GPT would praise them for being so smart