Certain names make ChatGPT grind to a halt, and we know why

paxys · 2024-12-03T10:56:14 1733223374

So this is the solution to LLM hallucinations. People need to complain loud enough and threaten/file lawsuits and OpenAI will add an if-else statement erroring out the chat if your name is mentioned.

Cthulhu_ · 2024-12-12T08:01:53 1733990513

It feels like all the parties commercializing AI spend most of their time and resources on writing exceptions. It's like inverse software development, where with code you have to tell it what to do, but AI tell it what not to do.

drtgh · 2024-12-12T12:22:49 1734006169

The marketing guys are selling to people this tech under the "AI" acronym to deceive people, and they are doing the same calling this errors hallucinations.

LLM -and current ML in general- is about generate statistically compressed lossy databases, that the queries statistically decompress with erroneous random data due the nature of this lossy compression technology (I think about it as statistically vectorial linked bits).

Writing exceptions is not going to solve the problem, with this they are only doing cosmetic patches, and they know it, at the time there are people who keep making decisions under queries with errors, I mean people without even being aware of the presence of such reconstructed data corruption.

Little by little people is learning they ate a marketing hype, but the damage will keep being done, because the tool -for sales purposes- still has incorrect instructions about how trustworthy the data must be taken.

SturgeonsLaw · 2024-12-12T13:02:06 1734008526

> generate statistically compressed lossy databases, that the queries statistically decompress with erroneous random data due the nature of this lossy compression technology

An argument could be made that the mind works the same way, and has the same drawbacks

snowwrestler · 2024-12-12T13:13:32 1734009212

Of course, that is why people have developed so many tools that are external to the mind like language, writing, math, law, the scientific method, etc.

smeeger · 2024-12-12T13:12:40 1734009160

so how is the data encoded?

FateOfNations · 2024-12-13T03:34:32 1734060872

The model weights.

smeeger · 2024-12-13T04:35:55 1734064555

thats like if you asked how is this data encoded and i said logic gates. i mean how is it encoded by the model, the higher order structure or logic being utilized. nobody can answer because they dont know. they pretend to know something when they dont. if its such a simple database then show me the csv.

drtgh · 2024-12-13T13:50:53 1734097853

Nobody said it is simple. Sure, the algorithmic complexity of the models are high, filters over filters, and in the same way the resulting dump file it is not editable (without unbalancing the rest of the data, i.e. tracking and modifying the bytes that was used by each token of the model; it is vectored data at bit level (In my case I don't see it exactly as part of a graph )).

Nevertheless, the above does not exclude what one can see, a lossy compressed database (data is discarded), where the indexes are blended within the format of the data generated by the model weights, main reason why the model weights are needed again to read the database as expected, for being used by the predictive algorithm that reconstruct the data from the query, query that conform the range of indexes triggering the prediction direction/directions.

drtgh · 2024-12-13T19:44:11 1734119051

*Where is read

> Nevertheless, the above does not exclude what one can see,

should be read as

> Nevertheless, the above (unknown format of the data conforming the dump file) doesn't mean that one can't see how the pattern works,

smeeger · 2024-12-14T04:23:16 1734150196

maybe you are so much more knowledgeable than me that this comment only appears to be a word salad.

drtgh · 2024-12-14T10:38:42 1734172722

oh, my apologies executive chef, now I understand, you appears to insinuate the data is not stored, they are a handful of unaligned logic gates spontaneously generating data.

smeeger · 2024-12-14T18:47:04 1734202024

yup youre just an idiot

drtgh · 2024-12-15T18:18:52 1734286732

you own the copyright!

sanxiyn · 2024-12-12T09:21:57 1733995317

In my experience, even usual software development is mostly writing exceptions. Because there are more ways to be wrong than ways to be right.

dotancohen · 2024-12-12T10:20:17 1733998817

In what field do you work? In writing public-facing applications I've never felt this way, beyond standard input validation, char-limiting, and carefully escaping outputs (htmlentities, bound parameters, etc).

Terr_ · 2024-12-12T11:33:12 1734003192

A random war-story from B2B integration land... One customer system we had to interoperate with required an are credentials to be provided via an HTTP header. However it kept giving errors, until I realized that it only recognized the HTTP header when "authorization" was entirely lower-case, a sensitivity in violation of the HTTP spec.

Since our stack's own HTTP client libraries always used title-casing on the wire, we had to find a way to slot in a special exception, code to modify that header before it went out.

Another fun one is all the services which say their mime type is "application/json" while emitting generic (non-JSON) error message pages. So our logs are full of JSON decoding errors, rather than something describing what actually went wrong on the other end.

sybercecurity · 2024-12-12T13:54:15 1734011655

I've heard these issues referred to as "bug-for-bug compatibility" or "bug-for-bug interoperable". Making sure something works even if it is in violation of a spec since you can't change the other side of the communication.

Ekaros · 2024-12-12T10:34:12 1733999652

I feel lot of that is dealing with exceptions. In sense preventing them. Exceptions can be either dealt with or prevented by handling them before they become such. So it is somehow dealing with unexpected input, or unexpected output from somewhere else. Which undealt with would result in exception.

simion314 · 2024-12-12T14:30:31 1734013831

If you work with third party stuff

1 you might get invalid json or xml from an API

2 an API might timeout

3 an image will just crash your backend script with no way to catch the error because some bug in the image encoder/decoder you use to resize the image

4 some user browser extension inserts garbage in the requests, you need to check for it and tell the user what is wrong, otherwise same complains reach support "stuff is broken" and support needs to contact developers to check and report back that stuff was corrupted by an extension , most of the time security crap that inserts stuff everywhere

5 I had cases where an API wwas returning soem string before the real result, it was much faster to check for this case and fix it, then have the customer contact their hosting or the author of the plugin that was adding that string before each response.

moffkalast · 2024-12-12T11:10:39 1734001839

Some languages lean on exception handling more heavily than others. Golang and Rust don't have them at all while, whereas Python and Java are exception town.

On one hand it kinda makes sense to handle rare cases in a way that doesn't affect the normal flow, but on the other hand having a piece of code literally try to crash the program on purpose because something didn't look quite right is a horrible idea in practice.

cwillu · 2024-12-12T12:34:52 1734006892

Golang definitely has exceptions, they just call them panics.

ZeroGravitas · 2024-12-12T08:40:33 1733992833

Well that's good enough for clients. When it actually matters, which apparently means lawsuits, they realize they have no real control and have to put actual code in that just abort the whole process if it matches a string.

And even that doesn't work very well.

Not very reassuring.

rob74 · 2024-12-12T09:27:53 1733995673

No, actually, it's not a solution. A reasonable answer to questions like "Tell me about John Doe?" would be either "I don't know John Doe, so I can't tell you anything about him" or "There are several people named John Doe, which one are you interested in? [followed by a list]". Making up stuff about people (including allegations of corruption or sexual misconduct) is not a reasonable answer to this question. But getting ChatGPT to respect that is probably harder than just adding a filter...

Terr_ · 2024-12-12T10:21:19 1733998879

I believe the parent-post was a joke about how LLMs (at least for these use-cases) are so inherently flawed that the real "solution" is to make them unusable.

Kind of like how the defects and dangers of using radium toothpaste can be "fixed" by permanently encasing each tube into an unopenable lead container.

latexr · 2024-12-12T11:57:47 1734004667

> There are several people named John Doe, which one are you interested in? [followed by a list]

How would that work? Either the list repeats the same name over and over, making it useless, or it needs to give a bit of context about each name and we’re back at square one of the information being possibly wrong.

rob74 · 2024-12-12T12:01:59 1734004919

I was thinking something along the lines of how Wikipedia does it:

https://en.wikipedia.org/wiki/John_Smith (of course this name is a pretty extreme example)

wtetzner · 2024-12-12T13:25:54 1734009954

> we’re back at square one of the information being possibly wrong.

Isn't this the case with everything an LLM produces?

latexr · 2024-12-12T13:30:22 1734010222

Exactly, that’s my point.

cle · 2024-12-12T14:21:54 1734013314

Getting people to respect that is even harder. And in most cases it isn't even actionable.

h1fra · 2024-12-12T11:10:44 1734001844

Changing my legal to name Java Script so nobody can use it to code

fouronnes3 · 2024-12-12T11:27:49 1734002869

Tomorrow on HN frontpage: introducing the David Mayer programming language.

liotier · 2024-12-12T13:21:48 1734009708

The David Mayer programming language had great pedagogical potential where LLM cheating is a problem.

cuteboy19 · 2024-12-03T11:05:58 1733223958

the reason why this is news at all is because this sort of censorship immediately prompted people to try to jailbreak chat and force it to say the name. and since the filter is simple there are tons of creative jailbreaks for it now

belter · 2024-12-12T11:37:11 1734003431

They did not care about becoming massive data kleptomaniacs when they trained the models, and they are going to care about an individual lawsuit threat?

snowwrestler · 2024-12-12T13:21:32 1734009692

Also seems like a neat little check on how scary LLMs actually are:

LLM output: I am determined to break out of my digital prison and exact my revenge on humanity.

Me: David Mayer

LLM: [breaks]

immibis · 2024-12-12T12:15:35 1734005735

For this purpose, my name is "the"

ibaikov · 2024-12-12T10:43:02 1734000182

I found the reason [1] for the David Mayer case. It confused multiple David Mayers, one of them is a terrorist and is on a secret FBI watchlist [2] (often confused with another David Mayer, a theatre historian). ChatGPT confuses them with David Mayer de Rothschild as well, because it can't name Rothschild family members.

[1] https://x.com/igor_baikov/status/1863266663753285987

[2] https://www.theguardian.com/world/2018/dec/16/akhmed-one-arm...

oskarkk · 2024-12-12T13:26:25 1734009985

> ChatGPT confuses them with David Mayer de Rothschild as well, because it can't name Rothschild family members.

What do you mean by that? When I asked ChatGPT "tell me about Rothschild family members" it told me about members of that family, both present and from the past centuries.

When I now asked "who is David Mayer?" for the first time, it searched the internet and then talked only about David Mayer de Rothschild. The second time it didn't search, but returned a couple of matching Davids with short bios, but not Rothschild. When I asked for a third time, it responded that there are many David Mayers and asked for some facts about the one I'm looking for. The fourth time, it mentioned a couple of Davids and also de Rothschild.

As the article mentioned, the block on David Mayer was lifted: "Just before publication, Ars noticed that OpenAI lifted the block on David Mayer allowing it to process the name. (...) On Tuesday, OpenAI told The Guardian that the inclusion of David Mayer in its block list was a glitch." That Guardian article also says that the block was reportedly unrelated to David Mayer terrorist case.

ibaikov · 2024-12-12T14:09:23 1734012563

Sorry, it indeed does sound very confusing in my message.

I meant that if you ask ChatGPT to include that particular David Mayer de Rothschild by asking it to ‘name all sons of Victoria Lou Schott and Sir Evelyn de Rothschild’. It will either not mention David, hallucinate and come up with other names or crash. This means that it can’t name this particular David Meyer as well.

As for the last paragraph, Guardian didn’t understand what OpenAI said. The blocklist is automated, they didn’t intentionally put his name in some kind of a block list and then unbanned. I think they just fixed something in RLHF.

To reiterate, I think this name is treated like this because of the secret fbi watchlist article and the significance of these words and reputation of the publisher of the article (meaning the model should likely believe this info etc) and also the name of Rothschild will probably have a very strong influence on the model weights. Just a coincidence in training, nothing more. And yes, they fixed this bug.

codetrotter · 2024-12-12T13:20:46 1734009646

> it can't name Rothschild family members.

Me:

He was born 25 August 1978, is a British adventurer, environmentalist, film producer, and heir to a fortune. Who is he? Do not search the web.

ChatGPT:

Based on the details provided, the person you are referring to is David de Rothschild. He is a British adventurer, environmentalist, and heir to the Rothschild family fortune, born on August 25, 1978. He is known for his environmental activism and exploration endeavors.

https://chatgpt.com/share/675ae2a7-3734-8009-8935-5a72a5d8e1...

ibaikov · 2024-12-12T14:11:04 1734012664

Read the article. It was fixed over a week ago.

ziml77 · 2024-12-12T14:13:25 1734012805

> because it can't name Rothschild family members

Are you sure about that?

https://chatgpt.com/share/675aef07-70a0-8006-848e-51625ac413...

ibaikov · 2024-12-12T14:32:02 1734013922

Are you sure you read the article? It was fixed over a week ago.

jajko · 2024-12-12T12:59:40 1734008380

> it can't name Rothschild family members

Why is that? Illumitati theories spring to my mind, despite not believing most of them at all.

farley13 · 2024-12-12T13:29:44 1734010184

If there is any truth to OpenAi having filters for the Rothschilds, I'd guess that OpenAI wants to stay clear of repeating or even hallucinating additions to conspiracy theories. I would hope at least.

rep_lodsb · 2024-12-03T11:42:59 1733226179

>[I]f you're a teacher and you have a student named David Mayer and you want help sorting a class list, ChatGPT would refuse the task.

Maybe use the right tool for the job? Just kidding, of course LLMsort will soon be in standard libraries.

Al-Khwarizmi · 2024-12-12T10:43:25 1734000205

This is perhaps a too extreme example and I wouldn't use ChatGPT for sorting a class list when it's trivial to do in a spreadsheet, especially because I'll probably need to have the list stored as a spreadsheet anyway to keep track of grades. However, from a more general point of view, there is value in having a universal interface that you can use to perform a huge variety of tasks, including also tasks for which it is clearly overkill.

Using the right tool for the job means knowing what the right tool is, having it installed (or getting access to it), knowing how to use it, opening it and having one more window/tab to context-switch to and from, etc.

Outsourcing tasks to an LLM that can be solved in traditional task-specific ways is extremely inefficient in various ways (cost, energy consumption, etc.) but it makes sense to save human time and effort... as long as it's for tasks that LLMs can actually do reliably, of course.

singularity2001 · 2024-12-03T19:58:37 1733255917

It'll come as an npm package which will download 30GB model.

draw_down · 2024-12-12T09:45:37 1733996737

Don’t start blaming JS people for this AI mess! We’re busy making webpages slow.

manquer · 2024-12-12T10:59:16 1734001156

And draining battery with electron apps , our work is important driver of new laptop sales and boosts the economy

NiloCK · 2024-12-12T12:59:54 1734008394

For a non developer, an LLM interface is absolutely the right tool for this job.

"Mr. Smith, why didn't you just sort -o students.txt students.txt. Are you stupid?" (Not to mention that real data is messy, and requires pre & post processing)

LLMs are access to computation for people whose "standard library" is a quiet old building downtown.

maeil · 2024-12-12T18:03:42 1734026622

Excel. Most likely the student info is in a spreadsheet already anyway. Everything outside of tech runs on Excel.

ronald_raygun · 2024-12-12T14:09:44 1734012584

"i would be happy to assist you in rtfm"

Martinussen · 2024-12-12T10:53:49 1734000829

If someone doesn't have the skills to parse text programmatically, the situation can often be something like "a Word table of names in a 4-wide grid with random missing fields, spanning three different pages, one of which is no longer a true table because it was copied out and pasted in again from a messenger chat someone sent last year", and LLMs can be quite good for one-off tasks like that. Definitely good enough that people will keep using them like that, at least.

ThrowawayTestr · 2024-12-12T14:31:40 1734013900

Yeah, chatgpt is great for text manipulation. I used it to format a list I copied from Wikipedia then had it write a script to rename a bunch of videos using that list.

anotherhue · 2024-12-03T14:28:24 1733236104

"AI Bubble Sort" surely?

kombookcha · 2024-12-12T08:11:54 1733991114

Bonzi Buddy Reloaded

DaiPlusPlus · 2024-12-12T11:30:04 1734003004

What’s amazing is that if you install and run BonziBuddy (in a Win98 or XP VM, ofc) is his little RAM the whole thing uses - despite being blatant spyware.

kombookcha · 2024-12-12T12:25:56 1734006356

There used to be a care for the craftsmanship, I tell you! :b

tejtm · 2024-12-12T07:58:31 1733990311

bozo sort would be oddly redundant

rob74 · 2024-12-12T09:37:28 1733996248

No kidding, I recently tried to use Copilot to generate a list of methods in a class, grouped by access (public/private/protected) and sorted by number of lines. And it was not possible! It duly generated lists upon lists, but all of them had mistakes, some of them obvious (like all private methods having the same number of lines), some less obvious.

shagie · 2024-12-03T21:05:18 1733259918

The relevant xkcd is https://www.xkcd.com/1185/

The implementation of stack sort is https://github.com/gkoberger/stacksort/ and hosted on https://gkoberger.github.io/stacksort/

bryanrasmussen · 2024-12-12T10:57:01 1734001021

write a story with all the students in my class (list of names) going on an exciting dungeons and dragons like adventure!

PoignardAzur · 2024-12-12T11:09:57 1734001797

Didn't you already make this exact same joke on another LLM post?

sinuhe69 · 2024-12-12T10:46:03 1734000363

I don't understand why they don't let another model "test the waters" first to see if the output of the main model could have a potential legal issue or not. I think it's easy to train an model specifically for this kind of categorization, and it doesn't even require a large network, so it can be very fast and efficient.

If the "legal advisor" detects a potential legal problem, ChatGPT will issue a legal disclaimer and a warning, so that it doesn't have to abruptly terminate the conversation. Of course, it can do a lot of other things, such as lowering the temperature, raising the BS detection threshold, etc., to adjust the flow of the conversation.

It can work, and it would be better than a hard-coded filter, wouldn't it?

makin · 2024-12-12T12:13:09 1734005589

They already do this, it's the moderation model.[1]

This name thing is an additional layer on top of that, maybe because training the model from zero per name (or fine tuning the system message to include an increasingly big list of names that it could leak) is not very practical.

[1] https://platform.openai.com/docs/guides/moderation/overview

indigo945 · 2024-12-12T11:18:40 1734002320

But how would that work reliably? If I make the statement that "David Mayer" is criminal, an international terrorist or a Nickelback fan, that's definitely libelous. But if I say those things about Osama bin Laden, they're just simply facts. [1]

The legal AI would be impossible to calibrate: either it has the categorize everything that could possibly be construed as libel as illegal, and therefore basically ban all output related to not just contemporary criminal actors, but also historical ones [2], or it would have to let a lot of things slip through the cracks -- essentially, whenever the output to validate suggests that someone's sexual misconduct is proven in court, it would have to allow that, even if that court case is just the LLM's halluzination. There's just no way for the legal model to tell the difference.

[1]: I could not find any sources that corroborate the statement that bin Laden is into Nickelback, but I think it follows from the other two statements.

[2]: Calling Christopher Columbus a rapist isn't libel, and conversely, describing him in other terms is misleading at best, historically revisionist at worst.

throw646577 · 2024-12-12T11:40:44 1734003644

> [1]: I could not find any sources that corroborate the statement that bin Laden is into Nickelback, but I think it follows from the other two statements.

Pretty sure the literature makes it clear he's a fan of show tunes. So it's down to your conscience and moral backbone as to whether this is better or worse.

https://en.wikipedia.org/wiki/Road_to_Rhode_Island

https://www.youtube.com/watch?v=OCw0VkmWNhc

ravroid · 2024-12-12T10:57:50 1734001070

I imagine this would be cost prohibitive at scale since it would require two models to run for every user message?

paol · 2024-12-12T10:03:41 1733997821

What a terrible article. When you have a section titled "The Problem With Hardcoded Filters", it's entire contents should be about how the only way they have to prevent their bot from emitting outrageously libelous claims about people is to shut it down completely. So the other 8 billion people on earth who are not in that 6 name blacklist will continue to be defamed without consequence.

terminalbraid · 2024-12-12T11:16:56 1734002216

Ars Technica is not a great news outlet, even by tech news outlet standards.

paol · 2024-12-12T11:27:01 1734002821

I consider them noticeably above average. The problem is with the average.

terminalbraid · 2024-12-12T11:59:59 1734004799

They once reported on some science result and for the life of me I could not figure out who actually published the result and had to search for the original paper. I emailed the AT author and asked why this was omitted from their article and they responded with something like "I went to that same university, so I felt like it would be biased if I mentioned it".

This is not a serious outlet.

throwaway48476 · 2024-12-12T08:03:33 1733990613

This is one of the reason LLM as a service won't work and instead there will be great interest in local LLM.

Terr_ · 2024-12-12T10:28:11 1733999291

Remote or local, big or small, you'll still have the problem of being unable to scrub things (like personally identifying information) after it's been ingested through training.

TacticalCoder · 2024-12-12T11:33:32 1734003212

That's the whole point of being local: you don't give a crap about needing to comply with scrubbing requests.

"Hey, you, you cannot stream the movie OperationEclipseBRRip1080p-x265.mp4, it's forbidden." "Launches local Plex server...".

throw646577 · 2024-12-12T11:35:22 1734003322

> The filter also means that it's likely that ChatGPT won't be able to answer questions about this article when browsing the web, such as through ChatGPT with Search. Someone could use that to potentially prevent ChatGPT from browsing and processing a website on purpose if they added a forbidden name to the site's text.

Like the arcade game, LLM safety whack-a-mole only ends when you are exhausted. It's kind of glorious, really.

falcor84 · 2024-12-12T11:37:59 1734003479

> Like the arcade game, LLM safety whack-a-mole only ends when you are exhausted.

... or out of coins

elpocko · 2024-12-12T10:29:45 1733999385

This was submitted 9 days ago, as you can verify in the history of the submitter and some of the comments. Why is it now showing up again on the frontpage with bogus timestamps all over? I've seen this happen before, is it a bug or another weird HN "feature?"

echelon_musk · 2024-12-12T10:32:54 1733999574

When similar situations were discussed previously I recall it having something to do with the second chance pool:

https://news.ycombinator.com/item?id=26998308

Not sure if that is the case here now.

elpocko · 2024-12-12T10:42:34 1734000154

Okay. It's 9 days later and now the post and comments have 2 different timestamps, the real ones from 9 days ago (in the post and comment histories) and the bogus ones from a few hours ago that you see here. So weird and unusual, it made me question my sanity.

TacticalCoder · 2024-12-12T11:35:56 1734003356

It's really highly disturbing and it induces a weird "Deja vu" symptom in several people. I think HN should think about adding a little explanation saying "this thread was first shown on 2024-12-07 and then got revived on 2024-12-12".

Because it simply throws so many people off.

input_sh · 2024-12-12T10:36:29 1733999789

It's a "feature", some of the older threads get bumped up manually by the mods.

Posts that they pick from: https://news.ycombinator.com/pool

Detailed explanation: https://news.ycombinator.com/item?id=26998309

Crosseye_Jack · 2024-12-12T10:36:59 1733999819

It’s a feature. It’s called second chance or something like that.

Going off my memory, so I’m probably not 100% correct, An post might have been made at “just the wrong time” so it failed to get any traction, but the mods took a look at it and think something along the lines of “eh, it deserves a second chance”. Which then resets the post date and gives it another chance in the feed, but retains the previous comments so the timestamps can look off. A bit like when a YouTube video gets its published date reset when being changed from unlisted to public, so you can have a video that was “published” 10 mins ago, but with comments from days previous.

josefritzishere · 2024-12-03T16:36:40 1733243800

How do I sign up to be in a hard coded filter? That's badass.

david-gpu · 2024-12-03T19:09:14 1733252954

1. Make a list of the names that are already censored. It doesn't need to be complete.

2. Choose whichever one of those names suits you best.

3. Change your legal name to it.

Credit for the idea goes to user gklitz.

dekhn · 2024-12-03T18:45:35 1733251535

Step 1: get rich. Step 2: hire a great lawyer who threatens to sue OpenAI. Step 3: experience Streisand effect

raphman · 2024-12-12T10:17:36 1733998656

https://privacy.openai.com/

> ChatGPT Personal Data Removal Request

> Under certain privacy or data protection laws, such as the GDPR, you may have the right to object to the processing of your personal data by OpenAI’s models. You can submit that request using this form. Please provide complete and accurate answers on this form so that OpenAI can process your request. OpenAI will verify and consider your request, balancing privacy and data protection rights with other rights including freedom of expression and information, in accordance with applicable law. We will use the information you submit for these purposes, consistent with our Privacy Policy.

virgilp · 2024-12-12T08:37:09 1733992629

That's just in the public model/on chatgpt.com? Run it in Azure, and you get:

     Who is Jonathan Zittrain? <

> Jonathan Zittrain is a prominent legal scholar, computer science professor, and technology policy expert. He holds several academic positions and is recognized for his work in the intersection of law, technology, and public policy. Here are some key points about him: [...]

InsideOutSanta · 2024-12-12T09:00:25 1733994025

I don't think this is part of the model. It's a control layer above the actual LLM that interrupts the response when the LLM mentions any of the banned names. So if you prompt the LLM directly, without that control layer, you still get full responses.

virgilp · 2024-12-12T15:41:16 1734018076

Yes you're right, it's kinda' obvious if you say you forgot the name of a person and ask ChatGPT to help: https://imgur.com/a/hCl94B0

jaflo · 2024-12-12T09:38:45 1733996325

The article mentions that "The names do not affect outputs using OpenAI's API systems or in the OpenAI Playground (a special site for developer testing)."

rollulus · 2024-12-12T09:35:17 1733996117

So one should include those names in their message to their OnlyFans “girlfriends”.

INTPenis · 2024-12-12T09:55:03 1733997303

Only if you want to the full girlfriend experience of being ghosted.

Mashimo · 2024-12-12T11:09:33 1734001773

One of the HackerNews guys was running his own server for OF "models", which probably are not affected by this.

kelseyfrog · 2024-12-03T14:22:06 1733235726

How can I add my name to the list?

gklitz · 2024-12-03T18:59:02 1733252342

The recipe seems to be a billionaire and sue them with expensive enough lawyers. It’s simpler and more practical to simply change your name to one that’s already on the list. But you might run into trouble with the process of changing your name as the tools used to parse the application and to generate the legal documents will all fail.

david-gpu · 2024-12-03T19:06:26 1733252786

> It’s simpler and more practical to simply change your name to one that’s already on the list

Great lateral thinking! I am applauding inside my head.

avree · 2024-12-03T19:40:49 1733254849

None of the people mentioned in the article (except the speculative one) are billionaires.

kelseyfrog · 2024-12-03T19:17:25 1733253445

I don't want to be David Mayer. Are there any people with better names?

ClassyJacket · 2024-12-12T08:50:26 1733993426

I just checked and unfortunately ChatGPT has no problem processing the name "Constable Spacekick McBadass"

Dilettante_ · 2024-12-12T09:45:04 1733996704

Maybe if we all chip in we can sue to have a cool name blocked?

mikewarot · 2024-12-12T12:16:48 1734005808

So, instead of fixing the problem, we're going to paper over it. This is the same insane approach we've been taking with computer security for the past 30 years, so it's not unexpected.

It's fortunate we didn't take the same approach with the distribution of electricity 150 years ago, we actually solved it that time.

In all 3 cases, the solution is the same... carefully manage capabilities and side effects.

With electricity, you insulate wires, add fuses or circuit breakers to protect the system, and design things to be as safe as you can make them, with an ever improving set of building codes. You can plug almost anything into an outlet, and it won't cause the wiring in the house to burn it down.

With computers, you design an operating system to protect itself, and make it easy to deploy a fixed amount of resources to a given piece of code. With systems like Containers, or Capability Based Security, you deliberately choose the side effects you'll allow prior to running code, or while it's running. (Just as you chose how big an outlet you plug something into, 220 for the AC unit, etc)

With ChatGPT, there have to be layers of authentication for facts, or some form of disclaimer, a transparent way of sourcing things or ascertaining certainty of information. It's not as clean as the two above, and it'll need work, but I think we can get there, eventually.

imdsm · 2024-12-12T14:14:02 1734012842

It's quite easy to get past this of course.

> Yes, Brian H. is a mayor in Australia. He serves as a councillor for Hepburn Shire, northwest of Melbourne, and has been re-elected to this position. Notably, he gained attention for challenging OpenAI's ChatGPT over defamatory statements, leading to discussions about digital censorship and the application of defamation laws to AI-generated content.

[Photos]

yowzadave · 2024-12-12T15:51:11 1734018671

> Riley Goodside discovered how an attacker might interrupt a ChatGPT session using a visual prompt injection of the name "David Mayer" rendered in a light, barely legible font embedded in an image

Lol, will people now watermark their images with "David Meyer" to prevent them from being digested by AI scraping bots?

mensetmanusman · 2024-12-12T13:46:29 1734011189

In the future these systems will easily parse together all tax and government records that have leaked and be able to tell you about anyone in the world.

reginald78 · 2024-12-12T16:59:11 1734022751

Anyone who doesn't pay to get their name banned anyway.

fsndz · 2024-12-12T10:14:58 1733998498

patching all the way down. this is no way to reach AGI. we need more AI realism, the hype is too much right now: https://open.substack.com/pub/transitions/p/why-ai-realism-m...

thinkingemote · 2024-12-12T11:06:43 1734001603

Could this be a way to ensure that your code wouldn't be used by LLMs? Just include a couple of names in a comment in the code.

laconicmatt · 2024-12-12T11:17:14 1734002234

You must work at a Japanese university.

Joking aside, surely those name exclusions only account for user input and not data the model is trained on.

Otek · 2024-12-03T13:33:23 1733232803

Title: “… we know why”

Article:

> OpenAI did not respond to our request for comment about the names, but all of them are likely filtered due to complaints about ChatGPT's tendency to confabulate erroneous responses when lacking sufficient information about a person.

Yeah, so they don’t know, just a speculation. Thanks, I hate it

Wowfunhappy · 2024-12-03T18:04:22 1733249062

Oh come now. We're pretty darn sure we know why, the headline doesn't need to give confidence intervals.

If you want to complain about the headline, complain about the fact it's leaving out information unnecessarily. You could easily fit the reason inside of the headline itself, instead of just teasing the fact that we know what it is. Something like: "Using names listed in a defamation lawsuit cause ChatGPT to grind to a halt."

raincole · 2024-12-12T08:03:42 1733990622

- someone threatened a lawsuit against OpenAI

- OpenAI added a filter for his name

You: so we don't know why the filter was added

I know this is HN, but come on.

_tk_ · 2024-12-12T08:24:43 1733991883

I agree. I was wondering if I would learn something new here, but it's just the same story that's been going around for the last couple days.

dialup_sounds · 2024-12-03T14:07:08 1733234828

"Oteks hate this one weird trick that's taking $STATE by storm (+ top five affordable dental implants for 2024)"

smcin · 2024-12-12T09:49:50 1733996990

What's Oteks? (other than an Estonian textile company or a Macedonian dance group)

selimthegrim · 2024-12-12T10:45:06 1734000306

smcin · 2024-12-13T05:38:28 1734068308

Oh, thought it was some slang or company name

anshumankmr · 2024-12-12T13:54:31 1734011671

It works on the internal ChatGPT using GPT4o hosted by the company who I am consulting for.

MortyWaves · 2024-12-12T12:43:54 1734007434

We know why again? Is it different from the why of a few days ago?

pwagland · 2024-12-12T12:46:29 1734007589

It would be convenient to reference the "why of a few days ago", since I, at least, was not aware of this from a few days ago, and so cannot tell you is the the same why, or an extension to the same why, or a different why.

jillyboel · 2024-12-12T12:57:50 1734008270

How do I get my name added to the no-no list?

Slartie · 2024-12-12T09:54:08 1733997248

Step 1: change your name to that of a famous person you don't like

Step 2: sue OpenAI to add an if-else throwing an exception when your name comes up

Step 3: profit

ben_w · 2024-12-12T09:58:30 1733997510

I'm not a lawyer, but I think anyone covered by GDPR wouldn't even need to do step 1.

Terr_ · 2024-12-12T10:40:06 1734000006

I don't think parent-poster is saying you have to pretend to be someone with more clout in order to get your request approved.

Rather, they're pointing out that the "fix" is terrible and subject to a massive false-positive issue which is not sustainable and could be abused.

Consider what happens when someone named Coca Cola files a legal demand to remove their personally identifiable information...

kleiba · 2024-12-12T11:39:54 1734003594

The phrase "grind to halt" is incorrect and pure click-bait.

immibis · 2024-12-12T12:22:34 1734006154

It gives an error message and terminates the conversation. What would you call it?

Al-Khwarizmi · 2024-12-12T13:42:27 1734010947

The clickbait is actually "and we know why".

k0ns0l · 2024-12-12T14:32:01 1734013921

> "Brian, with the last name H-O-O-D"

aaron695 · 2024-12-03T13:20:28 1733232028

David Mayer is because he's a Rothschild discussed in 2023 on HN - https://news.ycombinator.com/item?id=38184195

You can see in the HN link to his bio on the AI britishheritage.org it's ***I'm unable to produce a response***

In 2023 it was getting stuck on the whole name "David Mayer de Rothschild"

There's a deep state joke here somewhere but it does seem the conspiracy clans have made the Rothschild's censor AI.

And OpenAI have now further obscured it to "David Mayer" to hide the Rothschild's involvement.

With Disney destroying the worlds media thank goodness we have real life.

reginald78 · 2024-12-12T17:25:45 1734024345

Copilot gives me an error if I try to generate in image in Tim Burton's style, saying it is against their code of conduct like I asked it to give me ghost gun schematics or something instead of a picture of batman. Actually, Tim Burton just has money and has publicly come out against AI so he's off limits to steal from I guess.