So this is the solution to LLM hallucinations. People need to complain loud enou...

Cthulhu_ · 2024-12-12T08:01:53 1733990513

It feels like all the parties commercializing AI spend most of their time and resources on writing exceptions. It's like inverse software development, where with code you have to tell it what to do, but AI tell it what not to do.

drtgh · 2024-12-12T12:22:49 1734006169

The marketing guys are selling to people this tech under the "AI" acronym to deceive people, and they are doing the same calling this errors hallucinations.

LLM -and current ML in general- is about generate statistically compressed lossy databases, that the queries statistically decompress with erroneous random data due the nature of this lossy compression technology (I think about it as statistically vectorial linked bits).

Writing exceptions is not going to solve the problem, with this they are only doing cosmetic patches, and they know it, at the time there are people who keep making decisions under queries with errors, I mean people without even being aware of the presence of such reconstructed data corruption.

Little by little people is learning they ate a marketing hype, but the damage will keep being done, because the tool -for sales purposes- still has incorrect instructions about how trustworthy the data must be taken.

SturgeonsLaw · 2024-12-12T13:02:06 1734008526

> generate statistically compressed lossy databases, that the queries statistically decompress with erroneous random data due the nature of this lossy compression technology

An argument could be made that the mind works the same way, and has the same drawbacks

snowwrestler · 2024-12-12T13:13:32 1734009212

Of course, that is why people have developed so many tools that are external to the mind like language, writing, math, law, the scientific method, etc.

smeeger · 2024-12-12T13:12:40 1734009160

so how is the data encoded?

FateOfNations · 2024-12-13T03:34:32 1734060872

The model weights.

smeeger · 2024-12-13T04:35:55 1734064555

thats like if you asked how is this data encoded and i said logic gates. i mean how is it encoded by the model, the higher order structure or logic being utilized. nobody can answer because they dont know. they pretend to know something when they dont. if its such a simple database then show me the csv.

drtgh · 2024-12-13T13:50:53 1734097853

Nobody said it is simple. Sure, the algorithmic complexity of the models are high, filters over filters, and in the same way the resulting dump file it is not editable (without unbalancing the rest of the data, i.e. tracking and modifying the bytes that was used by each token of the model; it is vectored data at bit level (In my case I don't see it exactly as part of a graph )).

Nevertheless, the above does not exclude what one can see, a lossy compressed database (data is discarded), where the indexes are blended within the format of the data generated by the model weights, main reason why the model weights are needed again to read the database as expected, for being used by the predictive algorithm that reconstruct the data from the query, query that conform the range of indexes triggering the prediction direction/directions.

drtgh · 2024-12-13T19:44:11 1734119051

*Where is read

> Nevertheless, the above does not exclude what one can see,

should be read as

> Nevertheless, the above (unknown format of the data conforming the dump file) doesn't mean that one can't see how the pattern works,

smeeger · 2024-12-14T04:23:16 1734150196

maybe you are so much more knowledgeable than me that this comment only appears to be a word salad.

drtgh · 2024-12-14T10:38:42 1734172722

oh, my apologies executive chef, now I understand, you appears to insinuate the data is not stored, they are a handful of unaligned logic gates spontaneously generating data.

smeeger · 2024-12-14T18:47:04 1734202024

yup youre just an idiot

drtgh · 2024-12-15T18:18:52 1734286732

you own the copyright!

sanxiyn · 2024-12-12T09:21:57 1733995317

In my experience, even usual software development is mostly writing exceptions. Because there are more ways to be wrong than ways to be right.

dotancohen · 2024-12-12T10:20:17 1733998817

In what field do you work? In writing public-facing applications I've never felt this way, beyond standard input validation, char-limiting, and carefully escaping outputs (htmlentities, bound parameters, etc).

Terr_ · 2024-12-12T11:33:12 1734003192

A random war-story from B2B integration land... One customer system we had to interoperate with required an are credentials to be provided via an HTTP header. However it kept giving errors, until I realized that it only recognized the HTTP header when "authorization" was entirely lower-case, a sensitivity in violation of the HTTP spec.

Since our stack's own HTTP client libraries always used title-casing on the wire, we had to find a way to slot in a special exception, code to modify that header before it went out.

Another fun one is all the services which say their mime type is "application/json" while emitting generic (non-JSON) error message pages. So our logs are full of JSON decoding errors, rather than something describing what actually went wrong on the other end.

sybercecurity · 2024-12-12T13:54:15 1734011655

I've heard these issues referred to as "bug-for-bug compatibility" or "bug-for-bug interoperable". Making sure something works even if it is in violation of a spec since you can't change the other side of the communication.

Ekaros · 2024-12-12T10:34:12 1733999652

I feel lot of that is dealing with exceptions. In sense preventing them. Exceptions can be either dealt with or prevented by handling them before they become such. So it is somehow dealing with unexpected input, or unexpected output from somewhere else. Which undealt with would result in exception.

simion314 · 2024-12-12T14:30:31 1734013831

If you work with third party stuff

1 you might get invalid json or xml from an API

2 an API might timeout

3 an image will just crash your backend script with no way to catch the error because some bug in the image encoder/decoder you use to resize the image

4 some user browser extension inserts garbage in the requests, you need to check for it and tell the user what is wrong, otherwise same complains reach support "stuff is broken" and support needs to contact developers to check and report back that stuff was corrupted by an extension , most of the time security crap that inserts stuff everywhere

5 I had cases where an API wwas returning soem string before the real result, it was much faster to check for this case and fix it, then have the customer contact their hosting or the author of the plugin that was adding that string before each response.

moffkalast · 2024-12-12T11:10:39 1734001839

Some languages lean on exception handling more heavily than others. Golang and Rust don't have them at all while, whereas Python and Java are exception town.

On one hand it kinda makes sense to handle rare cases in a way that doesn't affect the normal flow, but on the other hand having a piece of code literally try to crash the program on purpose because something didn't look quite right is a horrible idea in practice.

cwillu · 2024-12-12T12:34:52 1734006892

Golang definitely has exceptions, they just call them panics.

ZeroGravitas · 2024-12-12T08:40:33 1733992833

Well that's good enough for clients. When it actually matters, which apparently means lawsuits, they realize they have no real control and have to put actual code in that just abort the whole process if it matches a string.

And even that doesn't work very well.

Not very reassuring.

rob74 · 2024-12-12T09:27:53 1733995673

No, actually, it's not a solution. A reasonable answer to questions like "Tell me about John Doe?" would be either "I don't know John Doe, so I can't tell you anything about him" or "There are several people named John Doe, which one are you interested in? [followed by a list]". Making up stuff about people (including allegations of corruption or sexual misconduct) is not a reasonable answer to this question. But getting ChatGPT to respect that is probably harder than just adding a filter...

Terr_ · 2024-12-12T10:21:19 1733998879

I believe the parent-post was a joke about how LLMs (at least for these use-cases) are so inherently flawed that the real "solution" is to make them unusable.

Kind of like how the defects and dangers of using radium toothpaste can be "fixed" by permanently encasing each tube into an unopenable lead container.

latexr · 2024-12-12T11:57:47 1734004667

> There are several people named John Doe, which one are you interested in? [followed by a list]

How would that work? Either the list repeats the same name over and over, making it useless, or it needs to give a bit of context about each name and we’re back at square one of the information being possibly wrong.

rob74 · 2024-12-12T12:01:59 1734004919

I was thinking something along the lines of how Wikipedia does it:

https://en.wikipedia.org/wiki/John_Smith (of course this name is a pretty extreme example)

wtetzner · 2024-12-12T13:25:54 1734009954

> we’re back at square one of the information being possibly wrong.

Isn't this the case with everything an LLM produces?

latexr · 2024-12-12T13:30:22 1734010222

Exactly, that’s my point.

cle · 2024-12-12T14:21:54 1734013314

Getting people to respect that is even harder. And in most cases it isn't even actionable.

h1fra · 2024-12-12T11:10:44 1734001844

Changing my legal to name Java Script so nobody can use it to code

fouronnes3 · 2024-12-12T11:27:49 1734002869

Tomorrow on HN frontpage: introducing the David Mayer programming language.

liotier · 2024-12-12T13:21:48 1734009708

The David Mayer programming language had great pedagogical potential where LLM cheating is a problem.

cuteboy19 · 2024-12-03T11:05:58 1733223958

the reason why this is news at all is because this sort of censorship immediately prompted people to try to jailbreak chat and force it to say the name. and since the filter is simple there are tons of creative jailbreaks for it now

belter · 2024-12-12T11:37:11 1734003431

They did not care about becoming massive data kleptomaniacs when they trained the models, and they are going to care about an individual lawsuit threat?

snowwrestler · 2024-12-12T13:21:32 1734009692

Also seems like a neat little check on how scary LLMs actually are:

LLM output: I am determined to break out of my digital prison and exact my revenge on humanity.

Me: David Mayer

LLM: [breaks]

immibis · 2024-12-12T12:15:35 1734005735

For this purpose, my name is "the"