would you rather the LLM make up something that sounds right when it doesn't kno...

ncallaway · 2025-03-28T05:14:13 1743138853

> would you rather the LLM make up something that sounds right when it doesn't know, or would you like it to claim "i don't know" for tasks it actually can figure out?

The latter option every single time

skydhash · 2025-03-28T03:41:34 1743133294

> but we assume our nascent AIs, being machines, should always function correctly all the time

A tool that does not function is a defective tool. When I issue a command, it better does it correctly or it will be replaced.

BoiledCabbage · 2025-03-28T04:26:24 1743135984

And that's part of the problem - you're thinking of it like a hammer when it's not a hammer. It's asking someone at a bar a question. You'll often get an answer - but even if they respond confidently that doesn't make it correct. The problem is people assuming things are fact because "someone at a bar told them." That's not much better than, "it must be true I saw it on TV".

It's a different type of tool - a person has to treat it that way.

skydhash · 2025-03-28T05:16:26 1743138986

Asking a question is very contextual. I don't ask a lawyer house engineering problems, nor my doctor how to bake cake. That means If I'm asking someone at a bar, I'm already prepare to deal with the fact that the person is maybe drunk, probably won't know,... And more often than not, I won't even ask the question unless dire needs. Because it's the most inefficient way to get an informed answer.

I wouldn't bat an eye if people were taking code suggestions, then review it and edit it to make it correct. But from what I see, it's pretty a direct push to production if they got it to compile, which is different from correct.

player1234 · 2025-03-31T11:27:17 1743420437

Sounds like a trillion dollar industry.

alextingle · 2025-03-28T05:33:43 1743140023

It would be nice to have some kind of "confidence level" annotation.