On what topics you understand well does GOT-4o or Claude Opus produce garbage?

threeseed · 2024-06-02T22:46:05 1717368365

I do run into the issue where the longer the conversation goes the more inaccurate the information.

But a common situation is that with code generation it will fail to understand the context of where the code belongs and so it's a function that will compile but makes no sense.

fnordpiglet · 2024-06-02T22:51:12 1717368672

Yeah. I often springboard into a new context by having the LLM compose the next prompt based on the discussion and restart the context. Remarkably effective if you ask it to incorporate “prompt engineering” terms from research.

Zenzero · 2024-06-03T14:42:29 1717425749

Anything deeper than surface level in medicine.

Try getting it to properly select crystalloids with proper additives for a patient with a given history and lab results and watch in horror as it confidently gives instructions that would kill the patient.

What is even more irritating is that I had gpt4 debate me on things that it was completely wrong about and it was only when I responded with a stern rebuke that it hit me with the usual "Apologies for the misunderstanding..."

mensetmanusman · 2024-06-03T17:11:03 1717434663

LLMs are not good at answering expert level questions at the forefront of human knowledge.

Zenzero · 2024-06-03T19:19:02 1717442342

Unfortunately it would be considered basic medicine in this case.

mensetmanusman · 2024-06-03T20:08:29 1717445309

Is it basic but not documented? Basic to me means the first google search result is generally correct.

Zenzero · 2024-06-03T22:46:05 1717454765

That's not how medicine operates.

Medical problems are highly contextual, so you are not going to get much valuable information at the level of what a doctor is thinking from the first page of Google. That doesn't mean it isn't a simple within our area of expertise.

mensetmanusman · 2024-06-05T12:17:16 1717589836

In my area of expertise, a well formulated google search can result in a page 1 full of academic articles on the general topic, but there isn’t necessarily consensus. This might be a case of the curse of knowledge :)

mordymoop · 2024-06-03T20:04:55 1717445095

To be fair, I have not found MDs to be particularly reliable for answering basic questions about medicine either.

Zenzero · 2024-06-03T22:47:29 1717454849

OK. I can't speak for what you've experienced. I can only offer what I see from LLMs given what I know.

lupire · 2024-06-03T11:27:53 1717414073

High school math problems.