> Is there any evidence or reason to suspect that this would result in the desir...

vgel · on March 12, 2023

But note that just because a model says it's using chain-of-thought or tools to come to a certain conclusion, doesn't necessarily mean that it is: https://vgel.me/posts/tools-not-needed/

cs702 · on March 12, 2023

Yes, I agree. But note that the same logic applies to human beings too. Just because people say they are using chain-of-thought or tools to come to a certain conclusion, doesn't necessarily mean they are. Philosophers have been grappling with this issue for many centuries :-)

vba616 · on March 12, 2023

A car can explode, and so can a piñata.

Therefore deciding whether they are interchangeable is a deep question that may take centuries to resolve.

concordDance · on March 12, 2023

I don't understand this analogy.

throwaway1851 · on March 12, 2023

The analogy is meant to show that, while it’s possible to raise deep philosophical questions based on superficial or trivial observations, it can also also be quite silly to do that.

throwaway1851 · on March 12, 2023

Yes. I have been using ChatGPT quite a bit with programming tasks. One of the things I've been trying to do is using chain-of-thought prompting to ask the model to review its own code, line by line, evaluating it for the presence of a certain type of bug or some other criterion.

This has been illuminating: as ChatGPT steps through the lines of code, its "analysis" discusses material that _is not present in the line of code_. It then reaches a "conclusion" that is either correct or incorrect, but having no real relationship to the actual code.

melagonster · on March 12, 2023

I try to ask chatGPT add comments to my code, everything is good until I found it modify code, too.

Ozzie_osman · on March 12, 2023

But in the case of chain-of-thought and that ReAct paper, the results did have a measured increase in accuracy.

vgel · on March 12, 2023

Oh yeah, absolutely. Just not something I'd use for, e.g., mortgages where denying someone unfairly could lead to a lawsuit.

loxias · on March 11, 2023

Thanks! I was genuinely asking, will need some time to read and digest. LangChain looks interesting.

cs702 · on March 12, 2023

Thank you. Also search the web for posts on "prompt engineering," to get a sense of how people are using LLMs today. It's pretty eye-opening.