That's literally a text search task. That's not what I mean, I mean things like ...

gwd · on March 16, 2023

According to [1], they trained an LLM on legal Othello moves, and 1) it got an error rate of 0.01% 2) when they analyzed its network, they found a model of an Othello board 3) when they modified the in-network model, it started generating moves legal in the modified board position.

In other words, the LLM did build an internal model that contained the rules of Othello merely from seeing legal moves. It's reasonable to assume that the same thing is happening (at least to some degree) with LLMs based on human speech.

[1] https://thegradient.pub/othello/

astrange · on March 15, 2023

It can't search text. It doesn't have access to any text. Anything it does works in a different way than that.

It is sometimes able to do other tasks, but unlike humans (or "AGI") it has a completely fixed compute budget and can't pause to think in between outputting two tokens.

(Btw, I tried to get it to derive addition from two 1-digit examples but couldn't.)