The clue is in the name, it's a large language *model*. When I asked ChatGPT "Do...

circuit10 · on May 16, 2023

What does this prove? It proves either that someone trained ChatGPT to spit out this information or that this view is very common on the Internet, which doesn’t necessarily make it right

hammyhavoc · on May 16, 2023

In the same breath: what does saying it must have a world model prove? Why must it? For every occasion that it can impress with seeming reasoning, there's plenty of times where it just falls down.

The more balanced view is that we simply don't know, it's assumption on both sides of the fence—Schrödinger's world model. If something resembling a world model is emergent from the sheer scale of data, perhaps, that's a very interesting idea, but it's clear that it still doesn't understand, in the same way that a physics simulation doesn't understand what it's doing either, but it can still output useable information whilst doing something incredibly complex. Ergo, I'd say it's far more likely that there isn't anything out of the ordinary going on.

If I can't get something as simple as a valid NGINX configuration out of it without hallucinations, despite supplying it with the original Apache .htaccess file, documentation and the URL rewrites it will require, why would it have a world model? How an NGINX configuration works is much more simplistic than how the world works with its many systems. Especially when you consider that NGINX is inherently language-based, which should be what it excels at with pattern recognition et al, but it's dumb pattern recognition to a fault with "this commonly follows that" in the training data.

circuit10 · on May 16, 2023

The world model is very flawed, yes, but it does exist. I don't think that's "out of the ordinary", if you train something on complex real-world data you would expect at least a rudimentary world model to develop. A world model doesn't have to be something complicated, it's just having some ability to predict things that could happen in the real world. An ant probably has a world model in the sense that I'm using it.