"Everyone just says scaling hypothesis. Everyone neglects to ask, what are we sc...

waldarbeiter · 2024-09-04T15:28:36 1725463716

The conventional teaching that I am aware of says that you can scale across three dimensions: data, compute, parameters. But Ilya's formulation suggests that there may be more dimensions along which scaling is possible.

trashtester · 2024-09-05T12:29:35 1725539375

That's not how I read it. The scaling may still be those parameters, but the object (the "what" that is subjected to scaling) may need to retain some characteristics as it scales.

In other words, there may be a need to retains some sorts of symmetries or constraints from generation to generation that others understand less well than him (or so he thinks).

slashdave · 2024-09-05T16:43:13 1725554593

You also need to scale data. Since OpenAI has basically exhausted all available text data, there is something to this.