Its not even been 2 years, and you think things are coming to a halt?

0points · 2024-10-31T08:02:48 1730361768

Yes. The models require training data and they already been fed the internet.

More and more of the content generated since is LLM generated and useless as training data.

The models get worse, not better by being fed their own output, and right now they are out of training data.

This is why Reddit just went profitable, AI companies buy their text to train their models because it is at least somewhat human written.

Of course, even reddit is crawling with LLM generated text, so yes. It is coming to a halt.

CaptainFever · 2024-10-31T11:25:15 1730373915

Data is not the only factor. Architecture improvements, data filtering etc. matter too.

simianparrot · 2024-10-31T07:02:19 1730358139

I know for a fact they are because rate _and_ quality of improvement is diminishing exponentially. I keep a close eye on this field as part of my job.