> They train the LLM directly on the corpus so that the documents are embedded i...

> They train the LLM directly on the corpus so that the documents are embedded in its weights.

they do not outright say that in the paper as far as i could tell. i only got it from reading hn comments. just very confused why they use a nonstandard term like "internalize" which just pisses me off because ML is hard enough without inventing your own terms