Hacker News new | past | comments | ask | show | jobs | submit login

> They train the LLM directly on the corpus so that the documents are embedded in its weights.

they do not outright say that in the paper as far as i could tell. i only got it from reading hn comments. just very confused why they use a nonstandard term like "internalize" which just pisses me off because ML is hard enough without inventing your own terms




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: