As opposed to, sampling from the model a bunch, getting scores offline, and then fine tuning the model on those offline scored generations.