Hacker News new | past | comments | ask | show | jobs | submit login

So what you’re saying is that they not just trained it on text, but also verified the answers and trained it in such a way that it would get a negative feedback if the model gave the wrong answer?



It got negative feedback if the model gave wrong-sounding answers, which is not the same as wrong answers.

I have seen no indication that they had anybody verify the factual content of answers in the training process, and many indications that they did not.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: