Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yes, we are getting there. I think compiler is a bigger problem than unit tests given most verticals don't even have that. With unit tests, there would be some reward hacking but would be controlled at the model level + tests. (this is one of the reason i dont believe in transformer based llm as a judge for a verifier)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: