Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Anyone have a list of benchmarks that do not release the actual test set?

Anyone else share the suspicion that ML rapidly approaching 100% on benchmarks is sometimes due to releasing the test set?



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: