Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Which benchmarks are not garbage?

I don't consider myself super special. I think it should be doable to create a benchmark that beats me having to test every single new model.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: