Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Can we really draw any conclusions on LLMs based on 1 sample? Maybe you've tried multiple times and with different semi famous people, but in general I see people comparing ML models in this fashion.


Not really, I did try it with multiple attempts with multiple people and chatgpt had more issues. I just shared only one of them. If someone tests in a more systematic fashion that'd be great.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: