Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's not either/or. Generally you finetune when optimized many-shot still doesn't hit your desired quality bar. And it turns out with RL, things like system prompts matter a lot, so searching over prompts is a good idea even when reinforcing the desirable circuits.


I am not an expert in fine tuning, but in the company I work for our fine tuned model didn't do any noticeable difference.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: