Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

So dynamic quants like what I upload are not actually 4bit! It's a mixture of 4bit to 8bit with important layers being in higher precision! I wrote about our method here: https://docs.unsloth.ai/basics/unsloth-dynamic-2.0-ggufs


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: