Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The pixtral models are quite good and fast. They might be on par with gemma 3


They are different. Gemma 3 12b excels at natural languages but terrible at long context. Pixtral 12b is better at long context (not stellar), but worse at natural language.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: