Hacker News new | past | comments | ask | show | jobs | submit login
Unsloth creators fix universal error with gradient accumulation (unsloth.ai)
4 points by ZQ-Dev8 3 months ago | hide | past | favorite | 2 comments



Article title: Bugs in LLM Training - Gradient Accumulation Fix


seems like something pytorch maintainers would want to know about and fix asap...




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: