Hacker News new | past | comments | ask | show | jobs | submit login

The same issue exists with Jax. XLA compilation can take up quite a bit of time, especially on larger NN models. And theres no persistent compile cache, so even if you don't change the jitted function you need to wait for compilation again as you restart the process.



Jax does actually already support a persistent compilation cache for TPU, and support for caching GPU compiles is being worked on currently.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: