Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I few people have mentioned dagster and I took a look at that for some machine learning things I was playing with but I found dvc (data version control [1]) and I think it is fantastic. I think it also has more applications than just machine learning but really anything with data. If you have a bunch of shell scripts that write to files to pass data around, then dvc might be a good fit. it will do things like only rerun steps if it needs to. Also for totally non-data stuff, Prefect is great.

[1] https://dvc.org



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: