Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

this is a huge difference between Ibis (and really, any dataframe library that works with larger-than-RAM or distributed engines) and pandas. PySpark, Polars, and Snowpark all to some extent also do not implicitly order your data

so in Ibis, you need to explicitly manage order with an `order_by` call at the end. there are a lot of technical reasons for why maintaining order is inefficient as you scale up



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: