I really like R’s library and I’ll use them any chance I get (libraries like lmer are still orders of magnitude more efficient than the same model in Statsmodels).
From my experience the biggest impediment to using R in production is many orgs don’t have a blessed way to run it.
R is my favourite language for data processing, the manual section Computing on the Language[1]is why R is such an ergonomic tool. I had hoped Julia would catch up, but Julia’s macros are not comparable in their depth.
I think pandas is probably the data equivalent of editing files using default vim or processing data with awk.
As a joke, I wrote an Ibis backend (https://github.com/cpcloud/ibish) that processes expressions using shell commands strung together with named pipes. It supports joins using the coreutils join command, projections, filters and some aggregations with awk.
It's faster than pandas in some cases and folks should put it into production immediately!
From my experience the biggest impediment to using R in production is many orgs don’t have a blessed way to run it.
R is my favourite language for data processing, the manual section Computing on the Language[1]is why R is such an ergonomic tool. I had hoped Julia would catch up, but Julia’s macros are not comparable in their depth.
I think pandas is probably the data equivalent of editing files using default vim or processing data with awk.
[1] https://rstudio.github.io/r-manuals/r-lang/Computing-on-the-...