Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I agree with your comment, just FYI you could avoid apply on a pandas Series since it loops over every row in Python space. It's much faster to use vectorized operations directly e.g.:

df["balance"].mul(36902.7).div(100e6)

See [0] for explanations

[0] https://stackoverflow.com/a/52674448



Thanks for the tip! You weren't kidding...

    %time df['balance'].apply(lambda x: x * 36902.7 / 100e6).describe()
    Wall time: 12.6 s

    %time df["balance"].mul(36902.7).div(100e6).describe()
    Wall time: 2.33 s




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: