Spark is going to try and ingest all the data, and it won’t fit in RAM. Wrong to... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		mr_toad on May 24, 2018 \| parent \| context \| favorite \| on: Command-line Tools can be 235x Faster than a Hadoo... Spark is going to try and ingest all the data, and it won’t fit in RAM. Wrong tool for the job basically.

rsynnott on May 24, 2018 [–]

Depends on exactly how you do it, I suppose, but it shouldn't necessarily. Most Hadoop-y work can also be accomplished in Spark without much fuss.

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact