| | AI probably won't replace me in 2025 (robinlinacre.com) |
|
3 points by RobinL 39 days ago | past | 2 comments
|
| | The emerging impact of LLMs on my productivity (robinlinacre.com) |
|
3 points by sebg 61 days ago | past
|
| | Super-fast deduplication of large datasets using Splink and DuckDB (robinlinacre.com) |
|
4 points by chuckhend on Jan 20, 2024 | past
|
| | Super-fast deduplication of large datasets using Splink and DuckDB (robinlinacre.com) |
|
5 points by RobinL on Jan 19, 2024 | past
|
| | Why Probabilistic Linkage Is More Accurate Than Fuzzy Matching for Data Deduping (robinlinacre.com) |
|
4 points by RobinL on Jan 7, 2024 | past
|
| | An Interactive Introduction to Probabilistic Data Linkage/Deduplication (robinlinacre.com) |
|
3 points by RobinL on Oct 29, 2023 | past
|
| | Why parquet files are my preferred API for bulk open data (robinlinacre.com) |
|
3 points by RobinL on Sept 15, 2023 | past
|
| | SQL should be the default choice for data transformation logic (robinlinacre.com) |
|
434 points by RobinL on Jan 30, 2023 | past | 267 comments
|
| | Why parquet files are my preferred API for bulk open data (robinlinacre.com) |
|
2 points by kristianp on Jan 10, 2023 | past
|
| | Demystifying Apache Arrow (2020) (robinlinacre.com) |
|
197 points by dmlorenzetti on Jan 9, 2023 | past | 47 comments
|
| | Why parquet files are my preferred API for bulk open data (robinlinacre.com) |
|
8 points by RobinL on Jan 9, 2023 | past
|
| | Demystifying Apache Arrow (robinlinacre.com) |
|
1 point by dmlorenzetti on Oct 27, 2021 | past
|
| | The downfall of command and control data leadership (robinlinacre.com) |
|
2 points by RobinL on Nov 8, 2020 | past
|