niemandhier's favorites

1.		Sleep all comes down to the mitochondria (science.org)
		639 points by A_D_E_P_T 20 days ago \| 308 comments
2.		LLM architecture comparison (sebastianraschka.com)
		418 points by mdp2021 30 days ago \| 24 comments
3.		Smollm3: Smol, multilingual, long-context reasoner LLM (huggingface.co)
		388 points by kashifr 42 days ago \| 79 comments
4.		What I learned gathering nootropic ratings (2022) (troof.blog)
		127 points by julianh65 48 days ago \| 177 comments
5.		Andrej Karpathy: Software in the era of AI [video] (youtube.com)
		1481 points by sandslash 62 days ago \| 783 comments
6.		Ask HN: How do I learn robotics in 2025?
		408 points by srijansriv 78 days ago \| 99 comments
7.		Why DeepSeek is cheap at scale but expensive to run locally (seangoedecke.com)
		328 points by ingve 79 days ago \| 227 comments
8.		AI, Heidegger, and Evangelion (fakepixels.substack.com)
		164 points by jger15 87 days ago \| 83 comments
9.		Peer Programming with LLMs, for Senior+ Engineers (pmbanugo.me)
		213 points by pmbanugo 87 days ago \| 91 comments
10.		Deep Learning Is Applied Topology (theahura.substack.com)
		508 points by theahura 3 months ago \| 183 comments
11.		Mercury: Commercial-scale diffusion language model (inceptionlabs.ai)
		385 points by HyprMusic 3 months ago \| 180 comments
12.		Are polynomial features the root of all evil? (2024) (alexshtf.github.io)
		188 points by Areibman 3 months ago \| 77 comments
13.		Oda Ujiharu: Why the ‘weakest Samurai warlord’ is admired (tokyoweekender.com)
		162 points by cdplayer96 4 months ago \| 65 comments
14.		12-factor Agents: Patterns of reliable LLM applications (github.com/humanlayer)
		475 points by dhorthy 4 months ago \| 78 comments
15.		Numbering should start at zero (1982) (utexas.edu)
		107 points by checkyoursudo 5 months ago \| 295 comments
16.		Block Diffusion: Interpolating between autoregressive and diffusion models (arxiv.org)
		156 points by GaggiX 5 months ago \| 32 comments
17.		Ask HN: Any insider takes on Yann LeCun's push against current architectures?
		385 points by vessenes 5 months ago \| 325 comments
18.		Writing an LLM from scratch, part 8 – trainable self-attention (gilesthomas.com)
		380 points by gpjt 5 months ago \| 31 comments
19.		DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling (github.com/deepseek-ai)
		391 points by mfiguiere 5 months ago \| 67 comments
20.		Part two of Grant Sanderson's video with Terry Tao on the cosmic distance ladder (mathstodon.xyz)
		385 points by ColinWright 5 months ago \| 94 comments
21.		A step-by-step guide to the “World Models” AI paper (applied-data.science)
		261 points by datashrimp on April 17, 2018 \| 37 comments
22.		DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL (arxiv.org)
		1351 points by gradus_ad 6 months ago \| 1056 comments