Hacker Newsnew | past | comments | ask | show | jobs | submit | niemandhier's favoriteslogin
1.Sleep all comes down to the mitochondria (science.org)
639 points by A_D_E_P_T 20 days ago | 308 comments
2.LLM architecture comparison (sebastianraschka.com)
418 points by mdp2021 30 days ago | 24 comments
3.Smollm3: Smol, multilingual, long-context reasoner LLM (huggingface.co)
388 points by kashifr 42 days ago | 79 comments
4.What I learned gathering nootropic ratings (2022) (troof.blog)
127 points by julianh65 48 days ago | 177 comments
5.Andrej Karpathy: Software in the era of AI [video] (youtube.com)
1481 points by sandslash 62 days ago | 783 comments
6.Ask HN: How do I learn robotics in 2025?
408 points by srijansriv 78 days ago | 99 comments
7.Why DeepSeek is cheap at scale but expensive to run locally (seangoedecke.com)
328 points by ingve 79 days ago | 227 comments
8.AI, Heidegger, and Evangelion (fakepixels.substack.com)
164 points by jger15 87 days ago | 83 comments
9.Peer Programming with LLMs, for Senior+ Engineers (pmbanugo.me)
213 points by pmbanugo 87 days ago | 91 comments
10.Deep Learning Is Applied Topology (theahura.substack.com)
508 points by theahura 3 months ago | 183 comments
11.Mercury: Commercial-scale diffusion language model (inceptionlabs.ai)
385 points by HyprMusic 3 months ago | 180 comments
12.Are polynomial features the root of all evil? (2024) (alexshtf.github.io)
188 points by Areibman 3 months ago | 77 comments
13.Oda Ujiharu: Why the ‘weakest Samurai warlord’ is admired (tokyoweekender.com)
162 points by cdplayer96 4 months ago | 65 comments
14.12-factor Agents: Patterns of reliable LLM applications (github.com/humanlayer)
475 points by dhorthy 4 months ago | 78 comments
15.Numbering should start at zero (1982) (utexas.edu)
107 points by checkyoursudo 5 months ago | 295 comments
16.Block Diffusion: Interpolating between autoregressive and diffusion models (arxiv.org)
156 points by GaggiX 5 months ago | 32 comments
17.Ask HN: Any insider takes on Yann LeCun's push against current architectures?
385 points by vessenes 5 months ago | 325 comments
18.Writing an LLM from scratch, part 8 – trainable self-attention (gilesthomas.com)
380 points by gpjt 5 months ago | 31 comments
19.DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling (github.com/deepseek-ai)
391 points by mfiguiere 5 months ago | 67 comments
20.Part two of Grant Sanderson's video with Terry Tao on the cosmic distance ladder (mathstodon.xyz)
385 points by ColinWright 5 months ago | 94 comments
21.A step-by-step guide to the “World Models” AI paper (applied-data.science)
261 points by datashrimp on April 17, 2018 | 37 comments
22.DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL (arxiv.org)
1351 points by gradus_ad 6 months ago | 1056 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: