Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
|
niemandhier's favorites
login
submissions
|
comments
1.
Sleep all comes down to the mitochondria
(
science.org
)
639 points
by
A_D_E_P_T
20 days ago
|
308 comments
2.
LLM architecture comparison
(
sebastianraschka.com
)
418 points
by
mdp2021
30 days ago
|
24 comments
3.
Smollm3: Smol, multilingual, long-context reasoner LLM
(
huggingface.co
)
388 points
by
kashifr
42 days ago
|
79 comments
4.
What I learned gathering nootropic ratings (2022)
(
troof.blog
)
127 points
by
julianh65
48 days ago
|
177 comments
5.
Andrej Karpathy: Software in the era of AI [video]
(
youtube.com
)
1481 points
by
sandslash
62 days ago
|
783 comments
6.
Ask HN: How do I learn robotics in 2025?
408 points
by
srijansriv
78 days ago
|
99 comments
7.
Why DeepSeek is cheap at scale but expensive to run locally
(
seangoedecke.com
)
328 points
by
ingve
79 days ago
|
227 comments
8.
AI, Heidegger, and Evangelion
(
fakepixels.substack.com
)
164 points
by
jger15
87 days ago
|
83 comments
9.
Peer Programming with LLMs, for Senior+ Engineers
(
pmbanugo.me
)
213 points
by
pmbanugo
87 days ago
|
91 comments
10.
Deep Learning Is Applied Topology
(
theahura.substack.com
)
508 points
by
theahura
3 months ago
|
183 comments
11.
Mercury: Commercial-scale diffusion language model
(
inceptionlabs.ai
)
385 points
by
HyprMusic
3 months ago
|
180 comments
12.
Are polynomial features the root of all evil? (2024)
(
alexshtf.github.io
)
188 points
by
Areibman
3 months ago
|
77 comments
13.
Oda Ujiharu: Why the ‘weakest Samurai warlord’ is admired
(
tokyoweekender.com
)
162 points
by
cdplayer96
4 months ago
|
65 comments
14.
12-factor Agents: Patterns of reliable LLM applications
(
github.com/humanlayer
)
475 points
by
dhorthy
4 months ago
|
78 comments
15.
Numbering should start at zero (1982)
(
utexas.edu
)
107 points
by
checkyoursudo
5 months ago
|
295 comments
16.
Block Diffusion: Interpolating between autoregressive and diffusion models
(
arxiv.org
)
156 points
by
GaggiX
5 months ago
|
32 comments
17.
Ask HN: Any insider takes on Yann LeCun's push against current architectures?
385 points
by
vessenes
5 months ago
|
325 comments
18.
Writing an LLM from scratch, part 8 – trainable self-attention
(
gilesthomas.com
)
380 points
by
gpjt
5 months ago
|
31 comments
19.
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
(
github.com/deepseek-ai
)
391 points
by
mfiguiere
5 months ago
|
67 comments
20.
Part two of Grant Sanderson's video with Terry Tao on the cosmic distance ladder
(
mathstodon.xyz
)
385 points
by
ColinWright
5 months ago
|
94 comments
21.
A step-by-step guide to the “World Models” AI paper
(
applied-data.science
)
261 points
by
datashrimp
on April 17, 2018
|
37 comments
22.
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via RL
(
arxiv.org
)
1351 points
by
gradus_ad
6 months ago
|
1056 comments
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: