Hacker News new | past | comments | ask | show | jobs | submit | from login
Phase behavior of Cacio and Pepe sauce (arxiv.org)
3 points by pizza 7 days ago | past | discuss
Unlocking the Potential of Large Language Models in Data-Scarce Contexts (arxiv.org)
1 point by PaulHoule 7 days ago | past | discuss
European Space Agency Benchmark for Anomaly Detection in Satellite Telemetry (arxiv.org)
3 points by sarusso 7 days ago | past | discuss
[flagged] A path to O1 open source (arxiv.org)
133 points by bchelli 7 days ago | past | 80 comments
Algorithmic Language Models with Neurally Compiled Libraries (arxiv.org)
1 point by wseqyrku 7 days ago | past | discuss
Faster Positional-Population Counts for AVX2, AVX-512, and Asimd (arxiv.org)
2 points by mfiguiere 7 days ago | past | discuss
The Prompt Canvas: A Literature-Based Guide for Effective Prompts in LLMs (arxiv.org)
2 points by geox 7 days ago | past | discuss
Phase behavior of Cacio and Pepe sauce (arxiv.org)
378 points by rev13013 7 days ago | past | 192 comments
2 OLMo 2 Furious (arxiv.org)
4 points by lavabender 8 days ago | past | 1 comment
Medec: A Benchmark for Medical Error Detection and Correction in Clinical Notes (arxiv.org)
2 points by gone35 8 days ago | past | discuss
InvestorBench: A Benchmark for Financial Decision-Making Tasks with Agents (arxiv.org)
1 point by xianshou 8 days ago | past | discuss
Meta: Memory Layers at Scale (arxiv.org)
4 points by georgehill 8 days ago | past | discuss
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? (2023) (arxiv.org)
218 points by tzury 8 days ago | past | 104 comments
Reinforcement Learning for Multi-Intersection Traffic Signal Control (arxiv.org)
1 point by PaulHoule 8 days ago | past | discuss
MVQ: Efficient DNN Compression and Acceleration with Masked Vector Quantization (arxiv.org)
2 points by PaulHoule 8 days ago | past | discuss
Generative Modeling with Explicit Memory (arxiv.org)
2 points by PaulHoule 8 days ago | past | discuss
The Overthinking of O1-Like LLMs (arxiv.org)
3 points by omarsar 8 days ago | past | 1 comment
Would You Donate to a Chatbot? (arxiv.org)
2 points by frikskit 8 days ago | past | discuss
Why transformers are obviously good models of language (arxiv.org)
6 points by jxmorris12 8 days ago | past | discuss
Scaling of Search and Learning (arxiv.org)
2 points by jonbaer 9 days ago | past | discuss
Princ-wiki-a Mathematica: Wikipedia editing and mathematics (arxiv.org)
2 points by belter 9 days ago | past | discuss
4.5M Fake Stars in GitHub: Popularity Contests, Scams, and Malware [pdf] (arxiv.org)
4 points by caust1c 9 days ago | past | 2 comments
Experimental evidence photon can spend negative amount of time in an atom cloud (arxiv.org)
4 points by belter 9 days ago | past | discuss
Potential Perturbation of the Ionosphere by Megaconstellations (arxiv.org)
4 points by belter 9 days ago | past | 1 comment
1.58-Bit Flux (arxiv.org)
2 points by reynaldi 9 days ago | past | 1 comment
DeepSeek-V2: A Strong, Economical, and Efficient MOE Language Model (arxiv.org)
3 points by sonabinu 9 days ago | past | discuss
Identifying and Manipulating LLM Personality Traits via Activation Engineering (arxiv.org)
23 points by rntn 10 days ago | past | 9 comments
A 6 Years' Experience in Mitigating Cross-Core Interference in Linux (arxiv.org)
2 points by belter 10 days ago | past | discuss
Hint at an axion-like particle from GRB 221009A (arxiv.org)
2 points by belter 10 days ago | past | discuss
Re-Bench: Evaluating ML agents against human ML experts (arxiv.org)
2 points by marojejian 10 days ago | past | 1 comment

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: