rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

s-macke · 2025-01-10T13:29:09 1736515749

> Notably, no self-reflection training data or prompt was included, suggesting that advanced System 2 reasoning can foster intrinsic self-reflection.

They suggest, that self-reflection is an emergent phenomena of reasoning. Impressive. Can't wait to see the code.

helltone · 2025-01-10T14:57:33 1736521053

Off topic but how is MCTS usually implemented efficiently? It has a branching structure that doesn't seem parallelizable (GPU).

throwaway81523 · 2025-01-10T00:51:15 1736470275

Abstract is impressive. I'm surprised this post hasn't gotten more attention.

roboboffin · 2025-01-10T09:10:34 1736500234

Yeah, that's what I thought.

dantodor · 2025-01-10T01:31:03 1736472663

The repo gives 404?

funcDropShadow · 2025-01-10T10:38:33 1736505513

The abstract says the code will be available.