Hacker News new | past | comments | ask | show | jobs | submit login
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking (arxiv.org)
36 points by roboboffin 2 days ago | hide | past | favorite | 6 comments





> Notably, no self-reflection training data or prompt was included, suggesting that advanced System 2 reasoning can foster intrinsic self-reflection.

They suggest, that self-reflection is an emergent phenomena of reasoning. Impressive. Can't wait to see the code.


Off topic but how is MCTS usually implemented efficiently? It has a branching structure that doesn't seem parallelizable (GPU).

Abstract is impressive. I'm surprised this post hasn't gotten more attention.

Yeah, that's what I thought.

The repo gives 404?

The abstract says the code will be available.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: