*>Imagine having 300.* Would it not be useful to have multiple independent AIs o...

JoshTko · 2025-01-27T20:11:44 1738008704

This is exactly where Deepseeks enhancements come into play. Essentially deepseek lets the model think out loud via chain of thought (o1 and Claude also do this) but DS also does not supervise the chain of thought, and simply rewards CoT that get the answer correctly. This is just one of the half dozen training optimization that Deepseek has come up with.

tomrod · 2025-01-27T19:00:36 1738004436

Yes; to my understanding that is MoE.