The problem is not about quality of the code, it simply almost never works. Just tried some models still in beta on the code of one of the C projects and, while some pieces were okay and could be used as is, some are complete garbage and wouldn't even compile. Maybe few years later it will not become a problem but right now they are useless. For the AI-assisted research and internet search, that's a different thing and works pretty well already.