Your experience echoes my own for sufficiently trivial tasks, but I haven't gotten any of this to work for the actual time-consuming parts of my job. It's so reliably bad for some tasks that I've reworked them into screening questions for candidates trying to skate by with AI without knowing the fundamentals. Is that really not your experience, even with claude code?
Right, and I wasn't able to get this to work for any actual time consuming parts of my job until last weekend with sub-agents, and testing head to head battles with sub-agents, and selecting the best one and repeating.
Last weekend I did nothing but have different ideas battle it out against each other, with me selecting the most successful one, and repeating.
And now, my experience is no longer the same. Before last weekend, i had the same experience you are describing.