It does actually work. For some of the experiments I did with GPT-4, it made some mistakes because my initial prompt wasn't sufficiently precise. After discussing its mistakes with it, I asked it to write a better prompt that would prevent them. Sure enough, it did just that.