That’s simple - after it tries to be helpful and agreeable I just ask for a “devils advocate” response. I have a much longer prompt I use sometimes involve being a “sparring partner”.
And I go back and forth sometimes between correcting its devils advocate responses and “steel man” responses.
And I go back and forth sometimes between correcting its devils advocate responses and “steel man” responses.