I think it would be interesting to arrange a similar situation with three chatbots, by prompting each that they should assume the role of someone famous, e.g. Taylor Swift, and prove to the other two that they are not. Or expose the other two as human operators and not LLMs.