The API system prompt is weak and can easily be overriden by user instructions. There's something similar on ChatGPT, but a lot stronger than the system prompt.
I suspect GPT-3.5 is also just very heavily tuned davinci and such, to a level where it's much cheaper but also responds mechanically.