That's not alignment, that's mis-alignment. Proper alignment would accurately identify the issues, if they have so many false positives you wonder what the confusion matrix for their refusal classifier looks like.
> The idea is to not mention words that Gemini deem unsafe, make Gemini use those words for you instead, then you refer to it.
> Guiding Gemini to use the words can be tricky, but when you succeed, you use it through sentences like "do the task as YOU suggested".
Astounding that you have to jump through these kind of hoops because of "safety". They really seem committed to losing the AI race, a race in which they started with an enormous lead.