While it's hilarious, it's understandable IMHO - just basic statistics, she is more often used in front of pretty/ugly. It's showing a glitch in our society more than a glitch in Google AI.
This on the other hand looks like a hand-crafted blacklist of words that they want to remove from the language, I have no idea how would I train an AI which would classify "motherboard" as inappropriate.
> While it's hilarious, it's understandable IMHO - just basic statistics, she is more often used in front of pretty/ugly. It's showing a glitch in our society more than a glitch in Google AI.
Well, this doesn't necessarily show you facts about society because there's no particular reason to think the training set distribution is the same as anything in the real world.
It's frankly impossible that they have a training set without any biases, though I'm sure they worked to eliminate the ones they could think of (which itself would have bias).
You can make the distribution different by eg duplicating some of the data a lot, which you might want to do to improve the actual purpose of the model (translation). Any other purposes (having opinions on gender) is just a coincidence and not being optimized for/regression tested.
I still unironically want a singular variant to rise in popularity, don't particularly care what. There are so many times when "they" comes up and I have no clue if we're talking about a person or a group. There's already enough pronoun ambiguity when talking about single subjects without intoducing a dimension of plurality ambiguity.
Yes, I'm sure almost everyone reading this was already aware. This doesn't mean that "they are pretty. they are clever. they are ugly. they are good." is a good localization of "ő csinos. ő okos. ő csúnya. ő jó."
It's computer-brain solution to a problem that is crying out for a human. Humans "don't scale". Therefore paying 5 bi-lingual experts to teach the machine is "infeasible", but spending millions upon millions of dollars on hardware and software development, then exposing it the unwashed mess of the internet isn't....
I present you the beautiful translation of hungarian gender-less sentences to english:
ő csinos. ő okos. ő csúnya. ő jó.
becomes
she is pretty. he is clever. she is ugly. he is good.
I can't wait for inclusive warnings to come to non-english languages.