I also got a few characters of would-be-text into a couple of images. As you said, the net was probably trained with images that included cat memes and it "learned" that meme-text means "cat".
Makes me wonder if it is possible for deep learning to create an artificial language. Supposedly Lojban's vocabulary was created by an algorithm, although how it got "mlatu" for cat is obscure.
> At least one word was found in each of the six source languages (Chinese, English, Hindi, Spanish, Russian, Arabic) corresponding to the proposed gismu. This word was rendered into Lojban phonetics rather liberally: consonant clusters consisting of a stop and the corresponding fricative were simplified to just the fricative (“tc” became “c”, “dj” became “j”) and non-Lojban vowels were mapped onto Lojban ones. Furthermore, morphological endings were dropped. The same mapping rules were applied to all six languages for the sake of consistency.
> ...
https://imgur.com/a/IBudXk6