>*NOTE: You can always find a character boundary from an arbitrary point in a st...

SethMLarson · on Feb 8, 2022

You're right, that should read "codepoint boundary" not "character boundary". I can fix that.

I do briefly mention grapheme clusters near the end, didn't want to introduce them as this article was more about the encoding mechanism itself. Maybe a future article after more research :)

nabla9 · on Feb 8, 2022

Please do. You have the best visualizations of UTF-8 I have seen so far.

Usually people write just the UTF-8 encoding part, then don't mention the rest of the Unicode, because it's clearly not as good and simple.