That has nothing to do with UTF-8; that's a Unicode issue, and one that's entire... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		amake 16 days ago \| parent \| context \| favorite \| on: It’s not wrong that "\u{1F926}\u{1F3FC}\u200D\u264... That has nothing to do with UTF-8; that's a Unicode issue, and one that's entirely unescapable if you are the Unicode Consortium and your goal is to be compatible with all legacy charsets.

degamad 15 days ago [–]

Yep, that's the point I was making - that choosing fixed 4-byte code-points doesn't significantly reduce the complexity of capturing everything that Unicode does.

eru 13 days ago | [–]

Thanks for explaining!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact