Under the new string model in java > 8 a fairly frequent workflow is: 1) get ext...

Twirrim · on Oct 20, 2018

You might be interested in his blog on the same subject a few days ago: https://lemire.me/blog/2018/10/16/validating-utf-8-bytes-jav...

adamretter · on Oct 20, 2018

If you are given the external string as bytes, which is all you can have if you don't know the encoding. Then steps 2,3,4 can all be done as one step I would have thought. Something like - https://github.com/adamretter/utf8-validator/blob/optimize-u...