http://www.perlmonks.org?node_id=1084198


in reply to Re^3: length() miscounting UTF8 characters?
in thread length() miscounting UTF8 characters?

Representing the written languages of the world on computers is complex. The Unicode Standard is complex. Programming Unicode is complex. There's a limit to how much of this complexity can be hidden from computer programmers.

If you want to understand Unicode better, and how to think correctly about programming Unicode, read Tom Christiansen's excellent Stack Overflow post here. If, after reading his well-known post, you find you need more of Tom's Perl Unicode wisdom, then come back to PerlMonks and read what tchrist has written about the topic here.

  • Comment on Re^4: length() miscounting UTF8 characters?

Replies are listed 'Best First'.
Re^5: length() miscounting UTF8 characters?
by AppleFritter (Vicar) on Apr 28, 2014 at 22:15 UTC
    OK, thanks again. Whoa, that's a lot of information to digest. I'll have to meditate on this, as it were.