http://www.perlmonks.org?node_id=1115132


in reply to Re: Alphabetize in Esperanto
in thread Alphabetize in Esperanto

The entities were inserted by PM, the original contains the UTF-8 characters. To insert code with UTF-8, don't use <code>, use <pre>.
لսႽ† ᥲᥒ⚪⟊Ⴙᘓᖇ Ꮅᘓᖇ⎱ Ⴙᥲ𝇋ƙᘓᖇ

Replies are listed 'Best First'.
Re^3: Alphabetize in Esperanto
by aplonis (Pilgrim) on Jan 31, 2015 at 01:37 UTC
    So I updated using pre instead of code, but it uglified the regex brackets. I must read up on what to escape, I guess. Meanwhile, I went back to code. Will fix later, after some study.
      Perlmonks doesn't unicode, perlmonks does windows-1252, your browser does conversion to windows-1252 ... and at some point html entities are used ... perlmonks faq doesn't unicode or utf8 or utf-8 it only latin1 or windows-1252 or something like that

      so when posting to perlmonks stuff in unicode I always dd()dumper it

      use strict; use warnings; use Data::Dump; my @zam = qw( / A B C &#264; D E F G &#284; H &#292; I J &#308; K L M +N O P R S &#348; T U &#364; V Z a á b c &#265; d e é f g &#285; h &#293; i ï j &#309; k l m n o ó p r s &#349; t u &#365 +; ú v z ); Possible attempt to put comments in qw() list at - line 2. s{&#(\d+);}{chr $1}ge for @zam; dd( \@zam ); __END__ [ "/", "A", "B", "C", "\x{108}", "D" .. "G", "\x{11C}", "H", "\x{124}", "I", "J", "\x{134}", "K" .. "P", "R", "S", "\x{15C}", "T", "U", "\x{16C}", "V", "Z", "a", "\xA0", "b", "c", "\x{109}", "d", "e", "\x82", "f", "g", "\x{11D}", "h", "\x{125}", "i", "\x8B", "j", "\x{135}", "k" .. "o", "\xA2", "p", "r", "s", "\x{15D}", "t", "u", "\x{16D}", "\xA3", "v", "z", ]