Beefy Boxes and Bandwidth Generously Provided by pair Networks
Keep It Simple, Stupid

Re: Alphabetize in Esperanto

by ohcamacj (Beadle)
on Jan 30, 2015 at 19:28 UTC ( #1115121=note: print w/replies, xml ) Need Help??

in reply to Alphabetize in Esperanto

split(//, ) will break apart html entities into "&" "#" "0" "0" "0" ";" or similar. And the "�" in @zam will never be used.

Krome, mi estas feliĉa ke esperanto kreskas pli populara.

Replies are listed 'Best First'.
Re^2: Alphabetize in Esperanto
by choroba (Archbishop) on Jan 30, 2015 at 20:36 UTC
    The entities were inserted by PM, the original contains the UTF-8 characters. To insert code with UTF-8, don't use <code>, use <pre>.
    لսႽ ᥲᥒ⚪⟊Ⴙᘓᖇ Ꮅᘓᖇ⎱ Ⴙᥲ𝇋ƙᘓᖇ
      So I updated using pre instead of code, but it uglified the regex brackets. I must read up on what to escape, I guess. Meanwhile, I went back to code. Will fix later, after some study.
        Perlmonks doesn't unicode, perlmonks does windows-1252, your browser does conversion to windows-1252 ... and at some point html entities are used ... perlmonks faq doesn't unicode or utf8 or utf-8 it only latin1 or windows-1252 or something like that

        so when posting to perlmonks stuff in unicode I always dd()dumper it

        use strict; use warnings; use Data::Dump; my @zam = qw( / A B C &#264; D E F G &#284; H &#292; I J &#308; K L M +N O P R S &#348; T U &#364; V Z a b c &#265; d e f g &#285; h &#293; i j &#309; k l m n o p r s &#349; t u &#365 +; v z ); Possible attempt to put comments in qw() list at - line 2. s{&#(\d+);}{chr $1}ge for @zam; dd( \@zam ); __END__ [ "/", "A", "B", "C", "\x{108}", "D" .. "G", "\x{11C}", "H", "\x{124}", "I", "J", "\x{134}", "K" .. "P", "R", "S", "\x{15C}", "T", "U", "\x{16C}", "V", "Z", "a", "\xA0", "b", "c", "\x{109}", "d", "e", "\x82", "f", "g", "\x{11D}", "h", "\x{125}", "i", "\x8B", "j", "\x{135}", "k" .. "o", "\xA2", "p", "r", "s", "\x{15D}", "t", "u", "\x{16D}", "\xA3", "v", "z", ]
Re^2: Alphabetize in Esperanto
by japh2963 (Novice) on Feb 05, 2015 at 03:37 UTC
    I work with Unicode often enough to hate it, but not frequently enough to remember what I did last time. Thanks for the post. I'm sure this will come in handy one day.

Log In?

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://1115121]
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others romping around the Monastery: (4)
As of 2021-09-17 14:02 GMT
Find Nodes?
    Voting Booth?

    No recent polls found