Beefy Boxes and Bandwidth Generously Provided by pair Networks
Come for the quick hacks, stay for the epiphanies.
 
PerlMonks  

comment on

( [id://3333]=superdoc: print w/replies, xml ) Need Help??

Getting a bit OT here, but what the heck.

The Unicode standard has always guaranteed a 1:1 two-way mapping with existing character sets, and it contains a number of duplicate characters (in special compatibility regions) to cope with precisely the problem you mention.

No, the perceived problem with Unicode, when it was first introduced, was that certain characters are traditionally written slightly differently in China and in Japan, and Unicode unified each set of equivalent characters to have a single codepoint. IIRC the Japanese mistakenly assumed that this meant Unicode was trying to force them to use Chinese character shapes, which would obviously have signalled the end of the world.

Once an anti-Unicode movement had come into existence, they subsequently came up with various other kinds of anti-Unicode FUD; for example, one popular criticism is that Unicode does not necessarily include certain incredibly rare characters used only for writing ancient personal names (not that these were included in the then-standard JIS encodings either), while another, slightly more valid complaint was that UTF-8 required three bytes for every Japanese character rather than two bytes in Shift_JIS (of course, UTF-16 solves that).

Anyway, the upshot of all this was that many came to believe that Unicode was inherently unsuitable for encoding Japanese text, hence Ruby's failure to support it initially. Thankfully the resistance seems to be dying, perhaps aided by the fact that the Japanese language still exists even though Windows has been using Unicode internally for many years now...


In reply to Re^5: "When closures capture their context" and "scope gotchas in Javascript" by Porculus
in thread "When closures capture their context" and "scope gotchas in Javascript" by clinton

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post; it's "PerlMonks-approved HTML":



  • Are you posting in the right place? Check out Where do I post X? to know for sure.
  • Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
    <code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
  • Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
  • Want more info? How to link or How to display code and escape characters are good places to start.
Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others musing on the Monastery: (5)
As of 2024-04-26 08:09 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found