Beefy Boxes and Bandwidth Generously Provided by pair Networks
Your skill will accomplish
what the force of many cannot
 
PerlMonks  

Re: Help needed understanding unicode in perl

by moritz (Cardinal)
on Dec 11, 2009 at 13:45 UTC ( #812378=note: print w/ replies, xml ) Need Help??


in reply to Help needed understanding unicode in perl

First question. If I call a function, how do I know if it returns a unicode string or not

By reading the documentation. Core functions in perl (like for example uc) return a string of the same kind as they receive: Unicode in, Unicode out. Modules should(!) document what they return.

How is a unicode string encoded (utf8 or 16 etc?)

If you mean by "Unicode string" a string in perl's internal format, you shouldn't (and don't have to) care how it's encoded.

Second question. If I have a unicode string, how do I output it to my console window so that it appears correctly?

If it's a string in perl's internal format, you need to add an encoding layer. For testing, use this small snippet:

use strict; use warnings; use charnames qw(:full); binmode STDOUT, ":encoding(UTF-8)"; print "Euro: \N{EURO SIGN}\n"

If this prints an Euro sign, your terminal works with UTF-8. If it shows Mojibake, configure your terminal to accept UTF-8.

See also: Encodings, Unicode and Perl. It seems that you grasped the most important concepts about Unicode and Perl already, you just need to get a bit more familiar with how Perl handles it.

Perl 6 - links to (nearly) everything that is Perl 6.


Comment on Re: Help needed understanding unicode in perl
Download Code

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://812378]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others musing on the Monastery: (15)
As of 2015-07-31 17:39 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (279 votes), past polls