Beefy Boxes and Bandwidth Generously Provided by pair Networks
Keep It Simple, Stupid
 
PerlMonks  

Re: Help needed understanding unicode in perl

by moritz (Cardinal)
on Dec 11, 2009 at 13:45 UTC ( #812378=note: print w/replies, xml ) Need Help??


in reply to Help needed understanding unicode in perl

First question. If I call a function, how do I know if it returns a unicode string or not

By reading the documentation. Core functions in perl (like for example uc) return a string of the same kind as they receive: Unicode in, Unicode out. Modules should(!) document what they return.

How is a unicode string encoded (utf8 or 16 etc?)

If you mean by "Unicode string" a string in perl's internal format, you shouldn't (and don't have to) care how it's encoded.

Second question. If I have a unicode string, how do I output it to my console window so that it appears correctly?

If it's a string in perl's internal format, you need to add an encoding layer. For testing, use this small snippet:

use strict; use warnings; use charnames qw(:full); binmode STDOUT, ":encoding(UTF-8)"; print "Euro: \N{EURO SIGN}\n"

If this prints an Euro sign, your terminal works with UTF-8. If it shows Mojibake, configure your terminal to accept UTF-8.

See also: Encodings, Unicode and Perl. It seems that you grasped the most important concepts about Unicode and Perl already, you just need to get a bit more familiar with how Perl handles it.

Perl 6 - links to (nearly) everything that is Perl 6.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://812378]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others examining the Monastery: (4)
As of 2016-10-01 21:49 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    How many different varieties (color, size, etc) of socks do you have in your sock drawer?






    Results (8 votes). Check out past polls.