Beefy Boxes and Bandwidth Generously Provided by pair Networks
Do you know where your variables are?
 
PerlMonks  

Re: Mapping ACCEPT_LANG, USER_AGENT & GeoIP to Encode's character sets

by Your Mother (Canon)
on Jun 22, 2012 at 04:26 UTC ( #977762=note: print w/ replies, xml ) Need Help??


in reply to Mapping ACCEPT_LANG, USER_AGENT & GeoIP to Encode's character sets

This might get you started–

use strictures; use Encode; my $name = shift || die "Give an encoding!\n"; my $input = shift || "Some string..."; my $encoding = find_encoding($name) or die "No encoding found for $name\n"; binmode STDOUT, ":encoding(UTF-8)"; print $encoding->decode($input), $/; __END__
perl pm-977749 MacIcelandic "OHAI Ƌ"
OHAI ∆

perl pm-977749 MacRoman  "OHAI Ƌ"
OHAI ∆

perl pm-977749 UTF-8 "OHAI Ƌ"
OHAI �

Basically, just find_encoding as declared by client, rejecting unknowns or customizing to handle them, and then decode. For customizing see the Pod for Encode and realize that of the thousands of named encodings out there, they mostly line up with the stock list Encode is aware of, you just might have to do some mapping of your own; I seem to recall the EUC-KR set having several different names in various standards for example.


Comment on Re: Mapping ACCEPT_LANG, USER_AGENT & GeoIP to Encode's character sets
Select or Download Code

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://977762]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others musing on the Monastery: (5)
As of 2014-09-17 23:49 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    How do you remember the number of days in each month?











    Results (100 votes), past polls