Beefy Boxes and Bandwidth Generously Provided by pair Networks
good chemistry is complicated,
and a little bit messy -LW
 
PerlMonks  

Re: Mapping ACCEPT_LANG, USER_AGENT & GeoIP to Encode's character sets

by Your Mother (Canon)
on Jun 22, 2012 at 04:26 UTC ( #977762=note: print w/ replies, xml ) Need Help??


in reply to Mapping ACCEPT_LANG, USER_AGENT & GeoIP to Encode's character sets

This might get you started–

use strictures; use Encode; my $name = shift || die "Give an encoding!\n"; my $input = shift || "Some string..."; my $encoding = find_encoding($name) or die "No encoding found for $name\n"; binmode STDOUT, ":encoding(UTF-8)"; print $encoding->decode($input), $/; __END__
perl pm-977749 MacIcelandic "OHAI Ƌ"
OHAI ∆

perl pm-977749 MacRoman  "OHAI Ƌ"
OHAI ∆

perl pm-977749 UTF-8 "OHAI Ƌ"
OHAI �

Basically, just find_encoding as declared by client, rejecting unknowns or customizing to handle them, and then decode. For customizing see the Pod for Encode and realize that of the thousands of named encodings out there, they mostly line up with the stock list Encode is aware of, you just might have to do some mapping of your own; I seem to recall the EUC-KR set having several different names in various standards for example.


Comment on Re: Mapping ACCEPT_LANG, USER_AGENT & GeoIP to Encode's character sets
Select or Download Code

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://977762]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others rifling through the Monastery: (5)
As of 2014-12-28 13:05 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    Is guessing a good strategy for surviving in the IT business?





    Results (181 votes), past polls