Beefy Boxes and Bandwidth Generously Provided by pair Networks
laziness, impatience, and hubris
 
PerlMonks  

Re: The unicode / utf8 struggle, part 2: regexes

by mattr (Curate)
on May 22, 2007 at 09:41 UTC ( #616710=note: print w/replies, xml ) Need Help??


in reply to The unicode / utf8 struggle, part 2: regexes

Hi, The above masterful comments are just that, but since I noticed this module in the CPAN Nodelet I thought I'd mention HTML::Encoding. Apparently it helps you figure out what encoding is coming in at you, using the function mentioned above. Might even work! But I haven't used it myself. Good luck!
HTML::Encoding helps to determine the encoding of HTML and XML/XHTML documents...
use HTML::Encoding 'encoding_from_http_message'; use LWP::UserAgent; use Encode; my $resp = LWP::UserAgent->new->get('http://www.example.org'); my $enco = encoding_from_http_message($resp); my $utf8 = decode($enco => $resp->content);

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://616710]
help
Chatterbox?
[Corion]: Discipulus: Better buy a new PC (well, self-build one from parts) than buy a new car :-)
[choroba]: the boot time on my laptops is more than satisfactory, but the desktop boots in 30+ secs
[choroba]: because of some problems of the wicked with systemd :-(
[Corion]: choroba: Just imagine how bad it must be without systemd ;-D
[choroba]: but I'm not sure it's worth my time...
NodeReaper shudders
[Corion]: choroba: That chart is frustrating as it pits cold hard numbers against puttering around with an annoying problem :)
[choroba]: I only use it when I want to avoid a task...

How do I use this? | Other CB clients
Other Users?
Others browsing the Monastery: (9)
As of 2017-07-27 09:16 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    I came, I saw, I ...
























    Results (407 votes). Check out past polls.