Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl Monk, Perl Meditation
 
PerlMonks  

Re^3: HTML parsing module handles known and unknown encoding

by ambrus (Abbot)
on Nov 17, 2011 at 08:50 UTC ( #938544=note: print w/ replies, xml ) Need Help??


in reply to Re^2: HTML parsing module handles known and unknown encoding
in thread HTML parsing module handles known and unknown encoding

I don't see any way of specifying the encoding of an HTML document, which is weird because XML::LibXML supposedly handles HTML.

The docs of XML::LibXML::Parser says under the heading PARSER OPTIONS that there's a parser option encoding which sets the “character encoding of the input” for HTML.


Comment on Re^3: HTML parsing module handles known and unknown encoding
Download Code
Re^4: HTML parsing module handles known and unknown encoding
by ikegami (Pope) on Nov 17, 2011 at 09:56 UTC
    Awesome. Don't know how I missed it.

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://938544]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others surveying the Monastery: (10)
As of 2015-07-04 13:16 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    The top three priorities of my open tasks are (in descending order of likelihood to be worked on) ...









    Results (60 votes), past polls