Perl: the Markov chain saw | |
PerlMonks |
Re^2: HTML parsing module handles known and unknown encodingby ikegami (Patriarch) |
on Nov 16, 2011 at 18:55 UTC ( [id://938438]=note: print w/replies, xml ) | Need Help?? |
That works fine for XML since XML must specify its encoding within the document (binary format), but not so much with HTML where the encoding is specified outside of the document (text format). I don't see any way of specifying the encoding of an HTML document, which is weird because XML::LibXML supposedly handles HTML. XML::LibXML handles UTF-16 just fine.
In Section
Seekers of Perl Wisdom
|
|