good chemistry is complicated, and a little bit messy -LW |
|
PerlMonks |
Re: HTML parsing module handles known and unknown encodingby Corion (Patriarch) |
on Nov 16, 2011 at 15:49 UTC ( [id://938400]=note: print w/replies, xml ) | Need Help?? |
It seems that XML::LibXML has thought about the problem and solved in the way that you should always pass octets to XML::LibXML. If you have an encoding handy, you're allowed to tell XML::LibXML about it, but it's not necessary. I'm not sure how well XML::LibXML works with UTF-16LE and/or UTF-16BE and BOMs - you might need to use some regular (byte-)expressions to handle the BOM yourself.
In Section
Seekers of Perl Wisdom
|
|