in reply to Re^2: HTML parsing module handles known and unknown encoding
in thread HTML parsing module handles known and unknown encoding
I don't see any way of specifying the encoding of an HTML document
Yes, HTML encoding is specified in the HTTP headers, but you can use the 'http-equiv' attribute on a <meta> tag to include arbitrary headers in your HTML. For example:
<meta http-equiv="content-type" content="text/html; charset=utf-8" + />
Of course this will really only work in cases where the encoding is some superset of ASCII (like iso8859-*, utf8 etc).
|
---|
Replies are listed 'Best First'. | |
---|---|
Re^4: HTML parsing module handles known and unknown encoding
by ikegami (Patriarch) on Nov 16, 2011 at 22:19 UTC | |
by grantm (Parson) on Nov 17, 2011 at 23:42 UTC |
In Section
Seekers of Perl Wisdom