|
|
|
Clear questions and runnable code get the best and fastest answer |
|
| PerlMonks |
Re^2: UTF-8: Trying to make sense of form inputby ikegami (Pope) |
| on Sep 17, 2009 at 21:35 UTC ( #795984=note: print w/ replies, xml ) | Need Help?? |
|
I think you mean UTF-8. UTF-8 is a character encoding. It's a means of converting characters to and from bytes for use in mediums that don't have a concept of characters. HTML::Entities works with characters, not bytes that were characters before they were encoded. It doesn't know anything of any character encoding (like UTF-8) since it only works with characters. The HTML portions you pass to decode_entities must first be decoded from bytes into characters (based on the encoding specified in the Content-Type header). Similarly, the HTML portions you receive from encode_entities must then be encoded from characters into bytes to characters (based on the encoding specified in the Content-Type header).
In Section
Seekers of Perl Wisdom
|
|
||||||||||||||||||||