No such thing as a small change | |
PerlMonks |
Encoding/decoding questionby slugger415 (Monk) |
on Sep 11, 2011 at 15:33 UTC ( [id://925358]=perlquestion: print w/replies, xml ) | Need Help?? |
slugger415 has asked for the wisdom of the Perl Monks concerning the following question:
Hello Monks,
I am processing some XHTML pages (using XML::Twig) that contain numerous character entities, such as: é When I parse these files using XML::Twig, they turn into all sorts of wonky characters that look nothing like they did in the original HTML. réservebecomes réserve I've tried setting keep_encoding in Twig, and the entities get preserved, but I get another set of wonky characters when that output goes to HTML. I'm not sure how to proceed here -- any thoughts? I'm sure there's some kind of encoding/decoding process I need to do here, but I'm unfamiliar with the process. Many thanks. Scott
Back to
Seekers of Perl Wisdom
|
|