http://www.perlmonks.org?node_id=1045125


in reply to Re^2: Perl code for XML/HTML entities:
in thread Perl code for XML/HTML entities:

This is most likely an encoding problem.

Find out which encoding the data is in, and either declare the encoding at the top of the XML file using an XML declaration, or convert the data to UTF-8 before feeding it to the parser.

If you're in the US/UK, the encoding is probably ISO-8859-1 (or ISO-8859-15 if the data includes the euro symbol).

Replies are listed 'Best First'.
Re^4: Perl code for XML/HTML entities:
by Narend (Initiate) on Jul 18, 2013 at 16:20 UTC
    Thanks mirod, for pointing out. Yes, I did use the xml feed encoding as <?xml version="1.0" encoding="ISO-8859-1"?>. That throws interesting challenge. [XxXXXxx® II xxxxxx xxxxxxxxxx xxx but fails for XxXXXxx® 77 xxxxxx xxxxxxxxxx xxx So if there are numbers after the symbol it does not recognize now. If it is chars, seems to understand that. Any further pointers pls ?