Convert HTML symbols to equivalent Unicodeby jai_dgl (Beadle)
|on Apr 14, 2009 at 09:47 UTC||Need Help??|
jai_dgl has asked for the
wisdom of the Perl Monks concerning the following question:
I need to parse some HTML files and have to write a XML
output file. In some cases I get a XML parser error.
The symbol ® ( REGISTERED SIGN ) need to be convert to its equivalent unicode U00AE
Is there any module to convert all the special character into
its Equivalent Unicode.
I don't want Decimal Equivalent or HTML entities as this XML file should be parsed in JSON.