If your data is really encoded in UTF-16, not UTF-8 and you have characters with a code point above 127, you're going to be in a world of hurt. I also want to point out that XML::XPath has had it's last update in 2003, and still has several outstanding bugs in it (I know, I did some fixes to overcome a couple in my own processing, and even that was years ago). For XPath processing, it is HIGHLY recommended to use XML::LibXML which handles all the character encodings properly so that you don't have to mangle your input XML. Here's a PerlMonks intro to get you started: Stepping up from XML::Simple to XML::LibXML
-
Are you posting in the right place? Check out Where do I post X? to know for sure.
-
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big>
<blockquote> <br /> <dd>
<dl> <dt> <em> <font>
<h1> <h2> <h3> <h4>
<h5> <h6> <hr /> <i>
<li> <nbsp> <ol> <p>
<small> <strike> <strong>
<sub> <sup> <table>
<td> <th> <tr> <tt>
<u> <ul>
-
Snippets of code should be wrapped in
<code> tags not
<pre> tags. In fact, <pre>
tags should generally be avoided. If they must
be used, extreme care should be
taken to ensure that their contents do not
have long lines (<70 chars), in order to prevent
horizontal scrolling (and possible janitor
intervention).
-
Want more info? How to link
or How to display code and escape characters
are good places to start.
|