Re^2: Funny characters in nodes (exactly zero)

by dmitri (Curate)
on Jul 08, 2007 at 22:42 UTC

in reply to Re: Funny characters in nodes (exactly zero)
in thread Funny characters in nodes

Most of the characters that caused problems that I looked at can be safely ignored. They are not just linefeeds, however. What I'm afraid of is that they may be some multi-byte characters that make sense in another characters set (especially since uses Latin1 and not UTF-8).

Comment on Re^2: Funny characters in nodes (exactly zero)
Replies are listed 'Best First'.
Re^3: Funny characters in nodes (recode)
by tye (Sage) on Jul 08, 2007 at 23:19 UTC

    Then do option 2 or 3. Option 2 is pretty simple:

    s/(\\)|([...])/ $1 ? "\\\\" : sprintf "\\%02X", chr $2 /ge; my @elements= parseXML(); s/\\(\\|..)/ length $1 == 1 ? "\\" : chr hex $2 /ge for @elements;

    - tye        

Node Type: note
As of 2016-02-11 15:23 GMT
