Beefy Boxes and Bandwidth Generously Provided by pair Networks
No such thing as a small change

Re^3: Using encoding

by MorayJ (Beadle)
on Jan 14, 2013 at 12:37 UTC ( #1013207=note: print w/replies, xml ) Need Help??

in reply to Re^2: Using encoding
in thread Using encoding

Ok, I think that makes sense. So ord is not what I'm after

What's the best way to find 'funny' characters in a text file, and to translate them into meaningful characters in a text/unicode file?

I'm assuming that it's me that's making this difficult and it's probably quite straight forward

Replies are listed 'Best First'.
Re^4: Using encoding
by nikosv (Chaplain) on Jan 14, 2013 at 12:56 UTC

      I tried the UTF16LE and it was producing the error "UTF-16LE:Partial character", so it wouldn't work

      Taking all encoding off the input file (presumably letting it read as ASCII) allowed me to substitute

      $col =~ s/\x{B6}\x{9C}/\x{A3}/g;

      as long as my output file was encoded with ISO-8859-1

      That encoding (ISO-8859-1) works for the input file as well. So I guess I was leading people astray by suggesting using utf8.

      I now have signs appearing in the finished file. However, it also contains little squares which seem to be from return markers the users have put in.

      I would have thought that with this file able to be input without encoding that it would be a case of s/\r\n/\n/, but this doesn't seem to work

      Alternatively, is there a way to say "Perl, if you don't recognise the character, blitz it!"?

      I've read quite a few web pages about the whole thing, but I don't seem to be quite getting it.

      Any further help much appreciated

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1013207]
[Discipulus]: sure! probably the last chance to choice my holidays, 3 weeks

How do I use this? | Other CB clients
Other Users?
Others meditating upon the Monastery: (9)
As of 2018-06-25 18:22 GMT
Find Nodes?
    Voting Booth?
    Should cpanminus be part of the standard Perl release?

    Results (128 votes). Check out past polls.