Beefy Boxes and Bandwidth Generously Provided by pair Networks
Don't ask to ask, just ask
 
PerlMonks  

Re^4: Using encoding

by nikosv (Hermit)
on Jan 14, 2013 at 12:56 UTC ( #1013211=note: print w/ replies, xml ) Need Help??


in reply to Re^3: Using encoding
in thread Using encoding

try to import the text as UTF16LE or check Mixed character encoding issues and my answers for a relevant workaround


Comment on Re^4: Using encoding
Re^5: Using encoding
by MorayJ (Acolyte) on Jan 14, 2013 at 18:03 UTC

    I tried the UTF16LE and it was producing the error "UTF-16LE:Partial character", so it wouldn't work

    Taking all encoding off the input file (presumably letting it read as ASCII) allowed me to substitute

    $col =~ s/\x{B6}\x{9C}/\x{A3}/g;

    as long as my output file was encoded with ISO-8859-1

    That encoding (ISO-8859-1) works for the input file as well. So I guess I was leading people astray by suggesting using utf8.

    I now have signs appearing in the finished file. However, it also contains little squares which seem to be from return markers the users have put in.

    I would have thought that with this file able to be input without encoding that it would be a case of s/\r\n/\n/, but this doesn't seem to work

    Alternatively, is there a way to say "Perl, if you don't recognise the character, blitz it!"?

    I've read quite a few web pages about the whole thing, but I don't seem to be quite getting it.

    Any further help much appreciated

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://1013211]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others lurking in the Monastery: (9)
As of 2014-12-27 15:25 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    Is guessing a good strategy for surviving in the IT business?





    Results (177 votes), past polls