in reply to Re^2: Sorting/Cleansing a Duplicate File in thread Sorting/Cleansing a Duplicate File
- see current thread Problems handling UTF8 ! And removing accents.
-
I have bookmarked Re: UTF-8 for Everything (mostly because of the links in there)
-
In my scripts (mostly Windows console), for data (text) files I often use
open (my $fh, '<:encoding(Windows-1252)', $fname) or die "cannot open
+$fname: $!";
and for output intended for the console window I use binmode STDOUT, ':encoding(cp437)'; # or cp850
(if your data really is UTF-8, then "encoding(utf-8)" should do the right thing
-
Also of interest: perluniintro and perlopentut
|