slugger415 has asked for the wisdom of the Perl Monks concerning the following question:
Hello esteemed monks, I am using Text::CSV to parse an array of text strings (pipe delimited) and want to use UTF-8 encoding to read the strings. In the doc at https://metacpan.org/pod/Text::CSV#new I see this instruction:
On parsing (both for "getline" and "parse"), if the source is marked being UTF8, then all fields that are marked binary will also be marked UTF8.
I have set my 'new' instance to binary, and it mostly works, except some accented characters are showing up in the resulting web page as black diamond question marks, e.g. conexi�n. (Japanese and other language characters look fine.) Is there something else I need to set? If I don't use Text::CSV and just 'split' the strings, those characters look fine, and correct.
my $csv = Text::CSV->new ({ binary => 1, sep_char => "|" }); foreach my $row (@sorted_urls){ $csv->parse($row); # processing }
Thank you.
|
---|
Replies are listed 'Best First'. | |
---|---|
Re: Text::CSV encoding parse()
by haukex (Archbishop) on Aug 13, 2019 at 18:05 UTC | |
by slugger415 (Monk) on Aug 13, 2019 at 18:45 UTC | |
by haukex (Archbishop) on Aug 13, 2019 at 19:44 UTC | |
by slugger415 (Monk) on Aug 14, 2019 at 17:43 UTC | |
by haukex (Archbishop) on Aug 14, 2019 at 19:55 UTC | |
| |
by choroba (Cardinal) on Aug 14, 2019 at 19:35 UTC | |
| |
by jcb (Parson) on Aug 14, 2019 at 03:24 UTC | |
by slugger415 (Monk) on Aug 14, 2019 at 17:48 UTC | |
by afoken (Chancellor) on Aug 14, 2019 at 23:03 UTC | |
by jcb (Parson) on Aug 14, 2019 at 23:55 UTC | |
|