Beefy Boxes and Bandwidth Generously Provided by pair Networks
Problems? Is your data what you think it is?

Re: UTF8 support for CSV, PDF formats

by Eliya (Vicar)
on Jan 06, 2012 at 14:11 UTC ( #946601=note: print w/replies, xml ) Need Help??

in reply to UTF8 support for CSV, PDF formats

As for the CSV files, perhaps adding a BOM would help?  AFAIK, it is widely used on the Windows platform (even with UTF-8) to both indicate that the files do have Unicode content, and to specify the particular encoding being used (UTF-8, UTF-16le, etc.).

For this, the first thing you write to the file should be the BOM (\x{feff}):

my $fname = 'output.csv'; open my $fh, ">:utf8", $fname or die "couldn't open '$fname': $!"; print $fh "\x{feff}"; ...

Replies are listed 'Best First'.
Re^2: UTF8 support for CSV, PDF formats
by vishi83 (Pilgrim) on Jan 10, 2012 at 11:52 UTC
    Thanks for your response. Using BOM, did work for CSV format. I'm able to see the data properly now!
    However, it dint work for PDFs. I"m still trying to figure out a way for that.

    Any thoughts?
    A perl Script without 'strict' is like a House without Roof; Both are not Safe;

      yes, buy a pdf library  open (FILE, ">:encoding(UTF-8)", 'output.pdf'); is a mistake 99% of the time

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: note [id://946601]
and all is quiet...

How do I use this? | Other CB clients
Other Users?
Others meditating upon the Monastery: (4)
As of 2017-01-17 17:13 GMT
Find Nodes?
    Voting Booth?
    Do you watch meteor showers?

    Results (158 votes). Check out past polls.