Beefy Boxes and Bandwidth Generously Provided by pair Networks
Perl Monk, Perl Meditation

Unicode in bz2 compressed files

by Anonymous Monk
on Sep 04, 2011 at 20:52 UTC ( #924119=perlquestion: print w/replies, xml ) Need Help??
Anonymous Monk has asked for the wisdom of the Perl Monks concerning the following question:

Dear Monks,

I have been using Compress::Bzip2 to uncompress & compress files for a while now. However, now I need to work with files in unicode.

I know about

binmode(STDIN, ":utf8"); binmode(STDOUT, ":utf8"); binmode(STDERR, ":utf8");
and open FILE, '<:utf8', $file when reading from/writing to normal text files, but I could not find anything with Google in this case.

Is there a way to tell Perl the files are in Unicode when reading from/writing to them using

my $bzIn = bzopen($in, "rb") or die "Can't open stdin: $bzerrno\n"; my $bzOut = bzopen($out, "wb") or die "Can't open stdout: $bzerrno\n";

Am I going to have to switch to another package for compression, provided I find one that can deal with unicode, or am I going to have to do it the old fashioned way by decompressing first and then processing the plain text files?

Any suggestion here?

Replies are listed 'Best First'.
Re: Unicode in bz2 compressed files
by Anonymous Monk on Sep 05, 2011 at 06:26 UTC

      Thank you, wise friend.

      Although I had read the tutorials, I hadn't quite understood what this manual encoding and decoding was about, and how it applied in my case. Your example has made it clear to me.

Log In?

What's my password?
Create A New User
Node Status?
node history
Node Type: perlquestion [id://924119]
Approved by davies
and all is quiet...

How do I use this? | Other CB clients
Other Users?
Others wandering the Monastery: (4)
As of 2018-06-18 00:56 GMT
Find Nodes?
    Voting Booth?
    Should cpanminus be part of the standard Perl release?

    Results (107 votes). Check out past polls.