Beefy Boxes and Bandwidth Generously Provided by pair Networks
XP is just a number
 
PerlMonks  

crlf mess in unicode utf-16le

by b4swine (Pilgrim)
on Oct 10, 2006 at 02:30 UTC ( #577311=perlquestion: print w/ replies, xml ) Need Help??
b4swine has asked for the wisdom of the Perl Monks concerning the following question:

Windows once again does wonderful things without being asked: I am writing unicode to a file, to be read back in again. utf-16le has 2 byte or 4 byte characters, but on windows, if one of these bytes is a CR, it is automatically "corrected" to a CR LF, and I end up with an odd number of bytes, and a mess when I try to read this in with the same encoding. What a mess. And since I am already using binmode, I don't know how to set it to 'binary'. It does not work to just add a ':raw' or ':bytes' to the encodoing.
$_ = "\x{ff0a}\x{1234}"; open OUT, ">temp.txt"; binmode OUT, ":encoding(UTF-16le)"; print OUT; close OUT; # The output file contains FIVE bytes: 0D 0A FF 34 12 open IN, "<temp.txt"; binmode IN, ":encoding(UTF-16le)"; <IN>; # this line causes: UTF-16LE:Partial character at temp.pl lin +e 12. close IN;

Comment on crlf mess in unicode utf-16le
Download Code
Re: crlf mess in unicode utf-16le
by ikegami (Pope) on Oct 10, 2006 at 03:18 UTC

    Apparently, the encoding layer does not remove underlying layers. A solution (workaround?) is to push raw first.

    open OUT, ">temp.txt"; binmode OUT, ":raw:encoding(UTF-16le)";

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: perlquestion [id://577311]
Approved by GrandFather
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others avoiding work at the Monastery: (15)
As of 2014-07-31 20:25 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    My favorite superfluous repetitious redundant duplicative phrase is:









    Results (253 votes), past polls